SlideShare una empresa de Scribd logo
1 de 49
Exploratory data analysis: OGD user motivations

The Potential of Open Government Data (OGD) as a Tool in Democratic
Engagement and Reform of Public Services: The Case of Data.gov.uk

A presentation of data as an online appendix to Section 4.1 of the written
research report. To be used in conjunction with the written report only.


Tim Davies, MSc Student, Oxford Internet Institute
Exploratory Analysis: July 2010.
www.practicalparticipation.co.uk/odi/ | tim@timdavies.org.uk | @timdavies
What is this presentation
I have used this presentation as a working notebook whilst bringing together my
exploratory analysis of the motivations of OGD users according to the survey data
from the Open Data Impacts Survey.

It is not a presentation of findings, but shows the process I’ve gone through in
exploratory analysis.

Stages are left in, even if they don’t illuminate the final conclusions in order to help
ensure a transparent process.

R commands
Analysis carried out in R. Relevant commands given throughout.

These are mainly an aide-memoir for my own use, but, in conjunction with the shared data, should
also support replication of the results.
Overview:
     • Question: What motivates users of Open Government Data (OGD)


     • Data source: 72 responses from an opportunistically sampled online survey
       delivered between May 11th and June 14th 2010 targeted at users of OGD.


     • Data: Responses to a range of ‘Strongly Agree -> Strongly Disagree’ based
       likert scales on ‘Attitudes’ towards OGD and ‘Motivations’ for working with
       OGD. Also responses to a range of statements about OGD use projects from
       44 individuals within the sample.


     • Analysis: Using correlation analysis, exploratory factor analysis (Bartholomew
       et al. 2008, 7 - 9; Costello & Osborne 2005; Lawley & Maxwell 1962) and
       bootstrapped cluster analysis (Suzuki & Shimodaira 2006) in R.
Bartholomew, D.J. et al., 2008. Analysis of Multivariate Social Science Data, Second Edition (Chapman & Hall) 2nd ed., Chapman and Hall.
Becker, R.A., Chambers, J.M. & Wilks, A.R., 1988. The new S language: a programming environment for data analysis and graphics.
Costello, A.B. & Osborne, J.W., 2005. Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. Practical Assessment,
      Research & Evaluation, (7), 1–9.
Lawley, D.N. & Maxwell, A.E., 1962. Factor Analysis as a Statistical Method. Journal of the Royal Statistical Society. Series D (The Statistician), 12(3), 209-229.
Suzuki, R. & Shimodaira, H., 2006. Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics, 22(12), 1540.
Caveats

• Sample size: Too small to allow any strong conclusions to be drawn


• Items: Survey items were based on an initial literature review and exploratory
  research, but, due to time constraints, were not subjected to prior pilot testing.
  Refinements of the items and new data collection would strengthen the available
  data for future analysis.


• Holistic interpretation: The interpretation presented draws upon contextual
  information gathered during exploratory research.


• Interactive presentation: I’ve tried to include enough detail in this presentation to
  account for my analysis in Section 4.1 of the related dissertation. However, if the
  reader feels any analysis is not fully explained or does not provide adequate
  justification, questions are welcome on the blog post relating to this presentation at
  http://www.practicalparticipation.co.uk/odi/
1: Correlation between motivations

• Q: “People are interested in open data for many different reasons. Thinking
  about your own engagement with open government data, how important are
  the following motivations for working with or using open government data?”


• A: ‘Not at all important’ to ‘Very Important’ coded on a scale -2 to +2.


• Missing answers: Coded as 0 (neutral)


• Data: OpenDataImpacts-SurveyMotivationData.csv (Available online as Google Spreadsheet)


• Analysis: R correlation matrix, coded and ordered for strength or correlation.
  Clusters interpreted. R commands
                              > library(gclus)
                              > dta<-motivations
                              > dta.r<-abs(cor(dta))
                              > dta.col<-dmat.color(dta.r,colorRampPalette(c("white","orange","red"))(100))
                              > dta.o<-order.single(dta.r)
                              > cpairs(dta,dta.o,panel.colors=dta.col,gap=.5,main="Variables ordered and colored by correlation")
1: Correlation between motivations
1: Correlation between motivations




          1
1: Correlation between motivations
                             1) Government focussed
                             Understanding, Efficiency, Accountability




          1
1: Correlation between motivations
                                                               Also linked to make a
                             1) Government focussed            difference...

                             Understanding, Efficiency, Accountability




          1
1: Correlation between motivations
                                                               Also linked to make a
                             1) Government focussed            difference...

                             Understanding, Efficiency, Accountability




          1
1: Correlation between motivations
                                                               Also linked to make a
                             1) Government focussed            difference...

                             Understanding, Efficiency, Accountability




          1




                   2
1: Correlation between motivations
                                                               Also linked to make a
                             1) Government focussed            difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web




          1




                   2
1: Correlation between motivations
                                                               Also linked to make a
                             1) Government focussed            difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web




          1




                   2


                         3
1: Correlation between motivations
                                                               Also linked to make a
                             1) Government focussed            difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web

                             3) Reward focussed
                             Recognition or profit




          1




                   2


                         3
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...




          1




                   2


                         3
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...




          1


              4
                   2


                         3
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...


                             4) Digitising government focus
                             A computerisation movement?

          1


              4
                   2


                         3
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...


                             4) Digitising government focus
                             A computerisation movement?

          1


              4
                   2


                         3




                                                        5
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
                             Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...


                             4) Digitising government focus
                             A computerisation movement?

          1                  5) Problem solving focus
                             Develop new skills to meet interesting challenge


              4
                   2


                         3




                                                         5
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
6                            Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...


                             4) Digitising government focus
                             A computerisation movement?

          1                  5) Problem solving focus
                             Develop new skills to meet interesting challenge


              4
                   2


                         3




                                                         5
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
6                            Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...


                             4) Digitising government focus
                             A computerisation movement?

          1                  5) Problem solving focus
                             Develop new skills to meet interesting challenge

                             6) Social/public sector enterprise
              4              Companies providing local focussed services

                   2


                         3




                                                         5
1: Correlation between motivations
                                                                    Also linked to make a
                             1) Government focussed                 difference...

                             Understanding, Efficiency, Accountability

                             2) Technology innovation focussed
6                            Creating platforms, with semantic web

                             3) Reward focussed          In open tech community
                                                         recognition is key asset for
                             Recognition or profit        gaining future work...


                             4) Digitising government focus
                             A computerisation movement?

          1                  5) Problem solving focus
                             Develop new skills to meet interesting challenge

                             6) Social/public sector enterprise
              4              Companies providing local focussed services
                                                         E.g. Environmental
                   2                                     mapping; Transport
                                                         service using OGD.




                         3




                                                         5
2: Cluster analysis of motivations
                                             • Bootstrapped cluster analysis
                                               shows similar patterns.


                                             • But no strong p values. I.e.
                                               weak & overlapping clusters as
                                               seen in correlation analysis.


                                             • Adds curiosity to
                                               government cluster. Fits with
                                               participant-observation
                                               evidence that interest in
                                               government is secondary to
                                               interest in technology for
                                               many OGD users.




                              R commands
                              > library(pvclust)
                              > motclust<-pvclust(motivations,nboot=100000)
                              > plot(motclust)
3: Additional analysis
                                                                      R commands: non-graphical screen test solutions
                                                                      > library(nFactors)
• Whilst a number of exploratory factor analysis solutions            > ev<-eigen(cor(motivations))
  can offer insights during interactive exploration of the data,      > ap<-
                                                                      parallel(subject=nrow(motivations),var=ncol(motivations),re
  the non-graphical scree test solution suggests between 8            p=100,cent=0.05)
  and 10 factors, almost as many as there are items: thus             > nS <- nScree(ev$values, ap$eigen$qevpea)
  Factor analysis is of limited additional use.                       > plotnScree(nS)

                                                                      R commands: factor analysis
                                                                      > factmot3<-factanal(motivations,3, rotation="promax",
• kmeans analysis can be used to look at cluster sizes by             scores="regression") # The three factor solution is the
                                                                      most intuitively interpretable.
  discretely dividing the data - but the overlapping nature of        > print(factmot3,cutoff=0.3,sort=T,digits=2)
  the clusters we are identifying suggests this is also of limited
  utility. Interactively exploring different cluster sizes and the    R commands: kmeans analysis
  size of those clusters does however provide some additional         > factkmeans5<-kmeans(t(motivations),5)
                                                                      > #5 kmeans factors are interpretable
  insights which can then be checked against other survey
  data.


• Qualitative reading of particular individual cases in each cluster, and additional exploratory statistical analysis
  can support further identification & description of the different forms of motivations OGD users have.


• E.g. Those motivated by ‘Specific problems’ want ‘bulk Data’ and are not interested in SPARQL and building
  platforms; whereas those interested in platform building and developing new skills have strong interest in
  semantic web technologies.
Analysis of motivations
• O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
  OGD:

  “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
  advocates who want a more efficient and honest government, [3] citizen advocates who want services and
  information to make their lives better, [4] open advocates who believe that governments act for the people
  therefore government data should be available for free to the people, and [5] wonks who are hoping that
  releasing datasets ... will deliver…economic benefits to the country”.


• However, we find quite different clusters of motivations in the survey data:
Analysis of motivations
 • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
   OGD:

    “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
    advocates who want a more efficient and honest government, [3] citizen advocates who want services and
    information to make their lives better, [4] open advocates who believe that governments act for the people
    therefore government data should be available for free to the people, and [5] wonks who are hoping that
    releasing datasets ... will deliver…economic benefits to the country”.


 • However, we find quite different clusters of motivations in the survey data:


1) Government focussed
Understanding, Efficiency, Accountability
Analysis of motivations
 • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
   OGD:

    “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
    advocates who want a more efficient and honest government, [3] citizen advocates who want services and
    information to make their lives better, [4] open advocates who believe that governments act for the people
    therefore government data should be available for free to the people, and [5] wonks who are hoping that
    releasing datasets ... will deliver…economic benefits to the country”.


 • However, we find quite different clusters of motivations in the survey data:


1) Government focussed                                                       2) Technology innovation focussed
Understanding, Efficiency, Accountability                                     Creating platforms, with semantic web
Analysis of motivations
 • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
   OGD:

    “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
    advocates who want a more efficient and honest government, [3] citizen advocates who want services and
    information to make their lives better, [4] open advocates who believe that governments act for the people
    therefore government data should be available for free to the people, and [5] wonks who are hoping that
    releasing datasets ... will deliver…economic benefits to the country”.


 • However, we find quite different clusters of motivations in the survey data:


1) Government focussed                                                       2) Technology innovation focussed
Understanding, Efficiency, Accountability                                     Creating platforms, with semantic web


                                                                             3) Reward focussed
                                                                             Recognition or profit
Analysis of motivations
 • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
   OGD:

    “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
    advocates who want a more efficient and honest government, [3] citizen advocates who want services and
    information to make their lives better, [4] open advocates who believe that governments act for the people
    therefore government data should be available for free to the people, and [5] wonks who are hoping that
    releasing datasets ... will deliver…economic benefits to the country”.


 • However, we find quite different clusters of motivations in the survey data:


1) Government focussed                     4) Digitising government focus             2) Technology innovation focussed
Understanding, Efficiency, Accountability   A political or computerisation movement?   Creating platforms, with semantic web


                                                                                      3) Reward focussed
                                                                                      Recognition or profit
Analysis of motivations
 • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
   OGD:

    “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
    advocates who want a more efficient and honest government, [3] citizen advocates who want services and
    information to make their lives better, [4] open advocates who believe that governments act for the people
    therefore government data should be available for free to the people, and [5] wonks who are hoping that
    releasing datasets ... will deliver…economic benefits to the country”.


 • However, we find quite different clusters of motivations in the survey data:


1) Government focussed                             4) Digitising government focus             2) Technology innovation focussed
Understanding, Efficiency, Accountability           A political or computerisation movement?   Creating platforms, with semantic web


                                                                                              3) Reward focussed
                                                                                              Recognition or profit


5) Problem solving focus
Develop new skills to meet interesting challenge
Analysis of motivations
 • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in
   OGD:

    “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency
    advocates who want a more efficient and honest government, [3] citizen advocates who want services and
    information to make their lives better, [4] open advocates who believe that governments act for the people
    therefore government data should be available for free to the people, and [5] wonks who are hoping that
    releasing datasets ... will deliver…economic benefits to the country”.


 • However, we find quite different clusters of motivations in the survey data:


1) Government focussed                             4) Digitising government focus             2) Technology innovation focussed
Understanding, Efficiency, Accountability           A political or computerisation movement?   Creating platforms, with semantic web


6) Social/public sector enterprise                                                            3) Reward focussed
Companies providing local focussed services                                                   Recognition or profit


5) Problem solving focus
Develop new skills to meet interesting challenge
Further analysis
                                                                R commands: identify clusters & create new matrix
                                                                > c1<-c(7,9,2)
                                                                > c2<-c(13,10,3)
• We can take these clusters and create a new matrix of         > c3<-c(3,11,4)
                                                                > c4<-c(9,2,13,10)
  individuals rankings against them by summing the individual
                                                                > c5<-c(14,8)
  variables.                                                    > c6<-c(12,6)

                                                                > mclusts<-
                                                                cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
• We can then look at correlations between these and other      by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
  variables.                                                    c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                otivations[,c6],c(1:72),sum)))
                                                                > colnames(mclusts)<-c("GovFocus", "TechFocus",
• For example, looking at correlations between preferred        "RewardFocus", "DigitalGov", "ProblemSolving",
  methods of data access and user motivation.                   "Enterprise")


                                                                The additional data used is available in Google Docs linked from
                                                                http://www.practicalparticipation.co.uk/odi/survey

                                                                Download the relevant file as CSV and then import into the correct
                                                                variable name with the command
                                                                > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                To remove the line numbers, you can use:
                                                                > egassociations<-egassociations[,2:length(assoctest[1,])]
Further analysis
                                                                    R commands: identify clusters & create new matrix
                                                                    > c1<-c(7,9,2)
                                                                    > c2<-c(13,10,3)
    • We can take these clusters and create a new matrix of         > c3<-c(3,11,4)
                                                                    > c4<-c(9,2,13,10)
      individuals rankings against them by summing the individual
                                                                    > c5<-c(14,8)
      variables.                                                    > c6<-c(12,6)

                                                                    > mclusts<-
                                                                    cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
    • We can then look at correlations between these and other      by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
      variables.                                                    c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                    ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                    otivations[,c6],c(1:72),sum)))
                                                                    > colnames(mclusts)<-c("GovFocus", "TechFocus",
    • For example, looking at correlations between preferred        "RewardFocus", "DigitalGov", "ProblemSolving",
      methods of data access and user motivation.                   "Enterprise")


                                                                    The additional data used is available in Google Docs linked from
                                                                    http://www.practicalparticipation.co.uk/odi/survey

                                                                    Download the relevant file as CSV and then import into the correct
                                                                    variable name with the command
                                                                    > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                    To remove the line numbers, you can use:
                                                                    > egassociations<-egassociations[,2:length(assoctest[1,])]



1) Government focussed
Understanding, Efficiency, Accountability
Further analysis
                                                                       R commands: identify clusters & create new matrix
                                                                       > c1<-c(7,9,2)
                                                                       > c2<-c(13,10,3)
    • We can take these clusters and create a new matrix of            > c3<-c(3,11,4)
                                                                       > c4<-c(9,2,13,10)
      individuals rankings against them by summing the individual
                                                                       > c5<-c(14,8)
      variables.                                                       > c6<-c(12,6)

                                                                       > mclusts<-
                                                                       cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
    • We can then look at correlations between these and other         by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
      variables.                                                       c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                       ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                       otivations[,c6],c(1:72),sum)))
                                                                       > colnames(mclusts)<-c("GovFocus", "TechFocus",
    • For example, looking at correlations between preferred           "RewardFocus", "DigitalGov", "ProblemSolving",
      methods of data access and user motivation.                      "Enterprise")


                                                                       The additional data used is available in Google Docs linked from
                                                                       http://www.practicalparticipation.co.uk/odi/survey

                                                                       Download the relevant file as CSV and then import into the correct
                                                                       variable name with the command
                                                                       > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                       To remove the line numbers, you can use:
                                                                       > egassociations<-egassociations[,2:length(assoctest[1,])]



1) Government focussed                                              2) Technology innovation focussed
Understanding, Efficiency, Accountability                            Creating platforms, with semantic web
Further analysis
                                                                       R commands: identify clusters & create new matrix
                                                                       > c1<-c(7,9,2)
                                                                       > c2<-c(13,10,3)
    • We can take these clusters and create a new matrix of            > c3<-c(3,11,4)
                                                                       > c4<-c(9,2,13,10)
      individuals rankings against them by summing the individual
                                                                       > c5<-c(14,8)
      variables.                                                       > c6<-c(12,6)

                                                                       > mclusts<-
                                                                       cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
    • We can then look at correlations between these and other         by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
      variables.                                                       c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                       ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                       otivations[,c6],c(1:72),sum)))
                                                                       > colnames(mclusts)<-c("GovFocus", "TechFocus",
    • For example, looking at correlations between preferred           "RewardFocus", "DigitalGov", "ProblemSolving",
      methods of data access and user motivation.                      "Enterprise")


                                                                       The additional data used is available in Google Docs linked from
                                                                       http://www.practicalparticipation.co.uk/odi/survey

                                                                       Download the relevant file as CSV and then import into the correct
                                                                       variable name with the command
                                                                       > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                       To remove the line numbers, you can use:
                                                                       > egassociations<-egassociations[,2:length(assoctest[1,])]



1) Government focussed                                              2) Technology innovation focussed
Understanding, Efficiency, Accountability                            Creating platforms, with semantic web

                                                                    3) Reward focussed
                                                                    Recognition or profit
Further analysis
                                                                               R commands: identify clusters & create new matrix
                                                                               > c1<-c(7,9,2)
                                                                               > c2<-c(13,10,3)
    • We can take these clusters and create a new matrix of                    > c3<-c(3,11,4)
                                                                               > c4<-c(9,2,13,10)
      individuals rankings against them by summing the individual
                                                                               > c5<-c(14,8)
      variables.                                                               > c6<-c(12,6)

                                                                               > mclusts<-
                                                                               cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
    • We can then look at correlations between these and other                 by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
      variables.                                                               c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                               ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                               otivations[,c6],c(1:72),sum)))
                                                                               > colnames(mclusts)<-c("GovFocus", "TechFocus",
    • For example, looking at correlations between preferred                   "RewardFocus", "DigitalGov", "ProblemSolving",
      methods of data access and user motivation.                              "Enterprise")


                                                                               The additional data used is available in Google Docs linked from
                                                                               http://www.practicalparticipation.co.uk/odi/survey

                                                                               Download the relevant file as CSV and then import into the correct
                                                                               variable name with the command
                                                                               > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                               To remove the line numbers, you can use:
                                                                               > egassociations<-egassociations[,2:length(assoctest[1,])]



1) Government focussed                     4) Digitising government focus   2) Technology innovation focussed
Understanding, Efficiency, Accountability   A computerisation movement...    Creating platforms, with semantic web

                                                                            3) Reward focussed
                                                                            Recognition or profit
Further analysis
                                                                                       R commands: identify clusters & create new matrix
                                                                                       > c1<-c(7,9,2)
                                                                                       > c2<-c(13,10,3)
    • We can take these clusters and create a new matrix of                            > c3<-c(3,11,4)
                                                                                       > c4<-c(9,2,13,10)
      individuals rankings against them by summing the individual
                                                                                       > c5<-c(14,8)
      variables.                                                                       > c6<-c(12,6)

                                                                                       > mclusts<-
                                                                                       cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
    • We can then look at correlations between these and other                         by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
      variables.                                                                       c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                                       ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                                       otivations[,c6],c(1:72),sum)))
                                                                                       > colnames(mclusts)<-c("GovFocus", "TechFocus",
    • For example, looking at correlations between preferred                           "RewardFocus", "DigitalGov", "ProblemSolving",
      methods of data access and user motivation.                                      "Enterprise")


                                                                                       The additional data used is available in Google Docs linked from
                                                                                       http://www.practicalparticipation.co.uk/odi/survey

                                                                                       Download the relevant file as CSV and then import into the correct
                                                                                       variable name with the command
                                                                                       > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                                       To remove the line numbers, you can use:
                                                                                       > egassociations<-egassociations[,2:length(assoctest[1,])]



1) Government focussed                             4) Digitising government focus   2) Technology innovation focussed
Understanding, Efficiency, Accountability           A computerisation movement...    Creating platforms, with semantic web

                                                                                    3) Reward focussed
                                                                                    Recognition or profit

5) Problem solving focus
Develop new skills to meet interesting challenge
Further analysis
                                                                                       R commands: identify clusters & create new matrix
                                                                                       > c1<-c(7,9,2)
                                                                                       > c2<-c(13,10,3)
    • We can take these clusters and create a new matrix of                            > c3<-c(3,11,4)
                                                                                       > c4<-c(9,2,13,10)
      individuals rankings against them by summing the individual
                                                                                       > c5<-c(14,8)
      variables.                                                                       > c6<-c(12,6)

                                                                                       > mclusts<-
                                                                                       cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector(
    • We can then look at correlations between these and other                         by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[,
      variables.                                                                       c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum)
                                                                                       ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m
                                                                                       otivations[,c6],c(1:72),sum)))
                                                                                       > colnames(mclusts)<-c("GovFocus", "TechFocus",
    • For example, looking at correlations between preferred                           "RewardFocus", "DigitalGov", "ProblemSolving",
      methods of data access and user motivation.                                      "Enterprise")


                                                                                       The additional data used is available in Google Docs linked from
                                                                                       http://www.practicalparticipation.co.uk/odi/survey

                                                                                       Download the relevant file as CSV and then import into the correct
                                                                                       variable name with the command
                                                                                       > egassociations<-read.csv<-(file=”filename.csv’,headers=T)

                                                                                       To remove the line numbers, you can use:
                                                                                       > egassociations<-egassociations[,2:length(assoctest[1,])]



1) Government focussed                             4) Digitising government focus   2) Technology innovation focussed
Understanding, Efficiency, Accountability           A computerisation movement...    Creating platforms, with semantic web

6) Social/public sector enterprise                                                  3) Reward focussed
Companies providing local focussed services                                         Recognition or profit

5) Problem solving focus
Develop new skills to meet interesting challenge
Further analysis: motivations and attitudes
        • The attitudes dataset records responses against a number of claims about OGD.


        • The table below shows how different motivational clusters correlate with particular statements about
          OGD. Note the strong correlation between ‘Reward focus’ and statements about commercial benefit;
          and the view in Government focussed and problem solving that OGD can drive reform of public
          services.


        • Only Government focussed motivations have any real correlation with claims about accountability.


        • The ‘Enterprise’ correlations suggest a stronger ‘Social Enterprise’ interpretation may be appropriate.

                                    Correlation matrix of generated motivational clusters and response to ‘attitudes 1’ questions




    This row is curious. Little
evidence elsewhere in study that
OGD actually is helping errors to
          be identified...


                                                                                                           R commands:
                                                                                                           > cor(attitudes,mclusts) #display formatted in excel
Further analysis: attitudes 2	

• The attitudes 2 set of questions have far lower correlation. Overall the consensus on
  these items was far stronger, thus the information that can be extracted from their
  correlation with particular motivations less.


• The correlations are not easily interpretable - but suggest that the views within each
  motivational cluster are not homogenous. The small (but greater than others)
  correlation of Tech and DigitalGov clusters against ‘Data only open when clear
  demand’ is surprising.


• Attitudes 2 does little to illuminate the interpretation of these motivational clusters.

                Correlation matrix of generated motivational clusters and response to ‘attitudes 2’ questions




                                                                                  R commands:
                                                                                  > cor(attitudes2,mclusts) #display formatted in excel
Further analysis: associations

• Correlations against associations provide some further confirmation of the
  clusters, suggesting the diversity of the ‘Gov Focus’ users, but relatively
  strong identity of the tech and reward focussed clusters.




                                    Correlation matrix of generated motivational clusters and response to
                                    ‘associations’ questions




                                                       R commands:
                                                       > cor(associations,mclusts) #display formatted in excel
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different
  motivational clusters.



• Given the data feeding into the
  clusters is overlapping (1 overlaps
  4; 2 overlaps 3 and 4) we may
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.




                                          R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                          > fit<-cmdscale(dist,2)
                                          > plot(fit[,2],fit[,1])
                                          > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different      1) Government focussed
  motivational clusters.                  Understanding, Efficiency, Accountability



• Given the data feeding into the
  clusters is overlapping (1 overlaps
  4; 2 overlaps 3 and 4) we may
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.




                                                      R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                                      > fit<-cmdscale(dist,2)
                                                      > plot(fit[,2],fit[,1])
                                                      > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different      1) Government focussed
  motivational clusters.                  Understanding, Efficiency, Accountability



• Given the data feeding into the
  clusters is overlapping (1 overlaps
  4; 2 overlaps 3 and 4) we may
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.
                                                                                     2) Technology innovation focussed
                                                                                     Creating platforms, with semantic web




                                                      R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                                      > fit<-cmdscale(dist,2)
                                                      > plot(fit[,2],fit[,1])
                                                      > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different      1) Government focussed
  motivational clusters.                  Understanding, Efficiency, Accountability



• Given the data feeding into the
  clusters is overlapping (1 overlaps
  4; 2 overlaps 3 and 4) we may
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.
                                                                                     2) Technology innovation focussed
                                                                                     Creating platforms, with semantic web




                                                                                    3) Reward focussed
                                                                                    Recognition or profit


                                                      R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                                      > fit<-cmdscale(dist,2)
                                                      > plot(fit[,2],fit[,1])
                                                      > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different      1) Government focussed
  motivational clusters.                  Understanding, Efficiency, Accountability



• Given the data feeding into the
  clusters is overlapping (1 overlaps                                              4) Digitising government focus
  4; 2 overlaps 3 and 4) we may                                                    A computerisation movement?
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.
                                                                                     2) Technology innovation focussed
                                                                                     Creating platforms, with semantic web




                                                                                    3) Reward focussed
                                                                                    Recognition or profit


                                                      R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                                      > fit<-cmdscale(dist,2)
                                                      > plot(fit[,2],fit[,1])
                                                      > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different                       1) Government focussed
  motivational clusters.                                   Understanding, Efficiency, Accountability



• Given the data feeding into the
  clusters is overlapping (1 overlaps                                                               4) Digitising government focus
  4; 2 overlaps 3 and 4) we may                                                                     A computerisation movement?
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.
                                                                                                      2) Technology innovation focussed
                                                                                                      Creating platforms, with semantic web

        5) Problem solving focus
        Develop new skills to meet interesting challenge


                                                                                                     3) Reward focussed
                                                                                                     Recognition or profit


                                                                       R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                                                       > fit<-cmdscale(dist,2)
                                                                       > plot(fit[,2],fit[,1])
                                                                       > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
The digital government connection
• Using multi-dimensional scaling can
  support visualisation of the
  relationship between the different                          1) Government focussed
  motivational clusters.                                      Understanding, Efficiency, Accountability



• Given the data feeding into the
  clusters is overlapping (1 overlaps           6) Social/public sector enterprise                     4) Digitising government focus
  4; 2 overlaps 3 and 4) we may                 Companies providing local focussed services            A computerisation movement?
  expect some clustering of these
  when MDS is applied. However, we
  do see (6) closer to (1), (4) and (2)
  than may be anticipated, and (5)
  distant from Digitising Government
  focus.
                                                                                                         2) Technology innovation focussed
                                                                                                         Creating platforms, with semantic web

        5) Problem solving focus
        Develop new skills to meet interesting challenge


                                                                                                        3) Reward focussed
                                                                                                        Recognition or profit


                                                                          R: Using a manually created distance matrix of 1-correlation matrix value as dist

                                                                          > fit<-cmdscale(dist,2)
                                                                          > plot(fit[,2],fit[,1])
                                                                          > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
Analysis

• To be written up....

Más contenido relacionado

Similar a Who is using Open Government Data - Analysis

Emerging Trends in Knowledge Management
Emerging Trends in Knowledge ManagementEmerging Trends in Knowledge Management
Emerging Trends in Knowledge Management
Nikesh Narayanan
 
Knowledge-Centric Paradigm: A New World of IT Solutions
Knowledge-Centric Paradigm: A New World of IT SolutionsKnowledge-Centric Paradigm: A New World of IT Solutions
Knowledge-Centric Paradigm: A New World of IT Solutions
Ed Dodds
 

Similar a Who is using Open Government Data - Analysis (20)

Opendata challeges&opportunites
Opendata challeges&opportunitesOpendata challeges&opportunites
Opendata challeges&opportunites
 
21st Century Skills
21st Century Skills21st Century Skills
21st Century Skills
 
Semantic Technology Solutions For Gov 2 0 Citizen-Friendly Recovery.Gov and D...
Semantic Technology Solutions For Gov 2 0 Citizen-Friendly Recovery.Gov and D...Semantic Technology Solutions For Gov 2 0 Citizen-Friendly Recovery.Gov and D...
Semantic Technology Solutions For Gov 2 0 Citizen-Friendly Recovery.Gov and D...
 
OpenKollab Project Matching
OpenKollab Project MatchingOpenKollab Project Matching
OpenKollab Project Matching
 
Emerging Trends in Knowledge Management
Emerging Trends in Knowledge ManagementEmerging Trends in Knowledge Management
Emerging Trends in Knowledge Management
 
Research Issues in Knowledge Management and Social Media
Research Issues in Knowledge Management and Social MediaResearch Issues in Knowledge Management and Social Media
Research Issues in Knowledge Management and Social Media
 
Towards the intranet of the future
Towards the intranet of the futureTowards the intranet of the future
Towards the intranet of the future
 
Developing a Business Case for Corporate Portals
Developing a Business Case for Corporate PortalsDeveloping a Business Case for Corporate Portals
Developing a Business Case for Corporate Portals
 
Semantic Technology Solutions For Recovery Gov And Data Gov With Transparenc...
Semantic Technology Solutions For Recovery Gov And  Data Gov With Transparenc...Semantic Technology Solutions For Recovery Gov And  Data Gov With Transparenc...
Semantic Technology Solutions For Recovery Gov And Data Gov With Transparenc...
 
Organizational Competences for Open Innovation
Organizational Competences for Open InnovationOrganizational Competences for Open Innovation
Organizational Competences for Open Innovation
 
ICT Vision and Technologies Adoption across Thailand
ICT Vision and Technologies Adoption across ThailandICT Vision and Technologies Adoption across Thailand
ICT Vision and Technologies Adoption across Thailand
 
Pmc
PmcPmc
Pmc
 
Presentation at board DKV Seguros
Presentation at board DKV SegurosPresentation at board DKV Seguros
Presentation at board DKV Seguros
 
Enterprise Techonology Trends
Enterprise Techonology TrendsEnterprise Techonology Trends
Enterprise Techonology Trends
 
E democracy, visualization, open data, digital citizenship
E democracy, visualization, open data, digital citizenshipE democracy, visualization, open data, digital citizenship
E democracy, visualization, open data, digital citizenship
 
2010 Tech America Federal CIO Survey Final Report
2010 Tech America Federal CIO Survey Final Report2010 Tech America Federal CIO Survey Final Report
2010 Tech America Federal CIO Survey Final Report
 
Art of Projects 2018 - Kamil Mroz
Art of Projects 2018 - Kamil MrozArt of Projects 2018 - Kamil Mroz
Art of Projects 2018 - Kamil Mroz
 
Tl Techno Vision 2012
Tl Techno Vision 2012Tl Techno Vision 2012
Tl Techno Vision 2012
 
Knowledge-Centric Paradigm: A New World of IT Solutions
Knowledge-Centric Paradigm: A New World of IT SolutionsKnowledge-Centric Paradigm: A New World of IT Solutions
Knowledge-Centric Paradigm: A New World of IT Solutions
 
Report 2011 Internal Communications Within Italian Banks
Report 2011   Internal Communications Within Italian BanksReport 2011   Internal Communications Within Italian Banks
Report 2011 Internal Communications Within Italian Banks
 

Más de Tim Davies

Open data and e participation copy
Open data and e participation copyOpen data and e participation copy
Open data and e participation copy
Tim Davies
 
Workshop an introduction to open data
Workshop   an introduction to open dataWorkshop   an introduction to open data
Workshop an introduction to open data
Tim Davies
 
Lse open data construction and critiques
Lse   open data construction and critiquesLse   open data construction and critiques
Lse open data construction and critiques
Tim Davies
 

Más de Tim Davies (20)

The State of Open Data - AI, Data Literacy and the Private Sector
The State of Open Data - AI, Data Literacy and the Private SectorThe State of Open Data - AI, Data Literacy and the Private Sector
The State of Open Data - AI, Data Literacy and the Private Sector
 
Participatory public data infrastructure: open data standards and the turn to...
Participatory public data infrastructure: open data standards and the turn to...Participatory public data infrastructure: open data standards and the turn to...
Participatory public data infrastructure: open data standards and the turn to...
 
Open Data and Anti-Corruption: Open Data Charter Packages
Open Data and Anti-Corruption: Open Data Charter PackagesOpen Data and Anti-Corruption: Open Data Charter Packages
Open Data and Anti-Corruption: Open Data Charter Packages
 
Unpacking Open Data: power, politics and the importance of infrastructure
Unpacking Open Data: power, politics and the importance of infrastructureUnpacking Open Data: power, politics and the importance of infrastructure
Unpacking Open Data: power, politics and the importance of infrastructure
 
Open data, decision points and distribution of benefits
Open data, decision points and distribution of benefitsOpen data, decision points and distribution of benefits
Open data, decision points and distribution of benefits
 
Open data and e participation copy
Open data and e participation copyOpen data and e participation copy
Open data and e participation copy
 
Workshop an introduction to open data
Workshop   an introduction to open dataWorkshop   an introduction to open data
Workshop an introduction to open data
 
What does open development mean to you?
What does open development mean to you?What does open development mean to you?
What does open development mean to you?
 
OKFest - Open Development Open Panel
OKFest - Open Development Open PanelOKFest - Open Development Open Panel
OKFest - Open Development Open Panel
 
Supporting open data use through active engagement (Annotated version)
Supporting open data use through active engagement (Annotated version)Supporting open data use through active engagement (Annotated version)
Supporting open data use through active engagement (Annotated version)
 
Open Data Engagement - Using Open Data w3c Workshop
Open Data Engagement - Using Open Data w3c Workshop Open Data Engagement - Using Open Data w3c Workshop
Open Data Engagement - Using Open Data w3c Workshop
 
Lse open data construction and critiques
Lse   open data construction and critiquesLse   open data construction and critiques
Lse open data construction and critiques
 
Nominet Trust Charity Open Data Days - What is open data anyway
Nominet Trust Charity Open Data Days - What is open data anywayNominet Trust Charity Open Data Days - What is open data anyway
Nominet Trust Charity Open Data Days - What is open data anyway
 
Youth Part - eParticipation Ideas and Insights from the UK
Youth Part - eParticipation Ideas and Insights from the UKYouth Part - eParticipation Ideas and Insights from the UK
Youth Part - eParticipation Ideas and Insights from the UK
 
Annotated Version: EU Safer Internet Forum - Rethinking Responses to Young Pe...
Annotated Version: EU Safer Internet Forum - Rethinking Responses to Young Pe...Annotated Version: EU Safer Internet Forum - Rethinking Responses to Young Pe...
Annotated Version: EU Safer Internet Forum - Rethinking Responses to Young Pe...
 
EU Safer Internet Forum - Rethinking Responses to Young People's Online Lives
EU Safer Internet Forum - Rethinking Responses to Young People's Online LivesEU Safer Internet Forum - Rethinking Responses to Young People's Online Lives
EU Safer Internet Forum - Rethinking Responses to Young People's Online Lives
 
Rethinking Responses to Young People and the Internet - beyond Opportunity an...
Rethinking Responses to Young People and the Internet - beyond Opportunity an...Rethinking Responses to Young People and the Internet - beyond Opportunity an...
Rethinking Responses to Young People and the Internet - beyond Opportunity an...
 
Making a Difference with Data - Online Conference - Education Section - Intro...
Making a Difference with Data - Online Conference - Education Section - Intro...Making a Difference with Data - Online Conference - Education Section - Intro...
Making a Difference with Data - Online Conference - Education Section - Intro...
 
Open data campaigning camp - introduction and history
Open data campaigning camp - introduction and historyOpen data campaigning camp - introduction and history
Open data campaigning camp - introduction and history
 
Youth Work Online - Rethinking Responses to Young People's Online Lives
Youth Work Online - Rethinking Responses to Young People's Online LivesYouth Work Online - Rethinking Responses to Young People's Online Lives
Youth Work Online - Rethinking Responses to Young People's Online Lives
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Último (20)

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 

Who is using Open Government Data - Analysis

  • 1. Exploratory data analysis: OGD user motivations The Potential of Open Government Data (OGD) as a Tool in Democratic Engagement and Reform of Public Services: The Case of Data.gov.uk A presentation of data as an online appendix to Section 4.1 of the written research report. To be used in conjunction with the written report only. Tim Davies, MSc Student, Oxford Internet Institute Exploratory Analysis: July 2010. www.practicalparticipation.co.uk/odi/ | tim@timdavies.org.uk | @timdavies
  • 2. What is this presentation I have used this presentation as a working notebook whilst bringing together my exploratory analysis of the motivations of OGD users according to the survey data from the Open Data Impacts Survey. It is not a presentation of findings, but shows the process I’ve gone through in exploratory analysis. Stages are left in, even if they don’t illuminate the final conclusions in order to help ensure a transparent process. R commands Analysis carried out in R. Relevant commands given throughout. These are mainly an aide-memoir for my own use, but, in conjunction with the shared data, should also support replication of the results.
  • 3. Overview: • Question: What motivates users of Open Government Data (OGD) • Data source: 72 responses from an opportunistically sampled online survey delivered between May 11th and June 14th 2010 targeted at users of OGD. • Data: Responses to a range of ‘Strongly Agree -> Strongly Disagree’ based likert scales on ‘Attitudes’ towards OGD and ‘Motivations’ for working with OGD. Also responses to a range of statements about OGD use projects from 44 individuals within the sample. • Analysis: Using correlation analysis, exploratory factor analysis (Bartholomew et al. 2008, 7 - 9; Costello & Osborne 2005; Lawley & Maxwell 1962) and bootstrapped cluster analysis (Suzuki & Shimodaira 2006) in R. Bartholomew, D.J. et al., 2008. Analysis of Multivariate Social Science Data, Second Edition (Chapman & Hall) 2nd ed., Chapman and Hall. Becker, R.A., Chambers, J.M. & Wilks, A.R., 1988. The new S language: a programming environment for data analysis and graphics. Costello, A.B. & Osborne, J.W., 2005. Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. Practical Assessment, Research & Evaluation, (7), 1–9. Lawley, D.N. & Maxwell, A.E., 1962. Factor Analysis as a Statistical Method. Journal of the Royal Statistical Society. Series D (The Statistician), 12(3), 209-229. Suzuki, R. & Shimodaira, H., 2006. Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics, 22(12), 1540.
  • 4. Caveats • Sample size: Too small to allow any strong conclusions to be drawn • Items: Survey items were based on an initial literature review and exploratory research, but, due to time constraints, were not subjected to prior pilot testing. Refinements of the items and new data collection would strengthen the available data for future analysis. • Holistic interpretation: The interpretation presented draws upon contextual information gathered during exploratory research. • Interactive presentation: I’ve tried to include enough detail in this presentation to account for my analysis in Section 4.1 of the related dissertation. However, if the reader feels any analysis is not fully explained or does not provide adequate justification, questions are welcome on the blog post relating to this presentation at http://www.practicalparticipation.co.uk/odi/
  • 5. 1: Correlation between motivations • Q: “People are interested in open data for many different reasons. Thinking about your own engagement with open government data, how important are the following motivations for working with or using open government data?” • A: ‘Not at all important’ to ‘Very Important’ coded on a scale -2 to +2. • Missing answers: Coded as 0 (neutral) • Data: OpenDataImpacts-SurveyMotivationData.csv (Available online as Google Spreadsheet) • Analysis: R correlation matrix, coded and ordered for strength or correlation. Clusters interpreted. R commands > library(gclus) > dta<-motivations > dta.r<-abs(cor(dta)) > dta.col<-dmat.color(dta.r,colorRampPalette(c("white","orange","red"))(100)) > dta.o<-order.single(dta.r) > cpairs(dta,dta.o,panel.colors=dta.col,gap=.5,main="Variables ordered and colored by correlation")
  • 7. 1: Correlation between motivations 1
  • 8. 1: Correlation between motivations 1) Government focussed Understanding, Efficiency, Accountability 1
  • 9. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 1
  • 10. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 1
  • 11. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 1 2
  • 12. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 1 2
  • 13. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 1 2 3
  • 14. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed Recognition or profit 1 2 3
  • 15. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 1 2 3
  • 16. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 1 4 2 3
  • 17. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 4) Digitising government focus A computerisation movement? 1 4 2 3
  • 18. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 4) Digitising government focus A computerisation movement? 1 4 2 3 5
  • 19. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 4) Digitising government focus A computerisation movement? 1 5) Problem solving focus Develop new skills to meet interesting challenge 4 2 3 5
  • 20. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed 6 Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 4) Digitising government focus A computerisation movement? 1 5) Problem solving focus Develop new skills to meet interesting challenge 4 2 3 5
  • 21. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed 6 Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 4) Digitising government focus A computerisation movement? 1 5) Problem solving focus Develop new skills to meet interesting challenge 6) Social/public sector enterprise 4 Companies providing local focussed services 2 3 5
  • 22. 1: Correlation between motivations Also linked to make a 1) Government focussed difference... Understanding, Efficiency, Accountability 2) Technology innovation focussed 6 Creating platforms, with semantic web 3) Reward focussed In open tech community recognition is key asset for Recognition or profit gaining future work... 4) Digitising government focus A computerisation movement? 1 5) Problem solving focus Develop new skills to meet interesting challenge 6) Social/public sector enterprise 4 Companies providing local focussed services E.g. Environmental 2 mapping; Transport service using OGD. 3 5
  • 23. 2: Cluster analysis of motivations • Bootstrapped cluster analysis shows similar patterns. • But no strong p values. I.e. weak & overlapping clusters as seen in correlation analysis. • Adds curiosity to government cluster. Fits with participant-observation evidence that interest in government is secondary to interest in technology for many OGD users. R commands > library(pvclust) > motclust<-pvclust(motivations,nboot=100000) > plot(motclust)
  • 24. 3: Additional analysis R commands: non-graphical screen test solutions > library(nFactors) • Whilst a number of exploratory factor analysis solutions > ev<-eigen(cor(motivations)) can offer insights during interactive exploration of the data, > ap<- parallel(subject=nrow(motivations),var=ncol(motivations),re the non-graphical scree test solution suggests between 8 p=100,cent=0.05) and 10 factors, almost as many as there are items: thus > nS <- nScree(ev$values, ap$eigen$qevpea) Factor analysis is of limited additional use. > plotnScree(nS) R commands: factor analysis > factmot3<-factanal(motivations,3, rotation="promax", • kmeans analysis can be used to look at cluster sizes by scores="regression") # The three factor solution is the most intuitively interpretable. discretely dividing the data - but the overlapping nature of > print(factmot3,cutoff=0.3,sort=T,digits=2) the clusters we are identifying suggests this is also of limited utility. Interactively exploring different cluster sizes and the R commands: kmeans analysis size of those clusters does however provide some additional > factkmeans5<-kmeans(t(motivations),5) > #5 kmeans factors are interpretable insights which can then be checked against other survey data. • Qualitative reading of particular individual cases in each cluster, and additional exploratory statistical analysis can support further identification & description of the different forms of motivations OGD users have. • E.g. Those motivated by ‘Specific problems’ want ‘bulk Data’ and are not interested in SPARQL and building platforms; whereas those interested in platform building and developing new skills have strong interest in semantic web technologies.
  • 25. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data:
  • 26. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data: 1) Government focussed Understanding, Efficiency, Accountability
  • 27. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data: 1) Government focussed 2) Technology innovation focussed Understanding, Efficiency, Accountability Creating platforms, with semantic web
  • 28. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data: 1) Government focussed 2) Technology innovation focussed Understanding, Efficiency, Accountability Creating platforms, with semantic web 3) Reward focussed Recognition or profit
  • 29. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data: 1) Government focussed 4) Digitising government focus 2) Technology innovation focussed Understanding, Efficiency, Accountability A political or computerisation movement? Creating platforms, with semantic web 3) Reward focussed Recognition or profit
  • 30. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data: 1) Government focussed 4) Digitising government focus 2) Technology innovation focussed Understanding, Efficiency, Accountability A political or computerisation movement? Creating platforms, with semantic web 3) Reward focussed Recognition or profit 5) Problem solving focus Develop new skills to meet interesting challenge
  • 31. Analysis of motivations • O’Reilly blogger Nat Torkington (2010) has suggested there are five types of people with an interested in OGD: “… [1] low-polling governments who want to see a PR win from opening their data, [2] transparency advocates who want a more efficient and honest government, [3] citizen advocates who want services and information to make their lives better, [4] open advocates who believe that governments act for the people therefore government data should be available for free to the people, and [5] wonks who are hoping that releasing datasets ... will deliver…economic benefits to the country”. • However, we find quite different clusters of motivations in the survey data: 1) Government focussed 4) Digitising government focus 2) Technology innovation focussed Understanding, Efficiency, Accountability A political or computerisation movement? Creating platforms, with semantic web 6) Social/public sector enterprise 3) Reward focussed Companies providing local focussed services Recognition or profit 5) Problem solving focus Develop new skills to meet interesting challenge
  • 32. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])]
  • 33. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])] 1) Government focussed Understanding, Efficiency, Accountability
  • 34. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])] 1) Government focussed 2) Technology innovation focussed Understanding, Efficiency, Accountability Creating platforms, with semantic web
  • 35. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])] 1) Government focussed 2) Technology innovation focussed Understanding, Efficiency, Accountability Creating platforms, with semantic web 3) Reward focussed Recognition or profit
  • 36. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])] 1) Government focussed 4) Digitising government focus 2) Technology innovation focussed Understanding, Efficiency, Accountability A computerisation movement... Creating platforms, with semantic web 3) Reward focussed Recognition or profit
  • 37. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])] 1) Government focussed 4) Digitising government focus 2) Technology innovation focussed Understanding, Efficiency, Accountability A computerisation movement... Creating platforms, with semantic web 3) Reward focussed Recognition or profit 5) Problem solving focus Develop new skills to meet interesting challenge
  • 38. Further analysis R commands: identify clusters & create new matrix > c1<-c(7,9,2) > c2<-c(13,10,3) • We can take these clusters and create a new matrix of > c3<-c(3,11,4) > c4<-c(9,2,13,10) individuals rankings against them by summing the individual > c5<-c(14,8) variables. > c6<-c(12,6) > mclusts<- cbind(as.vector(by(motivations[,c1],c(1:72),sum)),as.vector( • We can then look at correlations between these and other by(motivations[,c2],c(1:72),sum)),as.vector(by(motivations[, variables. c3],c(1:72),sum)),as.vector(by(motivations[,c4],c(1:72),sum) ),as.vector(by(motivations[,c5],c(1:72),sum)),as.vector(by(m otivations[,c6],c(1:72),sum))) > colnames(mclusts)<-c("GovFocus", "TechFocus", • For example, looking at correlations between preferred "RewardFocus", "DigitalGov", "ProblemSolving", methods of data access and user motivation. "Enterprise") The additional data used is available in Google Docs linked from http://www.practicalparticipation.co.uk/odi/survey Download the relevant file as CSV and then import into the correct variable name with the command > egassociations<-read.csv<-(file=”filename.csv’,headers=T) To remove the line numbers, you can use: > egassociations<-egassociations[,2:length(assoctest[1,])] 1) Government focussed 4) Digitising government focus 2) Technology innovation focussed Understanding, Efficiency, Accountability A computerisation movement... Creating platforms, with semantic web 6) Social/public sector enterprise 3) Reward focussed Companies providing local focussed services Recognition or profit 5) Problem solving focus Develop new skills to meet interesting challenge
  • 39. Further analysis: motivations and attitudes • The attitudes dataset records responses against a number of claims about OGD. • The table below shows how different motivational clusters correlate with particular statements about OGD. Note the strong correlation between ‘Reward focus’ and statements about commercial benefit; and the view in Government focussed and problem solving that OGD can drive reform of public services. • Only Government focussed motivations have any real correlation with claims about accountability. • The ‘Enterprise’ correlations suggest a stronger ‘Social Enterprise’ interpretation may be appropriate. Correlation matrix of generated motivational clusters and response to ‘attitudes 1’ questions This row is curious. Little evidence elsewhere in study that OGD actually is helping errors to be identified... R commands: > cor(attitudes,mclusts) #display formatted in excel
  • 40. Further analysis: attitudes 2 • The attitudes 2 set of questions have far lower correlation. Overall the consensus on these items was far stronger, thus the information that can be extracted from their correlation with particular motivations less. • The correlations are not easily interpretable - but suggest that the views within each motivational cluster are not homogenous. The small (but greater than others) correlation of Tech and DigitalGov clusters against ‘Data only open when clear demand’ is surprising. • Attitudes 2 does little to illuminate the interpretation of these motivational clusters. Correlation matrix of generated motivational clusters and response to ‘attitudes 2’ questions R commands: > cor(attitudes2,mclusts) #display formatted in excel
  • 41. Further analysis: associations • Correlations against associations provide some further confirmation of the clusters, suggesting the diversity of the ‘Gov Focus’ users, but relatively strong identity of the tech and reward focussed clusters. Correlation matrix of generated motivational clusters and response to ‘associations’ questions R commands: > cor(associations,mclusts) #display formatted in excel
  • 42. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different motivational clusters. • Given the data feeding into the clusters is overlapping (1 overlaps 4; 2 overlaps 3 and 4) we may expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 43. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different 1) Government focussed motivational clusters. Understanding, Efficiency, Accountability • Given the data feeding into the clusters is overlapping (1 overlaps 4; 2 overlaps 3 and 4) we may expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 44. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different 1) Government focussed motivational clusters. Understanding, Efficiency, Accountability • Given the data feeding into the clusters is overlapping (1 overlaps 4; 2 overlaps 3 and 4) we may expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. 2) Technology innovation focussed Creating platforms, with semantic web R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 45. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different 1) Government focussed motivational clusters. Understanding, Efficiency, Accountability • Given the data feeding into the clusters is overlapping (1 overlaps 4; 2 overlaps 3 and 4) we may expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed Recognition or profit R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 46. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different 1) Government focussed motivational clusters. Understanding, Efficiency, Accountability • Given the data feeding into the clusters is overlapping (1 overlaps 4) Digitising government focus 4; 2 overlaps 3 and 4) we may A computerisation movement? expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. 2) Technology innovation focussed Creating platforms, with semantic web 3) Reward focussed Recognition or profit R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 47. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different 1) Government focussed motivational clusters. Understanding, Efficiency, Accountability • Given the data feeding into the clusters is overlapping (1 overlaps 4) Digitising government focus 4; 2 overlaps 3 and 4) we may A computerisation movement? expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. 2) Technology innovation focussed Creating platforms, with semantic web 5) Problem solving focus Develop new skills to meet interesting challenge 3) Reward focussed Recognition or profit R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 48. The digital government connection • Using multi-dimensional scaling can support visualisation of the relationship between the different 1) Government focussed motivational clusters. Understanding, Efficiency, Accountability • Given the data feeding into the clusters is overlapping (1 overlaps 6) Social/public sector enterprise 4) Digitising government focus 4; 2 overlaps 3 and 4) we may Companies providing local focussed services A computerisation movement? expect some clustering of these when MDS is applied. However, we do see (6) closer to (1), (4) and (2) than may be anticipated, and (5) distant from Digitising Government focus. 2) Technology innovation focussed Creating platforms, with semantic web 5) Problem solving focus Develop new skills to meet interesting challenge 3) Reward focussed Recognition or profit R: Using a manually created distance matrix of 1-correlation matrix value as dist > fit<-cmdscale(dist,2) > plot(fit[,2],fit[,1]) > text(fit[,2],fit[,1],labels=colnames(mclusts),pos=4,offset=0.2)
  • 49. Analysis • To be written up....

Notas del editor