RPI Research in Linked Open Government Systems

Linked Open Government Data http://logd.tw.rpi.edu Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information Technology and Web Science Rensselaer Polytechnic Institute http://www.cs.rpi.edu/~hendler @jahendler (twitter)

Demo of our site http://logd.tw.rpi.edu

Data.gov community: International

Government Data Sharing January 1, 2009 “ Openness will strengthen our democracy and promote efficiency and effectiveness in Government.” --- President Obama Putting Govt Data online- Data.gov.uk beta May 21, 2009 January 19, 2010 data.gov.uk online May 21, 2010 data.gov online data.gov relaunch with semantic web featured June30,2009 December 8, 2009 “ Open Government Directive” released 2009 2010 … 57 Data Sets ~6000 Data Set ~2000 Data Sets >305,000 Data Sets

New ways to see data sets David McCandless

Important to the citizens: eg. Education

What’s promising ,[object Object],[object Object],[object Object],[object Object],[object Object]

Moving data.gov to linked data (UK) ,[object Object],[object Object]

Moving data.gov to linked data (US) ,[object Object],[object Object],[object Object]

Linked Open Data goes beyond govt http://linkeddata.org/ Government Data is currently over ½ the cloud in size (~17B triples), 10s of thousands of links to other data (within and without)

[object Object],More than 50 of these at http://logd.tw.rpi.edu

Adding some Web magic Web Analytics Social Data Networks External Links

Linking GDP of the US and China GDP of China (Billion Chinese Yuan ) GDP of the US (Billion Dollar) [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn

Linking GDP of the US and China GDP of China (Billion Chinese Yuan ) GDP of the US (Billion Dollar) [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn This mashup was built in less than 4 hours – including conversion of data, web interface, and visualization!

Mashups allow comparisons that single data sets cannot Trends in Smoking Prevalence, Tobacco Policy Coverage and Tobacco Prices (1991-2007) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Our process derive derive create derive revision Convert Access Enhance Version SemDiff

Csv2rdflod (from logd.tw.rpi.edu) Install csv2rdflod

Metadata is critical What kinds of metadata are: simple to create, powerful enough for search and internationalizable (esp. beyond English)

Work in Progress ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

RDF encodings from our metadata collection

[object Object],Bag of words LED on strings String Match Various Weighted Combinations

Simple Example EPA Toxic Release Data This looks like it could be state identifiers. Look for possible state identifiers: -Names: “Pennsylvania”, “Michigan”, “Wisconsin” -Abbr: “PA”, “MI”, “WI” -FIPS: “42”, “26”, “55” 75% match state identifiers. If this meets our threshold, then recommend interpreting as state and integrating with linked data on the web. Federal Information Processing Standards (FIPS) 14 is “Guam” which is not a US state Facility ID … Latitude Longitude ST:val … … 40.416944 -75.935 42 … … 42.955383 -85.480074 26 … … 43.1698 -88.01829 55 … … 38.87025 -77.00905 14 … … … … …

Results ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Next Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Challenge ,[object Object],[object Object],[object Object]

Good news – easy to do comparisons

Good news - Even if not “rationalized” together

Bad news – real comparisons are hard across govts

Presents a challenge Same or different?

Different “ontologies” ? Definitely not the expected result!!

And many other interesting issues ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Questions? http://logd.tw.rpi.edu

Govt systems can use linked data web for context Correlates fires, acres burned, and agency budgets

Visualization can help identify data errors Were there really no fires in 1985?

RPI Research in Linked Open Government Systems

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (19)

Destacado

Destacado (6)

Similar a RPI Research in Linked Open Government Systems

Similar a RPI Research in Linked Open Government Systems (20)

Más de James Hendler

Más de James Hendler (20)

Último

Último (20)

RPI Research in Linked Open Government Systems

Notas del editor