"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Free Webinar: LOD2 Stack - 1st release
1. Creating Knowledge out of Interlinked Data
LOD2 Webinar . 29.11.2011 . Page 1 http://lod2.eu
2. Creating Knowledge out of Interlinked Data
LOD2 is a large-scale integrating project co-funded by the European
Commission within the FP7 Information and Communication Technologies
Work Programme. This 4-year project comprises leading Linked Open
Data technology researchers, companies, and service providers. Coming
from across 12 countries the partners are coordinated by the Agile
Knowledge Engineering and Semantic Web Research Group at the
University of Leipzig, Germany.
LOD2 will integrate and syndicate Linked Data with existing large-scale
applications. The project shows the benefits in the scenarios of Media and
Publishing, Corporate Data intranets and eGovernment.
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 2 http://lod2.eu
3. Creating Knowledge out of Interlinked Data
Once per month the LOD2 webinar series offer a free webinar
about tools and services along the Linked Open Data Life Cycle.
Stay with us and learn more about acquisition, editing,
composing, connected applications – and finally publishing Linked
Open Data.
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 3 http://lod2.eu
4. Creating Knowledge out of Interlinked Data
A strong partnership
Contact
Address Coordinator
University of Leipzig
Faculty of Mathematics and Computer Science
Institute of Computer Science
Department of Business Information Systems
Postfach 100920
04009 Leipzig
Germany
Thanks for your attention!
LOD2 Webinar . 29.11.2011 . Page 4
http://lod2.eu
http://lod2.eu
5. Creating Knowledge out of Interlinked Data
LOD2 stack anno 2011
An introduction by Bert Van Nuffelen, TenForce
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 5 http://lod2.eu
6. Creating Knowledge out of Interlinked Data
Contents
• The LOD2 stack concepts
• Demonstration
– WP7 use case
– Digital Agenda use case
• Summary
• Q&A
LOD2 Webinar . 29.11.2011 . Page 6 http://lod2.eu
7. Creating Knowledge out of Interlinked Data
Situating the component stack in the LOD2 project
• LOD2 project goal:
– improved tool support for publishing Linked Data.
– Easily accessible for the wide public
• Core enabler: the LOD2 stack
LOD2 Webinar . 29.11.2011 . Page 7 http://lod2.eu
8. Creating Knowledge out of Interlinked Data
The Linked Open Data Life Cycle
Inter-
linking/
Fusing
Manual Classifi-
revision/ cation/
authoring Enrichment
Storage/ Quality
Querying Analysis
Evolution /
Extraction
Repair
Search/
Browsing/
Exploration
LOD2 Webinar . 29.11.2011 . Page 8 http://lod2.eu
9. Creating Knowledge out of Interlinked Data
LOD2 stack anno 2011: easing deployment of components
LOD2 stack v2 = 2012
More components + more inter component
integration
2011
LOD2 stack v1 = Debian repository
Sindice Silk 2010
Virtuoso PoolParty
D2R Sigma.EE Dbpedia
CKAN OntoWiki ORE
LOD2 Webinar . 29.11.2011 . Page 9 http://lod2.eu
10. Creating Knowledge out of Interlinked Data
Installing the LOD2 stack – system requirements
• Ubuntu 10.10 or more recent,
• or a Linux distribution which supports Debian packages
• the software in the stack is open-source
– although individual licenses differ
– Some components are also available as commercial product
– The source itself is not (yet) distributed through the LOD2 stack repository.
LOD2 Webinar . 29.11.2011 . Page 10 http://lod2.eu
11. Creating Knowledge out of Interlinked Data
Installing the LOD2 stack – software installation
Setup of LOD components is now a matter of following the next few steps
# the installation of the LOD2 repository package.
wget http://stack.lod2.eu/lod2repository_current_all.deb
sudo dpkg –i lod2repository_current_all.deb
sudo apt-get update
# get some third party software
sudo add-apt-repository ‘deb http://archive.canonical.com/ lucid partner’
sudo apt-get update
# install all LOD components
sudo apt-get install lod2demo
LOD2 Webinar . 29.11.2011 . Page 11 http://lod2.eu
12. Creating Knowledge out of Interlinked Data
Linked Data publishing capabilities currently offered
• Covers most of the LOD publishing cycle
• Combination of
– locally installed software,
– online available software, and
– online available data sources
– about page in the LOD demonstrator (http://demo.lod2.eu/lod2demo)
• Next: small demonstrations
– simplified examples
– Aim at demonstrating capabilities of the LOD2 stack
– Detailed discussions on tools see next webinars
Disclaimer. No harmonized user interface.
LOD2 Webinar . 29.11.2011 . Page 12 http://lod2.eu
13. Creating Knowledge out of Interlinked Data
USE CASE 1 – WP7 WKD
LOD2 Webinar . 29.11.2011 . Page 13 http://lod2.eu
14. Creating Knowledge out of Interlinked Data
WP7 – Media and Publishing usecase
• Wolters Kluwer Deutschland (WKD) contributed large dataset (XML)
WKG Legal & Regulatory
Companies/Brands Products (Examples)
- Carl Heymanns Verlag - IP, Administrative Law
- Luchterhand - Civil, Family, Labor Law
- Werner Verlag - Construction Law
- Carl Link - Publications for Schools/KiTas
- CW Haarfeld - Public Health Insurance WKG is part of Wolters Kluw er n.v .
- Deutscher - Magazin „Personalwirtschaft“ (HR
Wirtschaftsdienst Management)
- AnNoText - SW for Lawyers and Notaries - Customer - Worldwide reach
- Trigon Data orientation - Europe
- Lawyers - North America
- Tax Accountants - Asia/Pacific
- Corporations and SMEs
WKG Tax & Accounting - Fincancial institutions Economic success
- Health Providers - Revenue 2010: EUR 3,6 bln
Companies/Brands - Products (Examples)
- Public Sector - 19.000 Employees
- Akademische Arbeits- - Tax SW for Consumers - Listed Amsterdam SE
gemeinschaft Verlag - SW for Tax Accountants
- Addison Group - SW for SMEs with focus
- Schleupen Tax Controlling and Accounting
- Wago Curadata
• Work package goal: evaluating the LOD2 stack for publishing legal data
– Extracted meta data statements
– created controlled vocabularies
LOD2 Webinar . 29.11.2011 . Page 14 http://lod2.eu
15. Creating Knowledge out of Interlinked Data
WP7 DEMO 1 – extracting meta data from structured data
Virtuoso
XML documents
representing
e.g. journals Extract and upload RDF
about legal
content, store
comments
about laws, etc.
XSL
T
LOD2 Webinar . 29.11.2011 . Page 15 http://lod2.eu
16. Creating Knowledge out of Interlinked Data
WP7 DEMO 2 – Associating concepts to unstructured data
PoolParty
RDF
Store
Virtuoso
Extract and upload RDF
Plain text store
LOD2 Webinar . 29.11.2011 . Page 16 http://lod2.eu
17. Creating Knowledge out of Interlinked Data
The demos support a real world scenario
Tagge Associate
lawyer
d d WKD
personal Taxonomy based annotation
text lawyer document
text s
WKD
Annotated
content
LOD2 Webinar . 29.11.2011 . Page 17 http://lod2.eu
18. Creating Knowledge out of Interlinked Data
DEMO
LOD2 Webinar . 29.11.2011 . Page 18 http://lod2.eu
19. Creating Knowledge out of Interlinked Data
Publishing statistical data
USE CASE 2 – DIGITAL AGENDA
SCOREBOARD
LOD2 Webinar . 29.11.2011 . Page 19 http://lod2.eu
20. Creating Knowledge out of Interlinked Data
The Digital Agenda Scoreboard
• LOD2 has been contacted in context of Publink by EC (DG INFSO) in 2010.
• LOD2 supported the creation of the digital scoreboard where the DG INFSO
data is published as RDF using the DataCube vocabulary (2011).
• The limited effort resulted in a graphical visualization of the statistical data &
machine readable public data.
LOD2 Webinar . 29.11.2011 . Page 20 http://lod2.eu
21. Creating Knowledge out of Interlinked Data
The demo case
• The scoreboard data does not link to external data
• As the observations are about countries it would be nice to see the
evaluation of an indicator in the scoreboard w.r.t. to evolution in average
income.
• % households with access to the internet at home (DAA scoreboard)
• Mean and median income by household type (Eurostat)
LOD2 Webinar . 29.11.2011 . Page 21 http://lod2.eu
22. Creating Knowledge out of Interlinked Data
Available RDF data
LOD2 Webinar . 29.11.2011 . Page 22 http://lod2.eu
23. Creating Knowledge out of Interlinked Data
Available RDF data
DG INFSO Scoreboard.rdf
Observ ation Country Year Unit Measure households
w ith access to the
Internet at home
http://data.lod2.eu/scoreboard/item Netherlands 2005 % lines 0,7825984925
s/h_iacc/HH_TOTAL/hh/Netherlan
ds/2005
LOD2 Webinar . 29.11.2011 . Page 23 http://lod2.eu
24. Creating Knowledge out of Interlinked Data
Available RDF data
DG INFSO Scoreboard.rdf
Observ ation Country Year Unit Measure households
w ith access to the
Internet at home
http://data.lod2.eu/scoreboard/item Netherlands 2005 % lines 0,7825984925
s/h_iacc/HH_TOTAL/hh/Netherlan
ds/2005
Eurostat source ilc_di04
Observ ation Country Year Measure Av erage net
income - total
http://eurostat.linked- http://eurostat.linked- http://eurostat.linked- 17002.3
statistics.org/data/ilc_di04#A,TOTAL,
statistic s.org/dic /geo#NL statistic s.org/dic /time#200
MED_E,EUR,NL,2005
5
LOD2 Webinar . 29.11.2011 . Page 24 http://lod2.eu
25. Creating Knowledge out of Interlinked Data
Target table
Country Year Observ ation Measure % Observ ation Measure
DG INFSO Households w ith Eurostat Av erage net
internet at home income - total
Netherlands 2005 http://data.lod2.eu/scorebo 0,7825984925 http://eurostat.linked- 17002.3
ard/items/h_iacc/HH_TO statistics.org/data/demo_g
TAL/hh/Netherlands/2005 ind#A,AVG,NL,2004
LOD2 Webinar . 29.11.2011 . Page 25 http://lod2.eu
26. Creating Knowledge out of Interlinked Data
Demonstration
Observ ation Country Year Unit Measure DSL lines
share in fixed
broadband
http://data.lod2.eu/scoreboard/item Netherlands 2004 % lines 0,6109657095
s/bb_dsl/TOTAL_FBB/lines/Nether
lands/2004
DG INFSO
Scoreboard
Observ ation Country Year Measure Av erage net
income - total
http://eurostat.linked- http://eurostat.linked- http://eurostat.linked- 17002.3
statistics.org/data/ilc_di04#A,TOTAL,
statistic s.org/dic /geo#NL statistic s.org/dic /time#200
EuroStat
MED_E,EUR,NL,2005
5
(Latc)
LOD2 Webinar . 29.11.2011 . Page 26 http://lod2.eu
28. Creating Knowledge out of Interlinked Data
DEMO
LOD2 Webinar . 29.11.2011 . Page 28 http://lod2.eu
29. Creating Knowledge out of Interlinked Data
SUMMARY
LOD2 Webinar . 29.11.2011 . Page 29 http://lod2.eu
30. Creating Knowledge out of Interlinked Data
Summary
• More tools are available for exploration
– E.g. Sigma.EE, Sindice, CKAN, ORE, etc.
• Contribute your own component
– Check out our HowToContribute document
– Basically: create a Debian package of your application and upload it in our repository.
• More information
– stack.lod2.eu
• You find there also references to supporting documents such as HowToStart and
HowToContribute
• Online exploration can be done at demo.lod2.eu
– support-stack@lod2.eu for technical questions
– lod2@lists.okfn.org general questions on LOD2 and linked data publishing.
LOD2 Webinar . 29.11.2011 . Page 30 http://lod2.eu
31. Creating Knowledge out of Interlinked Data
Q&A
LOD2 Webinar . 29.11.2011 . Page 31 http://lod2.eu
32. Creating Knowledge out of Interlinked Data
LOD2 whishes you a
happy LOD publishing experience
LOD2 Webinar . 29.11.2011 . Page 32 http://lod2.eu
33. Creating Knowledge out of Interlinked Data
Credits
Jingle R.E.M., Martin Kaltenböck, Florian Kondert
Coordination Thomas Thurner
Martin Kaltenböck
Moderation Lambda Verdonckt
Presented by Bert Van Nuffelen
LOD2 Webinar . 29.11.2011 . Page 33 http://lod2.eu
34. Creating Knowledge out of Interlinked Data
Hope you enjoyed staying with us – if you need more detailed
information, visit us at www.lod2.eu and let us know how we can
improve to meet your expectations!
Don’t forget to register for our next webinar
20.12. 2011 - Virtuoso (Open Link Software)
24.01. 2012 - OntoWiki (University of Leipzig, Germany)
Have a great day and don’t forget ...
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 34 http://lod2.eu
35. Creating Knowledge out of Interlinked Data
http://lod2.eu
LOD2 Webinar . 29.11.2011 . Page 35 http://lod2.eu
Notas del editor
Ik ben bert Van Nuffelen from the LOD2 partner TenForce, I am leading WP6, the workpackage that is in charge of the integration effort and the construction of the LOD2 stack. today I will tour around the first version of the LOD2 stack. We first introduce you to the base concepts of the LOD2 stack And then demonstrate the current version in the context of 2 small usecases, One based on the WP7 and the other based on a LOD2 publink exercise. Finally we wrap up and answer your questions. At any moment you can to us via the chat window your questions. We will collect them and answer them at the end. VRAAG: hoe kunnen ze stellen, bij Lamdba intro misschien enkele technische richtlijnen meegeven.
The LOD2 stack is the supporting structure to bring those tools and components to the end user (you) so that the publishing process of Linked data is eased. How do we get our content (documents, pdfs, excelsheets, …) in RDF and then part of the Linked Data Web (Toon excel, word, pdf docs) -> toon (RDF doc) -> toon Linked data web From a distance these are the processes involved in the linked data publication
1 Extraction: from doc to RDF 2. Storage: native storage and access, supported excellent standards: RDF, SPARQL, SKOS, OWL, 3. Authoring: management of the data, corrections, tracking changes… 4. Linking: The distinguishing part for LOD data publication. The goal is to related your data with data that is not under your control. 5. Enrichment: if linked new insights can be (semi)automatically be added 6. Quality: asses the overall quality 7. Repair: (=semi)automatic corrections Evolution: adaptations to new business models 8. Explore and browse by the end-user. Disticting is sometimes artificial and order is not strict. The LOD2 stack has industrial strength tool support for the left part. The right part is much more research oriented
Where setup time in 2010 was serveral days, you can do it now in half a day from scratch. When we do the next presentation in a years time, more should be possible in an easier fashion.
Wat do you need to install the lod2 stack
Like the oracle-java compiler.
These offer you the necessary RICHNESS The use case examples are simplified examples from Linked Data publishing experiences in LOD2 which are used to demonstrate the capabilities of the LOD2 stack components. Disclaimer. The LOD2 stack does not come with a single harmonized user interface. So in the demo some steps are made in a technical fashion
This work package aims at evaluating the LOD2 stack and the LOD publishing process for the media and publishing domain. The partner Wolters Kluwer Germany (WKD) is very active in the legal industry and is a global player. They have contributed a large dataset of documents. The documents are in xml format. We have extracted meta data statements about these documents in RDF format. We have created controlled vocabularies ( taxonomies) delimiting the ranges of some meta data annotations Aim: evaluation of the LOD2 stack and the LOD publishing process for the media and publishing domain
Tools used: RDF extractor for XML (valiant/TenForce), Storeage (virtuoso/OpenLink) and Taxonomy management (PoolParty/PoolPartyExtractor/Semantic Web Company) The idea is here to extract from the structured content (XML documents) rdf meta data (structure of the document, annotations like about which legal domain the text is, what the application area of the text is, etc.)
Here the idea is that new not yet annotated documents can get a first annotation based on the taxonomy constructed in PoolParty.
The LOD2 stack can be used by you to distribute your component is a reliable way to potential users. If you have feedback on the webinar contact our webinar responsible.