SlideShare una empresa de Scribd logo
1 de 47
Descargar para leer sin conexión
Towards the Integration of a Research Group
      Website into the Web of Data

 Mikel Emaldi David Buj´n Diego L´pez de Ipi˜a
                         a            o      n
        {m.emaldi, dbujan, dipina}@deusto.es

        Deusto Institute of Technology - DeustoTech




                    November 2011
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




       1 Motivation

       2 Our Solution
               First Approach
               Solution Overview
               Data Extraction
               System Architecture

       3 Linked Data Extension

       4 Conclusions

       5 Future Work


Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Table of Contents


   1 Motivation

   2 Our Solution
       First Approach
       Solution Overview
       Data Extraction
       System Architecture

   3 Linked Data Extension

   4 Conclusions

   5 Future Work
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




Motivation


              The desire of offering our research group website’s
              (http://www.morelab.deusto.es) data as Linked Data
                      Our web is supported by Joomla! CMS
                              The data is unstructured




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




Motivation


              The desire of offering our research group website’s
              (http://www.morelab.deusto.es) data as Linked Data
                      Our web is supported by Joomla! CMS
                              The data is unstructured
              We chose our publications section as first attempt
                      Almost 100 publications
                      Possibility to link them to external datasets
                      We saw the oportunity of centralize group’s FOAF files




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Table of Contents


   1 Motivation

   2 Our Solution
       First Approach
       Solution Overview
       Data Extraction
       System Architecture

   3 Linked Data Extension

   4 Conclusions

   5 Future Work
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



First Approach


First Approach



                 A solution based on Python web-script (mod python)
                 The core code of Joomla! was to be modified
                 Here there was a major problem:
                      When a security update was installed, Joomla! used to destroy
                      our custom code




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Solution Overview


Joomla! Extension




              A solution based on an Extension for Joomla!




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Solution Overview


Joomla! Extension

              A solution based on an Extension for Joomla!
                      Component




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Solution Overview


Joomla! Extension

              A solution based on an Extension for Joomla!
                      Plugin




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Solution Overview


Joomla! Extension




              A solution based on an Extension for Joomla!
                      It offers a feasible solution for analyze published publications
                      and to generate correspondent Linked Data




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation               Our Solution             Linked Data Extension                Conclusions               Future Work



Data Extraction


Joomla! Content Example


              TALISMAN+: Intelligent System for Follow-Up and
              Promotion of Personal Autonomy
                                    o         n      e               ´
              David Aus´ Diego L´pez-de-Ipi˜a, Jos´ Bravo, Miguel Angel Valero, Francisco Fl´rez. TALISMAN+:
                          ın,                                                                 o
              Intelligent System for Follow-Up and Promotion of Personal Autonomy. III International Workshop on
              Ambient Assisted Living - IWAAL 2011. M´laga, Spain. June 2011.
                                                       a

              The TALISMAN+ project, financed by the Spanish Ministry of Science and Innovation, aims to research
              and demonstrate innovative solutions transferable to society which offer services and products based on
              information and communication technologies in order to promote personal autonomy in prevention and
              monitoring scenarios. It will solve critical interoperability problems among systems and emerging
              technologies in a context where heterogeneity brings about accessibility barriers not yet overcome and
              demanded by the scientific, technological or social-health settings.

              Download




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                                          DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Overview




              Data is extracted throught three ways:




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Overview




              Data is extracted throught three ways:
                      User defined Regular Expression




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Overview




              Data is extracted throught three ways:
                      User defined Regular Expression
                      DBLP SPARQL Endpoint




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Overview




              Data is extracted throught three ways:
                      User defined Regular Expression
                      DBLP SPARQL Endpoint
                      Google Scholar search engine




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Regex I


              User defines a regular expression to parse its content
              User has to define used ontologies and their prefixes into the
              admin control panel
              The regex tags are clearly understandable
                      The ontology properties to be mapped are tagged between {}
                      Every delimiter (also the {}) is identified by a 
                      The term {dummy } can be used to ignore content




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation               Our Solution             Linked Data Extension                Conclusions               Future Work



Data Extraction


Regex II

                                    o         n      e               ´
              David Aus´ Diego L´pez-de-Ipi˜a, Jos´ Bravo, Miguel Angel Valero, Francisco Fl´rez. TALISMAN+:
                          ın,                                                                 o
              Intelligent System for Follow-Up and Promotion of Personal Autonomy. III International Workshop on
              Ambient Assisted Living - IWAAL 2011. M´laga, Spain. June 2011.
                                                       a

              The TALISMAN+ project, financed by the Spanish Ministry of Science and Innovation, aims to research
              and demonstrate innovative solutions transferable to society which offer services and products based on
              information and communication technologies in order to promote personal autonomy in prevention and
              monitoring scenarios. It will solve critical interoperability problems among systems and emerging
              technologies in a context where heterogeneity brings about accessibility barriers not yet overcome and
              demanded by the scientific, technological or social-health settings.

              Download




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                                          DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation               Our Solution                Linked Data Extension             Conclusions               Future Work



Data Extraction


Regex II

                                    o         n      e               ´
              David Aus´ Diego L´pez-de-Ipi˜a, Jos´ Bravo, Miguel Angel Valero, Francisco Fl´rez. TALISMAN+:
                          ın,                                                                 o
              Intelligent System for Follow-Up and Promotion of Personal Autonomy. III International Workshop on
              Ambient Assisted Living - IWAAL 2011. M´laga, Spain. June 2011.
                                                       a

              The TALISMAN+ project, financed by the Spanish Ministry of Science and Innovation, aims to research
              and demonstrate innovative solutions transferable to society which offer services and products based on
              information and communication technologies in order to promote personal autonomy in prevention and
              monitoring scenarios. It will solve critical interoperability problems among systems and emerging
              technologies in a context where heterogeneity brings about accessibility barriers not yet overcome and
              demanded by the scientific, technological or social-health settings.

              Download

              {dc : c r e a t o r , s e p ( , ) }  . {dc : t i t l e }.
              { s w r c : s e r i e s }. { s w r c : l o c a t i o n }.
              {dc : d a t e }. { b i b o : a b s t r a c t } Download$




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                                          DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


DBLP I



              Digital Bibliography & Library Project
              > 1.3 million articles
              SPARQL endpoint at:
                      http://dblp.l3s.de/d2r/sparql/
                      http://dblp.l3s.de/d2r/snorql/




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension             Conclusions           Future Work



Data Extraction


DBLP II


              DBLP SPARQL endpoint is used to search data about
              publications
                      SELECT DISTINCT ?uri ?p ?o WHERE {?uri dc:title
                      “title-of-article”ˆˆ<http://www.w3.org/2001/XMLSchema#string>}

              Data is enriched with our own data and saved into the RDF
              store
              We also link members FOAF’s to DBLP authors data
                      <http://www.morelab.deusto.es/resource/dipina> owl:sameAs
                      <http://dblp.l3s.de/d2r/resource/authors/Diego L´pez-de-Ipi˜a> ;
                                                                      o          n




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                                    DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Google Scholar I


              A simple way to broadly search for scholarly literature
              http://scholar.google.com
              It exports data in diferent formats
                      BibTeX
                      EndNote
                      RefMan
                      RefWorks
                      WenXiangWang




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Google Scholar II




              The data from GS is extracted via BibTeX scrapping




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Google Scholar II


              The data from GS is extracted via BibTeX scrapping
                      An HTTP request using an specific cookie to retrieve BibTeX
                      data




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Google Scholar II



              The data from GS is extracted via BibTeX scrapping
                      BibTeX data is retrieved




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Google Scholar II

              The data from GS is extracted via BibTeX scrapping
                      Mapping from BibTeX data to RDF




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension             Conclusions              Future Work



Data Extraction


FOAF

              Every member of our group has its own FOAF file
                      http://www.morelab.deusto.es/resource/member-alias
              Every publication is linked to its author’s URI
                      <http://www.morelab.deusto.es/resource/imhotep-an-approach-to-user-and-device-conscious-
                      mobile-applications> dc:creator
                      <http://www.morelab.deusto.es/resource/dipina>


                      This is done automatically looking for author’s nicknames




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                                      DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



Data Extraction


Flowchart




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Joseki + SDB


              Joseki
                      A SPARQL server for Jena
                      Storage into RDF files and relational databases
                      It allows SPARQL Updates
                      It is private for our system




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Joseki + SDB


              Joseki
                      A SPARQL server for Jena
                      Storage into RDF files and relational databases
                      It allows SPARQL Updates
                      It is private for our system
              SDB
                      A component of Jena
                      It provides:
                              Scalable storage
                              Query of RDF datasets using conventional SQL databases



Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Pubby
              Pubby adds Linked Data interfaces to SPARQL endpoints
              It allows content negotiation among these formats:
                      HTML
                      RDF/XML
                      N3




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work



System Architecture


Snorql

              An AJAXy front-end for exploring RDF SPARQL endpoints
              More usable than Joseki
              It is MoreLab’s public SPARQL endpoint




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Table of Contents


   1 Motivation

   2 Our Solution
       First Approach
       Solution Overview
       Data Extraction
       System Architecture

   3 Linked Data Extension

   4 Conclusions

   5 Future Work
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




Admin Overview


              Dataset Creation:




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




Admin Overview
              Ontology Prefix Definition:




              Regex Definition:




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




User Overview




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Table of Contents


   1 Motivation

   2 Our Solution
       First Approach
       Solution Overview
       Data Extraction
       System Architecture

   3 Linked Data Extension

   4 Conclusions

   5 Future Work
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




Conclusions




              This solution integrates our data into Web of Data easily
              Provides a reusable solution
              Opens the door to more extendable solutions




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Table of Contents


   1 Motivation

   2 Our Solution
       First Approach
       Solution Overview
       Data Extraction
       System Architecture

   3 Linked Data Extension

   4 Conclusions

   5 Future Work
Motivation              Our Solution              Linked Data Extension    Conclusions           Future Work




Future Work



              Link our datasets with more external datasets
                      DBPedia
                      Geonames
              RDF and SPARQL search form
              Externalize linked data sources
                      Building the Extension modularly




Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a
                       a          o          n                                           DeustoTech - Internet
Towards the Integration of a Research Group Website into the Web of Data
Towards the Integration of a Research Group
      Website into the Web of Data

 Mikel Emaldi David Buj´n Diego L´pez de Ipi˜a
                         a            o      n
        {m.emaldi, dbujan, dipina}@deusto.es

        Deusto Institute of Technology - DeustoTech




                    November 2011

Más contenido relacionado

Similar a Towards the Integration of Research Group Website into the Web of Data

Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsBrand Niemann
 
Toward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docxToward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docxjuliennehar
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfAlan Morrison
 
Research Methodology (how to choose Datasets ).pptx
Research Methodology (how to choose Datasets ).pptxResearch Methodology (how to choose Datasets ).pptx
Research Methodology (how to choose Datasets ).pptxZainab Alhassani
 
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...Inside Analysis
 
Mike2.0 Methodology Overview
Mike2.0 Methodology OverviewMike2.0 Methodology Overview
Mike2.0 Methodology Overviewsean.mcclowry
 
How To Connect To Your Customers, Partners Securely, Privately and Effectively
How To Connect To Your Customers, Partners Securely, Privately and EffectivelyHow To Connect To Your Customers, Partners Securely, Privately and Effectively
How To Connect To Your Customers, Partners Securely, Privately and EffectivelyAndy Harjanto
 
Business Intelligence for normal people
Business Intelligence for normal peopleBusiness Intelligence for normal people
Business Intelligence for normal peoplemark madsen
 
Kbk group presentation
Kbk group presentationKbk group presentation
Kbk group presentationAlexeyKudashov
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project GuidanceVarad Meru
 
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET- Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET Journal
 
Forecast 2012 Panel: Big Data in the Cloud Das Kamhout
Forecast 2012 Panel: Big Data in the Cloud Das KamhoutForecast 2012 Panel: Big Data in the Cloud Das Kamhout
Forecast 2012 Panel: Big Data in the Cloud Das KamhoutOpen Data Center Alliance
 
GeoNode Motivation, Design, and Challenges
GeoNode Motivation, Design, and ChallengesGeoNode Motivation, Design, and Challenges
GeoNode Motivation, Design, and ChallengesSebastian Benthall
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOADemed L'Her
 
Response needed 1The paper is well placed on the issues of the.docx
Response needed 1The paper is well placed on the issues of the.docxResponse needed 1The paper is well placed on the issues of the.docx
Response needed 1The paper is well placed on the issues of the.docxaudeleypearl
 

Similar a Towards the Integration of Research Group Website into the Web of Data (20)

Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data Dashboards
 
Toward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docxToward a System Building Agenda for Data Integration(and Dat.docx
Toward a System Building Agenda for Data Integration(and Dat.docx
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
 
Research Methodology (how to choose Datasets ).pptx
Research Methodology (how to choose Datasets ).pptxResearch Methodology (how to choose Datasets ).pptx
Research Methodology (how to choose Datasets ).pptx
 
What Is Mike2.0
What Is Mike2.0What Is Mike2.0
What Is Mike2.0
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
 
Mike2.0 Methodology Overview
Mike2.0 Methodology OverviewMike2.0 Methodology Overview
Mike2.0 Methodology Overview
 
How To Connect To Your Customers, Partners Securely, Privately and Effectively
How To Connect To Your Customers, Partners Securely, Privately and EffectivelyHow To Connect To Your Customers, Partners Securely, Privately and Effectively
How To Connect To Your Customers, Partners Securely, Privately and Effectively
 
Business Intelligence for normal people
Business Intelligence for normal peopleBusiness Intelligence for normal people
Business Intelligence for normal people
 
Kbk group presentation
Kbk group presentationKbk group presentation
Kbk group presentation
 
Final Year Project Guidance
Final Year Project GuidanceFinal Year Project Guidance
Final Year Project Guidance
 
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
 
Forecast 2012 Panel: Big Data in the Cloud Das Kamhout
Forecast 2012 Panel: Big Data in the Cloud Das KamhoutForecast 2012 Panel: Big Data in the Cloud Das Kamhout
Forecast 2012 Panel: Big Data in the Cloud Das Kamhout
 
Good enough
Good enoughGood enough
Good enough
 
GeoNode Motivation, Design, and Challenges
GeoNode Motivation, Design, and ChallengesGeoNode Motivation, Design, and Challenges
GeoNode Motivation, Design, and Challenges
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOA
 
Response needed 1The paper is well placed on the issues of the.docx
Response needed 1The paper is well placed on the issues of the.docxResponse needed 1The paper is well placed on the issues of the.docx
Response needed 1The paper is well placed on the issues of the.docx
 

Último

Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Último (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

Towards the Integration of Research Group Website into the Web of Data

  • 1. Towards the Integration of a Research Group Website into the Web of Data Mikel Emaldi David Buj´n Diego L´pez de Ipi˜a a o n {m.emaldi, dbujan, dipina}@deusto.es Deusto Institute of Technology - DeustoTech November 2011
  • 2. Motivation Our Solution Linked Data Extension Conclusions Future Work 1 Motivation 2 Our Solution First Approach Solution Overview Data Extraction System Architecture 3 Linked Data Extension 4 Conclusions 5 Future Work Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 3. Table of Contents 1 Motivation 2 Our Solution First Approach Solution Overview Data Extraction System Architecture 3 Linked Data Extension 4 Conclusions 5 Future Work
  • 4. Motivation Our Solution Linked Data Extension Conclusions Future Work Motivation The desire of offering our research group website’s (http://www.morelab.deusto.es) data as Linked Data Our web is supported by Joomla! CMS The data is unstructured Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 5. Motivation Our Solution Linked Data Extension Conclusions Future Work Motivation The desire of offering our research group website’s (http://www.morelab.deusto.es) data as Linked Data Our web is supported by Joomla! CMS The data is unstructured We chose our publications section as first attempt Almost 100 publications Possibility to link them to external datasets We saw the oportunity of centralize group’s FOAF files Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 6. Table of Contents 1 Motivation 2 Our Solution First Approach Solution Overview Data Extraction System Architecture 3 Linked Data Extension 4 Conclusions 5 Future Work
  • 7. Motivation Our Solution Linked Data Extension Conclusions Future Work First Approach First Approach A solution based on Python web-script (mod python) The core code of Joomla! was to be modified Here there was a major problem: When a security update was installed, Joomla! used to destroy our custom code Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 8. Motivation Our Solution Linked Data Extension Conclusions Future Work Solution Overview Joomla! Extension A solution based on an Extension for Joomla! Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 9. Motivation Our Solution Linked Data Extension Conclusions Future Work Solution Overview Joomla! Extension A solution based on an Extension for Joomla! Component Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 10. Motivation Our Solution Linked Data Extension Conclusions Future Work Solution Overview Joomla! Extension A solution based on an Extension for Joomla! Plugin Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 11. Motivation Our Solution Linked Data Extension Conclusions Future Work Solution Overview Joomla! Extension A solution based on an Extension for Joomla! It offers a feasible solution for analyze published publications and to generate correspondent Linked Data Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 12. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Joomla! Content Example TALISMAN+: Intelligent System for Follow-Up and Promotion of Personal Autonomy o n e ´ David Aus´ Diego L´pez-de-Ipi˜a, Jos´ Bravo, Miguel Angel Valero, Francisco Fl´rez. TALISMAN+: ın, o Intelligent System for Follow-Up and Promotion of Personal Autonomy. III International Workshop on Ambient Assisted Living - IWAAL 2011. M´laga, Spain. June 2011. a The TALISMAN+ project, financed by the Spanish Ministry of Science and Innovation, aims to research and demonstrate innovative solutions transferable to society which offer services and products based on information and communication technologies in order to promote personal autonomy in prevention and monitoring scenarios. It will solve critical interoperability problems among systems and emerging technologies in a context where heterogeneity brings about accessibility barriers not yet overcome and demanded by the scientific, technological or social-health settings. Download Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 13. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Overview Data is extracted throught three ways: Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 14. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Overview Data is extracted throught three ways: User defined Regular Expression Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 15. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Overview Data is extracted throught three ways: User defined Regular Expression DBLP SPARQL Endpoint Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 16. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Overview Data is extracted throught three ways: User defined Regular Expression DBLP SPARQL Endpoint Google Scholar search engine Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 17. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Regex I User defines a regular expression to parse its content User has to define used ontologies and their prefixes into the admin control panel The regex tags are clearly understandable The ontology properties to be mapped are tagged between {} Every delimiter (also the {}) is identified by a The term {dummy } can be used to ignore content Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 18. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Regex II o n e ´ David Aus´ Diego L´pez-de-Ipi˜a, Jos´ Bravo, Miguel Angel Valero, Francisco Fl´rez. TALISMAN+: ın, o Intelligent System for Follow-Up and Promotion of Personal Autonomy. III International Workshop on Ambient Assisted Living - IWAAL 2011. M´laga, Spain. June 2011. a The TALISMAN+ project, financed by the Spanish Ministry of Science and Innovation, aims to research and demonstrate innovative solutions transferable to society which offer services and products based on information and communication technologies in order to promote personal autonomy in prevention and monitoring scenarios. It will solve critical interoperability problems among systems and emerging technologies in a context where heterogeneity brings about accessibility barriers not yet overcome and demanded by the scientific, technological or social-health settings. Download Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 19. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Regex II o n e ´ David Aus´ Diego L´pez-de-Ipi˜a, Jos´ Bravo, Miguel Angel Valero, Francisco Fl´rez. TALISMAN+: ın, o Intelligent System for Follow-Up and Promotion of Personal Autonomy. III International Workshop on Ambient Assisted Living - IWAAL 2011. M´laga, Spain. June 2011. a The TALISMAN+ project, financed by the Spanish Ministry of Science and Innovation, aims to research and demonstrate innovative solutions transferable to society which offer services and products based on information and communication technologies in order to promote personal autonomy in prevention and monitoring scenarios. It will solve critical interoperability problems among systems and emerging technologies in a context where heterogeneity brings about accessibility barriers not yet overcome and demanded by the scientific, technological or social-health settings. Download {dc : c r e a t o r , s e p ( , ) } . {dc : t i t l e }. { s w r c : s e r i e s }. { s w r c : l o c a t i o n }. {dc : d a t e }. { b i b o : a b s t r a c t } Download$ Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 20. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction DBLP I Digital Bibliography & Library Project > 1.3 million articles SPARQL endpoint at: http://dblp.l3s.de/d2r/sparql/ http://dblp.l3s.de/d2r/snorql/ Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 21. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction DBLP II DBLP SPARQL endpoint is used to search data about publications SELECT DISTINCT ?uri ?p ?o WHERE {?uri dc:title “title-of-article”ˆˆ<http://www.w3.org/2001/XMLSchema#string>} Data is enriched with our own data and saved into the RDF store We also link members FOAF’s to DBLP authors data <http://www.morelab.deusto.es/resource/dipina> owl:sameAs <http://dblp.l3s.de/d2r/resource/authors/Diego L´pez-de-Ipi˜a> ; o n Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 22. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Google Scholar I A simple way to broadly search for scholarly literature http://scholar.google.com It exports data in diferent formats BibTeX EndNote RefMan RefWorks WenXiangWang Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 23. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Google Scholar II The data from GS is extracted via BibTeX scrapping Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 24. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Google Scholar II The data from GS is extracted via BibTeX scrapping An HTTP request using an specific cookie to retrieve BibTeX data Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 25. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Google Scholar II The data from GS is extracted via BibTeX scrapping BibTeX data is retrieved Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 26. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Google Scholar II The data from GS is extracted via BibTeX scrapping Mapping from BibTeX data to RDF Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 27. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction FOAF Every member of our group has its own FOAF file http://www.morelab.deusto.es/resource/member-alias Every publication is linked to its author’s URI <http://www.morelab.deusto.es/resource/imhotep-an-approach-to-user-and-device-conscious- mobile-applications> dc:creator <http://www.morelab.deusto.es/resource/dipina> This is done automatically looking for author’s nicknames Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 28. Motivation Our Solution Linked Data Extension Conclusions Future Work Data Extraction Flowchart Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 29. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 30. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 31. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 32. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 33. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 34. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 35. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Joseki + SDB Joseki A SPARQL server for Jena Storage into RDF files and relational databases It allows SPARQL Updates It is private for our system Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 36. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Joseki + SDB Joseki A SPARQL server for Jena Storage into RDF files and relational databases It allows SPARQL Updates It is private for our system SDB A component of Jena It provides: Scalable storage Query of RDF datasets using conventional SQL databases Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 37. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Pubby Pubby adds Linked Data interfaces to SPARQL endpoints It allows content negotiation among these formats: HTML RDF/XML N3 Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 38. Motivation Our Solution Linked Data Extension Conclusions Future Work System Architecture Snorql An AJAXy front-end for exploring RDF SPARQL endpoints More usable than Joseki It is MoreLab’s public SPARQL endpoint Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 39. Table of Contents 1 Motivation 2 Our Solution First Approach Solution Overview Data Extraction System Architecture 3 Linked Data Extension 4 Conclusions 5 Future Work
  • 40. Motivation Our Solution Linked Data Extension Conclusions Future Work Admin Overview Dataset Creation: Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 41. Motivation Our Solution Linked Data Extension Conclusions Future Work Admin Overview Ontology Prefix Definition: Regex Definition: Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 42. Motivation Our Solution Linked Data Extension Conclusions Future Work User Overview Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 43. Table of Contents 1 Motivation 2 Our Solution First Approach Solution Overview Data Extraction System Architecture 3 Linked Data Extension 4 Conclusions 5 Future Work
  • 44. Motivation Our Solution Linked Data Extension Conclusions Future Work Conclusions This solution integrates our data into Web of Data easily Provides a reusable solution Opens the door to more extendable solutions Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 45. Table of Contents 1 Motivation 2 Our Solution First Approach Solution Overview Data Extraction System Architecture 3 Linked Data Extension 4 Conclusions 5 Future Work
  • 46. Motivation Our Solution Linked Data Extension Conclusions Future Work Future Work Link our datasets with more external datasets DBPedia Geonames RDF and SPARQL search form Externalize linked data sources Building the Extension modularly Mikel Emaldi, David Buj´n, Diego L´pez de Ipi˜a a o n DeustoTech - Internet Towards the Integration of a Research Group Website into the Web of Data
  • 47. Towards the Integration of a Research Group Website into the Web of Data Mikel Emaldi David Buj´n Diego L´pez de Ipi˜a a o n {m.emaldi, dbujan, dipina}@deusto.es Deusto Institute of Technology - DeustoTech November 2011