SlideShare una empresa de Scribd logo
1 de 20
Harvesting and semantically tagging media releases from political websites using web services   Peter Neish, Systems Officer Victorian Parliamentary Library @peterneish
What will be covered ,[object Object],[object Object],[object Object],[object Object],[object Object]
About the library ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Media Releases ,[object Object],[object Object],[object Object],[object Object]
Number of Media Releases per year 0 1000 2000 3000 4000 5000 6000 7000 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Media releases by party 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 ALP Coalition Green Independent
Project aims ,[object Object],[object Object],[object Object]
Part 1: Automation ,[object Object],[object Object],[object Object],[object Object]
What we built Polls RSS feed for links DB Textworks wkhtml2pdf Servlet Metadata
Technologies used ,[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Part 2: Semantic Tagging ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Open Calais ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Example Open Calais ,[object Object]
Number of Tags assigned by OpenCalais 0 500 1000 1500 2000 2500 3000 3500 4000 4500 0 20 40 60 80 100 120 Tags per item Total number
User interface
Tag Quality 85% 4% 6% 5% Correct Tags Incorrect Tags Repeated Tags Redundant Tags
Problems - disambiguation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Photo by  http://www.flickr.com/photos/meckimac/2971992/   Photo by  http://www.flickr.com/photos/eclogite/257560117/
Linked Data ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Conclusion ,[object Object],[object Object],[object Object],[object Object]

Más contenido relacionado

La actualidad más candente

Socialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINA
Socialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINASocialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINA
Socialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINACIGScotland
 
Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Terry Reese
 
New ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity researchNew ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity researchVince Smith
 
VALA 2016 L-Plate session on Linked Open Data
VALA 2016 L-Plate session on Linked Open DataVALA 2016 L-Plate session on Linked Open Data
VALA 2016 L-Plate session on Linked Open DataPeter Neish
 
Deployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_webDeployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_webSTIinnsbruck
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)OpenAIRE
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamEnno Meijers
 
Free UKSG webinar: Exploring how emerging open science services can enhance i...
Free UKSG webinar: Exploring how emerging open science services can enhance i...Free UKSG webinar: Exploring how emerging open science services can enhance i...
Free UKSG webinar: Exploring how emerging open science services can enhance i...UKSG: connecting the knowledge community
 
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)OpenAIRE
 
THOR Workshop - Services EBI
THOR Workshop - Services EBITHOR Workshop - Services EBI
THOR Workshop - Services EBIMaaike Duine
 
Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...petrknoth
 
Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...petrknoth
 
Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Repository Fringe
 
2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk CambridgeMagnus Manske
 

La actualidad más candente (19)

Socialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINA
Socialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINASocialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINA
Socialising in the Sun 245$a / Fred Guy, Suncat Service Manager, EDINA
 
Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History
 
New ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity researchNew ways to communicate in science: perspectives from biodiversity research
New ways to communicate in science: perspectives from biodiversity research
 
VALA 2016 L-Plate session on Linked Open Data
VALA 2016 L-Plate session on Linked Open DataVALA 2016 L-Plate session on Linked Open Data
VALA 2016 L-Plate session on Linked Open Data
 
Deployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_webDeployment of rd_fa_microdata_microformats_on_the_web
Deployment of rd_fa_microdata_microformats_on_the_web
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
A distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics AmsterdamA distributed network of digital heritage information - Semantics Amsterdam
A distributed network of digital heritage information - Semantics Amsterdam
 
Free UKSG webinar: Exploring how emerging open science services can enhance i...
Free UKSG webinar: Exploring how emerging open science services can enhance i...Free UKSG webinar: Exploring how emerging open science services can enhance i...
Free UKSG webinar: Exploring how emerging open science services can enhance i...
 
Linked Data
Linked DataLinked Data
Linked Data
 
Wikidata
WikidataWikidata
Wikidata
 
Finnie NISO-ICSTI Joint Webinar
Finnie NISO-ICSTI Joint WebinarFinnie NISO-ICSTI Joint Webinar
Finnie NISO-ICSTI Joint Webinar
 
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
EOSC-hub and OpenAIRE-Advance collaboration (Presentation at RDA 11th plenary)
 
20200901 ECCB M. Kutmon
20200901 ECCB M. Kutmon20200901 ECCB M. Kutmon
20200901 ECCB M. Kutmon
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
 
THOR Workshop - Services EBI
THOR Workshop - Services EBITHOR Workshop - Services EBI
THOR Workshop - Services EBI
 
Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...
 
Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...Better together: building services for public good on top of content from the...
Better together: building services for public good on top of content from the...
 
Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...Integration - the heart of researcher centric research data management system...
Integration - the heart of researcher centric research data management system...
 
2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge2014-02-27 Wikidata talk Cambridge
2014-02-27 Wikidata talk Cambridge
 

Similar a Harvesting and semantically tagging media releases from political websites using web services

Exploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal ArchivesExploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal ArchivesJon Voss
 
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”bridgingworlds2008
 
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...BlueFish
 
Library 2.0: A New Version for the Future
Library 2.0: A New Version for the FutureLibrary 2.0: A New Version for the Future
Library 2.0: A New Version for the Futurepddsnn
 
Realising Potential Of Web 2 0
Realising Potential Of Web 2 0Realising Potential Of Web 2 0
Realising Potential Of Web 2 0lisbk
 
Little Falls Library Digitization Project
Little Falls Library Digitization ProjectLittle Falls Library Digitization Project
Little Falls Library Digitization ProjectKevin Andreano
 
Revolutionising Library Management
Revolutionising Library ManagementRevolutionising Library Management
Revolutionising Library ManagementMichelle McLean
 
Biocatalogue, FileQuirks, MyExperiment
Biocatalogue, FileQuirks, MyExperimentBiocatalogue, FileQuirks, MyExperiment
Biocatalogue, FileQuirks, MyExperimentJerzy
 
Web2 UKOLN MLA Workshop
Web2 UKOLN MLA WorkshopWeb2 UKOLN MLA Workshop
Web2 UKOLN MLA WorkshopUKOLN_MLA
 
Mashups & Data Visualizations: The New Breed of Web Applications
Mashups & Data Visualizations: The New Breed of Web ApplicationsMashups & Data Visualizations: The New Breed of Web Applications
Mashups & Data Visualizations: The New Breed of Web ApplicationsDarlene Fichter
 
Defining Web 2.0 and RIA
Defining Web 2.0 and RIADefining Web 2.0 and RIA
Defining Web 2.0 and RIAArielladog
 
Semanticommunity.net: Community Infrastructure Sandbox for 2008
Semanticommunity.net: Community Infrastructure Sandbox for 2008 Semanticommunity.net: Community Infrastructure Sandbox for 2008
Semanticommunity.net: Community Infrastructure Sandbox for 2008 webhostingguy
 
Web 2.0 Challenges for appraisal
Web 2.0 Challenges for appraisalWeb 2.0 Challenges for appraisal
Web 2.0 Challenges for appraisalArian Ravanbakhsh
 
JISC Access and Identity Management: Future Directions
JISC Access and Identity Management: Future DirectionsJISC Access and Identity Management: Future Directions
JISC Access and Identity Management: Future DirectionsJISC.AM
 
Acs Presentation Thinking Outside Of Inbox V2
Acs Presentation   Thinking Outside Of Inbox V2Acs Presentation   Thinking Outside Of Inbox V2
Acs Presentation Thinking Outside Of Inbox V2Johnny Teoh
 
Web 2.0 - principles and implications
Web 2.0 - principles and implicationsWeb 2.0 - principles and implications
Web 2.0 - principles and implicationsMartin Weller
 
Dissmark Ii Social Software
Dissmark Ii Social SoftwareDissmark Ii Social Software
Dissmark Ii Social Softwaredavidroethler
 
Mattsslides
MattsslidesMattsslides
Mattsslidesmgallon
 

Similar a Harvesting and semantically tagging media releases from political websites using web services (20)

Exploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal ArchivesExploring the Use of Linked Data to Bridge State and Federal Archives
Exploring the Use of Linked Data to Bridge State and Federal Archives
 
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
 
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
Power to the People- Enabling Ever US Citizen to Participate in Federal Rule ...
 
Library 2.0: A New Version for the Future
Library 2.0: A New Version for the FutureLibrary 2.0: A New Version for the Future
Library 2.0: A New Version for the Future
 
Realising Potential Of Web 2 0
Realising Potential Of Web 2 0Realising Potential Of Web 2 0
Realising Potential Of Web 2 0
 
data.ac.uk briefing paper
data.ac.uk briefing paperdata.ac.uk briefing paper
data.ac.uk briefing paper
 
Little Falls Library Digitization Project
Little Falls Library Digitization ProjectLittle Falls Library Digitization Project
Little Falls Library Digitization Project
 
Revolutionising Library Management
Revolutionising Library ManagementRevolutionising Library Management
Revolutionising Library Management
 
Web 2.0 workshop
Web 2.0 workshopWeb 2.0 workshop
Web 2.0 workshop
 
Biocatalogue, FileQuirks, MyExperiment
Biocatalogue, FileQuirks, MyExperimentBiocatalogue, FileQuirks, MyExperiment
Biocatalogue, FileQuirks, MyExperiment
 
Web2 UKOLN MLA Workshop
Web2 UKOLN MLA WorkshopWeb2 UKOLN MLA Workshop
Web2 UKOLN MLA Workshop
 
Mashups & Data Visualizations: The New Breed of Web Applications
Mashups & Data Visualizations: The New Breed of Web ApplicationsMashups & Data Visualizations: The New Breed of Web Applications
Mashups & Data Visualizations: The New Breed of Web Applications
 
Defining Web 2.0 and RIA
Defining Web 2.0 and RIADefining Web 2.0 and RIA
Defining Web 2.0 and RIA
 
Semanticommunity.net: Community Infrastructure Sandbox for 2008
Semanticommunity.net: Community Infrastructure Sandbox for 2008 Semanticommunity.net: Community Infrastructure Sandbox for 2008
Semanticommunity.net: Community Infrastructure Sandbox for 2008
 
Web 2.0 Challenges for appraisal
Web 2.0 Challenges for appraisalWeb 2.0 Challenges for appraisal
Web 2.0 Challenges for appraisal
 
JISC Access and Identity Management: Future Directions
JISC Access and Identity Management: Future DirectionsJISC Access and Identity Management: Future Directions
JISC Access and Identity Management: Future Directions
 
Acs Presentation Thinking Outside Of Inbox V2
Acs Presentation   Thinking Outside Of Inbox V2Acs Presentation   Thinking Outside Of Inbox V2
Acs Presentation Thinking Outside Of Inbox V2
 
Web 2.0 - principles and implications
Web 2.0 - principles and implicationsWeb 2.0 - principles and implications
Web 2.0 - principles and implications
 
Dissmark Ii Social Software
Dissmark Ii Social SoftwareDissmark Ii Social Software
Dissmark Ii Social Software
 
Mattsslides
MattsslidesMattsslides
Mattsslides
 

Último

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 

Último (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Harvesting and semantically tagging media releases from political websites using web services

  • 1. Harvesting and semantically tagging media releases from political websites using web services Peter Neish, Systems Officer Victorian Parliamentary Library @peterneish
  • 2.
  • 3.
  • 4.
  • 5. Number of Media Releases per year 0 1000 2000 3000 4000 5000 6000 7000 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
  • 6. Media releases by party 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 ALP Coalition Green Independent
  • 7.
  • 8.
  • 9. What we built Polls RSS feed for links DB Textworks wkhtml2pdf Servlet Metadata
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15. Number of Tags assigned by OpenCalais 0 500 1000 1500 2000 2500 3000 3500 4000 4500 0 20 40 60 80 100 120 Tags per item Total number
  • 17. Tag Quality 85% 4% 6% 5% Correct Tags Incorrect Tags Repeated Tags Redundant Tags
  • 18.
  • 19.
  • 20.

Notas del editor

  1. Talk about the Parliamentary Library: Established in 1851, building itself 1858–60
  2. What will be covered in today’s talk