SlideShare una empresa de Scribd logo
1 de 16
The Future is All Mine
Text and Data Mining
Projects in Europe
@openminted_eu @futuretdm
@openminted_eu
@futuretdm
Funded by:
Projects funded by
@openminted_eu
@futuretdm
Text and data mining is
the future
“Text and data mining (TDM) is the
process of deriving information from
machine-read material. It works by
copying large quantities of material,
extracting the data, and recombining it
to identify patterns.”
JISC
Projects funded by
@openminted_eu
@futuretdm
Text and data mining
helps us understand the
past
Mining historical
books:
the evolution of
language
Source: http://www.sciencemag.org/content/331/6014/176 (Baylor College of Medicine, Houston)
Projects funded by
@openminted_eu
@futuretdm
Text and data mining
predicts the future
Mining newspapers:
Predicts revolutions
Source: http://journals.uic.edu/ojs/index.php/fm/article/view/3663/3040 (University of Illinois)
Projects funded by
@openminted_eu
@futuretdm
Text and data mining
saves the future
Mining scientific
publications about
diseases:
Save lives
Source: http://dl.acm.org/citation.cfm?id=2623667 (Baylor College of Medicine, Houston)
Projects funded by
@openminted_eu
@futuretdm
Text mining – it seems so easy:
Linguistic
Analysis:
Entity
Recognition
Data Mining
Knowledge
Discovery
Information
Extraction
STAGE 1 STAGE 2 STAGE 3 STAGE 4
Information
Retrieval
Projects funded by
@openminted_eu
@futuretdm
But it actually poses many
challenges…
?
?
?
?
?
?
?
??
?? ?
?
??
?
?
How do I
make my texts
readable by
machines?
?Which mining
method to
use?
STAGE 1 STAGE 2 STAGE 3 STAGE 4
Where do I
find data?
Projects funded by
@openminted_eu
@futuretdm
9
Current Barriers in Europe
Awareness across Institutions & Stakeholders
 Lack of awareness among research
communities
 Lack of guidance to uncover TDM potential
Skills and Tools
 Availability and accessibility across disciplines
 Gap in skills across various sectors
Licensing & Open Access
 License proliferation and interoperability
issues
 License barriers to transparent open access
Copyright and Data Protection
 TDM activities infringing current copyright laws
 Legal and policy limitations and barriers for
TDM
Projects funded by
@openminted_eu
@futuretdm
EU PROJECTS on TDM
FutureTDM
Identify TDM
barriers and
policy solutions
Open mine
Build a TDM
eInfrastructure
Projects funded by
@openminted_eu
@futuretdm
ELABORATE a legal and
policy framework for future
TDM and specify a research
agenda to foster the spread
of TDM
BUILD a website: a
Collaborative
Knowledge Base and
an Open Information
Hub combined
ANALYSE current
application areas and best
practices in TDM
ASSESS existing
studies, legal
regulations and
policies on TDM
Main Objectives of FutureTDM
INVOLVE all key
stakeholders to
identify practices,
requirements, and
specific challenges
INCREASE
awareness of
TDM to attract
new target
groups and
science domains
@openminted_eu
@futuretdm
This project has received funding from the European Union’s Horizon 2020
Research and Innovation Programme under Grant Agreement No 665940.
Bottom-up
approach:
Stakeholder
workshops and
knowledge cafes
throughout Europe
FutureTDM
@openminted_eu
@futuretdm
This project has received funding from the European Union’s Horizon 2020
Research and Innovation Programme under Grant Agreement No 665940.
Data centre Data centre Data centre Data centre
in public cloud
Publisher text
corpus
OpenAIRE/CORE text
corpus
PMC text
corpus
Other text
corpora
Other text
corpora
Other text
corpora
Other types of text
corpora
Layer 3:
Interoperability
to shared storage and
computing resources
Language resources
Language resources
Language resources Language resources
Layer 2:
Interoperability of
language resources
& corpora
Layer 1:
Interoperability
of text mining services
(platforms or
components)
Language resources and corpora registry service
Platform services Registry Workflow ManagementAuth2 & Policy management Annotator Accounting
Mining Platforms Mining Platforms Mining Platforms
Proprietary architectures
Mining Platforms
Objective of OpenMinTeD
@openminted_eu
Projects funded by@futuretdm
OpenMinTeD brings together:
14
ACCESSIBLE
CONTENT
DISCOVERABLE
SERVICES
EFFICIENT
PROCESSING
TDM
COMMUNITIES
VALUE ADDED
APPS
Via standardised programmatic
interfaces and access rules
Easily discoverable text mining
services and workflows which
process, analyse and annotate text
Operate on public e-Infrastructures
via standarized APIs
Different scientific communities
have different challenges
Community-driven applications to
illustrate the value of the
infastructure. Engage with industry.
OPENMINTED = The Open Mining Infrastructure for Text and Data
Become involved
Follow us on Twitter for the latest updates and blogs
@openminted_eu
@futuretdm
Follow our websites
www.openminted.eu
www.futuretdm.eu
Projects funded by
@openminted_eu
@futuretdm
THANK YOU
• Athena RIC
• Univ. of Manchester (NacTem)
• Univ. of Darmstadt
• INRA
• EMBL-EBI
• Agro-Know
• LIBER
• Univ. of Amsterdam
• Open University UK
• EPFL
• CNIO
• Univ. of Sheffield (GATE)
• GESIS
• GRNET
• Frontiers
• Univ. of Stirling
PARTNERS OPENMINTEDPARTNERS FUTURETDM
• SYNYO GmbH (SYNYO)
• LIBER Europe
• Open Knowledge Foundation
LBG (OK/CM)
• Radboud Univ. Nijmegen
• The British Library Board
• Univ. of Amsterdam
• Athena RIC
• Ubiquity Press
• Fundacja Projekt: Polska (FPP)

Más contenido relacionado

La actualidad más candente

re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data RepositoriesHeinz Pampel
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...EUDAT
 
Making Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org RegistryMaking Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org RegistryHeinz Pampel
 
Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...LIBER Europe
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Open content opens up new avenues of research
Open content opens up new avenues of researchOpen content opens up new avenues of research
Open content opens up new avenues of researchFelix Lohmeier
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhurymaredata
 
Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...Peter Webster
 
Zenodo - The catch-all repository
Zenodo - The catch-all repository Zenodo - The catch-all repository
Zenodo - The catch-all repository OpenAccessBelgium
 
Eva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSCEva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSCmaredata
 
Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...petrknoth
 
Library Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discoveryLibrary Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discoveryLIBER Europe
 
Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data  Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data LIBER Europe
 
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection dri_ireland
 

La actualidad más candente (20)

Connecting Museums with Linked Data
Connecting Museums with Linked DataConnecting Museums with Linked Data
Connecting Museums with Linked Data
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
 
Elab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-finalElab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-final
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
 
Making Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org RegistryMaking Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org Registry
 
Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...
 
Scholze goportis 4-11-14
Scholze goportis 4-11-14Scholze goportis 4-11-14
Scholze goportis 4-11-14
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Imac 090924
Imac 090924Imac 090924
Imac 090924
 
Scholze imcw 2014-11-25
Scholze imcw 2014-11-25Scholze imcw 2014-11-25
Scholze imcw 2014-11-25
 
Open content opens up new avenues of research
Open content opens up new avenues of researchOpen content opens up new avenues of research
Open content opens up new avenues of research
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...
 
Zenodo - The catch-all repository
Zenodo - The catch-all repository Zenodo - The catch-all repository
Zenodo - The catch-all repository
 
Eva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSCEva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSC
 
Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...
 
Library Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discoveryLibrary Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discovery
 
Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data  Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data
 
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
 

Similar a The Future of Text and Data Mining in Europe

New trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsMaría Poveda Villalón
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair" OpenAIRE
 
Open Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpenAccessBelgium
 
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EUFutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EUBrian Hole
 
Eu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukalaEu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukalaVictoria Tsoukala
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresguest0dc425
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migrationpetermurrayrust
 
Infrastructures for Open, Digital Science
Infrastructures for Open, Digital ScienceInfrastructures for Open, Digital Science
Infrastructures for Open, Digital ScienceCarl-Christian Buhr
 
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020Pedro Príncipe
 
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...Victoria Tsoukala
 
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020NordForsk
 
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...Heinz Pampel
 
Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin Nicolaie Constantinescu
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Peter Löwe
 
Open access: What's in there for me? And some ideas for advocacy programmes
Open access:  What's in there for me?  And some ideas for advocacy programmesOpen access:  What's in there for me?  And some ideas for advocacy programmes
Open access: What's in there for me? And some ideas for advocacy programmesIryna Kuchma
 
e-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE Francee-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE FranceJean-François Lutz
 

Similar a The Future of Text and Data Mining in Europe (20)

Open, Digital Science in Europe
Open, Digital Science in EuropeOpen, Digital Science in Europe
Open, Digital Science in Europe
 
New trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and tools
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
 
Open Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWO
 
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EUFutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
 
Eu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukalaEu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukala
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migration
 
Infrastructures for Open, Digital Science
Infrastructures for Open, Digital ScienceInfrastructures for Open, Digital Science
Infrastructures for Open, Digital Science
 
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
 
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
 
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
 
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
 
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
 
Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
European Perspectives on Open Science Policy/JC Burgelman
European Perspectives on Open Science Policy/JC BurgelmanEuropean Perspectives on Open Science Policy/JC Burgelman
European Perspectives on Open Science Policy/JC Burgelman
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
Open access: What's in there for me? And some ideas for advocacy programmes
Open access:  What's in there for me?  And some ideas for advocacy programmesOpen access:  What's in there for me?  And some ideas for advocacy programmes
Open access: What's in there for me? And some ideas for advocacy programmes
 
e-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE Francee-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE France
 

Más de openminted_eu

Supporting the uptake of TDM
Supporting the uptake of TDMSupporting the uptake of TDM
Supporting the uptake of TDMopenminted_eu
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017openminted_eu
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...openminted_eu
 
Seamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources syncSeamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources syncopenminted_eu
 
Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...openminted_eu
 
Legal issues Text and Data Mining
Legal issues Text and Data MiningLegal issues Text and Data Mining
Legal issues Text and Data Miningopenminted_eu
 
Tentative steps in mining UK theses
Tentative steps in mining UK thesesTentative steps in mining UK theses
Tentative steps in mining UK thesesopenminted_eu
 
Jisc Text Mining Capabilities
Jisc Text Mining CapabilitiesJisc Text Mining Capabilities
Jisc Text Mining Capabilitiesopenminted_eu
 
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiquesOpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiquesopenminted_eu
 
Infrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKProInfrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKProopenminted_eu
 
Experiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspectiveExperiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspectiveopenminted_eu
 
Text and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the NetherlandsText and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the Netherlandsopenminted_eu
 

Más de openminted_eu (12)

Supporting the uptake of TDM
Supporting the uptake of TDMSupporting the uptake of TDM
Supporting the uptake of TDM
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...
 
Seamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources syncSeamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources sync
 
Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...
 
Legal issues Text and Data Mining
Legal issues Text and Data MiningLegal issues Text and Data Mining
Legal issues Text and Data Mining
 
Tentative steps in mining UK theses
Tentative steps in mining UK thesesTentative steps in mining UK theses
Tentative steps in mining UK theses
 
Jisc Text Mining Capabilities
Jisc Text Mining CapabilitiesJisc Text Mining Capabilities
Jisc Text Mining Capabilities
 
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiquesOpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
 
Infrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKProInfrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKPro
 
Experiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspectiveExperiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspective
 
Text and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the NetherlandsText and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the Netherlands
 

Último

原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 

Último (20)

原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 

The Future of Text and Data Mining in Europe

  • 1. The Future is All Mine Text and Data Mining Projects in Europe @openminted_eu @futuretdm @openminted_eu @futuretdm Funded by:
  • 3. Text and data mining is the future “Text and data mining (TDM) is the process of deriving information from machine-read material. It works by copying large quantities of material, extracting the data, and recombining it to identify patterns.” JISC Projects funded by @openminted_eu @futuretdm
  • 4. Text and data mining helps us understand the past Mining historical books: the evolution of language Source: http://www.sciencemag.org/content/331/6014/176 (Baylor College of Medicine, Houston) Projects funded by @openminted_eu @futuretdm
  • 5. Text and data mining predicts the future Mining newspapers: Predicts revolutions Source: http://journals.uic.edu/ojs/index.php/fm/article/view/3663/3040 (University of Illinois) Projects funded by @openminted_eu @futuretdm
  • 6. Text and data mining saves the future Mining scientific publications about diseases: Save lives Source: http://dl.acm.org/citation.cfm?id=2623667 (Baylor College of Medicine, Houston) Projects funded by @openminted_eu @futuretdm
  • 7. Text mining – it seems so easy: Linguistic Analysis: Entity Recognition Data Mining Knowledge Discovery Information Extraction STAGE 1 STAGE 2 STAGE 3 STAGE 4 Information Retrieval Projects funded by @openminted_eu @futuretdm
  • 8. But it actually poses many challenges… ? ? ? ? ? ? ? ?? ?? ? ? ?? ? ? How do I make my texts readable by machines? ?Which mining method to use? STAGE 1 STAGE 2 STAGE 3 STAGE 4 Where do I find data? Projects funded by @openminted_eu @futuretdm
  • 9. 9 Current Barriers in Europe Awareness across Institutions & Stakeholders  Lack of awareness among research communities  Lack of guidance to uncover TDM potential Skills and Tools  Availability and accessibility across disciplines  Gap in skills across various sectors Licensing & Open Access  License proliferation and interoperability issues  License barriers to transparent open access Copyright and Data Protection  TDM activities infringing current copyright laws  Legal and policy limitations and barriers for TDM Projects funded by @openminted_eu @futuretdm
  • 10. EU PROJECTS on TDM FutureTDM Identify TDM barriers and policy solutions Open mine Build a TDM eInfrastructure Projects funded by @openminted_eu @futuretdm
  • 11. ELABORATE a legal and policy framework for future TDM and specify a research agenda to foster the spread of TDM BUILD a website: a Collaborative Knowledge Base and an Open Information Hub combined ANALYSE current application areas and best practices in TDM ASSESS existing studies, legal regulations and policies on TDM Main Objectives of FutureTDM INVOLVE all key stakeholders to identify practices, requirements, and specific challenges INCREASE awareness of TDM to attract new target groups and science domains @openminted_eu @futuretdm This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No 665940.
  • 12. Bottom-up approach: Stakeholder workshops and knowledge cafes throughout Europe FutureTDM @openminted_eu @futuretdm This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No 665940.
  • 13. Data centre Data centre Data centre Data centre in public cloud Publisher text corpus OpenAIRE/CORE text corpus PMC text corpus Other text corpora Other text corpora Other text corpora Other types of text corpora Layer 3: Interoperability to shared storage and computing resources Language resources Language resources Language resources Language resources Layer 2: Interoperability of language resources & corpora Layer 1: Interoperability of text mining services (platforms or components) Language resources and corpora registry service Platform services Registry Workflow ManagementAuth2 & Policy management Annotator Accounting Mining Platforms Mining Platforms Mining Platforms Proprietary architectures Mining Platforms Objective of OpenMinTeD @openminted_eu Projects funded by@futuretdm
  • 14. OpenMinTeD brings together: 14 ACCESSIBLE CONTENT DISCOVERABLE SERVICES EFFICIENT PROCESSING TDM COMMUNITIES VALUE ADDED APPS Via standardised programmatic interfaces and access rules Easily discoverable text mining services and workflows which process, analyse and annotate text Operate on public e-Infrastructures via standarized APIs Different scientific communities have different challenges Community-driven applications to illustrate the value of the infastructure. Engage with industry. OPENMINTED = The Open Mining Infrastructure for Text and Data
  • 15. Become involved Follow us on Twitter for the latest updates and blogs @openminted_eu @futuretdm Follow our websites www.openminted.eu www.futuretdm.eu Projects funded by @openminted_eu @futuretdm
  • 16. THANK YOU • Athena RIC • Univ. of Manchester (NacTem) • Univ. of Darmstadt • INRA • EMBL-EBI • Agro-Know • LIBER • Univ. of Amsterdam • Open University UK • EPFL • CNIO • Univ. of Sheffield (GATE) • GESIS • GRNET • Frontiers • Univ. of Stirling PARTNERS OPENMINTEDPARTNERS FUTURETDM • SYNYO GmbH (SYNYO) • LIBER Europe • Open Knowledge Foundation LBG (OK/CM) • Radboud Univ. Nijmegen • The British Library Board • Univ. of Amsterdam • Athena RIC • Ubiquity Press • Fundacja Projekt: Polska (FPP)