Text and data mining - the opportunities and the EU conundrum - why aren’t we doing more?
1. FutureTDM
Text & Data Mining- the opportunities and the EU conundrum
September 27, 2016,
Brussels, Belgium
Susan Reilly
LIBER, the Association of European
Research Libraries
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the
Digital Age: Text and Data Mining (TDM)
2. Improve the Uptake of Text & Data
Mining (TDM) in Europe…
Why are we here?
2
3. “the discovery by computer of new, previously
unknown information, by automatically extracting
and relating information from different (…)resources,
to reveal otherwise hidden meanings” (Hearst, 1999)
Why TDM? Why now?
3
ICT
Availability
Comput
ing
Power
Data
16 trillion
gigabytes of data
by 2020 (236%
growth)
Doubles every 2
years (Moores
Law, 1965)
Over 80% EU
citizens have
internet access
(Eurostat 2014)
4. • Addressing grand challenges such as climate change and global
epidemics
• Improving population health, wealth and development
• Creating new jobs and employment
• Exponentially increasing the speed and progress of science through new
insights and greater efficiency of research
• Increasing transparency of governments and their actions
• Fostering innovation and collaboration and boosting the impact of open
science
• Creating tools for education and research
• Providing new and richer cultural insights
• Speeding economic and social development in all parts of the globe
(The Hague Declaration on Knowledge Discovery)
Potential of TDM
4
6. TDM is not a homogeneous, self-contained, scientific
domain, but rather a diverse and complex set of
methods and technologies deployed in the
framework of diverse disciplines and business
activities
The Key Challenge
6
10. Enablers
10
Technical
- Automation of analysis
- Volume of data
Legal
-Data protection
-Regional disparities in copyright law
Skills
-Need to marry discipline specific with technical
-Information literacy
Content
-More diverse data sources
▪ Any kind of outcomes e.g. deliverables, research highlights, KCs, events, publications,
presentations, booths, factsheets, …
11. 11
ELABORATE a legal and policy
framework for future TDM, define
policy priorities, specify a research
agenda to foster the spread of TDM in
various research fields within the EU
BUILD a Collaborative
Knowledge Base and an
Open Information Hub
combined on a web-based
platform including intuitive
tools
ANALYSE current application areas
and trends in TDM including
statistics and key figures, collect
relevant research and industrial
projects and best practices
ASSESS existing studies, legal
regulations and policies on
TDM within the European
Union
Main Objectives of FutureTDM
INVOLVE all key stakeholders
to identify practices,
requirements, and specific
challenges in the field of
TDM
INCREASE awareness of
TDM to attract new
target groups and
science domains
Process
12. 12
Join the Community!
FutureTDM Collaborative Knowledge Base and Open Information Hub
Platform: http://futuretdm.eu/
Expert Blog Best Practice Catalogue
Project Database
Tool Selector
Baseline Reports
Comparative Analysis
FutureTDM Guidelines
Leading Projects Area
(promoting/knowledge exchange - e.g. OpenMinTeD)
Landscape Overview
15. 15
Impact
Remove existing legal, technological and skill barriers that prevent TDM technology
from being adopted within the EU.
Increase awareness about the social, economic and scientific benefits of
TDM.
Increase the Union’s competitiveness with other high-tech
economies (like Japan, South Korea, US) by enhancing TDM adoption.
Foster the adoption of TDM in science and economy.
Lead to Research & Innovation policy that is more relevant and responsive to society
Impact
16. 16
Contents of this presentation are cc-by 4.0 unless otherwise indicated
@skreilly @futureTDM