The presentation of Time Machine for the Dataverse Community meeting 2019 at Harvard University: standards, data management and networked services https://sched.co/PdxI
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Time Machine for the Web
1. Dataverse Community meeting 2019
Harvard University
Time Machine 21.06.2019
Vyacheslav Tykhonov
Frédéric Kaplan
2. Time Machine is …
• An international collaboration to bring
5000 years of European history to life
• Digitising millions of historical
documents, painting and monuments
• The largest computer simulation
ever developed
• An open access, interactive resource
3. How are we creating it?
The technology used to develop Time Machine
5. Time Machine is
comformed by …
• 300+ consortium members from 32 countries
• 95 of Europe´s top academic and
research institutions
• Private sector partners from SMEs to
international companies
• Internationally-acclaimed galleries,
libraries, archives and museums
• European institution bodies
• Civil society and industry associations
6. The Time Machine Organisation
• Leading international organisation
for cooperation in technology,
science and cultural heritage
• the institutional framework ensuring
economic independence as well as
cross-sectoral communication and
partnerships
• an association under Austrian law,
head-quartered in Vienna
8. “Our focus in on the joint efforts on Big Data,
artificial intelligence, augmented reality and 3D
and the development of European platforms in
line with European values”.
9. “We will develop tools, forms of analysis and
modelling procedures that combine Big Data
from multiple sources to explain phenomena
that extend over large periods of time, and/or
affect extended regions of populations.”
10. TM Black Box
Data is the basis of any research and therefore
should be managed and curated in the way that
will allow easy connection and involvement of any
other discipline.
Dataverse is a perfect candidate to become a
transparent ”black box” in the Time Machine.
11. Data Management
• Primary Data (Objects) should be preserved in
the Digital Archive with persistent identifiers
(Trusted Digital Repository)
• Secondary Data will be stored in the research
infrastructure with keeping data versioning and
provenance information (Dataverse)
• Linked Open Data Cloud (LOD) will provide the
layer of interoperability
12. Policy
• In most of cases EU countries have own policy on
the data management
• Usual requirement is to keep all primary data inside
of the country on local servers but metadata can be
shared with partners from other countries
• Data repository should be able to support selected
policy and be flexible enough to switch Storage
layer (Inside/Outside) or Access levels
(Open/Restricted) if policy will change
13. Standards
All tools supported by Time Machine must have highest
level of maturity to be accepted as a networked services.
Interoperability and sustainability of data services are key
problems and should be managed by Time Machine
transparent “black box” operating in the distributed
network.
Problem: TM consortium should agree on all
standards that will be supported by data repositories and
accepted by TM partners.
14. Networked Services
• Dataverse as a TM Shared Service
• data preview and visualizations:
2D/3D/4D, maps, text, spreadsheet/CSV, PDF, HTML, images, video,
audio, JSON, XML, DDI, …
• API endpoints with external controlled vocabularies
• Linked Open Data Cloud with SPARQL/GraphQL
• data processing, federated and migration services
(CLARIAH as a service, …)