SlideShare una empresa de Scribd logo
1 de 19
Toward universal information
access on the digital object cloud
Kei Kurakawa
National Institute of Informatics
orcid.org/0000-0002-7031-1846
1
Presentation slide for this:
Kei Kurakawa, Toward universal information access on the digital object cloud, In book of abstracts of International Workshop on Data Science -
Present & Future of Open Data & Open Science -, p.57-59, November 12-15, 2018, Mishima Citizens Cultural Hall & Joint Support-Center for Data
Science Research, Mishima, Shizuoka, Japan
Research Data Alliance
• Founded in 2013
• Motto
– Research data sharing without barriers
• Participants
– Domain scientist, Information specialist, disciplinary data manager, curator, engineer, librarian,
software engineer, policy maker, etc.
• Plenaries
– RDA1 Gothenburg, Sweden, March 2013
– RDA2 Washington DC, US, September 2013
– RDA3 Dublin, Ireland, March 2014
– RDA4 Amsterdam, the Netherlands, September 2014
– *RDA5 San Diego, US, March 2015
– *RDA6 Paris, France, September 2015
– *RDA7 Tokyo, Japan, March 2016
– *RDA8 Denver, US, September 2016
– *RDA9 Barcelona, Spain, April 2017
– RDA10 Montreal, Canada, September 2017
– *RDA11 Berlin, Germany, March 2018
– RDA12 Gaborone, Botswana, November 2018
2
* indicates that I participated in it.
Outline
• Chronological overview of universal
information access
• Professional data community
• Digital object cloud (DOC) and linked data (LD)
• Persistent identifiers (PID)
• PID centric approach data management and
access
• Conclusions and future work
3
Universal information access
• The quest for universal information access in networks began around 1960
and over the years yielded a set of principles to fully support universal
information access. [Denning and Kahn, 2010]
– Memex (Vannevar Bush, “As we may think” Atlantic Monthly, 1945)
• The first visionary speculation. It stored documents on microfilm and allowed
annotations and cross links.
– Xanadu (Ted Nelson, early 1960s)
• It introduced topics such as hypertext, hyperlinks, automatic version management,
automatic inclusion of referenced items, and small payments to authors for use of their
materials.
– NLS (Doug Engelbart, middle 1960s)
• It is the first working hypertext system with graphical user interface, mouse, and
collaboration tools.
– World Wide Web (Tim Berners-Lee, late 1980s)
• It is a potential means to Implement Bush’s, Nelson’s and Engelbart’s ideas of knowledge
representation in the Internet
– Digital Object Architecture (DOA) (CNRI, late 1980s) [Kahn and Wilensky, 1995]
• It shows key principles on information access, culled out and unified from digital library
projects, in a network environment (the Internet)
Denning, P. J., & Kahn, R. E. (2010). The long quest for universal information access. Communications of the ACM, 53(12), 34. http://doi.org/10.1145/1859204.1859218
4
Information scienceComputer science
RDA as a professional data community
Library
Knowledge 5
Founded in 2013
Premises in the professional data
community
• Computational data format should not be
complicated, ever lasting, and independent on
computer technology changes.
• Data scheme and data attributes are
complicated at some professional levels.
• Only the professionals can deal with data
processing and management.
• Of course, the professionals have good
knowledge of the domain.
6
Domain knowledge
Minimal
computational
complexity of
the data format
Theory of the domain
Knowledge of the experimental settings
What makes data valuable
7
Practices in the professional data
community
8
Data Fabric IG, Group details, https://www.rd-alliance.org/group/data-fabric-ig.html
The data cycle is based on the multi-disciplinary survey on the nature, the creation and the
usage of Persistent Identifiers (PIDs)
Peter Wittenburg, Margareta Hellström, and Carlo-Maria Zwölf (eds.), Hossein Abroshan, Ari Asmi, Giuseppe Di Bernardo, Danielle Couvreur, Tamas
Gaizer, Petr Holub, Rob Hooft, Ingemar Häggström, Manfred Kohler, Dimitris Koureas, Wolfgang Kuchinke, Luciano Milanesi, Joseph Padfield, Antonio
Rosato, Christine Staiger, Dieter van Uytvanck and Tobias Weigel (2017): Persistent identifiers: Consolidated assertions. DOI: 10.15497/RDA00027.
Digital Object Architecture (DOA)
[Kahn and Wilensky, 1995]
• Digital object (DO)
– Any unit of information represented in digital form may be structured as a digital object within
the Internet.
– The structure of a DO, including metadata, is machine and platform independent.
• A unique, persistent identifier (called a “handle”)
– Every DO has a unique identifier that can distinguish a DO from every other object, present,
past, or future.
• Handle System
– The “resolution” system maps handles to state information that includes location,
authentication, rights specifications, allowed operations, and object attributes.
• DO repositories
– DOs can be stored in DO Repositories, which are searchable systems.
– Accesses to an instance of DO Repository are made via a standard DO protocol (DOP) that
restrict actions to those.
• DO registries
– They allow users to reference, federate, and otherwise manage collections across multiple
repositories and allow for full access control.
9
Kahn, R. E. and Wilensky, R. A framework for distributed digital object services. International Journal on Digital Libraries 6, 2 (2006). DOI: 10.1007/s00799-
005-0128-x. (First made available on the Internet in 1995 and reprinted in 2006 as part of a collection of seminal papers on digital libraries).
Digital object cloud
10
Larry Lannom, Peter Wittenburg, Global Digital Object Cloud (DOC) - A Guiding Vision, 11 September 2016,
http://hdl.handle.net/11304/a8877a1a-9010-428f-b2ce-5863cec4aff3
Linked data
11
Linked Data - Connect Distributed Data across the Web
http://linkeddata.org https://www.w3.org/2007/03/layerCake.png
Semantic technology layer cake
Mixture of DOC and LD on the Internet
information space
Persistent identifiers is a key to bridge the gap between
the digital object cloud (DOC) and the linked data (LD)
Digital object cloud
Linked data
The Internet
A node represents a resource with a persistent identifier.
12
Varieties of academic persistent
identifiers and management systems
13
Handle System
ORCID
DOI
Digital object (DO), Research data, Research sample, Research instrument?,
Concept, Taxonomy?, Classification
Researcher
Organization
CrossRef, DataCite, etc
Grant
Federated
Identity
Management
eduPersonOrgOrcid eduPersonOrgDN
ISNI?
GRID?
Ringgold?
PID (persistent identifiers) entity type
Research resource
ARKPURL
URI / URN
Meta-resolver / Handle, DOI, ARK, PURL resolver
Publisher articles / figures,
Data citation,
IGSN, etc
OrgID?、Project?
ISBN,
LSID,
ChEBI,
Perma.cc,
etc
ePIC,
etc
CrossRef Funder?
Data consumer scenario
14
Data discovery
&
Automatic data processing
Dynamic data citation
Data fabric
PID (Persistent Identifier)
Data typing
Data versioning
Data provenance
Data collection
Data trustworthy
On the Global Digital Object Cloud
Google dataset search (Beta)
15
It was released on 2018-09-05.
Data discovery paradigm IG of RDA discussed with Dr. Natasha Noy from Research at
Google in Nov of 2017.
https://www.blog.google/products/search/making-it-easier-discover-datasets/
http://g.co/datasetsearch
Data providers are expected to prepare a descriptive metadata of
schema.org for the site to be discoverable.
PID centric approach to data
management and access
16
Data type registries Kernel information on Local Handle Service
Broeder, D., & Lannom, L. (2014). Data Type Registries: A Research Data
Alliance Working Group. D-Lib Magazine, 20(1/2).
http://doi.org/10.1045/january2014-broeder
Tobias Weigel, Beth Plale, Mark Parsons, Gabriel Zhou, Yu Luo, Ulrich
Schwardmann, Robert Quick, Margareta Hellström, Kei Kurakawa, “RDA
Recommendation on PID Kernel Information (Draft)”, https://www.rd-
alliance.org/sites/default/files/RDA%20Recommendation%20on%20PID%20K
ernel%20Information.pdf
Data providing with data types
• In parallel, we need data typing.
• Data providing maturity levels
– Level 1
• Data providers build their data in a community standard
– The data is packed in a commonly used format, i.e. XML, JSON, netCDF, CSV as
well as application dependent such as Microsoft EXCEL format.
– Some data are shipped with a document describing data meaning, data types,
and data format.
– Level 2
• Data providers use more complicated data format to assert data types
– A set of Handle server of DOI objects with Kernel Information profile and Data
Type Registry is a recommended candidate for a variety of domain community
to assert their data types in addition to their data sources in a community
standard format.
– On the other hand, linked data community uses RDF/XML, JSON-LD and other
linked data formats, or a kind of mixture format of data type and value.
– Common vocabularies are provided in a public server, e.g. schema.org.
17
On the Digital Object Cloud
• Kernel information connects data and data types.
• We need to handle with a graph structure of the data.
18
Attribute augmented graph
Data layer
Data type layer
Kernel Information metadata layer
Kei Kurakawa, Takayuki Sekiya, Yasumasa Baba, Making data typing efforts or automatically detecting data types
for automatic data processing?, Research Data Alliance 11th Plenary Meeting, Berlin, Germany, 2018.03.21-23
https://www.rd-alliance.org/sites/default/files/rda11_poster_20180321_kurakawa.pdf
Conclusions and future work
• Two types of accessing data on the Internet
– Digital object cloud (DOC)
– Linked data (LD)
• PID centric approach data management and access
– Data consumer scenario
• Data search
• Automatic data processing
• We need more functional research on
– data typing,
– handling with a graph structure of data resources,
– case studies
– on the digital object cloud.
19

Más contenido relacionado

La actualidad más candente

Digital library and metadata
Digital library and metadataDigital library and metadata
Digital library and metadataramncsi
 
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
Towards FAIR Open Science with PID Kernel Information: RPID TestbedTowards FAIR Open Science with PID Kernel Information: RPID Testbed
Towards FAIR Open Science with PID Kernel Information: RPID TestbedBeth Plale
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Taggingpauloshea
 
Digital Library Initiatives in India : An Overview
Digital Library Initiatives in  India : An OverviewDigital Library Initiatives in  India : An Overview
Digital Library Initiatives in India : An OverviewManoj Kumar Sinha
 
Digital Curation in Libraries: An innovative way of content preservation and...
Digital Curation in Libraries:  An innovative way of content preservation and...Digital Curation in Libraries:  An innovative way of content preservation and...
Digital Curation in Libraries: An innovative way of content preservation and...Bhojaraju Gunjal
 
Metadata approaches for digital presentation
Metadata approaches for digital presentationMetadata approaches for digital presentation
Metadata approaches for digital presentationMichael Day
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curationMichael Day
 
Bahan digital library
Bahan digital libraryBahan digital library
Bahan digital libraryMimi Ahmad
 
Qatar Digital Library Project Workshop
Qatar Digital Library Project WorkshopQatar Digital Library Project Workshop
Qatar Digital Library Project WorkshopAsad Nafees
 
3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
3LD: Towards high quality, industry-ready Linguistic Linked Licensed DataDaniel Vila Suero
 

La actualidad más candente (17)

Digital library and metadata
Digital library and metadataDigital library and metadata
Digital library and metadata
 
General concepts: DDI
General concepts: DDIGeneral concepts: DDI
General concepts: DDI
 
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
Towards FAIR Open Science with PID Kernel Information: RPID TestbedTowards FAIR Open Science with PID Kernel Information: RPID Testbed
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
 
Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015Digital Curation Technology: JHU Summit, October 2015
Digital Curation Technology: JHU Summit, October 2015
 
2013.05 - LDOW 2013 @ WWW 2013
2013.05 - LDOW 2013 @ WWW 20132013.05 - LDOW 2013 @ WWW 2013
2013.05 - LDOW 2013 @ WWW 2013
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Tagging
 
Digital Library Initiatives in India : An Overview
Digital Library Initiatives in  India : An OverviewDigital Library Initiatives in  India : An Overview
Digital Library Initiatives in India : An Overview
 
Digital Curation in Libraries: An innovative way of content preservation and...
Digital Curation in Libraries:  An innovative way of content preservation and...Digital Curation in Libraries:  An innovative way of content preservation and...
Digital Curation in Libraries: An innovative way of content preservation and...
 
Torsten Reimer
Torsten ReimerTorsten Reimer
Torsten Reimer
 
Metadata approaches for digital presentation
Metadata approaches for digital presentationMetadata approaches for digital presentation
Metadata approaches for digital presentation
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 
Hci
HciHci
Hci
 
Bahan digital library
Bahan digital libraryBahan digital library
Bahan digital library
 
Dlindia
DlindiaDlindia
Dlindia
 
Qatar Digital Library Project Workshop
Qatar Digital Library Project WorkshopQatar Digital Library Project Workshop
Qatar Digital Library Project Workshop
 
Digital Library UNIT-3
Digital Library UNIT-3Digital Library UNIT-3
Digital Library UNIT-3
 
3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
3LD: Towards high quality, industry-ready Linguistic Linked Licensed Data
 

Similar a Toward universal information access on the digital object cloud

Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs vty
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things PayamBarnaghi
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Projectvty
 
Sands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked KnowledgeSands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked Knowledgesandsfish
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEnno Meijers
 
FAIR data: LOUD for all audiences
FAIR data: LOUD for all audiencesFAIR data: LOUD for all audiences
FAIR data: LOUD for all audiencesAlessandro Adamou
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...aceas13tern
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph IntroductionSören Auer
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaEnno Meijers
 
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...Eric Stephan
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentationekansa
 
lodlam summit session browsable linked data
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked dataEnno Meijers
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAResearch Data Alliance
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked dataLaura Po
 

Similar a Toward universal information access on the digital object cloud (20)

Aggregation as tactic sm new
Aggregation as tactic sm newAggregation as tactic sm new
Aggregation as tactic sm new
 
Aggregation as Tactic
Aggregation as TacticAggregation as Tactic
Aggregation as Tactic
 
Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
Sands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked KnowledgeSands Fish - Knowing in the Age of Networked Knowledge
Sands Fish - Knowing in the Age of Networked Knowledge
 
EuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage informationEuropeanaTech 2018: A distributed network of digital heritage information
EuropeanaTech 2018: A distributed network of digital heritage information
 
FAIR data: LOUD for all audiences
FAIR data: LOUD for all audiencesFAIR data: LOUD for all audiences
FAIR data: LOUD for all audiences
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Linked Data to Improve the OER Experience
Linked Data to Improve the OER ExperienceLinked Data to Improve the OER Experience
Linked Data to Improve the OER Experience
 
A distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL IndiaA distributed network of digital heritage information - Unesco/NDL India
A distributed network of digital heritage information - Unesco/NDL India
 
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
A Linked Fusion of Things, Services, and Data to Support a Collaborative Data...
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
lodlam summit session browsable linked data
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked data
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDA
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Introduction to linked data
Introduction to linked dataIntroduction to linked data
Introduction to linked data
 

Más de National Institute of Informatics

Application of a Novel Subject Classification Scheme for a Bibliographic Data...
Application of a Novel Subject Classification Scheme for a Bibliographic Data...Application of a Novel Subject Classification Scheme for a Bibliographic Data...
Application of a Novel Subject Classification Scheme for a Bibliographic Data...National Institute of Informatics
 
Applying a new subject classification scheme for a database by a data-driven ...
Applying a new subject classification scheme for a database by a data-driven ...Applying a new subject classification scheme for a database by a data-driven ...
Applying a new subject classification scheme for a database by a data-driven ...National Institute of Informatics
 
Making data typing efforts or automatically detecting data types for automat...
Making data typing efforts or automatically detecting data types  for automat...Making data typing efforts or automatically detecting data types  for automat...
Making data typing efforts or automatically detecting data types for automat...National Institute of Informatics
 
Applying tensor decompositions to author name disambiguation of common Japane...
Applying tensor decompositions to author name disambiguation of common Japane...Applying tensor decompositions to author name disambiguation of common Japane...
Applying tensor decompositions to author name disambiguation of common Japane...National Institute of Informatics
 
Emerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksEmerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksNational Institute of Informatics
 
テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較
テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較
テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較National Institute of Informatics
 
離散一般化ベータ分布を仮定した研究分野マッピングの導出
離散一般化ベータ分布を仮定した研究分野マッピングの導出離散一般化ベータ分布を仮定した研究分野マッピングの導出
離散一般化ベータ分布を仮定した研究分野マッピングの導出National Institute of Informatics
 
レコードリンケージに基づく科研費分野-WoS分野マッピングの導出
レコードリンケージに基づく科研費分野-WoS分野マッピングの導出レコードリンケージに基づく科研費分野-WoS分野マッピングの導出
レコードリンケージに基づく科研費分野-WoS分野マッピングの導出National Institute of Informatics
 
レコードリンケージに基づく科研費分野-WoS分野マッピング
レコードリンケージに基づく科研費分野-WoS分野マッピングレコードリンケージに基づく科研費分野-WoS分野マッピング
レコードリンケージに基づく科研費分野-WoS分野マッピングNational Institute of Informatics
 
科研費分野-トピック分類マトリックスへの主成分分析の適用
科研費分野-トピック分類マトリックスへの主成分分析の適用科研費分野-トピック分類マトリックスへの主成分分析の適用
科研費分野-トピック分類マトリックスへの主成分分析の適用National Institute of Informatics
 
学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -
学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -
学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -National Institute of Informatics
 
機械学習を用いたWeb上の産学連携関連文書の抽出
機械学習を用いたWeb上の産学連携関連文書の抽出機械学習を用いたWeb上の産学連携関連文書の抽出
機械学習を用いたWeb上の産学連携関連文書の抽出National Institute of Informatics
 
科研費データベースの分野分類とトピック分類の比較分析
科研費データベースの分野分類とトピック分類の比較分析科研費データベースの分野分類とトピック分類の比較分析
科研費データベースの分野分類とトピック分類の比較分析National Institute of Informatics
 
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...National Institute of Informatics
 
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...National Institute of Informatics
 
著者の同定・識別について- JAIRO著者名検索プロジェクトへ -
著者の同定・識別について- JAIRO著者名検索プロジェクトへ -著者の同定・識別について- JAIRO著者名検索プロジェクトへ -
著者の同定・識別について- JAIRO著者名検索プロジェクトへ -National Institute of Informatics
 
1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張
1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張
1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張National Institute of Informatics
 
なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~
なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~
なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~National Institute of Informatics
 
ORCIDのプロトタイプシステムと著者ID関連技術の動向
ORCIDのプロトタイプシステムと著者ID関連技術の動向ORCIDのプロトタイプシステムと著者ID関連技術の動向
ORCIDのプロトタイプシステムと著者ID関連技術の動向National Institute of Informatics
 

Más de National Institute of Informatics (20)

Application of a Novel Subject Classification Scheme for a Bibliographic Data...
Application of a Novel Subject Classification Scheme for a Bibliographic Data...Application of a Novel Subject Classification Scheme for a Bibliographic Data...
Application of a Novel Subject Classification Scheme for a Bibliographic Data...
 
Applying a new subject classification scheme for a database by a data-driven ...
Applying a new subject classification scheme for a database by a data-driven ...Applying a new subject classification scheme for a database by a data-driven ...
Applying a new subject classification scheme for a database by a data-driven ...
 
Making data typing efforts or automatically detecting data types for automat...
Making data typing efforts or automatically detecting data types  for automat...Making data typing efforts or automatically detecting data types  for automat...
Making data typing efforts or automatically detecting data types for automat...
 
Applying tensor decompositions to author name disambiguation of common Japane...
Applying tensor decompositions to author name disambiguation of common Japane...Applying tensor decompositions to author name disambiguation of common Japane...
Applying tensor decompositions to author name disambiguation of common Japane...
 
Emerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksEmerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networks
 
テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較
テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較
テンソル分解の著者名寄せへの応用と潜在変数を持つモデルとの比較
 
研究者識別子の重要性とORCIDアップデート
研究者識別子の重要性とORCIDアップデート研究者識別子の重要性とORCIDアップデート
研究者識別子の重要性とORCIDアップデート
 
離散一般化ベータ分布を仮定した研究分野マッピングの導出
離散一般化ベータ分布を仮定した研究分野マッピングの導出離散一般化ベータ分布を仮定した研究分野マッピングの導出
離散一般化ベータ分布を仮定した研究分野マッピングの導出
 
レコードリンケージに基づく科研費分野-WoS分野マッピングの導出
レコードリンケージに基づく科研費分野-WoS分野マッピングの導出レコードリンケージに基づく科研費分野-WoS分野マッピングの導出
レコードリンケージに基づく科研費分野-WoS分野マッピングの導出
 
レコードリンケージに基づく科研費分野-WoS分野マッピング
レコードリンケージに基づく科研費分野-WoS分野マッピングレコードリンケージに基づく科研費分野-WoS分野マッピング
レコードリンケージに基づく科研費分野-WoS分野マッピング
 
科研費分野-トピック分類マトリックスへの主成分分析の適用
科研費分野-トピック分類マトリックスへの主成分分析の適用科研費分野-トピック分類マトリックスへの主成分分析の適用
科研費分野-トピック分類マトリックスへの主成分分析の適用
 
学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -
学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -
学術情報流通のための識別子とメタデータDBを対象とした融合研究シーズ探索 - 超高層物理学分野における観測データを例として -
 
機械学習を用いたWeb上の産学連携関連文書の抽出
機械学習を用いたWeb上の産学連携関連文書の抽出機械学習を用いたWeb上の産学連携関連文書の抽出
機械学習を用いたWeb上の産学連携関連文書の抽出
 
科研費データベースの分野分類とトピック分類の比較分析
科研費データベースの分野分類とトピック分類の比較分析科研費データベースの分野分類とトピック分類の比較分析
科研費データベースの分野分類とトピック分類の比較分析
 
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
A SVM Applied Text Categorization of Academia-Industry Collaborative Research...
 
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
 
著者の同定・識別について- JAIRO著者名検索プロジェクトへ -
著者の同定・識別について- JAIRO著者名検索プロジェクトへ -著者の同定・識別について- JAIRO著者名検索プロジェクトへ -
著者の同定・識別について- JAIRO著者名検索プロジェクトへ -
 
1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張
1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張
1.研究者リゾルバーとJAIRO著者名検索、2.KAKENデータベースの機能拡張
 
なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~
なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~
なぜ研究者の名寄せが必要か ~ 世界の動向と研究者リゾルバー ~
 
ORCIDのプロトタイプシステムと著者ID関連技術の動向
ORCIDのプロトタイプシステムと著者ID関連技術の動向ORCIDのプロトタイプシステムと著者ID関連技術の動向
ORCIDのプロトタイプシステムと著者ID関連技術の動向
 

Último

THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONjhunlian
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdfHafizMudaserAhmad
 
Risk Management in Engineering Construction Project
Risk Management in Engineering Construction ProjectRisk Management in Engineering Construction Project
Risk Management in Engineering Construction ProjectErbil Polytechnic University
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxsiddharthjain2303
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfRajuKanojiya4
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
"Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ..."Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ...Erbil Polytechnic University
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionMebane Rash
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Erbil Polytechnic University
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingBootNeck1
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - GuideGOPINATHS437943
 
Configuration of IoT devices - Systems managament
Configuration of IoT devices - Systems managamentConfiguration of IoT devices - Systems managament
Configuration of IoT devices - Systems managamentBharaniDharan195623
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 

Último (20)

THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf
 
Designing pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptxDesigning pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptx
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Risk Management in Engineering Construction Project
Risk Management in Engineering Construction ProjectRisk Management in Engineering Construction Project
Risk Management in Engineering Construction Project
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptx
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdf
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
"Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ..."Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ...
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
US Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of ActionUS Department of Education FAFSA Week of Action
US Department of Education FAFSA Week of Action
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event Scheduling
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - Guide
 
Configuration of IoT devices - Systems managament
Configuration of IoT devices - Systems managamentConfiguration of IoT devices - Systems managament
Configuration of IoT devices - Systems managament
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 

Toward universal information access on the digital object cloud

  • 1. Toward universal information access on the digital object cloud Kei Kurakawa National Institute of Informatics orcid.org/0000-0002-7031-1846 1 Presentation slide for this: Kei Kurakawa, Toward universal information access on the digital object cloud, In book of abstracts of International Workshop on Data Science - Present & Future of Open Data & Open Science -, p.57-59, November 12-15, 2018, Mishima Citizens Cultural Hall & Joint Support-Center for Data Science Research, Mishima, Shizuoka, Japan
  • 2. Research Data Alliance • Founded in 2013 • Motto – Research data sharing without barriers • Participants – Domain scientist, Information specialist, disciplinary data manager, curator, engineer, librarian, software engineer, policy maker, etc. • Plenaries – RDA1 Gothenburg, Sweden, March 2013 – RDA2 Washington DC, US, September 2013 – RDA3 Dublin, Ireland, March 2014 – RDA4 Amsterdam, the Netherlands, September 2014 – *RDA5 San Diego, US, March 2015 – *RDA6 Paris, France, September 2015 – *RDA7 Tokyo, Japan, March 2016 – *RDA8 Denver, US, September 2016 – *RDA9 Barcelona, Spain, April 2017 – RDA10 Montreal, Canada, September 2017 – *RDA11 Berlin, Germany, March 2018 – RDA12 Gaborone, Botswana, November 2018 2 * indicates that I participated in it.
  • 3. Outline • Chronological overview of universal information access • Professional data community • Digital object cloud (DOC) and linked data (LD) • Persistent identifiers (PID) • PID centric approach data management and access • Conclusions and future work 3
  • 4. Universal information access • The quest for universal information access in networks began around 1960 and over the years yielded a set of principles to fully support universal information access. [Denning and Kahn, 2010] – Memex (Vannevar Bush, “As we may think” Atlantic Monthly, 1945) • The first visionary speculation. It stored documents on microfilm and allowed annotations and cross links. – Xanadu (Ted Nelson, early 1960s) • It introduced topics such as hypertext, hyperlinks, automatic version management, automatic inclusion of referenced items, and small payments to authors for use of their materials. – NLS (Doug Engelbart, middle 1960s) • It is the first working hypertext system with graphical user interface, mouse, and collaboration tools. – World Wide Web (Tim Berners-Lee, late 1980s) • It is a potential means to Implement Bush’s, Nelson’s and Engelbart’s ideas of knowledge representation in the Internet – Digital Object Architecture (DOA) (CNRI, late 1980s) [Kahn and Wilensky, 1995] • It shows key principles on information access, culled out and unified from digital library projects, in a network environment (the Internet) Denning, P. J., & Kahn, R. E. (2010). The long quest for universal information access. Communications of the ACM, 53(12), 34. http://doi.org/10.1145/1859204.1859218 4
  • 5. Information scienceComputer science RDA as a professional data community Library Knowledge 5 Founded in 2013
  • 6. Premises in the professional data community • Computational data format should not be complicated, ever lasting, and independent on computer technology changes. • Data scheme and data attributes are complicated at some professional levels. • Only the professionals can deal with data processing and management. • Of course, the professionals have good knowledge of the domain. 6
  • 7. Domain knowledge Minimal computational complexity of the data format Theory of the domain Knowledge of the experimental settings What makes data valuable 7
  • 8. Practices in the professional data community 8 Data Fabric IG, Group details, https://www.rd-alliance.org/group/data-fabric-ig.html The data cycle is based on the multi-disciplinary survey on the nature, the creation and the usage of Persistent Identifiers (PIDs) Peter Wittenburg, Margareta Hellström, and Carlo-Maria Zwölf (eds.), Hossein Abroshan, Ari Asmi, Giuseppe Di Bernardo, Danielle Couvreur, Tamas Gaizer, Petr Holub, Rob Hooft, Ingemar Häggström, Manfred Kohler, Dimitris Koureas, Wolfgang Kuchinke, Luciano Milanesi, Joseph Padfield, Antonio Rosato, Christine Staiger, Dieter van Uytvanck and Tobias Weigel (2017): Persistent identifiers: Consolidated assertions. DOI: 10.15497/RDA00027.
  • 9. Digital Object Architecture (DOA) [Kahn and Wilensky, 1995] • Digital object (DO) – Any unit of information represented in digital form may be structured as a digital object within the Internet. – The structure of a DO, including metadata, is machine and platform independent. • A unique, persistent identifier (called a “handle”) – Every DO has a unique identifier that can distinguish a DO from every other object, present, past, or future. • Handle System – The “resolution” system maps handles to state information that includes location, authentication, rights specifications, allowed operations, and object attributes. • DO repositories – DOs can be stored in DO Repositories, which are searchable systems. – Accesses to an instance of DO Repository are made via a standard DO protocol (DOP) that restrict actions to those. • DO registries – They allow users to reference, federate, and otherwise manage collections across multiple repositories and allow for full access control. 9 Kahn, R. E. and Wilensky, R. A framework for distributed digital object services. International Journal on Digital Libraries 6, 2 (2006). DOI: 10.1007/s00799- 005-0128-x. (First made available on the Internet in 1995 and reprinted in 2006 as part of a collection of seminal papers on digital libraries).
  • 10. Digital object cloud 10 Larry Lannom, Peter Wittenburg, Global Digital Object Cloud (DOC) - A Guiding Vision, 11 September 2016, http://hdl.handle.net/11304/a8877a1a-9010-428f-b2ce-5863cec4aff3
  • 11. Linked data 11 Linked Data - Connect Distributed Data across the Web http://linkeddata.org https://www.w3.org/2007/03/layerCake.png Semantic technology layer cake
  • 12. Mixture of DOC and LD on the Internet information space Persistent identifiers is a key to bridge the gap between the digital object cloud (DOC) and the linked data (LD) Digital object cloud Linked data The Internet A node represents a resource with a persistent identifier. 12
  • 13. Varieties of academic persistent identifiers and management systems 13 Handle System ORCID DOI Digital object (DO), Research data, Research sample, Research instrument?, Concept, Taxonomy?, Classification Researcher Organization CrossRef, DataCite, etc Grant Federated Identity Management eduPersonOrgOrcid eduPersonOrgDN ISNI? GRID? Ringgold? PID (persistent identifiers) entity type Research resource ARKPURL URI / URN Meta-resolver / Handle, DOI, ARK, PURL resolver Publisher articles / figures, Data citation, IGSN, etc OrgID?、Project? ISBN, LSID, ChEBI, Perma.cc, etc ePIC, etc CrossRef Funder?
  • 14. Data consumer scenario 14 Data discovery & Automatic data processing Dynamic data citation Data fabric PID (Persistent Identifier) Data typing Data versioning Data provenance Data collection Data trustworthy On the Global Digital Object Cloud
  • 15. Google dataset search (Beta) 15 It was released on 2018-09-05. Data discovery paradigm IG of RDA discussed with Dr. Natasha Noy from Research at Google in Nov of 2017. https://www.blog.google/products/search/making-it-easier-discover-datasets/ http://g.co/datasetsearch Data providers are expected to prepare a descriptive metadata of schema.org for the site to be discoverable.
  • 16. PID centric approach to data management and access 16 Data type registries Kernel information on Local Handle Service Broeder, D., & Lannom, L. (2014). Data Type Registries: A Research Data Alliance Working Group. D-Lib Magazine, 20(1/2). http://doi.org/10.1045/january2014-broeder Tobias Weigel, Beth Plale, Mark Parsons, Gabriel Zhou, Yu Luo, Ulrich Schwardmann, Robert Quick, Margareta Hellström, Kei Kurakawa, “RDA Recommendation on PID Kernel Information (Draft)”, https://www.rd- alliance.org/sites/default/files/RDA%20Recommendation%20on%20PID%20K ernel%20Information.pdf
  • 17. Data providing with data types • In parallel, we need data typing. • Data providing maturity levels – Level 1 • Data providers build their data in a community standard – The data is packed in a commonly used format, i.e. XML, JSON, netCDF, CSV as well as application dependent such as Microsoft EXCEL format. – Some data are shipped with a document describing data meaning, data types, and data format. – Level 2 • Data providers use more complicated data format to assert data types – A set of Handle server of DOI objects with Kernel Information profile and Data Type Registry is a recommended candidate for a variety of domain community to assert their data types in addition to their data sources in a community standard format. – On the other hand, linked data community uses RDF/XML, JSON-LD and other linked data formats, or a kind of mixture format of data type and value. – Common vocabularies are provided in a public server, e.g. schema.org. 17
  • 18. On the Digital Object Cloud • Kernel information connects data and data types. • We need to handle with a graph structure of the data. 18 Attribute augmented graph Data layer Data type layer Kernel Information metadata layer Kei Kurakawa, Takayuki Sekiya, Yasumasa Baba, Making data typing efforts or automatically detecting data types for automatic data processing?, Research Data Alliance 11th Plenary Meeting, Berlin, Germany, 2018.03.21-23 https://www.rd-alliance.org/sites/default/files/rda11_poster_20180321_kurakawa.pdf
  • 19. Conclusions and future work • Two types of accessing data on the Internet – Digital object cloud (DOC) – Linked data (LD) • PID centric approach data management and access – Data consumer scenario • Data search • Automatic data processing • We need more functional research on – data typing, – handling with a graph structure of data resources, – case studies – on the digital object cloud. 19