1. 超高層物理学を試験環境とした学術情報基盤の考察
Consideration of the scholarly information
infrastructure on upper atmospheric research field as a
test bed
Yukinobu KOYAMA
orcid:0000-0001-5363-3870
Transdisciplinary Research Integration Center
/National Institute of Informatics,
Research Organization Information and Systems.
1
8. Origin of Journal Culture
Royal society of London
philosophical transactions
started to published in 1665.
Basically, the format is not
changed for 350 years!
Imcompleteness:
Data Citation,
Metadata of Datasets,
Description of the derivation
process,
Sharing problem of data
visualization and analysis
software.
8
R. Boyle, doi:10.1098/rstl.1665.0007
9. Introduction 1
Number of articles and quantity of data
9
NISTEP, 2013
http:-reports/idc-digital-universe-2014.pdf
Total storage capacity in 2013: 4.4ZB
(kilo, mega, giga, tera, peta, exa, zetta, yotta)
It's increased 40 percent a year.
[Q] Articles & Data is increasing suddenly.
Papers which have no reproducibility are generating.
Is the current scholarly communication infrastructure enough?
Unable to validate the relevant
preclinical research for almost
two-thirds [Wadman, 2013]
サイエンスは国家公務員
がやるもの?
10. To simplify the issue
We consider Upper Atmospheric Research field to stay
away from
Ethical, Legal, Social Issues.
10
http://www.nipr.ac.jp/jare/now/20150901.h
ml
12. 12
Japan Link Center (JaLC)
JaLC is the 9th registration agency of DOI in the world.
Koyama is a member of External Committee of JaLC.
JaLC started to mint DOI into Research Data in 2014
14. Japanese Usecase
Landing Page of DOI
Our WDS/WDC group in
Japan minted a DOI to
mesospheric wind velocity
data observed by NICT.
This is the first case in “ DOI
REGISTRATION
EXPERIMENTAL PROJECT
TO RESEARCH DATA” by
JaLC.
This DOI have already refered
from JGR paper.
(doi:10.1002/2014JD022647)
doi:10.17591/55838dbd6c0ad 14
17. Upper Atmospheric
Domain Specific Metadata Database
(IUGONET Metadata DB)
http://search.iugonet.org/
(Customized Dspace 1.7.2)
Instantiation
Insert into DB
17
18. Data Handling
in Upper Atmospheric Research
Upper Atmospheric Field
Variety issues in Big Data.
Data Format is not unified.
To unify it is too difficult.
Data Analysis absorb the difference of data format.
18W3 CSV on the web working group.
19. 5 Stars OPEN DATA
⭐️
make your stuff available on the Web (whatever
format) under an open
license.
⭐️⭐️
make it available as structured data
(e.g., Excel instead of image scan of a table).
⭐️⭐️⭐️
make it avaibalbe in a non-proprietary open
format (e.g., CSV as well as of Excel).
⭐️⭐️⭐️⭐️
use URIs to denote things, so that people
can point at your stuff.
⭐️⭐️⭐️⭐️⭐️
link your data to other data to provide
context.
19
20. Upper Atmopsheric Domain Specific
Data Visualization & Analysis Software
(SPEDAS)
IDL is needed:
$2,500/license in Japan.
Can’t use CLI on free VM.
IDL: Popular soft. in Astro.
However, SPEDAS conflicts with SolarSoft
in Astronomy because of name space.
Confliction because of no name space.
Not enough for Big Data Analysis to
use many core because of limitation
of number of licenses.
For domain researcher mainly.
Not good choice for neighbor field scientist,
Data Scientist, scientist in
Development Country, Citizens?
20
SPEDAS
22. The Open Definition
by opendefinition.org
Open means anyone can freely access, use, modify,
and share for any purpose.
Open data and content can be freely used, modified,
and shared by anyone for any purpose.
Open Format:
Specifically, data should be machine-readable, available
in bulk, and provided in an open format, at the very least,
can be processed with at least one free/libre/open-source
software tool.
22
32. Conclusion
We summarized ideal scholarly information
infrastructure.
We indicated the current achievement situation
in upper atmospheric research field.
We suggest the importance of free data analysis
software.
Building the 100% free Data Visualization and Analyze
software which is called “JudasFX”.
32
33. RDAのご案内
2016/03/01-03: Research Data Allianceが、東京(一ツ橋
会館)で開かれます。
2/29 にプレイベントがあります。
九大の方のイベントと重なる可能性もありますが、お手すき
の方は、ぜひ参加することをお勧めします。
キーワード: オープンサイエンス、データ中心科学、
CODATA、WDS、データ出版、データ引用、provenance
33