SlideShare una empresa de Scribd logo
1 de 37
Global Agricultural Concept Scheme
The collaborative integration of three thesauri
Prof Dr Thomas Baker [2]
Dr Osma Suominen [1]
Dini Jahrestagung
“Linked Data – Vision und Wirklichkeit”
Frankfurt, 28. Oktober 2015
[1] Sungkyunkwan University (Korea) and Dublin Core Metadata Initiative
[2] National Library of Finland
http://www.iskouk.org/content/great-debate
http://www.iskouk.org/content/great-debate
Concept-based thesauri, published with modern tools
and available as Linked Data, are useful!
Three big thesauri in agriculture
Three thesauri of terms and concepts related to agriculture -- concepts
like rice, ricefield aquaculture, and plant pests.
● FAO – Food and Agriculture Organization of the United Nations
● CABI – Centre for Biosciences and Agriculture International (UK)
● NAL – National Agricultural Library (US)
Separate thesauri
Separate databases
Create GACS as glue linking them together
Global Agricultural Concept Scheme (GACS)
1. Improve semantic interoperability of the thesauri
2. Provide core concepts.
3. Achieve efficiencies through cooperative maintenance.
Requirements
1. Integrated view
2. Reuse of work, such as translations
3. Compatibility with existing databases
4. Based on RDF technologies: URIs, SKOS...
5. Available as Linked Open Data
Based on, mapped to, but independent of, its three source
thesauri.
AGROVOC CAB Thesaurus NAL Thesaurus
140,000
concepts,
>1.4M labels
32,000
concepts,
>1.2M labels
53,000
concepts,
>200k labels
English, Spanish,
Portuguese, German,
Czech, Persian, Polish,
Hindi, French, Italian,
Russian, Japanese,
Hungarian, Chinese,
Slovak, Thai, Lao, Turkish,
Korean, Arabic, Telugu ...
English, Spanish,
Portuguese, Dutch
+ many languages with
lower coverage
English, Spanish
First step: represent all three in SKOS
Obtained via automatic
mappings created using
AgreementMakerLight
First rough estimate
Long tail distribution (in AGRIS)
10,000 concepts cover nearly 99% of occurrences in metadata
Top 10,000 concepts from each
Each partner organization provided
the 10,000 concepts most frequently
used in their respective databases.
Added:
● all countries
● all higher-level organisms
Automated mappings
Created using AgreementMakerLight software
between the full thesauri, for completeness
Human evaluation of mappings
Created Google Docs spreadsheets using the lists of selected concepts and
the auto-generated mappings. Three sheets with circa 10,700 rows each.
Mappings manually evaluated by
staff of partner organizations.
Evaluated 60 to 150 rows/hour.
Evaluation took 500 to 600 hours
for GACS Beta.
Starting point
October 2014
30,000 mappings later...
January 2015
4,689 mappings later...
February 2015
5,522 mappings later...
March 2015
Forming GACS concepts
by merging the source concepts and aggregating their information
rice
UF paddy
UF paddy rice
cereals
UF feed cereals
UF small grain cereals (grain)
Oryza sativa
UF Oryza glutinosa
UF Oryza indica
UF Oryza japonica
UF Oryza sativa … (subsp, var etc.)
Oryza
UF Padia
UF rice (plant)
agrovoc:c_5435
cabt:82917
nalt:56271
exactMatch
agrovoc:c_5438
cabt:82935
nalt:56277
exactMatch
agrovoc:c_1474
cabt:26247
exactMatch
agrovoc:c_6599
cabt:101613
nalt:56293
exactMatch
(Note: GACS uses SKOS, not traditional thesaurus tags)
Lumps
clusters of concepts mapped one-to-several, several-to-one, or in spirals
15,090 concepts; 972 lumps
Lumps
March 2015
15,278 concepts; 339 lumps
Lumps
April 2015
15,411 concepts; 84 lumps
Lumps
October 2015
15,406 concepts; no lumps
Last week
Polyhierarchy?
countries
developing
countries
Argentina
Buenos Aires
development
socioeconomic
development
economic
development
sciences
social sciences
economics
Concept types?
Concept types!
Plus Product
Towards GACS roll-out (2016)
• Concept scheme as Linked Data. Own publication and editorial
platform.
• Quality improvements. Inconsistencies in hierarchy, choice of
labels, scope notes and definitions.
• Own semantic structure. Common vs scientific names, custom
relationships, concept types.
VocBench for editing
Skosmos for display and browsing
Size of GACS
GACS
GACS Beta 1.1
• 15,406 concepts
• 398,216 labels in
28 languages
Extension module for what remains?
AGROVOC and NALT may be phased out
Extension module?
GACS
CABT
GACS
Agrisemantics
http://aims.fao.org/sites/default/files/Report_workshop_Agrisemantics.pdf
Global food security and climate change
• GACS as hub for agricultural code lists,
taxonomies, statistical indicators...
• Simplify data normalization and integration
• More coherent datasets and research results
• Help farmers become more efficient
Reports available on the FAO AIMS site
http://aims.fao.org/community/agrovoc/blogs/phase-one-gacs-approved-read-reports
http://aims.fao.org/sites/default/files/Report_workshop_Agrisemantics.pdf
osma.suominen@helsinki.fi
tom@tombaker.org
Abstract
The Food and Agricultural Organization of the United Nations (FAO), CAB International (CABI), and the USDA National
Agricultural Library (NAL), maintainers of three large thesauri of agricultural terminology that largely overlap in scope, have
partnered to create a shared Global Agricultural Concept Scheme (GACS). Duplication of effort has proven to be both inefficient
and a barrier to users wishing to search across databases indexed with their terms. Expressing AGROVOC, CAB Thesaurus, and
NAL Thesaurus in RDF and SKOS, as Linked Data, facilitates mappings, but mappings among three large, continually moving
targets are difficult to maintain.
Starting with algorithmically generated mappings among three sets of the terms most frequently used to index the AGRICOLA,
CAB Abstracts, and AGRIS databases, thesaurus managers in the GACS Working Group have manually vetted the mappings for
quality and are currently correcting logical inconsistencies. In a final iteration, these mappings will be used to generate a Global
Agricultural Concept Scheme with its own identifiers, and GACS will be moved into its own distributed editorial environment and
jointly maintained by the three partners.
Targeted for beta release in early 2016, GACS aggregates the complementary strengths of its sources, such as expertise in
particular areas and labels in twenty languages. Formulating consistent policies for GACS on issues such as scientific versus
common names for organisms requires balancing scientific, commercial, educational, and mass-market perspectives. The
challenge of global food security under conditions of climate change will require the integration of data at all levels. GACS can
serve as a focal point in the broader ecosystem of vocabularies, code lists, database schemas, ontologies, statistical indicators,
and taxonomies required to drive agricultural research and innovation.

Más contenido relacionado

La actualidad más candente

Google earth gaez tool
Google earth gaez toolGoogle earth gaez tool
Google earth gaez toolNAP Events
 
Farm level case studies Tanzania
Farm level case studies TanzaniaFarm level case studies Tanzania
Farm level case studies Tanzaniaafrica-rising
 
A new digital hub to consolidate learnings from global food legume initiatives
A new digital hub to consolidate learnings from global food legume initiativesA new digital hub to consolidate learnings from global food legume initiatives
A new digital hub to consolidate learnings from global food legume initiativesICRISAT
 
Evidence on sustainable livestock from LMICs is urgently needed to inform nua...
Evidence on sustainable livestock from LMICs is urgently needed to inform nua...Evidence on sustainable livestock from LMICs is urgently needed to inform nua...
Evidence on sustainable livestock from LMICs is urgently needed to inform nua...ILRI
 
Diversity Seek -- Crop diversity for food security
Diversity Seek -- Crop diversity for food securityDiversity Seek -- Crop diversity for food security
Diversity Seek -- Crop diversity for food securityLuigi Guarino
 
Introducing the Biosciences eastern and central Africa–International Livestoc...
Introducing the Biosciences eastern and central Africa–International Livestoc...Introducing the Biosciences eastern and central Africa–International Livestoc...
Introducing the Biosciences eastern and central Africa–International Livestoc...ILRI
 
Why we need open data to achieve impacts with regards to SDG2
Why we need open data to achieve impacts with regards to SDG2Why we need open data to achieve impacts with regards to SDG2
Why we need open data to achieve impacts with regards to SDG2Suchith Anand
 
Institutional Change from a capacity development perspective: Open data devel...
Institutional Change from a capacity development perspective: Open data devel...Institutional Change from a capacity development perspective: Open data devel...
Institutional Change from a capacity development perspective: Open data devel...Suchith Anand
 
Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...
Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...
Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...apaari
 
Exploring intensification options for mixed crop livestock farms in southern ...
Exploring intensification options for mixed crop livestock farms in southern ...Exploring intensification options for mixed crop livestock farms in southern ...
Exploring intensification options for mixed crop livestock farms in southern ...ICRISAT
 
Vietnam: Initial ideas for Integrated Core Project
Vietnam: Initial ideas for Integrated Core ProjectVietnam: Initial ideas for Integrated Core Project
Vietnam: Initial ideas for Integrated Core ProjectILRI
 

La actualidad más candente (18)

Call for contributions to critical investigation supporting scale-up of clima...
Call for contributions to critical investigation supporting scale-up of clima...Call for contributions to critical investigation supporting scale-up of clima...
Call for contributions to critical investigation supporting scale-up of clima...
 
Google earth gaez tool
Google earth gaez toolGoogle earth gaez tool
Google earth gaez tool
 
Presentation 7 r layer_complementa
Presentation 7 r layer_complementaPresentation 7 r layer_complementa
Presentation 7 r layer_complementa
 
Farm level case studies Tanzania
Farm level case studies TanzaniaFarm level case studies Tanzania
Farm level case studies Tanzania
 
A new digital hub to consolidate learnings from global food legume initiatives
A new digital hub to consolidate learnings from global food legume initiativesA new digital hub to consolidate learnings from global food legume initiatives
A new digital hub to consolidate learnings from global food legume initiatives
 
Ghotbi Resume
Ghotbi ResumeGhotbi Resume
Ghotbi Resume
 
RTB Program Overview 2018 - ISTRC
RTB Program Overview 2018 - ISTRC RTB Program Overview 2018 - ISTRC
RTB Program Overview 2018 - ISTRC
 
From GACS to Agrisemantics - Steps Forward Towards Interoperability of Data f...
From GACS to Agrisemantics - Steps Forward Towards Interoperability of Data f...From GACS to Agrisemantics - Steps Forward Towards Interoperability of Data f...
From GACS to Agrisemantics - Steps Forward Towards Interoperability of Data f...
 
Evidence on sustainable livestock from LMICs is urgently needed to inform nua...
Evidence on sustainable livestock from LMICs is urgently needed to inform nua...Evidence on sustainable livestock from LMICs is urgently needed to inform nua...
Evidence on sustainable livestock from LMICs is urgently needed to inform nua...
 
Ingersoll Cool farm tool methods oct 2011
Ingersoll Cool farm tool methods oct 2011Ingersoll Cool farm tool methods oct 2011
Ingersoll Cool farm tool methods oct 2011
 
Diversity Seek -- Crop diversity for food security
Diversity Seek -- Crop diversity for food securityDiversity Seek -- Crop diversity for food security
Diversity Seek -- Crop diversity for food security
 
2005 09 Dc Keynote
2005 09 Dc Keynote2005 09 Dc Keynote
2005 09 Dc Keynote
 
Introducing the Biosciences eastern and central Africa–International Livestoc...
Introducing the Biosciences eastern and central Africa–International Livestoc...Introducing the Biosciences eastern and central Africa–International Livestoc...
Introducing the Biosciences eastern and central Africa–International Livestoc...
 
Why we need open data to achieve impacts with regards to SDG2
Why we need open data to achieve impacts with regards to SDG2Why we need open data to achieve impacts with regards to SDG2
Why we need open data to achieve impacts with regards to SDG2
 
Institutional Change from a capacity development perspective: Open data devel...
Institutional Change from a capacity development perspective: Open data devel...Institutional Change from a capacity development perspective: Open data devel...
Institutional Change from a capacity development perspective: Open data devel...
 
Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...
Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...
Investing in Agricultural Biotechnologies in the Pacific: Striving for an Eff...
 
Exploring intensification options for mixed crop livestock farms in southern ...
Exploring intensification options for mixed crop livestock farms in southern ...Exploring intensification options for mixed crop livestock farms in southern ...
Exploring intensification options for mixed crop livestock farms in southern ...
 
Vietnam: Initial ideas for Integrated Core Project
Vietnam: Initial ideas for Integrated Core ProjectVietnam: Initial ideas for Integrated Core Project
Vietnam: Initial ideas for Integrated Core Project
 

Destacado

Agrithon Proposal
Agrithon ProposalAgrithon Proposal
Agrithon ProposalNguyen Do
 
PPT on "Employee's State Insurance Act 1948" of India.
PPT on "Employee's State Insurance Act 1948" of India.PPT on "Employee's State Insurance Act 1948" of India.
PPT on "Employee's State Insurance Act 1948" of India.Anshu Shekhar Singh
 
Roles and problems of agriculture
Roles and problems of agricultureRoles and problems of agriculture
Roles and problems of agricultureRebam Jilani
 
agriculture ( ppt made by akshit.manhas)
agriculture ( ppt made by akshit.manhas)agriculture ( ppt made by akshit.manhas)
agriculture ( ppt made by akshit.manhas)Akshit Manhas
 
agriculture ppt
 agriculture ppt agriculture ppt
agriculture ppticon66rt
 

Destacado (8)

The agricultural ontology service a proposal to create a knowledge organizati...
The agricultural ontology service a proposal to create a knowledge organizati...The agricultural ontology service a proposal to create a knowledge organizati...
The agricultural ontology service a proposal to create a knowledge organizati...
 
Agrithon Proposal
Agrithon ProposalAgrithon Proposal
Agrithon Proposal
 
Bde euro proworkshop
Bde euro proworkshopBde euro proworkshop
Bde euro proworkshop
 
Agriculture
AgricultureAgriculture
Agriculture
 
PPT on "Employee's State Insurance Act 1948" of India.
PPT on "Employee's State Insurance Act 1948" of India.PPT on "Employee's State Insurance Act 1948" of India.
PPT on "Employee's State Insurance Act 1948" of India.
 
Roles and problems of agriculture
Roles and problems of agricultureRoles and problems of agriculture
Roles and problems of agriculture
 
agriculture ( ppt made by akshit.manhas)
agriculture ( ppt made by akshit.manhas)agriculture ( ppt made by akshit.manhas)
agriculture ( ppt made by akshit.manhas)
 
agriculture ppt
 agriculture ppt agriculture ppt
agriculture ppt
 

Similar a Global Agricultural Concept Scheme The collaborative integration of three thesauri

Turning three thesauri into a Global Agricultural Concept Scheme
Turning three thesauri into a  Global Agricultural Concept SchemeTurning three thesauri into a  Global Agricultural Concept Scheme
Turning three thesauri into a Global Agricultural Concept SchemeCIARD Movement
 
The GACS Project by Caterina Caracciolo
The GACS Project by Caterina CaraccioloThe GACS Project by Caterina Caracciolo
The GACS Project by Caterina CaraccioloCIARD Movement
 
AGRIS (agricultural information system)
AGRIS (agricultural information system)AGRIS (agricultural information system)
AGRIS (agricultural information system)Abid Fakhre Alam
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataVassilis Protonotarios
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceDag Endresen
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Dag Endresen
 
Ak jai ne resources-oct2014
Ak jai ne resources-oct2014Ak jai ne resources-oct2014
Ak jai ne resources-oct2014FRANK Water
 
Estimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble TechniqueEstimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble Techniqueijtsrd
 
Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Dag Endresen
 
Agris (agricultural information system)
Agris (agricultural information system)Agris (agricultural information system)
Agris (agricultural information system)yashir16
 

Similar a Global Agricultural Concept Scheme The collaborative integration of three thesauri (20)

Turning three thesauri into a Global Agricultural Concept Scheme
Turning three thesauri into a  Global Agricultural Concept SchemeTurning three thesauri into a  Global Agricultural Concept Scheme
Turning three thesauri into a Global Agricultural Concept Scheme
 
IGAD_CODATA
IGAD_CODATAIGAD_CODATA
IGAD_CODATA
 
The GACS Project by Caterina Caracciolo
The GACS Project by Caterina CaraccioloThe GACS Project by Caterina Caracciolo
The GACS Project by Caterina Caracciolo
 
AGRIS (agricultural information system)
AGRIS (agricultural information system)AGRIS (agricultural information system)
AGRIS (agricultural information system)
 
AgriOcean DSpace
AgriOcean DSpace AgriOcean DSpace
AgriOcean DSpace
 
Global RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm DataGlobal RDF Descriptors for Germplasm Data
Global RDF Descriptors for Germplasm Data
 
AgriOcean DSpace
AgriOcean DSpaceAgriOcean DSpace
AgriOcean DSpace
 
Sharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conferenceSharing of germplasm data sets, at the TDWG 2006 conference
Sharing of germplasm data sets, at the TDWG 2006 conference
 
Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)Prototype germplasm data portal (2006)
Prototype germplasm data portal (2006)
 
Maize database
Maize database Maize database
Maize database
 
Ak jai ne resources-oct2014
Ak jai ne resources-oct2014Ak jai ne resources-oct2014
Ak jai ne resources-oct2014
 
Integrating controlled vocabularies in information management systems : the n...
Integrating controlled vocabularies in information management systems : the n...Integrating controlled vocabularies in information management systems : the n...
Integrating controlled vocabularies in information management systems : the n...
 
Estimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble TechniqueEstimating Evaporation using Machine Learning Based Ensemble Technique
Estimating Evaporation using Machine Learning Based Ensemble Technique
 
Use and integration of controlled vocabularies (AGROVOC) in DSpace Repositories
Use and integration of controlled vocabularies (AGROVOC) in DSpace RepositoriesUse and integration of controlled vocabularies (AGROVOC) in DSpace Repositories
Use and integration of controlled vocabularies (AGROVOC) in DSpace Repositories
 
Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)Trait data mining using FIGS (2006)
Trait data mining using FIGS (2006)
 
Agris (agricultural information system)
Agris (agricultural information system)Agris (agricultural information system)
Agris (agricultural information system)
 
Aos ciard-china
Aos ciard-chinaAos ciard-china
Aos ciard-china
 
World bank 2011-05
World bank 2011-05World bank 2011-05
World bank 2011-05
 
A Collaborative Framework for Managing and Publishing KOS
A Collaborative  Framework for  Managing and Publishing KOS A Collaborative  Framework for  Managing and Publishing KOS
A Collaborative Framework for Managing and Publishing KOS
 
Day 1_Session3_TRIPS_WASDS_ICRISAT
Day 1_Session3_TRIPS_WASDS_ICRISATDay 1_Session3_TRIPS_WASDS_ICRISAT
Day 1_Session3_TRIPS_WASDS_ICRISAT
 

Más de AIMS (Agricultural Information Management Standards)

Más de AIMS (Agricultural Information Management Standards) (20)

Linked Data Competency Index : Mapping the field for teachers and learners
 Linked Data Competency Index : Mapping the field for teachers and learners Linked Data Competency Index : Mapping the field for teachers and learners
Linked Data Competency Index : Mapping the field for teachers and learners
 
Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...
 
Assigning Digital Object Identifiers (DOIs) to Plant Genetic Resources
Assigning Digital Object Identifiers (DOIs) to Plant Genetic ResourcesAssigning Digital Object Identifiers (DOIs) to Plant Genetic Resources
Assigning Digital Object Identifiers (DOIs) to Plant Genetic Resources
 
VocBench 3: some insights on the forthcoming release
VocBench 3: some insights on the forthcoming release VocBench 3: some insights on the forthcoming release
VocBench 3: some insights on the forthcoming release
 
The case for Digital Objects Identifiers (DOIs) in support of research activi...
The case for Digital Objects Identifiers (DOIs) in support of research activi...The case for Digital Objects Identifiers (DOIs) in support of research activi...
The case for Digital Objects Identifiers (DOIs) in support of research activi...
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
 
Webinar@ASIRA: How to foster openness from an academic library
Webinar@ASIRA: How to foster openness from an academic library Webinar@ASIRA: How to foster openness from an academic library
Webinar@ASIRA: How to foster openness from an academic library
 
Webinar@ASIRA: A Practitioners Approach to Open Data for Agricultural Research
Webinar@ASIRA: A Practitioners Approach to Open Data for Agricultural Research Webinar@ASIRA: A Practitioners Approach to Open Data for Agricultural Research
Webinar@ASIRA: A Practitioners Approach to Open Data for Agricultural Research
 
Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...
Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...
Webinar@ASIRA: AuthorAID: Supporting Developing Country Researchers in Publis...
 
Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals
Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals
Webinar@ASIRA: Introduction to Using TEEAL to Access Agricultural Journals
 
Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA)
Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA) Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA)
Webinar@ASIRA: Access to Global Online Research in Agriculture (AGORA)
 
Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...
Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...
Webinar@ASIRA: AGRIS: Providing Access to Agricultural Research and Technolog...
 
Webinar@ASIRA: New Roles for Changing Times UNAM Subject Librarians in Context
Webinar@ASIRA: New Roles for Changing Times UNAM Subject Librarians in Context Webinar@ASIRA: New Roles for Changing Times UNAM Subject Librarians in Context
Webinar@ASIRA: New Roles for Changing Times UNAM Subject Librarians in Context
 
Webinar@ASIRA: Emerging Themes in Agricultural Research Publishing
Webinar@ASIRA: Emerging Themes in Agricultural Research PublishingWebinar@ASIRA: Emerging Themes in Agricultural Research Publishing
Webinar@ASIRA: Emerging Themes in Agricultural Research Publishing
 
Webinar@AIMS: OKAD & F1000Research: a very different approach to publishing a...
Webinar@AIMS: OKAD & F1000Research: a very different approach to publishing a...Webinar@AIMS: OKAD & F1000Research: a very different approach to publishing a...
Webinar@AIMS: OKAD & F1000Research: a very different approach to publishing a...
 
Research4Life: La bibliothèque qui ouvre ses portes
Research4Life: La bibliothèque qui ouvre ses portesResearch4Life: La bibliothèque qui ouvre ses portes
Research4Life: La bibliothèque qui ouvre ses portes
 
Publishing skos concept schemes with skosmos
Publishing skos concept schemes with skosmosPublishing skos concept schemes with skosmos
Publishing skos concept schemes with skosmos
 
Research4Life: La biblioteca que abre puertas
Research4Life: La biblioteca que abre puertasResearch4Life: La biblioteca que abre puertas
Research4Life: La biblioteca que abre puertas
 
Research4Life: The library that opens doors
Research4Life: The library that opens doorsResearch4Life: The library that opens doors
Research4Life: The library that opens doors
 
Webinar@AIMS: Perspective on Big Data in the CGIAR
Webinar@AIMS: Perspective on Big Data in the CGIARWebinar@AIMS: Perspective on Big Data in the CGIAR
Webinar@AIMS: Perspective on Big Data in the CGIAR
 

Último

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Último (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

Global Agricultural Concept Scheme The collaborative integration of three thesauri

  • 1. Global Agricultural Concept Scheme The collaborative integration of three thesauri Prof Dr Thomas Baker [2] Dr Osma Suominen [1] Dini Jahrestagung “Linked Data – Vision und Wirklichkeit” Frankfurt, 28. Oktober 2015 [1] Sungkyunkwan University (Korea) and Dublin Core Metadata Initiative [2] National Library of Finland
  • 3. http://www.iskouk.org/content/great-debate Concept-based thesauri, published with modern tools and available as Linked Data, are useful!
  • 4. Three big thesauri in agriculture Three thesauri of terms and concepts related to agriculture -- concepts like rice, ricefield aquaculture, and plant pests. ● FAO – Food and Agriculture Organization of the United Nations ● CABI – Centre for Biosciences and Agriculture International (UK) ● NAL – National Agricultural Library (US)
  • 6. Create GACS as glue linking them together
  • 7. Global Agricultural Concept Scheme (GACS) 1. Improve semantic interoperability of the thesauri 2. Provide core concepts. 3. Achieve efficiencies through cooperative maintenance.
  • 8. Requirements 1. Integrated view 2. Reuse of work, such as translations 3. Compatibility with existing databases 4. Based on RDF technologies: URIs, SKOS... 5. Available as Linked Open Data Based on, mapped to, but independent of, its three source thesauri.
  • 9. AGROVOC CAB Thesaurus NAL Thesaurus 140,000 concepts, >1.4M labels 32,000 concepts, >1.2M labels 53,000 concepts, >200k labels English, Spanish, Portuguese, German, Czech, Persian, Polish, Hindi, French, Italian, Russian, Japanese, Hungarian, Chinese, Slovak, Thai, Lao, Turkish, Korean, Arabic, Telugu ... English, Spanish, Portuguese, Dutch + many languages with lower coverage English, Spanish First step: represent all three in SKOS
  • 10. Obtained via automatic mappings created using AgreementMakerLight First rough estimate
  • 11. Long tail distribution (in AGRIS) 10,000 concepts cover nearly 99% of occurrences in metadata
  • 12. Top 10,000 concepts from each Each partner organization provided the 10,000 concepts most frequently used in their respective databases. Added: ● all countries ● all higher-level organisms
  • 13. Automated mappings Created using AgreementMakerLight software between the full thesauri, for completeness
  • 14. Human evaluation of mappings Created Google Docs spreadsheets using the lists of selected concepts and the auto-generated mappings. Three sheets with circa 10,700 rows each. Mappings manually evaluated by staff of partner organizations. Evaluated 60 to 150 rows/hour. Evaluation took 500 to 600 hours for GACS Beta.
  • 19. Forming GACS concepts by merging the source concepts and aggregating their information rice UF paddy UF paddy rice cereals UF feed cereals UF small grain cereals (grain) Oryza sativa UF Oryza glutinosa UF Oryza indica UF Oryza japonica UF Oryza sativa … (subsp, var etc.) Oryza UF Padia UF rice (plant) agrovoc:c_5435 cabt:82917 nalt:56271 exactMatch agrovoc:c_5438 cabt:82935 nalt:56277 exactMatch agrovoc:c_1474 cabt:26247 exactMatch agrovoc:c_6599 cabt:101613 nalt:56293 exactMatch (Note: GACS uses SKOS, not traditional thesaurus tags)
  • 20. Lumps clusters of concepts mapped one-to-several, several-to-one, or in spirals
  • 21. 15,090 concepts; 972 lumps Lumps March 2015
  • 22. 15,278 concepts; 339 lumps Lumps April 2015
  • 23. 15,411 concepts; 84 lumps Lumps October 2015
  • 24. 15,406 concepts; no lumps Last week
  • 28. Towards GACS roll-out (2016) • Concept scheme as Linked Data. Own publication and editorial platform. • Quality improvements. Inconsistencies in hierarchy, choice of labels, scope notes and definitions. • Own semantic structure. Common vs scientific names, custom relationships, concept types.
  • 30. Skosmos for display and browsing
  • 31. Size of GACS GACS GACS Beta 1.1 • 15,406 concepts • 398,216 labels in 28 languages
  • 32. Extension module for what remains?
  • 33. AGROVOC and NALT may be phased out Extension module? GACS CABT GACS
  • 35. Global food security and climate change • GACS as hub for agricultural code lists, taxonomies, statistical indicators... • Simplify data normalization and integration • More coherent datasets and research results • Help farmers become more efficient
  • 36. Reports available on the FAO AIMS site http://aims.fao.org/community/agrovoc/blogs/phase-one-gacs-approved-read-reports http://aims.fao.org/sites/default/files/Report_workshop_Agrisemantics.pdf osma.suominen@helsinki.fi tom@tombaker.org
  • 37. Abstract The Food and Agricultural Organization of the United Nations (FAO), CAB International (CABI), and the USDA National Agricultural Library (NAL), maintainers of three large thesauri of agricultural terminology that largely overlap in scope, have partnered to create a shared Global Agricultural Concept Scheme (GACS). Duplication of effort has proven to be both inefficient and a barrier to users wishing to search across databases indexed with their terms. Expressing AGROVOC, CAB Thesaurus, and NAL Thesaurus in RDF and SKOS, as Linked Data, facilitates mappings, but mappings among three large, continually moving targets are difficult to maintain. Starting with algorithmically generated mappings among three sets of the terms most frequently used to index the AGRICOLA, CAB Abstracts, and AGRIS databases, thesaurus managers in the GACS Working Group have manually vetted the mappings for quality and are currently correcting logical inconsistencies. In a final iteration, these mappings will be used to generate a Global Agricultural Concept Scheme with its own identifiers, and GACS will be moved into its own distributed editorial environment and jointly maintained by the three partners. Targeted for beta release in early 2016, GACS aggregates the complementary strengths of its sources, such as expertise in particular areas and labels in twenty languages. Formulating consistent policies for GACS on issues such as scientific versus common names for organisms requires balancing scientific, commercial, educational, and mass-market perspectives. The challenge of global food security under conditions of climate change will require the integration of data at all levels. GACS can serve as a focal point in the broader ecosystem of vocabularies, code lists, database schemas, ontologies, statistical indicators, and taxonomies required to drive agricultural research and innovation.