SlideShare una empresa de Scribd logo
1 de 44
IFLA - Lyon, France 19 August 2014 
Multilingualism in 
WorldCat and 
VIAF 
Janifer Gatenby 
Working with Karen Smith-Yoshimura, Robert Bremer, Eric Childress, Jean Godby, 
Richard Greene, JD Shipengrover, Gail Thornburg, Jenny Toves, Diane Vizine 
Goetz, Shenghui Wang, Jay Weitz
WorldCat Today 
• Resources in nearly all 
languages 
• Contributed by more 
than 20,000 libraries 
worldwide 
• More than half the 
database is for works 
not in English 
Languages 
English 
German 
French 
Spanish 
Chinese 
Dutch 
Japanese 
Russian 
Arabic 
469 others
WorldCat Today 
• Bibliographic Records 
– Hybrid records 
– Parallel records 
• Clustered at Work 
level (FRBR)
Existing Architecture 
Authors 
Authors 
Authors 
Subj 
Classif 
Subj 
Classif Subj 
Classif 
Holding 
Holding 
Holdings 
Bibliographic 
record 
Work 
cluster 
Content 
cluster 
Manifes 
tation 
cluster
Complementary Initiatives 
Work Level 
Record 
GLIMIR 
Manifestation & 
Content Clusters 
Multi-lingual 
Bibliographic 
Structure
Create a consolidated 
metadata summary for the 
content of a work 
Objective: Work Level Record
Work Level Record 
http://www.oclc.org/research/activities/workrecs.html 
Coming Q1 
2015
Create better work 
presentations 
GLIMIR: Objective
Users like C 
• The Content Cluster 
GLIMIR 
– Enables better work record displays by reducing the number of 
lines that display for large works 
– Enables a choice of format and presents the formats that could be 
acceptable substitutes 
– Consolidates holdings for identical content 
• The Manifestation Cluster is important 
– Consolidates holdings at manifestation level 
– In the short term allows the record catalogued in the language of 
the interface to be chosen for display 
– Reduces apparent duplication 
– Allows a more accurate count of the number of manifestations in 
WorldCat (as opposed to the number of records) 
Cataloguers 
& scholars 
like C
Manifestation Clustering 
So far 103 million records processed (about 30%)
Manifestation Cluster Opened
SRU Search: 
Loti Pêcheur d’islande 
(Work ID 21536567) 
Records Holdings 
Work 18 148 
Content 14 143 
Manifestation 7 115
Multilingual Bibliographic 
Structure Project 
Objective: Improve displays; surface translations
Multilingual Bibliographic Structure Project 
Creates true multi-lingual displays 
– At work and manifestation levels 
– Using all available data instead of “most appropriate record” 
– Generates data 
Corrects many of the 28 million records coded 
“und” 
Better control and linking of translations 
Input to refinement of work clusters 
Smarter data storage
“Most appropriate” questioned 
• Worldcat.org selects the most appropriate 
record to show to a user as representative of 
the work in the short result list and beyond 
• The end result will not be very satisfactory from a 
multi-lingual viewpoint… here’s why
Which record is better to present to a German speaker?
Incomplete Swedish Record
Hybrid record
Most appropriate 
display 
Build the display from all available data
Multilingual Bibliographic Structure Project 
• Work level data, mined from all associated 
bibliographic records will be displayed 
supplemented with expression / manifestation 
level data as the user drills through the short 
to fuller versions of the metadata. 
End user interface will show works and manifestations not bibliographic records; the 
cataloguing client will also show bibliographic records
Proposed new 
architecture 
jpn 
Work 
eng 
fre 
ger 
jpn 
Manif 
eng 
Manif 
eng 
Manif 
eng 
Manif 
eng Manif 
Manif 
eng 
engA 
oN oftrees 
Contents 
++ 
Holding 
Holding 
Holding 
Subj 
Classif 
Holding 
Subj 
sif 
eng 
fre 
ger 
jpn 
Authors 
Authors 
eng Authors 
fre 
ger 
eng 
fre 
ger 
jpn 
fre 
eng 
ger 
jpn 
Translations 
(Language of work) 
Manif 
fre 
Holding
Important principles 
• Language tagging of elements, particularly 
– Summaries (M21 520) 
– Subject headings 
• Display in script preferred by the user if data is 
available 
• Improve translated interfaces 
• Show consolidated holdings as appropriate
Translations 
Surfacing the “cream”
Great works are translated 
• The cream of the world’s cultural and 
knowledge heritage is shared by being 
translated 
• WorldCat contains many rich cataloguing 
records for these translations 
GOAL: Data mine the really good records to 
improve clustering, presentation, authority records 
and linked data
Ιλιάδα 
The Iliad 紅樓夢 
Dream of the Red Chamber 
ঘরে বাইরে 
The Home and the World 
زقاق المدق 
Midaq Alley 
Война и миръ 
War and Peace 
The Tale of Genji 
דער בעל-תשובה 
The Penitent 
સત્યના પ્રયોગો અથવા આત્મકથા 
源 
氏 
物 
語 
The Story of My Experiments with Truth [Gandhi autobiography]
Translations 
Leo Tolstoy: 32 languages 
Homer: 28 languages 
Rabindranath Tagore: 21 
Isaac Bashevis Singer: 17 
Najīb Maḥfūẓ: 12 languages 
Cao Xueqin: 9 languages 
Mahatma Gandhi: 7 languages 
Murasaki Shikabu: 7 languages
Improving work clustering 
• Inconsistencies cause work clusters to be 
incomplete resulting in less than optimal 
search results 
– Titles without subtitles 
– Missing or different forms of uniform title 
– Inverted title 
– Different coding of original and translated 
information 
Generated uniform title authority records will overcome most of these differences 
without needing to edit individual records
Addition of xR records to VIAF 
Before 
After
UNESCO Translation Database
XR VIAF Record 
VIAF ID for Author 
Translated title 
Translator
IFLA - Lyon, France 19 August 2014 
VIAF Linked Data 
New Information
Title: 西遊記 
Language: Chinese 
Author: 吳承恩 
Created: 1592 
HasTranslation: 
Title: Journey to the West 
Language: English 
Translator: Anthony C. Yu 
Date: 1977 
IsTranslationOf: 
Title: Journey to the West 
Language: English 
Translator: W. J. F. Jenner 
Date: 1982-1984 
IsTranslationOf: 
Title: Tây du ký bình khảo 
Language: Vietnamese 
Translator: Phan Quân 
Date: 1980 
IsTranslationOf: 
Title: Monkeys Pilgerfahrt 
Language: German 
Translator: Georgette Boner 
Date: 1983 
IsTranslationOf: 
Title: 西遊記 
Language: Japanese 
Translator: 中野美代子 
Date: 1986 
IsTranslationOf:
Markup for the Semantic Web 
# Original Work (in Chinese) 
<http://worldcat.org/entity/work/id/1215997> 
a schema:CreativeWork; 
schema:creator <http://viaf.org/viaf/102266649> ; # "Gao, Xingjian” 
schema:inLanguage "zh"; 
schema:name "靈山"@zh; 
. 
# Translated Work (in English) 
<http://worldcat.org/entity/work/id/145209748> 
a schema:CreativeWork; 
schema:creator <http://viaf.org/viaf/102266649> ; # "Gao, Xingjian“ 
[new]:translator <http://viaf.org/viaf/81663420> ; # "Lee, Mabel" 
schema:inLanguage "en"; 
schema:name "Soul Mountain"@en ; 
[new]:translationOfWork <http://worldcat.org/entity/work/id/1215997> “
Understanding information sharing 
across cultures 
• What percentage of non-English works are translations of 
English works, and vice-versa? 
• Which authors are translated the most? 
• Which works have been translated into the most languages? 
• Which countries translate the most English works, the most 
non-English works? 
• Which countries translate a new 
work the fastest? 
Etc. 
http://www.oclc.org/research/activities/multilingual-bib-structure.html
Where are we now? 
Clustering 
• Work clusters done; ongoing refinement 
• GLIMIR clustering done for all [simple] text; 
– 103 million records have GLIMIR IDs 
• Working on collected works 
Displays 
• Working on VIAF expression displays 
• Work level displays in WorldCat.org ++ 
Data Mining for translations
Janifer Gatenby 
EMEA Program Manager Metadata 
Janifer.gatenby@oclc.orgoclc.org 
Explore. Share. Magnify.

Más contenido relacionado

La actualidad más candente

Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending InfluenceRichard Wallis
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data StrategyRichard Wallis
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for RepositoriesMartin Klein
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library DataRichard Wallis
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataRichard Wallis
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Linked Data Snowball, or Why We Need Reconciliation
Linked Data Snowball, or Why We Need ReconciliationLinked Data Snowball, or Why We Need Reconciliation
Linked Data Snowball, or Why We Need ReconciliationRobert Sanderson
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.orgJoshua Shinavier
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Oscar Corcho
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in LibrariesRichard Wallis
 
Designing Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesDesigning Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesRichard Wallis
 
Cogapp Open Studios 2012 - Adventures with Linked Data
Cogapp Open Studios 2012 - Adventures with Linked DataCogapp Open Studios 2012 - Adventures with Linked Data
Cogapp Open Studios 2012 - Adventures with Linked DataCogapp
 
OCLC Linked Data Progress
OCLC Linked Data ProgressOCLC Linked Data Progress
OCLC Linked Data ProgressRichard Wallis
 
The Web of Data is Our Oyster
The Web of Data is Our OysterThe Web of Data is Our Oyster
The Web of Data is Our OysterRichard Wallis
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending BenefitsRichard Wallis
 
The Cultural Linked Data Backbone
The Cultural Linked Data BackboneThe Cultural Linked Data Backbone
The Cultural Linked Data BackboneRichard Wallis
 
ARPHA: Next-Generation Journal Publishing
ARPHA: Next-Generation Journal PublishingARPHA: Next-Generation Journal Publishing
ARPHA: Next-Generation Journal PublishingPensoft Publishers
 
From Open Access to Open Science: from the Viewpoint of a Scholarly Publisher
From Open Access to Open Science: from the Viewpoint of a Scholarly PublisherFrom Open Access to Open Science: from the Viewpoint of a Scholarly Publisher
From Open Access to Open Science: from the Viewpoint of a Scholarly PublisherPensoft Publishers
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemWiLS
 

La actualidad más candente (20)

Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending Influence
 
LD4L OCLC Data Strategy
LD4L OCLC Data StrategyLD4L OCLC Data Strategy
LD4L OCLC Data Strategy
 
Signposting for Repositories
Signposting for RepositoriesSignposting for Repositories
Signposting for Repositories
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library Data
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Linked Data Snowball, or Why We Need Reconciliation
Linked Data Snowball, or Why We Need ReconciliationLinked Data Snowball, or Why We Need Reconciliation
Linked Data Snowball, or Why We Need Reconciliation
 
semantic markup using schema.org
semantic markup using schema.orgsemantic markup using schema.org
semantic markup using schema.org
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?
 
Linked Data in Libraries
Linked Data in LibrariesLinked Data in Libraries
Linked Data in Libraries
 
Designing Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for LibrariesDesigning Linked Data Software & Services for Libraries
Designing Linked Data Software & Services for Libraries
 
Cogapp Open Studios 2012 - Adventures with Linked Data
Cogapp Open Studios 2012 - Adventures with Linked DataCogapp Open Studios 2012 - Adventures with Linked Data
Cogapp Open Studios 2012 - Adventures with Linked Data
 
OCLC Linked Data Progress
OCLC Linked Data ProgressOCLC Linked Data Progress
OCLC Linked Data Progress
 
The Web of Data is Our Oyster
The Web of Data is Our OysterThe Web of Data is Our Oyster
The Web of Data is Our Oyster
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending Benefits
 
The Cultural Linked Data Backbone
The Cultural Linked Data BackboneThe Cultural Linked Data Backbone
The Cultural Linked Data Backbone
 
ARPHA: Next-Generation Journal Publishing
ARPHA: Next-Generation Journal PublishingARPHA: Next-Generation Journal Publishing
ARPHA: Next-Generation Journal Publishing
 
Extending Schema.org
Extending Schema.orgExtending Schema.org
Extending Schema.org
 
From Open Access to Open Science: from the Viewpoint of a Scholarly Publisher
From Open Access to Open Science: from the Viewpoint of a Scholarly PublisherFrom Open Access to Open Science: from the Viewpoint of a Scholarly Publisher
From Open Access to Open Science: from the Viewpoint of a Scholarly Publisher
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library System
 

Destacado

ILN Library 2.014 presentation
ILN Library 2.014 presentationILN Library 2.014 presentation
ILN Library 2.014 presentationInterLibNet
 
IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...
IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...
IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...Walter Schlect
 
International Librarianship 101
International Librarianship 101International Librarianship 101
International Librarianship 101InterLibNet
 
Going to the Next Level / Presentation at IFLA 2014, Lyon
Going to the Next Level / Presentation at IFLA 2014, LyonGoing to the Next Level / Presentation at IFLA 2014, Lyon
Going to the Next Level / Presentation at IFLA 2014, LyonMartin Morris
 
Anna Stoute FLC Presentation IFLA 2014 PDF Final
Anna Stoute FLC Presentation IFLA 2014 PDF FinalAnna Stoute FLC Presentation IFLA 2014 PDF Final
Anna Stoute FLC Presentation IFLA 2014 PDF FinalAnna Stoute
 
Developing an international perspective on librarianship
Developing an international perspective on librarianshipDeveloping an international perspective on librarianship
Developing an international perspective on librarianshipMichael Crumpton
 
Smart libraries create smart communities
Smart libraries create smart communitiesSmart libraries create smart communities
Smart libraries create smart communitiesJan Holmquist
 
Crossing borders for professional development
Crossing borders for professional developmentCrossing borders for professional development
Crossing borders for professional developmentInterLibNet
 
Librarians without borders: the International Librarians Network experience
Librarians without borders: the International Librarians Network experienceLibrarians without borders: the International Librarians Network experience
Librarians without borders: the International Librarians Network experienceInterLibNet
 
Similarities in SEA
Similarities in SEA Similarities in SEA
Similarities in SEA SirimasS
 

Destacado (14)

ILN Library 2.014 presentation
ILN Library 2.014 presentationILN Library 2.014 presentation
ILN Library 2.014 presentation
 
IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...
IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...
IFLA Art Libraries section 2014 presentation on NYARC webarchiving project wi...
 
International Librarianship 101
International Librarianship 101International Librarianship 101
International Librarianship 101
 
Going to the Next Level / Presentation at IFLA 2014, Lyon
Going to the Next Level / Presentation at IFLA 2014, LyonGoing to the Next Level / Presentation at IFLA 2014, Lyon
Going to the Next Level / Presentation at IFLA 2014, Lyon
 
Inspirez-vous des bibliothèques du monde
Inspirez-vous des bibliothèques du mondeInspirez-vous des bibliothèques du monde
Inspirez-vous des bibliothèques du monde
 
Anna Stoute FLC Presentation IFLA 2014 PDF Final
Anna Stoute FLC Presentation IFLA 2014 PDF FinalAnna Stoute FLC Presentation IFLA 2014 PDF Final
Anna Stoute FLC Presentation IFLA 2014 PDF Final
 
Developing an international perspective on librarianship
Developing an international perspective on librarianshipDeveloping an international perspective on librarianship
Developing an international perspective on librarianship
 
Smart libraries create smart communities
Smart libraries create smart communitiesSmart libraries create smart communities
Smart libraries create smart communities
 
Congrès IFLA 2014 - The crpd and its impact on library services
Congrès IFLA 2014 - The crpd and its impact on library servicesCongrès IFLA 2014 - The crpd and its impact on library services
Congrès IFLA 2014 - The crpd and its impact on library services
 
Comparative Librarianship
Comparative LibrarianshipComparative Librarianship
Comparative Librarianship
 
Crossing borders for professional development
Crossing borders for professional developmentCrossing borders for professional development
Crossing borders for professional development
 
Madre gestante
Madre gestanteMadre gestante
Madre gestante
 
Librarians without borders: the International Librarians Network experience
Librarians without borders: the International Librarians Network experienceLibrarians without borders: the International Librarians Network experience
Librarians without borders: the International Librarians Network experience
 
Similarities in SEA
Similarities in SEA Similarities in SEA
Similarities in SEA
 

Similar a Journey to Multilingualism

Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Janifer Gatenby
 
Preparing an Open Source Documentation Repository for Translations
Preparing an Open Source Documentation Repository for TranslationsPreparing an Open Source Documentation Repository for Translations
Preparing an Open Source Documentation Repository for TranslationsHPCC Systems
 
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...Lucidworks
 
How community software supports language documentation and data analysis
How community software supports language documentation and data analysisHow community software supports language documentation and data analysis
How community software supports language documentation and data analysisPeter Bouda
 
A Closer Look at the Changing Dynamics of DBpedia Mappings
A Closer Look at the Changing Dynamics of DBpedia MappingsA Closer Look at the Changing Dynamics of DBpedia Mappings
A Closer Look at the Changing Dynamics of DBpedia MappingsMaribel Acosta Deibe
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana
 
plone.app.multilingual
plone.app.multilingual plone.app.multilingual
plone.app.multilingual Ramon Navarro
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Tobias Wunner
 
Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012François Belleau
 
Using the Semantic Web to Improve Knowledge of Translations
Using the Semantic Web to Improve Knowledge of TranslationsUsing the Semantic Web to Improve Knowledge of Translations
Using the Semantic Web to Improve Knowledge of TranslationsOCLC
 
FEC2017-Introduction-to-programming
FEC2017-Introduction-to-programmingFEC2017-Introduction-to-programming
FEC2017-Introduction-to-programmingHenrikki Tenkanen
 
2016 bioinformatics i_python_part_1_wim_vancriekinge
2016 bioinformatics i_python_part_1_wim_vancriekinge2016 bioinformatics i_python_part_1_wim_vancriekinge
2016 bioinformatics i_python_part_1_wim_vancriekingeProf. Wim Van Criekinge
 
Elasticsearch Basics
Elasticsearch BasicsElasticsearch Basics
Elasticsearch BasicsShifa Khan
 
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h442010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44Alain Désilets
 
From Semantics to Self-supervised Learning for Speech and Beyond
From Semantics to Self-supervised Learning for Speech and BeyondFrom Semantics to Self-supervised Learning for Speech and Beyond
From Semantics to Self-supervised Learning for Speech and Beyondlinshanleearchive
 

Similar a Journey to Multilingualism (20)

Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19Multilingual presentation ifla 2013 08-19
Multilingual presentation ifla 2013 08-19
 
Preparing an Open Source Documentation Repository for Translations
Preparing an Open Source Documentation Repository for TranslationsPreparing an Open Source Documentation Repository for Translations
Preparing an Open Source Documentation Repository for Translations
 
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
Embracing Diversity: Searching over Multiple Languages - Suneel Marthi, Red H...
 
Php packages
Php packagesPhp packages
Php packages
 
How community software supports language documentation and data analysis
How community software supports language documentation and data analysisHow community software supports language documentation and data analysis
How community software supports language documentation and data analysis
 
A Closer Look at the Changing Dynamics of DBpedia Mappings
A Closer Look at the Changing Dynamics of DBpedia MappingsA Closer Look at the Changing Dynamics of DBpedia Mappings
A Closer Look at the Changing Dynamics of DBpedia Mappings
 
20110728 datalift-rpi-troy
20110728 datalift-rpi-troy20110728 datalift-rpi-troy
20110728 datalift-rpi-troy
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
Swift vs. Language X
Swift vs. Language XSwift vs. Language X
Swift vs. Language X
 
plone.app.multilingual
plone.app.multilingual plone.app.multilingual
plone.app.multilingual
 
P1 2017 python
P1 2017 pythonP1 2017 python
P1 2017 python
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1
 
Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012
 
Using the Semantic Web to Improve Knowledge of Translations
Using the Semantic Web to Improve Knowledge of TranslationsUsing the Semantic Web to Improve Knowledge of Translations
Using the Semantic Web to Improve Knowledge of Translations
 
FEC2017-Introduction-to-programming
FEC2017-Introduction-to-programmingFEC2017-Introduction-to-programming
FEC2017-Introduction-to-programming
 
2016 bioinformatics i_python_part_1_wim_vancriekinge
2016 bioinformatics i_python_part_1_wim_vancriekinge2016 bioinformatics i_python_part_1_wim_vancriekinge
2016 bioinformatics i_python_part_1_wim_vancriekinge
 
Elasticsearch Basics
Elasticsearch BasicsElasticsearch Basics
Elasticsearch Basics
 
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h442010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
2010 06-u maryland-crowd_sourcing-workshop-v2010-06-16-10h44
 
From Semantics to Self-supervised Learning for Speech and Beyond
From Semantics to Self-supervised Learning for Speech and BeyondFrom Semantics to Self-supervised Learning for Speech and Beyond
From Semantics to Self-supervised Learning for Speech and Beyond
 

Más de Janifer Gatenby

Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411Janifer Gatenby
 
Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11Janifer Gatenby
 
Art discovery group catalogue: Usage, content and new horizons
Art discovery group catalogue:  Usage, content and new horizonsArt discovery group catalogue:  Usage, content and new horizons
Art discovery group catalogue: Usage, content and new horizonsJanifer Gatenby
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveJanifer Gatenby
 
Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15Janifer Gatenby
 
Art discovery view to future content and interface ifla lyon 20140820
Art discovery view to future content and interface   ifla lyon 20140820Art discovery view to future content and interface   ifla lyon 20140820
Art discovery view to future content and interface ifla lyon 20140820Janifer Gatenby
 
Viaf and isni ifla 2013 08-16
Viaf and isni  ifla 2013 08-16Viaf and isni  ifla 2013 08-16
Viaf and isni ifla 2013 08-16Janifer Gatenby
 

Más de Janifer Gatenby (7)

Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411
 
Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11
 
Art discovery group catalogue: Usage, content and new horizons
Art discovery group catalogue:  Usage, content and new horizonsArt discovery group catalogue:  Usage, content and new horizons
Art discovery group catalogue: Usage, content and new horizons
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspective
 
Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15
 
Art discovery view to future content and interface ifla lyon 20140820
Art discovery view to future content and interface   ifla lyon 20140820Art discovery view to future content and interface   ifla lyon 20140820
Art discovery view to future content and interface ifla lyon 20140820
 
Viaf and isni ifla 2013 08-16
Viaf and isni  ifla 2013 08-16Viaf and isni  ifla 2013 08-16
Viaf and isni ifla 2013 08-16
 

Último

Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxmibuzondetrabajo
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predieusebiomeyer
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书rnrncn29
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxMario
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119APNIC
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxAndrieCagasanAkio
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxNIMMANAGANTI RAMAKRISHNA
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书rnrncn29
 
Cybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best PracticesCybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best PracticesLumiverse Solutions Pvt Ltd
 

Último (9)

Unidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptxUnidad 4 – Redes de ordenadores (en inglés).pptx
Unidad 4 – Redes de ordenadores (en inglés).pptx
 
SCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is prediSCM Symposium PPT Format Customer loyalty is predi
SCM Symposium PPT Format Customer loyalty is predi
 
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
『澳洲文凭』买詹姆士库克大学毕业证书成绩单办理澳洲JCU文凭学位证书
 
Company Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptxCompany Snapshot Theme for Business by Slidesgo.pptx
Company Snapshot Theme for Business by Slidesgo.pptx
 
IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119IP addressing and IPv6, presented by Paul Wilson at IETF 119
IP addressing and IPv6, presented by Paul Wilson at IETF 119
 
TRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptxTRENDS Enabling and inhibiting dimensions.pptx
TRENDS Enabling and inhibiting dimensions.pptx
 
ETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptxETHICAL HACKING dddddddddddddddfnandni.pptx
ETHICAL HACKING dddddddddddddddfnandni.pptx
 
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
『澳洲文凭』买拉筹伯大学毕业证书成绩单办理澳洲LTU文凭学位证书
 
Cybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best PracticesCybersecurity Threats and Cybersecurity Best Practices
Cybersecurity Threats and Cybersecurity Best Practices
 

Journey to Multilingualism

  • 1. IFLA - Lyon, France 19 August 2014 Multilingualism in WorldCat and VIAF Janifer Gatenby Working with Karen Smith-Yoshimura, Robert Bremer, Eric Childress, Jean Godby, Richard Greene, JD Shipengrover, Gail Thornburg, Jenny Toves, Diane Vizine Goetz, Shenghui Wang, Jay Weitz
  • 2. WorldCat Today • Resources in nearly all languages • Contributed by more than 20,000 libraries worldwide • More than half the database is for works not in English Languages English German French Spanish Chinese Dutch Japanese Russian Arabic 469 others
  • 3. WorldCat Today • Bibliographic Records – Hybrid records – Parallel records • Clustered at Work level (FRBR)
  • 4. Existing Architecture Authors Authors Authors Subj Classif Subj Classif Subj Classif Holding Holding Holdings Bibliographic record Work cluster Content cluster Manifes tation cluster
  • 5. Complementary Initiatives Work Level Record GLIMIR Manifestation & Content Clusters Multi-lingual Bibliographic Structure
  • 6. Create a consolidated metadata summary for the content of a work Objective: Work Level Record
  • 7. Work Level Record http://www.oclc.org/research/activities/workrecs.html Coming Q1 2015
  • 8. Create better work presentations GLIMIR: Objective
  • 9. Users like C • The Content Cluster GLIMIR – Enables better work record displays by reducing the number of lines that display for large works – Enables a choice of format and presents the formats that could be acceptable substitutes – Consolidates holdings for identical content • The Manifestation Cluster is important – Consolidates holdings at manifestation level – In the short term allows the record catalogued in the language of the interface to be chosen for display – Reduces apparent duplication – Allows a more accurate count of the number of manifestations in WorldCat (as opposed to the number of records) Cataloguers & scholars like C
  • 10. Manifestation Clustering So far 103 million records processed (about 30%)
  • 12. SRU Search: Loti Pêcheur d’islande (Work ID 21536567) Records Holdings Work 18 148 Content 14 143 Manifestation 7 115
  • 13. Multilingual Bibliographic Structure Project Objective: Improve displays; surface translations
  • 14. Multilingual Bibliographic Structure Project Creates true multi-lingual displays – At work and manifestation levels – Using all available data instead of “most appropriate record” – Generates data Corrects many of the 28 million records coded “und” Better control and linking of translations Input to refinement of work clusters Smarter data storage
  • 15. “Most appropriate” questioned • Worldcat.org selects the most appropriate record to show to a user as representative of the work in the short result list and beyond • The end result will not be very satisfactory from a multi-lingual viewpoint… here’s why
  • 16. Which record is better to present to a German speaker?
  • 19. Most appropriate display Build the display from all available data
  • 20. Multilingual Bibliographic Structure Project • Work level data, mined from all associated bibliographic records will be displayed supplemented with expression / manifestation level data as the user drills through the short to fuller versions of the metadata. End user interface will show works and manifestations not bibliographic records; the cataloguing client will also show bibliographic records
  • 21. Proposed new architecture jpn Work eng fre ger jpn Manif eng Manif eng Manif eng Manif eng Manif Manif eng engA oN oftrees Contents ++ Holding Holding Holding Subj Classif Holding Subj sif eng fre ger jpn Authors Authors eng Authors fre ger eng fre ger jpn fre eng ger jpn Translations (Language of work) Manif fre Holding
  • 22. Important principles • Language tagging of elements, particularly – Summaries (M21 520) – Subject headings • Display in script preferred by the user if data is available • Improve translated interfaces • Show consolidated holdings as appropriate
  • 23.
  • 24.
  • 25.
  • 26.
  • 28. Great works are translated • The cream of the world’s cultural and knowledge heritage is shared by being translated • WorldCat contains many rich cataloguing records for these translations GOAL: Data mine the really good records to improve clustering, presentation, authority records and linked data
  • 29. Ιλιάδα The Iliad 紅樓夢 Dream of the Red Chamber ঘরে বাইরে The Home and the World زقاق المدق Midaq Alley Война и миръ War and Peace The Tale of Genji דער בעל-תשובה The Penitent સત્યના પ્રયોગો અથવા આત્મકથા 源 氏 物 語 The Story of My Experiments with Truth [Gandhi autobiography]
  • 30. Translations Leo Tolstoy: 32 languages Homer: 28 languages Rabindranath Tagore: 21 Isaac Bashevis Singer: 17 Najīb Maḥfūẓ: 12 languages Cao Xueqin: 9 languages Mahatma Gandhi: 7 languages Murasaki Shikabu: 7 languages
  • 31. Improving work clustering • Inconsistencies cause work clusters to be incomplete resulting in less than optimal search results – Titles without subtitles – Missing or different forms of uniform title – Inverted title – Different coding of original and translated information Generated uniform title authority records will overcome most of these differences without needing to edit individual records
  • 32. Addition of xR records to VIAF Before After
  • 34.
  • 35. XR VIAF Record VIAF ID for Author Translated title Translator
  • 36.
  • 37.
  • 38.
  • 39. IFLA - Lyon, France 19 August 2014 VIAF Linked Data New Information
  • 40. Title: 西遊記 Language: Chinese Author: 吳承恩 Created: 1592 HasTranslation: Title: Journey to the West Language: English Translator: Anthony C. Yu Date: 1977 IsTranslationOf: Title: Journey to the West Language: English Translator: W. J. F. Jenner Date: 1982-1984 IsTranslationOf: Title: Tây du ký bình khảo Language: Vietnamese Translator: Phan Quân Date: 1980 IsTranslationOf: Title: Monkeys Pilgerfahrt Language: German Translator: Georgette Boner Date: 1983 IsTranslationOf: Title: 西遊記 Language: Japanese Translator: 中野美代子 Date: 1986 IsTranslationOf:
  • 41. Markup for the Semantic Web # Original Work (in Chinese) <http://worldcat.org/entity/work/id/1215997> a schema:CreativeWork; schema:creator <http://viaf.org/viaf/102266649> ; # "Gao, Xingjian” schema:inLanguage "zh"; schema:name "靈山"@zh; . # Translated Work (in English) <http://worldcat.org/entity/work/id/145209748> a schema:CreativeWork; schema:creator <http://viaf.org/viaf/102266649> ; # "Gao, Xingjian“ [new]:translator <http://viaf.org/viaf/81663420> ; # "Lee, Mabel" schema:inLanguage "en"; schema:name "Soul Mountain"@en ; [new]:translationOfWork <http://worldcat.org/entity/work/id/1215997> “
  • 42. Understanding information sharing across cultures • What percentage of non-English works are translations of English works, and vice-versa? • Which authors are translated the most? • Which works have been translated into the most languages? • Which countries translate the most English works, the most non-English works? • Which countries translate a new work the fastest? Etc. http://www.oclc.org/research/activities/multilingual-bib-structure.html
  • 43. Where are we now? Clustering • Work clusters done; ongoing refinement • GLIMIR clustering done for all [simple] text; – 103 million records have GLIMIR IDs • Working on collected works Displays • Working on VIAF expression displays • Work level displays in WorldCat.org ++ Data Mining for translations
  • 44. Janifer Gatenby EMEA Program Manager Metadata Janifer.gatenby@oclc.orgoclc.org Explore. Share. Magnify.

Notas del editor

  1. This project seeks to leverage the multilingual content of WorldCat. There are more than 300 million records in WorldCat today representing holdings of the world’s libraries. There are also another 200 million articles. Of the 300 million, more than half are in languages other than English.
  2. The 300 million records are clustered into works. The records may be in just one language of cataloguing or they may have a mixture of language of cataloguing, e.g. subject heading in more than one language. Parallel records may exist, i.e. records in different languages of cataloguing describing the same resource.
  3. The existing architecture is bibliographic record centric. There may be links to author and subject authority records. The work, manifestation and content clusters only contain identifiers, no metadata at present.
  4. Three complementary initiatives are in progress concerning multi-lingualism and WorldCat.
  5. The first project – generation of metadata at work level to make better presentations. Implementation Q1 2015.
  6. The GLIMIR project creates clusters of parallel records for the same manifestation (manifestation cluster) and also clusters for the same content, though the form may be different (print, microform, digital). We are in the process of trawling thorugh the database to make these clusters. Only the part in blue will be changed by the multi-lingual bibliographic structure approach
  7. Image from Connexion client that shows the short list selection benefing from GLIMIR. So far 103 million records have GLIMIR Ids.
  8. Different records in different languages of cataloguing are clustered. The « D »in the last line indicates an LC record. The manifestation cluster includes different records in different languages of cataloguing for the same manifestation. It also includes apparent duplicates that the de-duplication program was not able to confidently merge.
  9. Searching with SRU it is possible to search on workID, Content ID (Reproduction1) and manifestation ID. During the development of GLIMIR it was determined that it would be possible in some cases to identify equal content and thus present originals, micorform,re-prints, and electronic editions clustered together.
  10. Concerning correcting records coded as “und” = “undetermined”, we expect that we can correct about 7 million by searching for the same title string in other records.
  11. Currently when the short list and full displays are created, the system selects the most appropriate record for display. The most appropriate record is determined by its size and the number of associated holdings and it was envisaged to extend this to include the language of cataloguing. But we think we can do much better by questioning the “most appropriate record” concept.
  12. Here the first record is catalogued in German but has no significant German content – no subject headings and no notes. There are other richer records that could inform the user better.
  13. Hyrid records also mean that significant linguistic information may be buried in other records.
  14. This record has subject headings in 3 languages.
  15. Build the information to display to users from all available records and show them all relevant holdings. Do not display just one selected record from the work set. Cataloguers too will benefit from this, being able to drill down to actual records where appropriate.
  16. This is the theoretical bibliographic structure that will no longer be bibliographic record centric but work centric. All information – authors, notes, summaries, subject headings will be flagged with the language of cataloguing.
  17. The work level metadata will be tagged with language for each data element. Instead of always showing the data from the main title in the record, the alternative script fields may be chosen for display, depending on what the system can determine about the user, e.g. from IP address range or expressed preference. Worldcat.org already includes the ability to change language of display, but the numbers of fields that change will be enhanced and the tabulation of several displays will be improved. Consolidate holdings from all records that re applicable to a display (work, content, manifestation level)
  18. An Iceland Fisherman, original in French displayed in English UI with Translations table.
  19. Same record with Chinese interface
  20. The Grand Design; English in Italian UI with Translations table.
  21. Same in Japanese interface
  22. Instead of working directly with lesser quality records to improve the quality in WorldCat and instead of working with the long tail, we are turning our attention to the most important works and working ways to use the good records to improve the quality.
  23. And here’s one title written by each of those “cream” authors – in the original language and script. Likely you’ve heard of all or most of them? But probably in English (click). Acknowledgments: Karen Smith-Yoshimura
  24. Acknowledgment: Karen Smith-Yoshimura We have been focusing on the content that is most likely to be of interest to the most people – translations. The cream of the world’s cultural and knowledge heritage is shared by being translated and WorldCat contains many rich cataloguing records for these translations. The Virtual International Authority File, an aggregation of authority records from over 30 agencies worldwide, identifies 15 million unique persons. When we datamine WorldCat, only 7% have written works that have been translated into at least one other language. Only 7,000 have had their works translated into 10 or more languages. This is the “short head” of works that have the most impact on readers worldwide.
  25. Translated titles not always consistent, causing work grouping failure.  Sometimes: caused by titles without sub titles, caused by different forms of uniform title, i.e. in Gujarati and in English (several forms) caused by inverting the titles, by placing the name Gandhi before “Autobiography”.  Some figures:  French – 15 records, 6 work sets; German -  9 records, 9 work sets; Italian 7 records, 5 work sets; Spanish 8 records, 4 work sets
  26. We have been datamining WorldCat to generate work-translation ("expression level") records—including the translated title and translator with links to the original work and the author—and adding them to VIAF (Virtual International Authority File), flagged as "xR". Here is a concrete example of the impact – Jane Austen’s Pride and Prejudice originally had 13 translations associated with it; after we generated the xR records for other translations as datamined from WorldCat the number increased to 50. So far we have added 2 million xR records to VIAF. About the same as the UNESCO database after 25 years (!)
  27. UNESCO has also been interested in identifying and aggregating translations world-wide for an “international bibliography of translations” – they have been at it for over 80 years! Their database represents entries contributed from UNESCO members since 1979 and contains 2 million entries.
  28. Yet when we compare the results for a specific work, like Anatole France’s Crime de Silvestre Bonnard, we can see that the contributions OCLC member libraries have made, through WorldCat, is far greater that all the work invested so far by the UNESCO member states. VIAF, with the xR records datamined from libraries’ contributions to WorldCat supplementing those from VIAF contributors, lists 29 translations in 28 different languages.
  29. We are now working on enhancing the VIAF display to make the contents of title records more visible. This is a mockup of a VIAF Consolidated display – shows work with expressions summary. It shows the title as it is translated in each language.
  30. This is a mockup of the full expressions summary for the work Pêcheur d’Islande by Pierre Loti.
  31. This is a mockup of a VIAF Full display – showing different translations of the title, plus the translator and the earliest determined publication date of each.
  32. Acknowlegments: Karen Smith-Yoshimura & Dan Benson The relationship of a work (with an author) and its associated translations (with their respective translators) is relatively straight-forward. This diagram captures the relationship in two reciprocal links, one from the original Chinese book (blue box) through its property HasTranslation pointing (with blue arrows) to each translation (red boxes), and one from each translation back to the original Chinese work (red arrows) through the property IsTranslationOf . A work can have any number of translations, and there can be multiple translations into the same language which is why identifying the translator is so important.
  33. Acknowledgements: Karen Smith-Yoshimura Machines access VIAF far more than humans. To leverage all the work done by the OCLC cooperative we want to share the relationships we’ve established between original works and their associated translations with the semantic Web. Here is a sample markup of an original Chinese work written by Gao Xingjian, a Chinese Nobel Prize laureate for literature, and one of the translations of his work into English. We marked this up with schema.org but there are two new terms we are proposing, shown here: translator and translationOfWork.
  34. Acknowledgements: Karen Smith-Yoshimura This work will help us better understand the extent to which information is shared across cultures. Here are some questions we should be able to answer soon! More information on this project is available on the OCLC Research website at the link shown. Just think. How many of the 150+ million records that are for non English language works are actually translations of English language records. And how many of the English language records are also translations? It could be as high as 25%? Once we have these records generated, it will open many new possibilities. Also, we know we have 300 million records, but how many real resources do we have? GLIMIR will produce these figures. We are just starting…
  35. VIAF expression level displays will include GLIMIR Ids.