SlideShare a Scribd company logo
1 of 26
Download to read offline
Your spoken paper cannot be the same as your written paper Read more: Museums and the Web 2011 (MW2011): Presentation Guidelines | conference.archimuse.com
Computational Linguistics in Museums: Applications for Cultural Datasets Klavans Judith Susan Robert Chun Stein Guerra Raul
ComputationalLinguistics Language  - Words, Words, Words Use Meaning Syntax Shape of words Sounds
Applications Speech synthesis – 1980’s Talking Machines for the Blind Intelligent search – pre-google Finding names – who, what, where Translation Speech recognition Answering Questions – What is Watson?
Domains for Computational Linguistics Healthcare – interpreting patient records Government – helping people find information International Affairs – cross-language translation Law – analyzing Enron scandal email Marketing – Opinions on products Museums – analyzing text and tags associated with objects for better access
Computational  Linguistics for Metadata Building +
Computational Linguistics in Museums: Applications for Cultural Datasets Klavans Judith Susan Robert Chun Stein Guerra Raul
InterdisciplinaryResearch Computational Linguistics in Museums
Text, Tags, Trust Funded in 2008 by IMLS With the University of Maryland, and collaborative of museum partners Studying the relationships between social tags, scholarly text and resources, and the application of trust networks to improve access to museum collections.
MW 2011 Contributions		 Which Computational Linguistic tools can or should be applied to tags? How do these tools impact tag analysis? What results differ from the initial steve.museum results from Trant 2007? So what – for CL? So what – for Museums?
Hard  Challenges ,[object Object]
  How can tags be related to other tags? 		across languages 		across users ,[object Object]
   How can they be used?  ,[object Object]
Gallery Label This canvas was the first one Gauguin painted during the two months he spent in Provence.... Gauguin had rebelled against Impressionism's reliance on the visible world, and he altered nature's shapes and colors to suggest his own more subjective reaction to the landscape. While the rural subject and acidic colors show the influence of van Gogh, this image is more indebted to Paul Cézanne. In his careful integration of the haystack and farm buildings, Gauguin has echoed Cézanne's emphasis on geometric form.
Tools for Tags Morphological Analysis – Conflate when possible Cats, cat Haystacks, haystack Painting, paint ? What words are verbs, nouns, adjectives? How should multi-word tags be handled?
Raw Tags or Tokens
Results		 25%  93%  68%
1. NN=25205 2. JJ=6319 3. NNS=4041 4. NN_NN=2257 5. JJ_NN=1792 6. VBG=1043 7. VBN=727 8. NP=708 9. OD_NN=454 10. JJ_NNS=413
Top 10 POS Patterns: 1. NN=6706 2. NN_NN=1713 3. JJ_NN=1194 4. JJ=921 5. NNS=757 6. JJ_NNS=303 7. NN_NNS=300 8. VBG=238 9. NP=209 10. VBN_NN=202
Hard  Challenges ,[object Object]
  How can tags be related to other tags? 		across languages 		across users ,[object Object]
   How can they be used?  ,[object Object]
Irecursor to parsing.
   However, for social tags, parsing is not a meaningful step.  Research: ,[object Object]
  Link part of speech information with other lexical resources for disambiguation,[object Object]
What About “New England” Idioms / lexicalized phrases are more difficult Heuristic comparison to Wikipedia Titles matched 46% (30% distinct) of multiword tags E.g. “Grapes of Wrath”, “Irish Wolfhound”, “Franco-Prussian War” *Klavans and Golbeck, 2010

More Related Content

Similar to MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications for Cultural Datasets

An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsTye Rausch
 
Literacy Integration Presentation
Literacy Integration PresentationLiteracy Integration Presentation
Literacy Integration PresentationNAFCareerAcads
 
Antropologia/anthropology
Antropologia/anthropologyAntropologia/anthropology
Antropologia/anthropologyWilmer Carrion
 
Ounl Celstec Presentation
Ounl Celstec PresentationOunl Celstec Presentation
Ounl Celstec PresentationRiina Vuorikari
 
Vuorikari Multilingual Tagging behaviour by teachers
Vuorikari Multilingual Tagging behaviour by teachersVuorikari Multilingual Tagging behaviour by teachers
Vuorikari Multilingual Tagging behaviour by teachersRiina Vuorikari
 
MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdfMartin Wynne
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
Graphic literacies for a digital age the survival of layout
Graphic literacies for a digital age the survival of layoutGraphic literacies for a digital age the survival of layout
Graphic literacies for a digital age the survival of layoutAsliza Hamzah
 
Technologies and englishes
Technologies and englishesTechnologies and englishes
Technologies and englishesTariq Usman
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
Finding and Citing Online Images & Sources
Finding and Citing Online Images & SourcesFinding and Citing Online Images & Sources
Finding and Citing Online Images & SourcesWendy DeGroat
 
Exploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentExploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentMartin Wynne
 
Animal Essay.pdf
Animal Essay.pdfAnimal Essay.pdf
Animal Essay.pdfAmi Hall
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Toby Burrows
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with PythonBenjamin Bengfort
 
eMargin Presentation given to Skills Funding Agency
eMargin Presentation given to Skills Funding AgencyeMargin Presentation given to Skills Funding Agency
eMargin Presentation given to Skills Funding AgencyRDUES
 
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...Bradley Allen
 
Vocabulary 2010 rubena
Vocabulary 2010 rubenaVocabulary 2010 rubena
Vocabulary 2010 rubenaDaf
 

Similar to MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications for Cultural Datasets (20)

An Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical SemanticsAn Outline Of Type-Theoretical Approaches To Lexical Semantics
An Outline Of Type-Theoretical Approaches To Lexical Semantics
 
Literacy Integration Presentation
Literacy Integration PresentationLiteracy Integration Presentation
Literacy Integration Presentation
 
Diachronic Analysis
Diachronic AnalysisDiachronic Analysis
Diachronic Analysis
 
Antropologia/anthropology
Antropologia/anthropologyAntropologia/anthropology
Antropologia/anthropology
 
Ounl Celstec Presentation
Ounl Celstec PresentationOunl Celstec Presentation
Ounl Celstec Presentation
 
Vuorikari Multilingual Tagging behaviour by teachers
Vuorikari Multilingual Tagging behaviour by teachersVuorikari Multilingual Tagging behaviour by teachers
Vuorikari Multilingual Tagging behaviour by teachers
 
MacroMicroZoom.pdf
MacroMicroZoom.pdfMacroMicroZoom.pdf
MacroMicroZoom.pdf
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Graphic literacies for a digital age the survival of layout
Graphic literacies for a digital age the survival of layoutGraphic literacies for a digital age the survival of layout
Graphic literacies for a digital age the survival of layout
 
Technologies and englishes
Technologies and englishesTechnologies and englishes
Technologies and englishes
 
Reading Street
Reading StreetReading Street
Reading Street
 
Finding and Citing Online Images & Sources
Finding and Citing Online Images & SourcesFinding and Citing Online Images & Sources
Finding and Citing Online Images & Sources
 
Exploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic EnlightenmentExploring rhetoric in the Electronic Enlightenment
Exploring rhetoric in the Electronic Enlightenment
 
Class14
Class14Class14
Class14
 
Animal Essay.pdf
Animal Essay.pdfAnimal Essay.pdf
Animal Essay.pdf
 
Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...Ontologies and the humanities: some issues affecting the design of digital in...
Ontologies and the humanities: some issues affecting the design of digital in...
 
Natural Language Processing with Python
Natural Language Processing with PythonNatural Language Processing with Python
Natural Language Processing with Python
 
eMargin Presentation given to Skills Funding Agency
eMargin Presentation given to Skills Funding AgencyeMargin Presentation given to Skills Funding Agency
eMargin Presentation given to Skills Funding Agency
 
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
Faceted Navigation of User-Generated Metadata (Calit2 Rescue Seminar Series 2...
 
Vocabulary 2010 rubena
Vocabulary 2010 rubenaVocabulary 2010 rubena
Vocabulary 2010 rubena
 

More from museums and the web

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siumuseums and the web
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...museums and the web
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museummuseums and the web
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...museums and the web
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...museums and the web
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...museums and the web
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guidemuseums and the web
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...museums and the web
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Trackingmuseums and the web
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...museums and the web
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...museums and the web
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculptingmuseums and the web
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...museums and the web
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...museums and the web
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...museums and the web
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network museums and the web
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...museums and the web
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...museums and the web
 
MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...
MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...
MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...museums and the web
 

More from museums and the web (20)

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siu
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museum
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
 
MW2011 Best of the Web Awards
MW2011 Best of the Web AwardsMW2011 Best of the Web Awards
MW2011 Best of the Web Awards
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculpting
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
 
MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...
MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...
MW2010: E. Bachta and R. Stein, Breaking the Bottleneck: Using Pseudo-Wikis t...
 

Recently uploaded

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...Karmanjay Verma
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...amber724300
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 

Recently uploaded (20)

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 

MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications for Cultural Datasets

  • 1. Your spoken paper cannot be the same as your written paper Read more: Museums and the Web 2011 (MW2011): Presentation Guidelines | conference.archimuse.com
  • 2. Computational Linguistics in Museums: Applications for Cultural Datasets Klavans Judith Susan Robert Chun Stein Guerra Raul
  • 3. ComputationalLinguistics Language - Words, Words, Words Use Meaning Syntax Shape of words Sounds
  • 4. Applications Speech synthesis – 1980’s Talking Machines for the Blind Intelligent search – pre-google Finding names – who, what, where Translation Speech recognition Answering Questions – What is Watson?
  • 5. Domains for Computational Linguistics Healthcare – interpreting patient records Government – helping people find information International Affairs – cross-language translation Law – analyzing Enron scandal email Marketing – Opinions on products Museums – analyzing text and tags associated with objects for better access
  • 6. Computational Linguistics for Metadata Building +
  • 7. Computational Linguistics in Museums: Applications for Cultural Datasets Klavans Judith Susan Robert Chun Stein Guerra Raul
  • 9. Text, Tags, Trust Funded in 2008 by IMLS With the University of Maryland, and collaborative of museum partners Studying the relationships between social tags, scholarly text and resources, and the application of trust networks to improve access to museum collections.
  • 10. MW 2011 Contributions Which Computational Linguistic tools can or should be applied to tags? How do these tools impact tag analysis? What results differ from the initial steve.museum results from Trant 2007? So what – for CL? So what – for Museums?
  • 11.
  • 12.
  • 13.
  • 14. Gallery Label This canvas was the first one Gauguin painted during the two months he spent in Provence.... Gauguin had rebelled against Impressionism's reliance on the visible world, and he altered nature's shapes and colors to suggest his own more subjective reaction to the landscape. While the rural subject and acidic colors show the influence of van Gogh, this image is more indebted to Paul Cézanne. In his careful integration of the haystack and farm buildings, Gauguin has echoed Cézanne's emphasis on geometric form.
  • 15. Tools for Tags Morphological Analysis – Conflate when possible Cats, cat Haystacks, haystack Painting, paint ? What words are verbs, nouns, adjectives? How should multi-word tags be handled?
  • 16. Raw Tags or Tokens
  • 17. Results 25% 93% 68%
  • 18. 1. NN=25205 2. JJ=6319 3. NNS=4041 4. NN_NN=2257 5. JJ_NN=1792 6. VBG=1043 7. VBN=727 8. NP=708 9. OD_NN=454 10. JJ_NNS=413
  • 19. Top 10 POS Patterns: 1. NN=6706 2. NN_NN=1713 3. JJ_NN=1194 4. JJ=921 5. NNS=757 6. JJ_NNS=303 7. NN_NNS=300 8. VBG=238 9. NP=209 10. VBN_NN=202
  • 20.
  • 21.
  • 22.
  • 24.
  • 25.
  • 26. What About “New England” Idioms / lexicalized phrases are more difficult Heuristic comparison to Wikipedia Titles matched 46% (30% distinct) of multiword tags E.g. “Grapes of Wrath”, “Irish Wolfhound”, “Franco-Prussian War” *Klavans and Golbeck, 2010
  • 27. Wish List - Better ways to tame the proliferation of rich but “noisy” content Clustering over tags for similarity Clustering over tags and terms from text Matching over existing terms to identify meaningful units Apply machine learning techniques to guess meaning Bigrams, Trigram, Thesauri, Corpus Analysis
  • 28. Acknowledgements Steve.museum project members T3 and steve.museum museum partners University of Maryland, T3 group IMA Museum ……and other participants

Editor's Notes

  1. Take this seriously.
  2. IN presenting this paper, start with something not in the paper.
  3. Still need to finish
  4. Words,words, words.