Se ha denunciado esta presentación.
Utilizamos tu perfil de LinkedIn y tus datos de actividad para personalizar los anuncios y mostrarte publicidad más relevante. Puedes cambiar tus preferencias de publicidad en cualquier momento.

Shorthouse - Authority Management of People Names Workshop

Presented at workshop, "Authority Management of People Names" organized by Elspeth Haston & Quentin Groom in Sofia, Bulgaria 12-13 March, 2018 in conjunction with the DiSSCo/MOBILISE kick-off meetings.

  • Inicia sesión para ver los comentarios

Shorthouse - Authority Management of People Names Workshop

  1. 1. What are the Drivers? …and how do we intend to meaningfully respond to them? David P. Shorthouse Canadian Museum of Nature Agriculture & Agri-Food Canada (April 1)
  2. 2. “Is it possible that the lack of recognition in the academic assessment system of these forms of productivity has contributed to the diminished status—indeed even the near disappearance from many academic departments—of traditional systematics…” Collecting Curating Identifying Naming
  3. 3. Natural History Museums Desperately Want Brand Awareness Meaningful Measures of Impact
  4. 4. “…trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators.”
  5. 5. How Do We Fix This?
  6. 6. recognition for taxonomist recognition for host institution recognition for taxonomists’ institution Fully automated Quantifiable
  7. 7. Ingredients to Make This Happen Newly digitized specimen IRI identifiedBy institutionCode ORCID: ringgold, GRID dateIdentified ORCID: employment/education start/end date GRBIO ? not sameAs
  8. 8. Are There Other Drivers?
  9. 9. Newly digitized specimen IRI identifiedBy
  10. 10.
  11. 11. For the DeceasedFor the Living
  12. 12. Cautionary Tale
  13. 13. Retrospective & Prospective Approaches
  14. 14. Retrospective Approach…Layers of Dirt • Strings to things • Parsing, eg ruby gems Namae, DwcAgent • Entity extraction, eg Rosette,, Watson Natural Language • Similarity scoring, eg R.D.M. Page <=> Roderic Page <=> Roderic D.M. Page • Search logic • Disambiguation • Co-author, co-collector networks • Collector codes • Hand-crafted heuristics, eg birth/death/collection dates, taxa, places
  15. 15. Prospective Approach…Clean Dirt RDA / TDWG Metadata Standards for attribution of physical and digital collections stewardship Chairs: Anne Thessen, Matt Woodburn, Dimitris Koureas
  16. 16. Final Recommendations:
  17. 17. What “Actions” Do We Care About? • authored • borrowed • catalogued • collected • conserved • contributed • created • curated • … • georeferenced • reviewed
  18. 18. Wishlist • Test suite for parsing lists of names: text file with expectedJSON response Charles R. Darwin Esq. [{ “family”: “Darwin”, “given”:”Charles R.”, “title”:”Esq.”}] leg. A. Chuvilin [{“family”:”Chivilin”,”given”:”A.”}] N. Navarro, G. Gómez y A Ferreira [{“family”:”Navarro”,”given”:”N.”}, {“family”:”Gómez”, “given”:”G.”}, {“family”:”Ferreira”, “given”:”A”}]}
  19. 19. Wishlist • Common, consistent way to handle search • Elasticsearch, Solr plugin • Services • Input: raw string of name(s), optional parameters • Output: parsed name, identifiers, likelihood score • Actions for inclusion in a DwC extension