Named Entity Recognition - ACL 2011 Presentation

•Descargar como PPTX, PDF•

2 recomendaciones•2,559 vistas

Richard Littauer

Given for the Multiw

Educación Tecnología

The Web is not a PERSON, Berners-
Lee is not an ORGANIZATION, and
African-Americans are not
LOCATIONS:
An Analysis of the Performance of
Named-Entity Recognition
Robert Krovetz (Lexicalresearch.com), Paul Deane, Nitin
Madnani (ETS)

A Review by Richard
Littauer (UdS)

The Background
 Named-Entity Recognition (NER) is
normally judged in the context of
Information Extraction (IE)

The Background
 Named-Entity Recognition (NER) is
normally judged in the context of
Information Extraction (IE)
 Various competitions

The Background
 Named-Entity Recognition (NER) is
normally judged in the context of
Information Extraction (IE)
 Various competitions
 Recently:
◦ non-English languages
◦ improving unsupervised learning methods

The Background
 “There are no well-established
standards for evaluation of NER.”

The Background
 “There are no well-established
standards for evaluation of NER.”
◦ Criteria for NER system changes for
competitions
◦ Proprietary software

The Background
 KDM wanted to identify MWEs…

The Background
 KDM wanted to identify MWEs…
… but false positives, tagging
inconsistencies stopped this.

The Background
 KDM wanted to identify MWEs…
… but false positives, tagging
inconsistencies stopped this.

 IE derives Recall and Precision from
Information Retrieval
 NER is just a small part of this, so is
rarely evaluated independently

The Background
 So, they want to test NER systems,
and provide a unit test based on the
problems encountered

Evaluation
Compared three NER taggers:
 Stanford:
◦ CRF, 100m training corpus;
 University of Illinois (LBJ):
◦ Regularized average perceptron, Reuters
1996 News Corpus;
 BBN IdentiFinder (IdentiFinder):
◦ HMMs, commercial

Evaluation
 Agreement on Classification

Evaluation
 Agreement on Classification
 Ambiguity in Discourse

Evaluation
 Agreement on Classification
 Ambiguity in Discourse

 Stanford vs. LBJ on internal ETS
425m corpus
 All three on American National Corpus

Stanford vs. LBJ
 NER reported as 85-95% accurate.

Stanford vs. LBJ
 NER reported as 85-95% accurate.
 Same number for both: 1.95m for
Stanford, 1.8m for LBJ (7.6%
difference)
 However, errors:

Stanford vs. LBJ vs.
IdentiFinder
 Agreement:

Stanford vs. LBJ vs.
IdentiFinder
 Differences:
◦ How they are tokenized
◦ Number of entities recognized overall

Stanford vs. LBJ vs.
IdentiFinder
 Ambiguity:

Unit Test
 Created two documents that can be
used as texts
◦ Different cases for true positives of
PERSON, LOCATION, ORGANIZATION
◦ Entirely upper case not NE (Ex.
AAARGH)
◦ Punctuated terms not NE
◦ Terms with Initials
◦ Acronyms (some expanded, some not)
◦ Last names in close proximity to first
names

Unit Test
 Created two documents that can be
used as texts
◦ Terms with prepositions (Mass. Inst. Of
Tech.)
◦ Terms with location and organization
(Amherst College)

 Provided freely online.

One NE Tag per Discourse
 Unusual for multiple occurrences of a
token in a document to be different
entities
 True for homonyms
 An exception: Location + sports team

One NE Tag per Discourse
 Stanford, LBJ have features for non-
local dependencies to help with this.
 KDM: Two other uses for NLD:
◦ Source of error in evaluation
◦ A way to identify semantically related
entities

 These should be treated as
exceptions

Discussion
 There are guidelines for NER – but we
need standards.
 The community should focus on
PERSON, ORGANISATION,
LOCATION, and MISC.
◦ Harder to deal with than Dates, Times.
◦ Disagreement between taggers.
◦ MISC is necessary.
◦ These have important value elsewhere.

Discussion
 To improve intrinsic evaluation for
NER:
1. Create test sets for divers domains.
2. Use standardized sets for different
phenomena.
3. Report accuracy for POL separately.
4. Establish uncertainty in the tagging
system.

Conclusion
 90% accuracy not real.
 We need to use only entities that are
agreed on by multiple taggers.
 Even in cases where they both
disagree (Hint: Future work.)

 Unit test downloadable.

Cheers/PERSON

Richard/ORGANISATION thanks the
Mword Class/LOCATION for listening to
his talk about Berners-Lee/MISC

Más contenido relacionado

Destacado

Dictionary-based named entity recognition

Lars Juhl Jensen

Named Entities

Knut O. Hellan

A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text

Randa Elanwar

https://telecombcn-dl.github.io/2017-dlsl/ Winter School on Deep Learning for Speech and Language. UPC BarcelonaTech ETSETB TelecomBCN. The aim of this course is to train students in methods of deep learning for speech and language. Recurrent Neural Networks (RNN) will be presented and analyzed in detail to understand the potential of these state of the art tools for time series processing. Engineering tips and scalability issues will be addressed to solve tasks such as machine translation, speech recognition, speech synthesis or question answering. Hands-on sessions will provide development skills so that attendees can become competent in contemporary data analytics tools.

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...

Universitat Politècnica de Catalunya

Tyler Baldwin, Yunyao Li, Bogdan Alexe, Ioana Roxana Stanoi: Automatic Term Ambiguity Detection. ACL (2) 2013: 804-809 Abstract: While the resolution of term ambiguity is important for information extraction (IE) systems, the cost of resolving each instance of an entity can be prohibitively expensive on large datasets. To combat this, this work looks at ambiguity detection at the term, rather than the instance, level. By making a judgment about the general ambiguity of a term, a system is able to handle ambiguous and unambiguous cases differently, improving through-put and quality. To address the term ambiguity detection problem, we employ a model that combines data from language models, ontologies, and topic modeling. Results over a dataset of entities from four product domains show that the proposed approach achieves significantly above baseline F-measure of 0.96.

Automatic Term Ambiguity Detection

Yunyao Li

Exploring Linked Data content through network analysis

Christophe Guéret

Roy Tennant, Senior Program Officer, OCLC Research As library collections shift from print materials to digital formats, and as the web enables ubiquitous and instantaneous discovery of information, library users expect to find and access materials online. It’s not enough to have pages “on the web”; library data must be “woven into the web” and integrated into the sites and services that library users frequent daily – Google, Wikipedia, social networks. When information about a library’s collection is locked up behind a specific web site (such as an OPAC), it is often exceedingly difficult for services, such as search engines, to consume that data. Information seekers need to be connected back to their local library resources from wherever they are on the web. The imperative is to make library data available in new data formats that are native to the web, exposing it to the wider web community, making it easily discoverable by other sites, services, and ultimately consumers. Roy Tennant will shed light on what linked data is and how to re-envision, expose and share library data as entities that are part of the web.

Linked Data: What’s the Story?

WiLS

It is quite often observed that when people use retrieval systems, they do not just search documents or text passages in the first place, but for some information contained inside, which is related to some entities, for instance, person, organization, location, events, time, etc. The goal is to find out various kinds of valuable semantic information about real-world entites embedded in different web pages and databases. But It is a difficult task for us to find out specific or exact information about entities from present search engines. So we need search engines, which will identify our queries across different domains and extract structured information about entities.

Entity Search Engine

DRTC Indian Statistical Institute Bangalore

Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...

Olivier Grisel

Multlingual Linked Data Patterns

Jose Emilio Labra Gayo

QER : query entity recognition

Dhwaj Raj

Text mining

Lars Juhl Jensen

RDF and other linked data standards — how to make use of big localization data

Dave Lewis

Enterprises are adapting large-scale data processing platforms, such as Hadoop, to gain actionable insights from their "big data". Query optimization is still an open challenge in this environment due to the volume and heterogeneity of data, comprising both structured and un/semi-structured datasets. Moreover, it has become common practice to push business logic close to the data via user-defined functions (UDFs), which are usually opaque to the optimizer, further complicating cost-based optimization. As a result, classical relational query optimization techniques do not fit well in this setting, while at the same time, suboptimal query plans can be disastrous with large datasets. In this talk, I will present new techniques that take into account UDFs and correlations between relations for optimizing queries running on large scale clusters. We introduce "pilot runs", which execute part of the query over a sample of the data to estimate selectivities, and employ a cost-based optimizer that uses these selectivities to choose an initial query plan. Then, we follow a dynamic optimization approach, in which plans evolve as parts of the queries get executed. Our experimental results show that our techniques produce plans that are at least as good as, and up to 2x (4x) better for Jaql (Hive) than, the best hand-written left-deep query plans.

Dynamically Optimizing Queries over Large Scale Data Platforms

INRIA-OAK

This presentation addresses the main issues of Linked Data and scalability. In particular, it provides gives details on approaches and technologies for clustering, distributing, sharing, and caching data. Furthermore, it addresses the means for publishing data trough could deployment and the relationship between Big Data and Linked Data, exploring how some of the solutions can be transferred in the context of Linked Data.

Scaling up Linked Data

EUCLID project

This presentation focuses on providing means for exploring Linked Data. In particular, it gives an overview of current visualization tools and techniques, looking at semantic browsers and applications for presenting the data to the end used. We also describe existing search options, including faceted search, concept-based search and hybrid search, based on a mix of using semantic information and text processing. Finally, we conclude with approaches for Linked Data analysis, describing how available data can be synthesized and processed in order to draw conclusions.

Interaction with Linked Data

EUCLID project

Discoverers of Surface Analysis

Yamada Language Center

Enhancing Entity Linking by Combining NER Models

Julien PLU

Natural language procssing

Rajnish Raj

Recipes for PhD

Milad Shokouhi

Destacado (20)

Dictionary-based named entity recognition

Named Entities

A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...

Automatic Term Ambiguity Detection

Exploring Linked Data content through network analysis

Linked Data: What’s the Story?

Entity Search Engine

Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...

Multlingual Linked Data Patterns

QER : query entity recognition

Text mining

RDF and other linked data standards — how to make use of big localization data

Dynamically Optimizing Queries over Large Scale Data Platforms

Scaling up Linked Data

Interaction with Linked Data

Discoverers of Surface Analysis

Enhancing Entity Linking by Combining NER Models

Natural language procssing

Recipes for PhD

Similar a Named Entity Recognition - ACL 2011 Presentation

Csmr13d.ppt

Yann-Gaël Guéhéneuc

130102 venera arnaoudova - a new family of software anti-patterns linguisti...

Ptidej Team

Creating an Urban Legend: A System for Electrophysiology Data Management and ...

Anita de Waard

In this study, we focus on the creation and evaluation of domain-specific web corpora. To this purpose, we propose a two-step approach, namely the (1) the automatic extraction and evaluation of term seeds from personas and use cases/scenarios; (2) the creation and evaluation of domain-specific web corpora bootstrapped with term seeds automatically extracted in step 1. Results are encouraging and show that: (1) it is possible to create a fairly accurate term extractor for relatively short narratives; (2) it is straightforward to evaluate a quality such as domain-specificity of web corpora using well-established metrics.

Towards a Quality Assessment of Web Corpora for Language Technology Applications

Marina Santini

leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...

KristiLBurns

asdrfasdfasdf

SwayattaDaw1

SANAPHOR: Ontology-based Coreference Resolution

eXascale Infolab

Traditional approaches in anti-money laundering involve simple matching algorithms and a lot of human review. However, in recent years this approach has proven to not scale well with the ever increasingly strict regulatory environment. We at Bayard Rock have had much success at applying fancier approaches, including some machine learning, to this problem. In this talk I walk you through the general problem domain and talk about some of the algorithms we use. I’ll also dip into why and how we leverage typed functional programming for rapid iteration with a small team in order to out-innovate our competitors. Bayard Rock, LLC, is a private research and software development company with headquarters in the Empire State Building. It is a leader in the filed in the research and development of tools for improving the state of the art in anti-money laundering and fraud detection. As you might imagine, these tools rely heavily on mathematics and graph algorithms. In this talk, Richard Minerich will discuss the research activities of Bayard Rock and its approaches to build tools to find the “bad guys”. Richard Minerich is Bayard Rock’s Director of Research and Development. Rick has expertise in F#, C#, C, C++, C++/CLI,. NET (1.1, 2.0, 3.0, 3.5, 4.0, and 4.5), Object Oriented Design, Functional Design, Entity Resolution, Machine Learning, Concurrency, and Image Processing. He is interested in working on algorithmically, mathematically complex projects and remains open to explore new ideas. Rick holds 2 patents. The first one, co-invented with a colleague, is titled “Method of Image Analysis Using Sparse Hough Transform.” The other independently held is known as “Method for Document to Template Alignment.”

How We Use Functional Programming to Find the Bad Guys

New York City College of Technology Computer Systems Technology Colloquium

Learn How to Overcome Patient Identity Challenges

Iatric Systems

columbia-gwu

Tianrui Peng

Data Science Course In Pune

APT

data science institute in bangalore

devipatnala1

ExcelR's Data Science Course Pune.Excelr is the best institute for data science course. Here you got a very Top-notch faculty with much experience,and they are also providing the Certifications from the University of Malaysia,the most comprehensive Data Science course in the market, covering the complete Data Science lifecycle concepts from Data Collection, Data Extraction, Data Cleansing, Data Exploration, Data Transformation, Feature Engineering, Data Integration, Data Mining, building Prediction models, Data Visualization and deploying the solution to the customer. Come and Grab some ExcelR's Impressive Opportunities.. These Exclusive Offers and Discounts is not Provided by Anyone else except ExcelR.. https://www.excelr.com/data-science-course-training-in-pune/

Data Science Course Pune

APT

Data science course pdf

APT

data science courses in banglore

devipatnala1

Data Science Course

ashvisingh

Data Science Course

Data Analytics Courses in Pune

Unlimited opportunities are waiting ahead, in fact are just a click away. Excelr is catering best Data Science Certification in Pune and making the future even brighter of many. Learn with experts with full time support and Lifetime access to all the classes even if u have missed any we provide live session. Faculty from, Alumni of IIT, IIM, ISB, PhD qualified with placement assistance. https://www.excelr.com/data-science-course-training-in-pune/

data science certification

Data Analytics Courses in Pune

data science course in pune

devipatnala1

Data mining

devipatnala1

Similar a Named Entity Recognition - ACL 2011 Presentation (20)

Csmr13d.ppt

130102 venera arnaoudova - a new family of software anti-patterns linguisti...

Creating an Urban Legend: A System for Electrophysiology Data Management and ...

Towards a Quality Assessment of Web Corpora for Language Technology Applications

leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...

asdrfasdfasdf

SANAPHOR: Ontology-based Coreference Resolution

How We Use Functional Programming to Find the Bad Guys

Learn How to Overcome Patient Identity Challenges

columbia-gwu

Data Science Course In Pune

data science institute in bangalore

Data Science Course Pune

Data science course pdf

data science courses in banglore

Data Science Course

data science certification

data science course in pune

Data mining

Más de Richard Littauer

Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...

Richard Littauer

Marcu 2000 presentation

Richard Littauer

Barzilay & Lapata 2008 presentation

Richard Littauer

Saarland and UdS

Richard Littauer

Building Corpora from Social Media

Richard Littauer

Visualising Typological Relationships: Plotting WALS with Heat Maps

Richard Littauer

On Tocharian Exceptionality to the centum/satem Isogloss

Richard Littauer

The Evolution of Morphological Agreement

Richard Littauer

Trends in Use of Scientific Workflows: Insights from a Public Repository and ...

Richard Littauer

Evolution of Morphological Agreement - Peche Kucha

Richard Littauer

Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...

Richard Littauer

The Evolution of Speech Segmentation: A Computer Simulation

Richard Littauer

Towards Open Methods: Using Scientific Workflows in Linguistics

Richard Littauer

Recent studies have suggested that various anatomical changes, such as the widening of the hypoglossal canal, the descent of the larynx, and the loss of air sacs, are prerequisites for speech or occurred due to selective pressure on speech. Such studies have been used to suggest that Homo neanderthalis as well as early Homo sapiens were capable of speech. However, using a broad literature review of multimodal languages, such as whistle languages, and the articulation processes behind prosodic features, I will show that such studies ignore various aspects of language that would not require maximal discreteness in phonological features. I will suggest that these studies do not adequately account for prosodic features that would not require anatomical changes in early hominins when considering protolanguage, as they are based on a fundamentally modern view of modern languages which place a heavier load on phonological features at the cost of prosodic load. Therefore, a reanalysis of anatomical changes in early hominins is necessary.

A Reanalysis of Anatomical Changes for Language

Richard Littauer

Más de Richard Littauer (14)

Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...

Marcu 2000 presentation

Barzilay & Lapata 2008 presentation

Saarland and UdS

Building Corpora from Social Media

Visualising Typological Relationships: Plotting WALS with Heat Maps

On Tocharian Exceptionality to the centum/satem Isogloss

The Evolution of Morphological Agreement

Trends in Use of Scientific Workflows: Insights from a Public Repository and ...

Evolution of Morphological Agreement - Peche Kucha

Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...

The Evolution of Speech Segmentation: A Computer Simulation

Towards Open Methods: Using Scientific Workflows in Linguistics

A Reanalysis of Anatomical Changes for Language

Último

Application orientated numerical on hev.ppt

RamjanShidvankar

Single or Multiple melodic lines structure

dhanjurrannsibayan2

Key note speaker Neum_Admir Softic_ENG.pdf

Admir Softic

General Principles of Intellectual Property: Concepts of Intellectual Proper...

Poonam Aher Patil

Making communications land - Are they received and understood as intended? webinar Thursday 2 May 2024 A joint webinar created by the APM Enabling Change and APM People Interest Networks, this is the third of our three part series on Making Communications Land. presented by Ian Cribbes, Director, IMC&T Ltd @cribbesheet The link to the write up page and resources of this webinar: https://www.apm.org.uk/news/making-communications-land-are-they-received-and-understood-as-intended-webinar/ Content description: How do we ensure that what we have communicated was received and understood as we intended and how do we course correct if it has not.

Making communications land - Are they received and understood as intended? we...

Association for Project Management

Fostering Friendships - Enhancing Social Bonds in the Classroom

Pooky Knightsmith

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf

Nirmal Dwivedi

FSB Advising Checklist - Orientation 2024

Elizabeth Walsh

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx

Amanpreet Kaur

Unit-V; Pricing (Pharma Marketing Management).pptx

VishalSingh1417

1029-Danh muc Sach Giao Khoa khoi 6.pdf

QucHHunhnh

Klinik_ Apotek Onlin 085657271886 Solusi Menggugurkan Masalah Kehamilan Anda Jual Obat Aborsi Asli KLINIK ABORSI TERPEECAYA _ Jual Obat Aborsi Cytotec Misoprostol Asli 100% Ampuh Hanya 3 Jam Langsung Gugur || OBAT PENGGUGUR KANDUNGAN AMPUH MANJUR OBAT ABORSI OLINE" APOTIK Jual Obat Cytotec, Gastrul, Gynecoside Asli Ampuh. JUAL ” Obat Aborsi Tuntas | Obat Aborsi Manjur | Obat Aborsi Ampuh | Obat Penggugur Janin | Obat Pencegah Kehamilan | Obat Pelancar Haid | Obat terlambat Bulan | Ciri Obat Aborsi Asli | Obat Telat Bulan | Pil Aborsi Asli | Cara Menggugurkan Konten | Cara Aborsi Tuntas | Harga Obat Aborsi Asli | Pil Aborsi | Jual Obat Aborsi Cytotec | Cara Aborsi Sendiri | Cara Aborsi Usia 1 Bulan | Cara Aborsi Usia 2 Tahun | Cara Aborsi Usia 3 Bulan | Obat Aborsi Usia 4 Bulan | Cara Abrasi Usia 5 Bulan | Cara Menggugurkan Konten | Kandungan Obat Penggugur | Cara Menghitung Usia Konten | Cara Mengatasi Terlambat Bulan | Penjual Obat Aborsi Asli | Obat Aborsi Garansi | Kandungan Obat Peluntur | Obat Telat Datang Bulan | Obat Telat Haid | Obat Aborsi Paling Murah | Klinik Jual Obat Aborsi | Jual Pil Cytotec | Apotik Jual Obat Aborsi | Kandungan Dokter Abrasi | Cara Aborsi Cepat | Jual Obat Aborsi Bergaransi | Jual Obat Cytotec Asli | Obat Aborsi Aman Manjur | Obat Misoprostol Cytotec Asli. "APA ITU ABORSI" “Aborsi Adalah dengan membendung hormon yang di perlukan untuk mempertahankan kehamilan yaitu hormon progesteron, karena hormon ini dibendung, maka jalur kehamilan mulai membuka dan leher rahim menjadi melunak,sehingga mengeluarkan darah yang merupakan tanda bahwa obat telah bekerja || maksimal 1 jam obat diminum || PENJELASAN OBAT ABORSI USIA 1 _7 BULAN Pada usia kandungan ini, pasien akan merasakan sakit yang sedikit tidak berlebihan || sekitar 1 jam ||. namun hanya akan terjadi pada saatdarah keluar merupakan pertanda menstruasi. Hal ini dikarenakan pada usiakandungan 3 bulan,janin sudah terbentuk sebesar kepalan tangan orang dewasa. Cara kerja obat aborsi : JUAL OBAT ABORSI AMPUH dosis 3 bulan secara umum sama dengan cara kerja || DOSIS OBAT ABORSI 2 bulan”, hanya berbedanya selain mengisolasijanin juga menghancurkan janin dengan formula methotrexate dikandungdidalamnya. Formula methotrexate ini sangat ampuh untuk menghancurkan janinmenjadi serpihan-serpihan kecil akan sangat berguna pada saat dikeluarkan nanti. APA ALASAN WANITA MELAKUKAN ABORSI? Aborsi di lakukan wanita hamil baik yang sudah menikah maupun belum menikah dengan berbagai alasan , akan tetapi alasan yang utama adalah alasan-alasan non medis (termasuk aborsi sendiri / di sengaja/ buatan] MELAYANI PEMESANAN OBAT ABORSI SETIAP HARI, SIAP KIRIM KESELURUH KOTA BESAR DI INDONESIA DAN LUAR NEGERI. HUBUNGI PEMESANAN LEBIH NYAMAN VIA WA/: 085657271886

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

ZurliaSoop

HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx

Esquimalt MFRC

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...

Nguyen Thanh Tu Collection

Sociology 101 Demonstration of Learning Exhibit

jbellavia9

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...

christianmathematics

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...

pradhanghanshyam7136

Salient Features of India constitution especially power and functions

KarakKing

Python Notes for mca i year students osmania university.docx

Ramakrishna Reddy Bijjam

Unit-IV- Pharma. Marketing Channels.pptx

VishalSingh1417

Named Entity Recognition - ACL 2011 Presentation

1. The Web is not a PERSON, Berners- Lee is not an ORGANIZATION, and African-Americans are not LOCATIONS: An Analysis of the Performance of Named-Entity Recognition Robert Krovetz (Lexicalresearch.com), Paul Deane, Nitin Madnani (ETS) A Review by Richard Littauer (UdS)

2. The Background  Named-Entity Recognition (NER) is normally judged in the context of Information Extraction (IE)

3. The Background  Named-Entity Recognition (NER) is normally judged in the context of Information Extraction (IE)  Various competitions

4. The Background  Named-Entity Recognition (NER) is normally judged in the context of Information Extraction (IE)  Various competitions  Recently: ◦ non-English languages ◦ improving unsupervised learning methods

5. The Background  “There are no well-established standards for evaluation of NER.”

6. The Background  “There are no well-established standards for evaluation of NER.” ◦ Criteria for NER system changes for competitions ◦ Proprietary software

7. The Background  KDM wanted to identify MWEs…

8. The Background  KDM wanted to identify MWEs… … but false positives, tagging inconsistencies stopped this.

9. The Background  KDM wanted to identify MWEs… … but false positives, tagging inconsistencies stopped this.  IE derives Recall and Precision from Information Retrieval  NER is just a small part of this, so is rarely evaluated independently

10. The Background  So, they want to test NER systems, and provide a unit test based on the problems encountered

11. Evaluation Compared three NER taggers:  Stanford: ◦ CRF, 100m training corpus;  University of Illinois (LBJ): ◦ Regularized average perceptron, Reuters 1996 News Corpus;  BBN IdentiFinder (IdentiFinder): ◦ HMMs, commercial

12. Evaluation  Agreement on Classification

13. Evaluation  Agreement on Classification  Ambiguity in Discourse

14. Evaluation  Agreement on Classification  Ambiguity in Discourse  Stanford vs. LBJ on internal ETS 425m corpus  All three on American National Corpus

15. Stanford vs. LBJ  NER reported as 85-95% accurate.

16. Stanford vs. LBJ  NER reported as 85-95% accurate.  Same number for both: 1.95m for Stanford, 1.8m for LBJ (7.6% difference)  However, errors:

17. Stanford vs. LBJ  Agreement:

18. Stanford vs. LBJ  Ambiguity:

19. Stanford vs. LBJ vs. IdentiFinder  Agreement:

20. Stanford vs. LBJ vs. IdentiFinder  Agreement:

21. Stanford vs. LBJ vs. IdentiFinder  Differences: ◦ How they are tokenized ◦ Number of entities recognized overall

22. Stanford vs. LBJ vs. IdentiFinder  Ambiguity:

23. Unit Test  Created two documents that can be used as texts ◦ Different cases for true positives of PERSON, LOCATION, ORGANIZATION ◦ Entirely upper case not NE (Ex. AAARGH) ◦ Punctuated terms not NE ◦ Terms with Initials ◦ Acronyms (some expanded, some not) ◦ Last names in close proximity to first names

24. Unit Test  Created two documents that can be used as texts ◦ Terms with prepositions (Mass. Inst. Of Tech.) ◦ Terms with location and organization (Amherst College)  Provided freely online.

25. One NE Tag per Discourse  Unusual for multiple occurrences of a token in a document to be different entities  True for homonyms  An exception: Location + sports team

26. One NE Tag per Discourse  Stanford, LBJ have features for non- local dependencies to help with this.  KDM: Two other uses for NLD: ◦ Source of error in evaluation ◦ A way to identify semantically related entities  These should be treated as exceptions

27. Discussion  There are guidelines for NER – but we need standards.  The community should focus on PERSON, ORGANISATION, LOCATION, and MISC. ◦ Harder to deal with than Dates, Times. ◦ Disagreement between taggers. ◦ MISC is necessary. ◦ These have important value elsewhere.

28. Discussion  To improve intrinsic evaluation for NER: 1. Create test sets for divers domains. 2. Use standardized sets for different phenomena. 3. Report accuracy for POL separately. 4. Establish uncertainty in the tagging system.

29. Conclusion  90% accuracy not real.  We need to use only entities that are agreed on by multiple taggers.  Even in cases where they both disagree (Hint: Future work.)  Unit test downloadable.

30. Cheers/PERSON Richard/ORGANISATION thanks the Mword Class/LOCATION for listening to his talk about Berners-Lee/MISC

Notas del editor

NER: The Aim is to recognize and classify different types of entities (names, organizations, locations, dates, etc.)
Not sure why they focused on competitions, to be honest. But they mention the Message Understanding Conference, and CoNLL.
They give two possible reasons for this:
Part of the problem is that
No Gold Standards for any of these. So, they compared on two levels
How well do they work on PERSON, ORGANIZATION, and LOCATION? How much to they agree? What mistakes?
How frequently does each tagger produce multiple classifications for the same entity in a single document? Clinton as a person, and place, for instance.
ANC tagged for IdentiFinder already.
However, this was often not consistent
Identifiner got much more ORGANISATION than the others. Also uses extra class, Geo-Political Entity
Existing taggers treat the non-local dependencies as a way of dealing with the sparse data problem, and as a way to resolve tagging differences by look- ing at how often one token is classified as one type versus another.
1. They didn’t do this. 2. And actually use them, not just one of them. 3. Report accuracy rates separately for the three major classes. Accuracy rates should be further broken down according to the items in the unit test that are designed to assess mistakes: or- thography, acronym processing, frequent false positives, and knowledge-based classification.They go on to say that ANC is doing it right, but is too small, hence their ETS corpus.
1. They didn’t do this. 2. And actually use them, not just one of them. 3. Report accuracy rates separately for the three major classes. Accuracy rates should be further broken down according to the items in the unit test that are designed to assess mistakes: or- thography, acronym processing, frequent false positives, and knowledge-based classification.They go on to say that ANC is doing it right, but is too small, hence their ETS corpus.
1. They didn’t do this. 2. And actually use them, not just one of them. 3. Report accuracy rates separately for the three major classes. Accuracy rates should be further broken down according to the items in the unit test that are designed to assess mistakes: or- thography, acronym processing, frequent false positives, and knowledge-based classification.They go on to say that ANC is doing it right, but is too small, hence their ETS corpus.

Named Entity Recognition - ACL 2011 Presentation

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (20)

Similar a Named Entity Recognition - ACL 2011 Presentation

Similar a Named Entity Recognition - ACL 2011 Presentation (20)

Más de Richard Littauer

Más de Richard Littauer (14)

Último

Último (20)

Named Entity Recognition - ACL 2011 Presentation

Notas del editor