Named Entity Recognition - ACL 2011 Presentation

•Download as PPTX, PDF•

2 likes•2,559 views

Richard Littauer

Given for the Multiw

Education Technology

The Web is not a PERSON, Berners-
Lee is not an ORGANIZATION, and
African-Americans are not
LOCATIONS:
An Analysis of the Performance of
Named-Entity Recognition
Robert Krovetz (Lexicalresearch.com), Paul Deane, Nitin
Madnani (ETS)

A Review by Richard
Littauer (UdS)

The Background
 Named-Entity Recognition (NER) is
normally judged in the context of
Information Extraction (IE)

The Background
 Named-Entity Recognition (NER) is
normally judged in the context of
Information Extraction (IE)
 Various competitions

The Background
 Named-Entity Recognition (NER) is
normally judged in the context of
Information Extraction (IE)
 Various competitions
 Recently:
◦ non-English languages
◦ improving unsupervised learning methods

The Background
 “There are no well-established
standards for evaluation of NER.”

The Background
 “There are no well-established
standards for evaluation of NER.”
◦ Criteria for NER system changes for
competitions
◦ Proprietary software

The Background
 KDM wanted to identify MWEs…

The Background
 KDM wanted to identify MWEs…
… but false positives, tagging
inconsistencies stopped this.

The Background
 KDM wanted to identify MWEs…
… but false positives, tagging
inconsistencies stopped this.

 IE derives Recall and Precision from
Information Retrieval
 NER is just a small part of this, so is
rarely evaluated independently

The Background
 So, they want to test NER systems,
and provide a unit test based on the
problems encountered

Evaluation
Compared three NER taggers:
 Stanford:
◦ CRF, 100m training corpus;
 University of Illinois (LBJ):
◦ Regularized average perceptron, Reuters
1996 News Corpus;
 BBN IdentiFinder (IdentiFinder):
◦ HMMs, commercial

Evaluation
 Agreement on Classification

Evaluation
 Agreement on Classification
 Ambiguity in Discourse

Evaluation
 Agreement on Classification
 Ambiguity in Discourse

 Stanford vs. LBJ on internal ETS
425m corpus
 All three on American National Corpus

Stanford vs. LBJ
 NER reported as 85-95% accurate.

Stanford vs. LBJ
 NER reported as 85-95% accurate.
 Same number for both: 1.95m for
Stanford, 1.8m for LBJ (7.6%
difference)
 However, errors:

Stanford vs. LBJ vs.
IdentiFinder
 Agreement:

Stanford vs. LBJ vs.
IdentiFinder
 Differences:
◦ How they are tokenized
◦ Number of entities recognized overall

Stanford vs. LBJ vs.
IdentiFinder
 Ambiguity:

Unit Test
 Created two documents that can be
used as texts
◦ Different cases for true positives of
PERSON, LOCATION, ORGANIZATION
◦ Entirely upper case not NE (Ex.
AAARGH)
◦ Punctuated terms not NE
◦ Terms with Initials
◦ Acronyms (some expanded, some not)
◦ Last names in close proximity to first
names

Unit Test
 Created two documents that can be
used as texts
◦ Terms with prepositions (Mass. Inst. Of
Tech.)
◦ Terms with location and organization
(Amherst College)

 Provided freely online.

One NE Tag per Discourse
 Unusual for multiple occurrences of a
token in a document to be different
entities
 True for homonyms
 An exception: Location + sports team

One NE Tag per Discourse
 Stanford, LBJ have features for non-
local dependencies to help with this.
 KDM: Two other uses for NLD:
◦ Source of error in evaluation
◦ A way to identify semantically related
entities

 These should be treated as
exceptions

Discussion
 There are guidelines for NER – but we
need standards.
 The community should focus on
PERSON, ORGANISATION,
LOCATION, and MISC.
◦ Harder to deal with than Dates, Times.
◦ Disagreement between taggers.
◦ MISC is necessary.
◦ These have important value elsewhere.

Discussion
 To improve intrinsic evaluation for
NER:
1. Create test sets for divers domains.
2. Use standardized sets for different
phenomena.
3. Report accuracy for POL separately.
4. Establish uncertainty in the tagging
system.

Conclusion
 90% accuracy not real.
 We need to use only entities that are
agreed on by multiple taggers.
 Even in cases where they both
disagree (Hint: Future work.)

 Unit test downloadable.

Cheers/PERSON

Richard/ORGANISATION thanks the
Mword Class/LOCATION for listening to
his talk about Berners-Lee/MISC

Viewers also liked

Dictionary-based named entity recognition

Lars Juhl Jensen

Named Entities

Knut O. Hellan

A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text

Randa Elanwar

https://telecombcn-dl.github.io/2017-dlsl/ Winter School on Deep Learning for Speech and Language. UPC BarcelonaTech ETSETB TelecomBCN. The aim of this course is to train students in methods of deep learning for speech and language. Recurrent Neural Networks (RNN) will be presented and analyzed in detail to understand the potential of these state of the art tools for time series processing. Engineering tips and scalability issues will be addressed to solve tasks such as machine translation, speech recognition, speech synthesis or question answering. Hands-on sessions will provide development skills so that attendees can become competent in contemporary data analytics tools.

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...

Universitat Politècnica de Catalunya

Tyler Baldwin, Yunyao Li, Bogdan Alexe, Ioana Roxana Stanoi: Automatic Term Ambiguity Detection. ACL (2) 2013: 804-809 Abstract: While the resolution of term ambiguity is important for information extraction (IE) systems, the cost of resolving each instance of an entity can be prohibitively expensive on large datasets. To combat this, this work looks at ambiguity detection at the term, rather than the instance, level. By making a judgment about the general ambiguity of a term, a system is able to handle ambiguous and unambiguous cases differently, improving through-put and quality. To address the term ambiguity detection problem, we employ a model that combines data from language models, ontologies, and topic modeling. Results over a dataset of entities from four product domains show that the proposed approach achieves significantly above baseline F-measure of 0.96.

Automatic Term Ambiguity Detection

Yunyao Li

Exploring Linked Data content through network analysis

Christophe Guéret

Roy Tennant, Senior Program Officer, OCLC Research As library collections shift from print materials to digital formats, and as the web enables ubiquitous and instantaneous discovery of information, library users expect to find and access materials online. It’s not enough to have pages “on the web”; library data must be “woven into the web” and integrated into the sites and services that library users frequent daily – Google, Wikipedia, social networks. When information about a library’s collection is locked up behind a specific web site (such as an OPAC), it is often exceedingly difficult for services, such as search engines, to consume that data. Information seekers need to be connected back to their local library resources from wherever they are on the web. The imperative is to make library data available in new data formats that are native to the web, exposing it to the wider web community, making it easily discoverable by other sites, services, and ultimately consumers. Roy Tennant will shed light on what linked data is and how to re-envision, expose and share library data as entities that are part of the web.

Linked Data: What’s the Story?

WiLS

It is quite often observed that when people use retrieval systems, they do not just search documents or text passages in the first place, but for some information contained inside, which is related to some entities, for instance, person, organization, location, events, time, etc. The goal is to find out various kinds of valuable semantic information about real-world entites embedded in different web pages and databases. But It is a difficult task for us to find out specific or exact information about entities from present search engines. So we need search engines, which will identify our queries across different domains and extract structured information about entities.

Entity Search Engine

DRTC Indian Statistical Institute Bangalore

Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...

Olivier Grisel

Multlingual Linked Data Patterns

Jose Emilio Labra Gayo

QER : query entity recognition

Dhwaj Raj

Text mining

Lars Juhl Jensen

RDF and other linked data standards — how to make use of big localization data

Dave Lewis

Enterprises are adapting large-scale data processing platforms, such as Hadoop, to gain actionable insights from their "big data". Query optimization is still an open challenge in this environment due to the volume and heterogeneity of data, comprising both structured and un/semi-structured datasets. Moreover, it has become common practice to push business logic close to the data via user-defined functions (UDFs), which are usually opaque to the optimizer, further complicating cost-based optimization. As a result, classical relational query optimization techniques do not fit well in this setting, while at the same time, suboptimal query plans can be disastrous with large datasets. In this talk, I will present new techniques that take into account UDFs and correlations between relations for optimizing queries running on large scale clusters. We introduce "pilot runs", which execute part of the query over a sample of the data to estimate selectivities, and employ a cost-based optimizer that uses these selectivities to choose an initial query plan. Then, we follow a dynamic optimization approach, in which plans evolve as parts of the queries get executed. Our experimental results show that our techniques produce plans that are at least as good as, and up to 2x (4x) better for Jaql (Hive) than, the best hand-written left-deep query plans.

Dynamically Optimizing Queries over Large Scale Data Platforms

INRIA-OAK

This presentation addresses the main issues of Linked Data and scalability. In particular, it provides gives details on approaches and technologies for clustering, distributing, sharing, and caching data. Furthermore, it addresses the means for publishing data trough could deployment and the relationship between Big Data and Linked Data, exploring how some of the solutions can be transferred in the context of Linked Data.

Scaling up Linked Data

EUCLID project

This presentation focuses on providing means for exploring Linked Data. In particular, it gives an overview of current visualization tools and techniques, looking at semantic browsers and applications for presenting the data to the end used. We also describe existing search options, including faceted search, concept-based search and hybrid search, based on a mix of using semantic information and text processing. Finally, we conclude with approaches for Linked Data analysis, describing how available data can be synthesized and processed in order to draw conclusions.

Interaction with Linked Data

EUCLID project

Discoverers of Surface Analysis

Yamada Language Center

Enhancing Entity Linking by Combining NER Models

Julien PLU

Natural language procssing

Rajnish Raj

Recipes for PhD

Milad Shokouhi

Viewers also liked (20)

Dictionary-based named entity recognition

Named Entities

A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text

Recurrent Neural Networks I (D2L2 Deep Learning for Speech and Language UPC 2...

Automatic Term Ambiguity Detection

Exploring Linked Data content through network analysis

Linked Data: What’s the Story?

Entity Search Engine

Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...

Multlingual Linked Data Patterns

QER : query entity recognition

Text mining

RDF and other linked data standards — how to make use of big localization data

Dynamically Optimizing Queries over Large Scale Data Platforms

Scaling up Linked Data

Interaction with Linked Data

Discoverers of Surface Analysis

Enhancing Entity Linking by Combining NER Models

Natural language procssing

Recipes for PhD

Similar to Named Entity Recognition - ACL 2011 Presentation

Csmr13d.ppt

Yann-Gaël Guéhéneuc

130102 venera arnaoudova - a new family of software anti-patterns linguisti...

Ptidej Team

Creating an Urban Legend: A System for Electrophysiology Data Management and ...

Anita de Waard

In this study, we focus on the creation and evaluation of domain-specific web corpora. To this purpose, we propose a two-step approach, namely the (1) the automatic extraction and evaluation of term seeds from personas and use cases/scenarios; (2) the creation and evaluation of domain-specific web corpora bootstrapped with term seeds automatically extracted in step 1. Results are encouraging and show that: (1) it is possible to create a fairly accurate term extractor for relatively short narratives; (2) it is straightforward to evaluate a quality such as domain-specificity of web corpora using well-established metrics.

Towards a Quality Assessment of Web Corpora for Language Technology Applications

Marina Santini

leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...

KristiLBurns

asdrfasdfasdf

SwayattaDaw1

SANAPHOR: Ontology-based Coreference Resolution

eXascale Infolab

Traditional approaches in anti-money laundering involve simple matching algorithms and a lot of human review. However, in recent years this approach has proven to not scale well with the ever increasingly strict regulatory environment. We at Bayard Rock have had much success at applying fancier approaches, including some machine learning, to this problem. In this talk I walk you through the general problem domain and talk about some of the algorithms we use. I’ll also dip into why and how we leverage typed functional programming for rapid iteration with a small team in order to out-innovate our competitors. Bayard Rock, LLC, is a private research and software development company with headquarters in the Empire State Building. It is a leader in the filed in the research and development of tools for improving the state of the art in anti-money laundering and fraud detection. As you might imagine, these tools rely heavily on mathematics and graph algorithms. In this talk, Richard Minerich will discuss the research activities of Bayard Rock and its approaches to build tools to find the “bad guys”. Richard Minerich is Bayard Rock’s Director of Research and Development. Rick has expertise in F#, C#, C, C++, C++/CLI,. NET (1.1, 2.0, 3.0, 3.5, 4.0, and 4.5), Object Oriented Design, Functional Design, Entity Resolution, Machine Learning, Concurrency, and Image Processing. He is interested in working on algorithmically, mathematically complex projects and remains open to explore new ideas. Rick holds 2 patents. The first one, co-invented with a colleague, is titled “Method of Image Analysis Using Sparse Hough Transform.” The other independently held is known as “Method for Document to Template Alignment.”

How We Use Functional Programming to Find the Bad Guys

New York City College of Technology Computer Systems Technology Colloquium

Learn How to Overcome Patient Identity Challenges

Iatric Systems

columbia-gwu

Tianrui Peng

Data Science Course In Pune

APT

data science institute in bangalore

devipatnala1

ExcelR's Data Science Course Pune.Excelr is the best institute for data science course. Here you got a very Top-notch faculty with much experience,and they are also providing the Certifications from the University of Malaysia,the most comprehensive Data Science course in the market, covering the complete Data Science lifecycle concepts from Data Collection, Data Extraction, Data Cleansing, Data Exploration, Data Transformation, Feature Engineering, Data Integration, Data Mining, building Prediction models, Data Visualization and deploying the solution to the customer. Come and Grab some ExcelR's Impressive Opportunities.. These Exclusive Offers and Discounts is not Provided by Anyone else except ExcelR.. https://www.excelr.com/data-science-course-training-in-pune/

Data Science Course Pune

APT

Data science course pdf

APT

Data scientist is a broad catch-all title. While we will see more specific career paths evolve, the bubble around data science course and data engineering skills isn't set to burst.”. Data Science is a good career option. Data Science Course In Pune lets you master data analysis, deploying R statistical computing, Machine Learning algorithms, K-Means Clustering, Naïve Bayes, connecting R with Hadoop framework, time-series analysis, business analytics and more. https://www.excelr.com/data-science-course-training-in-pune/

Data Science course in Pune

ashvisingh

Data science course in pune

Data Analytics Courses in Pune

data science courses in banglore

devipatnala1

Data Science is best provided by Excelr Solutions, and getting to the mark of Excellency in delivering their students brightest future. Faculty best in the industry with all time help supports even after the Data Science training. Loan facility with other assistance in placements is available Data Science Certification in Pune is the most trending. https://www.excelr.com/data-science-course-training-in-pune/

Data science certification

Data Analytics Courses in Pune

Data Science Course

ashvisingh

Data Science Course

Data Analytics Courses in Pune

Similar to Named Entity Recognition - ACL 2011 Presentation (20)

Csmr13d.ppt

130102 venera arnaoudova - a new family of software anti-patterns linguisti...

Creating an Urban Legend: A System for Electrophysiology Data Management and ...

Towards a Quality Assessment of Web Corpora for Language Technology Applications

leewayhertz.com-Named Entity Recognition NER Unveiling the value in unstructu...

asdrfasdfasdf

SANAPHOR: Ontology-based Coreference Resolution

How We Use Functional Programming to Find the Bad Guys

Learn How to Overcome Patient Identity Challenges

columbia-gwu

Data Science Course In Pune

data science institute in bangalore

Data Science Course Pune

Data science course pdf

Data Science course in Pune

Data science course in pune

data science courses in banglore

Data science certification

Data Science Course

More from Richard Littauer

Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...

Richard Littauer

Marcu 2000 presentation

Richard Littauer

Barzilay & Lapata 2008 presentation

Richard Littauer

Saarland and UdS

Richard Littauer

Building Corpora from Social Media

Richard Littauer

Visualising Typological Relationships: Plotting WALS with Heat Maps

Richard Littauer

On Tocharian Exceptionality to the centum/satem Isogloss

Richard Littauer

The Evolution of Morphological Agreement

Richard Littauer

Trends in Use of Scientific Workflows: Insights from a Public Repository and ...

Richard Littauer

Evolution of Morphological Agreement - Peche Kucha

Richard Littauer

Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...

Richard Littauer

The Evolution of Speech Segmentation: A Computer Simulation

Richard Littauer

Towards Open Methods: Using Scientific Workflows in Linguistics

Richard Littauer

Recent studies have suggested that various anatomical changes, such as the widening of the hypoglossal canal, the descent of the larynx, and the loss of air sacs, are prerequisites for speech or occurred due to selective pressure on speech. Such studies have been used to suggest that Homo neanderthalis as well as early Homo sapiens were capable of speech. However, using a broad literature review of multimodal languages, such as whistle languages, and the articulation processes behind prosodic features, I will show that such studies ignore various aspects of language that would not require maximal discreteness in phonological features. I will suggest that these studies do not adequately account for prosodic features that would not require anatomical changes in early hominins when considering protolanguage, as they are based on a fundamentally modern view of modern languages which place a heavier load on phonological features at the cost of prosodic load. Therefore, a reanalysis of anatomical changes in early hominins is necessary.

A Reanalysis of Anatomical Changes for Language

Richard Littauer

More from Richard Littauer (14)

Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...

Marcu 2000 presentation

Barzilay & Lapata 2008 presentation

Saarland and UdS

Building Corpora from Social Media

Visualising Typological Relationships: Plotting WALS with Heat Maps

On Tocharian Exceptionality to the centum/satem Isogloss

The Evolution of Morphological Agreement

Trends in Use of Scientific Workflows: Insights from a Public Repository and ...

Evolution of Morphological Agreement - Peche Kucha

Workflow Classification and Open-Sourcing Methods: Towards a New Publication ...

The Evolution of Speech Segmentation: A Computer Simulation

Towards Open Methods: Using Scientific Workflows in Linguistics

A Reanalysis of Anatomical Changes for Language

Recently uploaded

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文

中央社

Dementia (Alzheimer & vasular dementia).

Mohamed Rizk Khodair

When Quality Assurance Meets Innovation in Higher Education - Report launch w...

Gary Wood

demyelinated disorder: multiple sclerosis.pptx

Mohamed Rizk Khodair

II BIOSENSOR PRINCIPLE APPLICATIONS AND WORKING II

agpharmacy11

Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45 Event Link:- https://meetups.mulesoft.com/events/details/mulesoft-mysore-presents-exploring-gemini-ai-and-integration-with-mulesoft/ Agenda ● Introduction ● Gemini AI & Gemini API ● Features & Capabilities ● MuleSoft Integration with Gemini ● Gemini Custom Connector ● Demo For Upcoming Meetups Join Mysore Meetup Group - https://meetups.mulesoft.com/mysore/ YouTube:- youtube.com/@mulesoftmysore Mysore WhatsApp group:- https://chat.whatsapp.com/EhqtHtCC75vCAX7gaO842N Speaker:- Shubham Chaurasia - https://www.linkedin.com/in/shubhamchaurasia1/ Priya Shaw - https://www.linkedin.com/in/priya-shaw Organizers:- Shubham Chaurasia - https://www.linkedin.com/in/shubhamchaurasia1/ Giridhar Meka - https://www.linkedin.com/in/giridharmeka Priya Shaw - https://www.linkedin.com/in/priya-shaw

Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45

MysoreMuleSoftMeetup

Championnat de France de Tennis de table/

siemaillard

“O BEIJO” EM ARTE .

Colégio Santa Teresinha

會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽

中央社

PSYPACT- Practicing Over State Lines May 2024.pptx

Marlene Maheu

APM webinar hosted by the Scotland Network on 14 May 2024. Speakers: Chris Drysdale and Peter Huggett An interactive session discussing how Project Managers can identify mental health symptoms, provide tools to help themselves and others, plus also increase the capabilities of the Project Management function. This webinar was held on 14 May 2024. The covid-19 pandemic led to concerns about a worsening of mental health & wellbeing across the world and increased awareness in both society and the workplace. This webinar looks to advise the benefits of having a Mental Health First Aid function in the workplace whilst also providing tools and techniques that can be readily used and applied to yourself and colleagues. Additionally, there are wider benefits to Project Management which will be proposed and discussed.

Including Mental Health Support in Project Delivery, 14 May.pdf

Association for Project Management

The Department of Emergency Medicine at Carolinas Medical Center is passionate about education! Dr. Michael Gibbs is a world-renowned clinician and educator and has helped guide numerous young clinicians on the long path of Mastery of Emergency Medical Care. With his oversight, the EMGuideWire team aim to help augment our understanding of emergent imaging. You can follow along with the EMGuideWire.com team as they post these educational, self-guided radiology slides or you can also use this section to learn more in-depth about specific conditions and diseases. This Radiology Reading Room pertains to Sternal Fractures and Dislocations and is brought to you by Carrie Bissell, MD, Aaron Fox, MD, Kendrick Lim, MD, Stephanie Jensen, MD, and Olivia Rice, MD. It is has special guest editor: Sean Dieffenbaugher, MD and Laurence Kempton, MD

Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room

Sean M. Fox

The Department of Emergency Medicine at Carolinas Medical Center is passionate about education! Dr. Michael Gibbs is a world-renowned clinician and educator and has helped guide numerous young clinicians on the long path of Mastery of Emergency Medical Care. With his oversight, the EMGuideWire team aim to help augment our understanding of emergent imaging. You can follow along with the EMGuideWire.com team as they post these educational, self-guided radiology slides or you can also use this section to learn more in-depth about specific conditions and diseases. This Radiology Reading Room pertains to Ventriculoperitoneal Shunts and their Complications and is brought to you by Brandon Friedman, MD, Kelsey Patterson, and L. Erin Miller MD. It is has special guest editor: Scott Wait, MD

Implanted Devices - VP Shunts: EMGuidewire's Radiology Reading Room

Sean M. Fox

TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...

Nguyen Thanh Tu Collection

Graduate Outcomes Presentation Slides - English (v3).pptx

neillewis46

Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj

Mohammed Sikander

Đề tieng anh thpt 2024 danh cho cac ban hoc sinh

leson0603

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...

Nguyen Thanh Tu Collection

MOOD STABLIZERS DRUGS.pptx

PoojaSen20

The basics of sentences session 4pptx.pptx

heathfieldcps1

Recently uploaded (20)

會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文

Dementia (Alzheimer & vasular dementia).

When Quality Assurance Meets Innovation in Higher Education - Report launch w...

demyelinated disorder: multiple sclerosis.pptx

II BIOSENSOR PRINCIPLE APPLICATIONS AND WORKING II

Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45

Championnat de France de Tennis de table/

“O BEIJO” EM ARTE .

會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽

PSYPACT- Practicing Over State Lines May 2024.pptx

Including Mental Health Support in Project Delivery, 14 May.pdf

Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room

Implanted Devices - VP Shunts: EMGuidewire's Radiology Reading Room

TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...

Graduate Outcomes Presentation Slides - English (v3).pptx

Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj

Đề tieng anh thpt 2024 danh cho cac ban hoc sinh

ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH FORM 50 CÂU TRẮC NGHI...

MOOD STABLIZERS DRUGS.pptx

The basics of sentences session 4pptx.pptx

Named Entity Recognition - ACL 2011 Presentation

1. The Web is not a PERSON, Berners- Lee is not an ORGANIZATION, and African-Americans are not LOCATIONS: An Analysis of the Performance of Named-Entity Recognition Robert Krovetz (Lexicalresearch.com), Paul Deane, Nitin Madnani (ETS) A Review by Richard Littauer (UdS)

2. The Background  Named-Entity Recognition (NER) is normally judged in the context of Information Extraction (IE)

3. The Background  Named-Entity Recognition (NER) is normally judged in the context of Information Extraction (IE)  Various competitions

4. The Background  Named-Entity Recognition (NER) is normally judged in the context of Information Extraction (IE)  Various competitions  Recently: ◦ non-English languages ◦ improving unsupervised learning methods

5. The Background  “There are no well-established standards for evaluation of NER.”

6. The Background  “There are no well-established standards for evaluation of NER.” ◦ Criteria for NER system changes for competitions ◦ Proprietary software

7. The Background  KDM wanted to identify MWEs…

8. The Background  KDM wanted to identify MWEs… … but false positives, tagging inconsistencies stopped this.

9. The Background  KDM wanted to identify MWEs… … but false positives, tagging inconsistencies stopped this.  IE derives Recall and Precision from Information Retrieval  NER is just a small part of this, so is rarely evaluated independently

10. The Background  So, they want to test NER systems, and provide a unit test based on the problems encountered

11. Evaluation Compared three NER taggers:  Stanford: ◦ CRF, 100m training corpus;  University of Illinois (LBJ): ◦ Regularized average perceptron, Reuters 1996 News Corpus;  BBN IdentiFinder (IdentiFinder): ◦ HMMs, commercial

12. Evaluation  Agreement on Classification

13. Evaluation  Agreement on Classification  Ambiguity in Discourse

14. Evaluation  Agreement on Classification  Ambiguity in Discourse  Stanford vs. LBJ on internal ETS 425m corpus  All three on American National Corpus

15. Stanford vs. LBJ  NER reported as 85-95% accurate.

16. Stanford vs. LBJ  NER reported as 85-95% accurate.  Same number for both: 1.95m for Stanford, 1.8m for LBJ (7.6% difference)  However, errors:

17. Stanford vs. LBJ  Agreement:

18. Stanford vs. LBJ  Ambiguity:

19. Stanford vs. LBJ vs. IdentiFinder  Agreement:

20. Stanford vs. LBJ vs. IdentiFinder  Agreement:

21. Stanford vs. LBJ vs. IdentiFinder  Differences: ◦ How they are tokenized ◦ Number of entities recognized overall

22. Stanford vs. LBJ vs. IdentiFinder  Ambiguity:

23. Unit Test  Created two documents that can be used as texts ◦ Different cases for true positives of PERSON, LOCATION, ORGANIZATION ◦ Entirely upper case not NE (Ex. AAARGH) ◦ Punctuated terms not NE ◦ Terms with Initials ◦ Acronyms (some expanded, some not) ◦ Last names in close proximity to first names

24. Unit Test  Created two documents that can be used as texts ◦ Terms with prepositions (Mass. Inst. Of Tech.) ◦ Terms with location and organization (Amherst College)  Provided freely online.

25. One NE Tag per Discourse  Unusual for multiple occurrences of a token in a document to be different entities  True for homonyms  An exception: Location + sports team

26. One NE Tag per Discourse  Stanford, LBJ have features for non- local dependencies to help with this.  KDM: Two other uses for NLD: ◦ Source of error in evaluation ◦ A way to identify semantically related entities  These should be treated as exceptions

27. Discussion  There are guidelines for NER – but we need standards.  The community should focus on PERSON, ORGANISATION, LOCATION, and MISC. ◦ Harder to deal with than Dates, Times. ◦ Disagreement between taggers. ◦ MISC is necessary. ◦ These have important value elsewhere.

28. Discussion  To improve intrinsic evaluation for NER: 1. Create test sets for divers domains. 2. Use standardized sets for different phenomena. 3. Report accuracy for POL separately. 4. Establish uncertainty in the tagging system.

29. Conclusion  90% accuracy not real.  We need to use only entities that are agreed on by multiple taggers.  Even in cases where they both disagree (Hint: Future work.)  Unit test downloadable.

30. Cheers/PERSON Richard/ORGANISATION thanks the Mword Class/LOCATION for listening to his talk about Berners-Lee/MISC

Editor's Notes

NER: The Aim is to recognize and classify different types of entities (names, organizations, locations, dates, etc.)
Not sure why they focused on competitions, to be honest. But they mention the Message Understanding Conference, and CoNLL.
They give two possible reasons for this:
Part of the problem is that
No Gold Standards for any of these. So, they compared on two levels
How well do they work on PERSON, ORGANIZATION, and LOCATION? How much to they agree? What mistakes?
How frequently does each tagger produce multiple classifications for the same entity in a single document? Clinton as a person, and place, for instance.
ANC tagged for IdentiFinder already.
However, this was often not consistent
Identifiner got much more ORGANISATION than the others. Also uses extra class, Geo-Political Entity
Existing taggers treat the non-local dependencies as a way of dealing with the sparse data problem, and as a way to resolve tagging differences by look- ing at how often one token is classified as one type versus another.
1. They didn’t do this. 2. And actually use them, not just one of them. 3. Report accuracy rates separately for the three major classes. Accuracy rates should be further broken down according to the items in the unit test that are designed to assess mistakes: or- thography, acronym processing, frequent false positives, and knowledge-based classification.They go on to say that ANC is doing it right, but is too small, hence their ETS corpus.
1. They didn’t do this. 2. And actually use them, not just one of them. 3. Report accuracy rates separately for the three major classes. Accuracy rates should be further broken down according to the items in the unit test that are designed to assess mistakes: or- thography, acronym processing, frequent false positives, and knowledge-based classification.They go on to say that ANC is doing it right, but is too small, hence their ETS corpus.
1. They didn’t do this. 2. And actually use them, not just one of them. 3. Report accuracy rates separately for the three major classes. Accuracy rates should be further broken down according to the items in the unit test that are designed to assess mistakes: or- thography, acronym processing, frequent false positives, and knowledge-based classification.They go on to say that ANC is doing it right, but is too small, hence their ETS corpus.

Named Entity Recognition - ACL 2011 Presentation

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (20)

Similar to Named Entity Recognition - ACL 2011 Presentation

Similar to Named Entity Recognition - ACL 2011 Presentation (20)

More from Richard Littauer

More from Richard Littauer (14)

Recently uploaded

Recently uploaded (20)

Named Entity Recognition - ACL 2011 Presentation

Editor's Notes