SlideShare una empresa de Scribd logo
1 de 18
The Named Entity Recognition (NER)
• Al-Shehri ,Aisha
• Almutairi ,Shaikhah
• Alswelim ,Haya
KINGDOM OF SAUDI ARABIA
Ministry of Higher Education
Al-Imam Muhammad Ibn Saud Islamic
University
College of Computer and Information Sciences
Abstract
Name Entity Recognition is an important part of many natural
language processing tasks .
There are different type of name entity such as people ,
location and organization .
Introduction
• The Named Entity Recognition is the identification and
classification of Named Entities within an open-domain text.
• The task of named entity recognition was defined as three
subtasks:
• ENAMEX.
• TIMEX, and NUMEX.
• We present the attempt at the recognition and
extraction of the most important proper name entity, that is,
the person name, for the Arabic language(PERA).
Components of an Arabic Full Name:
divided into five main categories, Ibn Auda (2003):
1. An ism (pronounced IZM).
2. A kunya (pronounced COON-yah).
3. By a nasab (pronounced NAH-sahb).
4. A laqab (pronounced LAH-kahb).
5. A nisba (pronounced NISS-bah).
Methodology
1-Parallel Corpora .
a-Reliability
b-Representativeness
2-Previously developed tools for other languages .
a-Person names
b-Location names (Geographical locations and Toponyms)
c-Organizations (Political of Administrative Entities)
d-Position (job titles)
e-Acronyms
Challenges
• 1- There is no capital letters or a specific signal in the
orthography like many other language.
• 2-The Arabic has different meaning
• 3-Abiguity
Ambiguous example
example CorrectIncorrectEnglish
translation
Ambiguous example
DatePerson15th of
Ramadan Al
karim 2005
CompanyLocationSaudi Aramco
Features
• Machine-learning features Word-Length.
• Noun-Flag
• Speech-Tag
• Type-Current
• Type-Left.
• Type-Right.
SystemArchitectureand Implementation
• Architecture of the NERA System:
SystemArchitectureand Implementation
• Gazetteers.
• Grammar.
• Filter.
SystemArchitectureand Implementation
1)Gazetteers:
Gazetteer containing: lists of known named entities.
White list:
The White list plays the role of fixed static dictionaries of
various NE.
SystemArchitectureand Implementation
2) Grammar:
The grammar performs recognition and extraction of Arabic
named entities from the input text based on derived rules.
The following are examples of indicators used within rules:
• Job title: (the doctor), (the sciences
professor).
• Person title: (Mr.) , (Mrs.).
SystemArchitectureand Implementation
3) Filter:
filter rules hels in dealing with recognition
ambiguity between named entities.
filtration mechanism is used that serves two different
purposes:revision of the NE extractor results and
disambiguation
of matches returned by different NE extractors.
Example:
variation
Typographic
Entity typeEnglish
translation
Arabic
example
Two dots removed from taa
marbouta
LocationSaudi
Arabia
Drop of the letter madda from the
aleph
LocationAsia
The Experiment
Results
Conclusion
• 1-We tried in the majority of cases to follow more general
criteria, applicable on English-Arabic transliteration or
French-Arabic transliteration.
• 2-This work is part of a new system for Arabic NER. It has
several ongoing activities.
References
• Sherief Abdallah, Khaled Shaalan, and Muhammad Shoaib ,
Integrating Rule-Based System with Classification for Arabic
Named Entity Recognition, 2012
• Yassine Benajiba , Mona Diab , and Paolo Rosso ,Using
Language Independent and Language Specific Features to
Enhance Arabic Named Entity Recognition, 2009
• Yassine Benajiba , Mona Diab , and Paolo Rosso , Arabic
Named Entity Recognition: AN SVM-BASED APPROACH, 2009
• Doaa Samy, Antonio Moreno, and José Mª Guirao, A Proposal
For An Arabic Named Entity Tagger Leveraging aParallel
Corpus,2005
• Khaled Shaalan, Hafsa Raza, Person Name Entity Recognition
for Arabic,2009

Más contenido relacionado

Destacado

Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...
Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...
Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...
Olivier Grisel
 
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in OntologiesA Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
Panos Alexopoulos
 

Destacado (20)

A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
A Semi-Automatic Annotation Tool For Arabic Online Handwritten TextA Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
A Semi-Automatic Annotation Tool For Arabic Online Handwritten Text
 
Exploring Linked Data content through network analysis
Exploring Linked Data content through network analysisExploring Linked Data content through network analysis
Exploring Linked Data content through network analysis
 
Automatic Term Ambiguity Detection
Automatic Term Ambiguity DetectionAutomatic Term Ambiguity Detection
Automatic Term Ambiguity Detection
 
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific VocabularyA Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
 
Entity Search Engine
Entity Search Engine Entity Search Engine
Entity Search Engine
 
Linked Data: What’s the Story?
Linked Data: What’s the Story?Linked Data: What’s the Story?
Linked Data: What’s the Story?
 
Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...
Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...
Universal Topic Classification - Named Entity Disambiguation (IKS Workshop Pa...
 
Multlingual Linked Data Patterns
Multlingual Linked Data PatternsMultlingual Linked Data Patterns
Multlingual Linked Data Patterns
 
QER : query entity recognition
QER : query entity recognitionQER : query entity recognition
QER : query entity recognition
 
Text mining
Text miningText mining
Text mining
 
RDF and other linked data standards — how to make use of big localization data
RDF and other linked data standards — how to make use of big localization dataRDF and other linked data standards — how to make use of big localization data
RDF and other linked data standards — how to make use of big localization data
 
Interaction with Linked Data
Interaction with Linked DataInteraction with Linked Data
Interaction with Linked Data
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Dynamically Optimizing Queries over Large Scale Data Platforms
Dynamically Optimizing Queries over Large Scale Data PlatformsDynamically Optimizing Queries over Large Scale Data Platforms
Dynamically Optimizing Queries over Large Scale Data Platforms
 
Discoverers of Surface Analysis
Discoverers of Surface AnalysisDiscoverers of Surface Analysis
Discoverers of Surface Analysis
 
Enhancing Entity Linking by Combining NER Models
Enhancing Entity Linking by Combining NER ModelsEnhancing Entity Linking by Combining NER Models
Enhancing Entity Linking by Combining NER Models
 
Natural language procssing
Natural language procssing Natural language procssing
Natural language procssing
 
Recipes for PhD
Recipes for PhDRecipes for PhD
Recipes for PhD
 
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in OntologiesA Vague Sense Classifier for Detecting Vague Definitions in Ontologies
A Vague Sense Classifier for Detecting Vague Definitions in Ontologies
 
Exploiting Linked Open Data and Natural Language Processing for Classificati...
Exploiting Linked Open Data  and Natural Language Processing for Classificati...Exploiting Linked Open Data  and Natural Language Processing for Classificati...
Exploiting Linked Open Data and Natural Language Processing for Classificati...
 

Similar a The named entity recognition (ner)2

Sentiment analysis of arabic,a survey
Sentiment analysis of arabic,a surveySentiment analysis of arabic,a survey
Sentiment analysis of arabic,a survey
Arabic_NLP_ImamU2013
 
A New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic TextA New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic Text
CSCJournals
 
XMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageXMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic Language
Waqas Tariq
 

Similar a The named entity recognition (ner)2 (20)

Exploring the effects of stemming on
Exploring the effects of stemming onExploring the effects of stemming on
Exploring the effects of stemming on
 
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATIONEFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
EFFECTIVE ARABIC STEMMER BASED HYBRID APPROACH FOR ARABIC TEXT CATEGORIZATION
 
Named Entity Recognition for Telugu Using Conditional Random Field
Named Entity Recognition for Telugu Using Conditional Random FieldNamed Entity Recognition for Telugu Using Conditional Random Field
Named Entity Recognition for Telugu Using Conditional Random Field
 
Keywords- Based on Arabic Information Retrieval Using Light Stemmer
Keywords- Based on Arabic Information Retrieval Using Light Stemmer Keywords- Based on Arabic Information Retrieval Using Light Stemmer
Keywords- Based on Arabic Information Retrieval Using Light Stemmer
 
almisbarIEEE-1
almisbarIEEE-1almisbarIEEE-1
almisbarIEEE-1
 
A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models A Survey of Arabic Text Classification Models
A Survey of Arabic Text Classification Models
 
A survey of named entity recognition in assamese and other indian languages
A survey of named entity recognition in assamese and other indian languagesA survey of named entity recognition in assamese and other indian languages
A survey of named entity recognition in assamese and other indian languages
 
Sentiment analysis of arabic,a survey
Sentiment analysis of arabic,a surveySentiment analysis of arabic,a survey
Sentiment analysis of arabic,a survey
 
UNDERSTANDING PEOPLE TITLE PROPERTIES TO IMPROVE INFORMATION EXTRACTION PROCESS
UNDERSTANDING PEOPLE TITLE PROPERTIES TO IMPROVE INFORMATION EXTRACTION PROCESSUNDERSTANDING PEOPLE TITLE PROPERTIES TO IMPROVE INFORMATION EXTRACTION PROCESS
UNDERSTANDING PEOPLE TITLE PROPERTIES TO IMPROVE INFORMATION EXTRACTION PROCESS
 
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMSCOMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
COMPARATIVE ANALYSIS OF ARABIC STEMMING ALGORITHMS
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivity
 
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
DEVELOPMENT OF ARABIC NOUN PHRASE EXTRACTOR (ANPE)
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionMultitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq Recognition
 
04. 9990 16097-1-ed (edited arf)
04. 9990 16097-1-ed (edited arf)04. 9990 16097-1-ed (edited arf)
04. 9990 16097-1-ed (edited arf)
 
SIMILAR THESAURUS BASED ON ARABIC DOCUMENT: AN OVERVIEW AND COMPARISON
SIMILAR THESAURUS BASED ON ARABIC DOCUMENT: AN OVERVIEW AND COMPARISONSIMILAR THESAURUS BASED ON ARABIC DOCUMENT: AN OVERVIEW AND COMPARISON
SIMILAR THESAURUS BASED ON ARABIC DOCUMENT: AN OVERVIEW AND COMPARISON
 
A New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic TextA New Concept Extraction Method for Ontology Construction From Arabic Text
A New Concept Extraction Method for Ontology Construction From Arabic Text
 
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITYUSING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
USING AUTOMATED LEXICAL RESOURCES IN ARABIC SENTENCE SUBJECTIVITY
 
Arabic words stemming approach using arabic wordnet
Arabic words stemming approach using arabic wordnetArabic words stemming approach using arabic wordnet
Arabic words stemming approach using arabic wordnet
 
XMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic LanguageXMODEL: An XML-based Morphological Analyzer for Arabic Language
XMODEL: An XML-based Morphological Analyzer for Arabic Language
 
Adopting Quadrilateral Arabic Roots in Search Engine of E-library System
Adopting Quadrilateral Arabic Roots in Search Engine of E-library SystemAdopting Quadrilateral Arabic Roots in Search Engine of E-library System
Adopting Quadrilateral Arabic Roots in Search Engine of E-library System
 

Más de Arabic_NLP_ImamU2013

Más de Arabic_NLP_ImamU2013 (16)

Arabic tokenization and stemming
Arabic tokenization and  stemmingArabic tokenization and  stemming
Arabic tokenization and stemming
 
Speech recognition for arabic
Speech recognition for arabicSpeech recognition for arabic
Speech recognition for arabic
 
Arabic spell checking approaches
Arabic spell checking approachesArabic spell checking approaches
Arabic spell checking approaches
 
Arabic spell checkers
Arabic spell  checkersArabic spell  checkers
Arabic spell checkers
 
Discourse annotation for arabic 3
Discourse annotation for arabic 3Discourse annotation for arabic 3
Discourse annotation for arabic 3
 
Syntactic parsing for arabic
Syntactic parsing for arabicSyntactic parsing for arabic
Syntactic parsing for arabic
 
Arabic to-english machine translation
Arabic to-english machine translationArabic to-english machine translation
Arabic to-english machine translation
 
Discourse annotation
Discourse annotationDiscourse annotation
Discourse annotation
 
Arabic speech recognition
Arabic speech recognitionArabic speech recognition
Arabic speech recognition
 
Discourse annotation for arabic 2
Discourse annotation for arabic 2Discourse annotation for arabic 2
Discourse annotation for arabic 2
 
Arabic question answering ‫‬
Arabic question answering ‫‬Arabic question answering ‫‬
Arabic question answering ‫‬
 
Part of speech tagging for Arabic
Part of speech tagging for ArabicPart of speech tagging for Arabic
Part of speech tagging for Arabic
 
Coreference recognition in arabic
Coreference recognition in arabicCoreference recognition in arabic
Coreference recognition in arabic
 
Building corpus from www for arabic
Building corpus from www for arabicBuilding corpus from www for arabic
Building corpus from www for arabic
 
Discourse annotation for arabic
Discourse annotation for arabicDiscourse annotation for arabic
Discourse annotation for arabic
 
Automatic summaraitztion for_arabic
Automatic summaraitztion for_arabicAutomatic summaraitztion for_arabic
Automatic summaraitztion for_arabic
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

The named entity recognition (ner)2

  • 1. The Named Entity Recognition (NER) • Al-Shehri ,Aisha • Almutairi ,Shaikhah • Alswelim ,Haya KINGDOM OF SAUDI ARABIA Ministry of Higher Education Al-Imam Muhammad Ibn Saud Islamic University College of Computer and Information Sciences
  • 2. Abstract Name Entity Recognition is an important part of many natural language processing tasks . There are different type of name entity such as people , location and organization .
  • 3. Introduction • The Named Entity Recognition is the identification and classification of Named Entities within an open-domain text. • The task of named entity recognition was defined as three subtasks: • ENAMEX. • TIMEX, and NUMEX.
  • 4. • We present the attempt at the recognition and extraction of the most important proper name entity, that is, the person name, for the Arabic language(PERA). Components of an Arabic Full Name: divided into five main categories, Ibn Auda (2003): 1. An ism (pronounced IZM). 2. A kunya (pronounced COON-yah). 3. By a nasab (pronounced NAH-sahb). 4. A laqab (pronounced LAH-kahb). 5. A nisba (pronounced NISS-bah).
  • 5. Methodology 1-Parallel Corpora . a-Reliability b-Representativeness 2-Previously developed tools for other languages . a-Person names b-Location names (Geographical locations and Toponyms) c-Organizations (Political of Administrative Entities) d-Position (job titles) e-Acronyms
  • 6. Challenges • 1- There is no capital letters or a specific signal in the orthography like many other language. • 2-The Arabic has different meaning • 3-Abiguity
  • 7. Ambiguous example example CorrectIncorrectEnglish translation Ambiguous example DatePerson15th of Ramadan Al karim 2005 CompanyLocationSaudi Aramco
  • 8. Features • Machine-learning features Word-Length. • Noun-Flag • Speech-Tag • Type-Current • Type-Left. • Type-Right.
  • 11. SystemArchitectureand Implementation 1)Gazetteers: Gazetteer containing: lists of known named entities. White list: The White list plays the role of fixed static dictionaries of various NE.
  • 12. SystemArchitectureand Implementation 2) Grammar: The grammar performs recognition and extraction of Arabic named entities from the input text based on derived rules. The following are examples of indicators used within rules: • Job title: (the doctor), (the sciences professor). • Person title: (Mr.) , (Mrs.).
  • 13. SystemArchitectureand Implementation 3) Filter: filter rules hels in dealing with recognition ambiguity between named entities. filtration mechanism is used that serves two different purposes:revision of the NE extractor results and disambiguation of matches returned by different NE extractors.
  • 14. Example: variation Typographic Entity typeEnglish translation Arabic example Two dots removed from taa marbouta LocationSaudi Arabia Drop of the letter madda from the aleph LocationAsia
  • 17. Conclusion • 1-We tried in the majority of cases to follow more general criteria, applicable on English-Arabic transliteration or French-Arabic transliteration. • 2-This work is part of a new system for Arabic NER. It has several ongoing activities.
  • 18. References • Sherief Abdallah, Khaled Shaalan, and Muhammad Shoaib , Integrating Rule-Based System with Classification for Arabic Named Entity Recognition, 2012 • Yassine Benajiba , Mona Diab , and Paolo Rosso ,Using Language Independent and Language Specific Features to Enhance Arabic Named Entity Recognition, 2009 • Yassine Benajiba , Mona Diab , and Paolo Rosso , Arabic Named Entity Recognition: AN SVM-BASED APPROACH, 2009 • Doaa Samy, Antonio Moreno, and José Mª Guirao, A Proposal For An Arabic Named Entity Tagger Leveraging aParallel Corpus,2005 • Khaled Shaalan, Hafsa Raza, Person Name Entity Recognition for Arabic,2009