SlideShare una empresa de Scribd logo
1 de 43
Drop me a mail:Drop me a mail: rushdecoder@yahoo.comrushdecoder@yahoo.com
Visit me at:Visit me at: http://http://rushdishams.googlepages.comrushdishams.googlepages.com
1Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh
Translation Approach
 The translation process may be stated as:
1. Decoding the meaning of the source text
2. Re-encoding this meaning in the target
language.
 Machine translation can use a method
based on linguistic rules-
 words will be translated in a linguistic way
 the most suitable words of the target language
will replace the ones in the source language.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 2
Translation Approach
 The success of machine translation requires
the problem of natural language
understanding to be solved first.
 Generally, rule-based methods
parse a text,
usually creating an intermediary, symbolic
representation,
from which the text in the target language is
generated.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 3
Translation Approach
 According to the nature of the intermediary
representation, an approach is described as
interlingual machine translation or
transfer-based machine translation.
 These methods require extensive
lexicons with
morphological,
syntactic, and
semantic information, and
large sets of rules.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 4
Translation Approach
 Machine translation programs often work
well enough for
a native speaker of one language to get the
approximate meaning of what is written by the
other native speaker.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 5
Translation Approach
 the large multilingual corpus of data needed
for statistical methods to work is not
necessary for the grammar-based methods.
 But then, the grammar methods need a
skilled linguist to carefully design the
grammar that they use.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 6
Types of Machine Translation
Text
Generation
Syntactic
Parsing
Semantic
Analysis
Sentence
Planning
Source
(Arabic)
Target
(English)
Transfer
Rules
Direct: SMT, EBMT
Interlingua
Rule based MT
 The rule-based machine translation
paradigm includes
1. transfer-based machine translation,
2. interlingual machine translation and
3. dictionary-based machine translation
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 8
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 9
Transfer based MT
 Itis necessary to have an intermediate
representation that captures the "meaning"
of the original sentence in order to generate
the correct translation
 In interlingua-based MT this intermediate
representation must be independent of the
languages in question, whereas in transfer-
based MT, it has some dependence on the
language pair involved.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 10
Transfer based MT
 The original text is first analyzed
morphologically and
syntactically
in order to obtain a syntactic
representation.
 This representation can then be refined to a
more abstract level putting emphasis on the
parts relevant for translation and ignoring
other types of information.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 11
Transfer based MT
 The transfer process then converts this final
representation (still in the original
language) to a representation of the same
level of abstraction in the target language.
 These two representations are referred to
as "intermediate" representations.
 From the target language representation,
the stages are then applied in reverse.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 12
Transfer based MT
Transformation process
 Morphological analysis
Surface forms of the input text are classified as
○ to part-of-speech (e.g. noun, verb, etc.) and
○ sub-category (number, gender, tense, etc.)
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 14
Transformation process
 Lexical categorization
In any given text some of the words may have
more than one meaning, causing ambiguity in
analysis.
Lexical categorization looks at the context of a
word to try and determine the correct meaning
in the context of the input.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 15
Transformation process
 Lexical transfer
This is basically dictionary translation
the source language lemma (perhaps with sense
information) is looked up in a bilingual
dictionary and the translation is chosen.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 16
Transformation process
 Structural transfer
While the previous stages deal with words, this
stage deals with larger constituents
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 17
Transformation process
 Morphological generation
 From the output of the structural transfer
stage, the target language surface forms are
generated.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 18
Transfer Types
 Superficial transfer (or syntactic)
This level is characterized by transferring
"syntactic structures" between the source and
target languages.
It is suitable for languages in the same family or
of the same type.
for example in the Romance languages between
Spanish, Catalan, French, Italian, etc.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 19
Transfer Types
 Deep transfer (or semantic)
This level constructs a semantic representation that is
dependent on the source language.
This representation can consist of a series of structures
which represent the meaning.
In these transfer systems predicates are typically
produced.
The translation also typically requires structural
transfer.
This level is used to translate between more distantly
related languages (e.g. Spanish-English or Spanish-
Basque, etc.)
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 20
Dependency Grammar
Case Grammar
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 23
Interlingual MT
 the source language, i.e. the text to be
translated is transformed into an
interlingua, i.e., an abstract language-
independent representation.
 The target language is then generated from
the interlingua.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 24
Interlingual MT
 In the direct approach, words are translated
directly without passing through an additional
representation.
 In the transfer approach the source language is
transformed into an abstract, less language-
specific representation.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 25
Interlingual MT
Advantage and disadvantage
 The advantage in multilingual machine
translations is that no transfer component
has to be created for each language pair
 The obvious disadvantage is that the
definition of an interlingua is difficult and
maybe even impossible for a wider domain.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 27
Components
 Dictionaries for analysis and generation
 A conceptual lexicon, which is
the knowledge base about events and
entities known in the domain.
 A set of projection rules (specific to the
domain and the languages).
 Grammars for the analysis and generation
of the languages involved.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 28
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 29
Dictionary-based MT
 The words will be translated as a dictionary does
— word by word, usually without much
correlation of meaning between them
 Dictionary lookups may be done with or without
morphological analysis or lemmatisation
 used to expedite manual translation, if the person
carrying it out is fluent in both languages and
therefore capable of correcting syntax and
grammar.
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 30
Dictionary-based MT
Dictionary-based MT
Example-based MT
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 33
Example-based MT
 characterized by its use of a bilingual corpus
with parallel texts as its main knowledge
base
 It is essentially a translation by analogy and
can be viewed as an implementation of
case-based reasoning approach of machine
learning
Example-based MT
 characterized by its use of a bilingual corpus
with parallel texts as its main knowledge
base
 It is essentially a translation by analogy and
can be viewed as an implementation of
case-based reasoning approach of machine
learning
Example-based MT
Example-based MT
 bilingual parallel corpora contain sentence
pairs like the example shown in the table.
 How much is that X ? corresponds to Ano X
wa ikura desu ka.
 red umbrella corresponds to akai kasa
 small camera corresponds to chiisai kamera
Example-based MT
 President Kennedy was shot dead during the
parade. and The convict escaped on July
15th. We could translate the sentence The
convict was shot dead during the parade. by
substituting the appropriate parts of the
sentences.
Statistical MT
Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 39
Statistical MT
 The idea behind statistical machine
translation comes from information theory.
 A document is translated according to the
probability distribution p(e | f) that a string
e in the target language (for example,
English) is the translation of a string f in the
source language (for example, French).
Statistical MT
 The problem of modeling the probability
distribution p(e | f) has been approached in
a number of ways. One intuitive approach is
to apply Bayes Theorem
where the translation model p(f | e) is the probability that the
source string is the translation of the target string, and the
language model p(e) is the probability of seeing that target
language string string.
Statistical MT
 Finding the best translation is done by picking up
the one that gives the highest probability

Más contenido relacionado

La actualidad más candente

Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
vini89
 
Equivalencein translation
Equivalencein translationEquivalencein translation
Equivalencein translation
Dorina Moisa
 
Theory of translation
Theory of translationTheory of translation
Theory of translation
ytsogzolmaa
 
History of translstudies
History of translstudiesHistory of translstudies
History of translstudies
Muhmmad Asif
 

La actualidad más candente (20)

Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?
 
What Is Translation? , by Dr. Shadia Yousef Banjar
What Is Translation? , by Dr. Shadia Yousef BanjarWhat Is Translation? , by Dr. Shadia Yousef Banjar
What Is Translation? , by Dr. Shadia Yousef Banjar
 
A tutorial on Machine Translation
A tutorial on Machine TranslationA tutorial on Machine Translation
A tutorial on Machine Translation
 
Presentation of text linguistics
Presentation of text linguisticsPresentation of text linguistics
Presentation of text linguistics
 
Translation Studies
Translation StudiesTranslation Studies
Translation Studies
 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
 
Katharina reiss
Katharina reissKatharina reiss
Katharina reiss
 
Translation studies slides 1st Lecture.ppt
Translation studies slides 1st Lecture.pptTranslation studies slides 1st Lecture.ppt
Translation studies slides 1st Lecture.ppt
 
Equivalencein translation
Equivalencein translationEquivalencein translation
Equivalencein translation
 
Machine Translation Introduction
Machine Translation IntroductionMachine Translation Introduction
Machine Translation Introduction
 
Relevance theory part 1
Relevance theory part 1Relevance theory part 1
Relevance theory part 1
 
Theory of translation
Theory of translationTheory of translation
Theory of translation
 
Translation Strategies, by Dr. Shadia Y. Banjar
Translation Strategies, by Dr. Shadia Y. BanjarTranslation Strategies, by Dr. Shadia Y. Banjar
Translation Strategies, by Dr. Shadia Y. Banjar
 
Challenges of Translation
Challenges of TranslationChallenges of Translation
Challenges of Translation
 
Translation techniques and text types
Translation techniques and text typesTranslation techniques and text types
Translation techniques and text types
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basics
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Approaches of translation
Approaches of translationApproaches of translation
Approaches of translation
 
Intro to Trans 350 methods of translation
Intro to Trans 350 methods of translationIntro to Trans 350 methods of translation
Intro to Trans 350 methods of translation
 
History of translstudies
History of translstudiesHistory of translstudies
History of translstudies
 

Similar a Types of machine translation

Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
Shashank Shisodia
 
Direct Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsDirect Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete Units
IJCI JOURNAL
 

Similar a Types of machine translation (20)

ReseachPaper
ReseachPaperReseachPaper
ReseachPaper
 
Machine Translation System: Chhattisgarhi to Hindi
Machine Translation System: Chhattisgarhi to HindiMachine Translation System: Chhattisgarhi to Hindi
Machine Translation System: Chhattisgarhi to Hindi
 
Hindi –tamil text translation
Hindi –tamil text translationHindi –tamil text translation
Hindi –tamil text translation
 
Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
Punjabi to Hindi Transliteration System for Proper Nouns Using Hybrid Approach
Punjabi to Hindi Transliteration System for Proper Nouns Using Hybrid ApproachPunjabi to Hindi Transliteration System for Proper Nouns Using Hybrid Approach
Punjabi to Hindi Transliteration System for Proper Nouns Using Hybrid Approach
 
E-Translation
E-TranslationE-Translation
E-Translation
 
The methodology of translation with special reference to sanskrit to persian
The methodology of translation   with special reference to sanskrit to persianThe methodology of translation   with special reference to sanskrit to persian
The methodology of translation with special reference to sanskrit to persian
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design Aspects
 
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
 
SMT3
SMT3SMT3
SMT3
 
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
IMPROVING THE QUALITY OF GUJARATI-HINDI MACHINE TRANSLATION THROUGH PART-OF-S...
 
I026050054
I026050054I026050054
I026050054
 
Pxc3898474
Pxc3898474Pxc3898474
Pxc3898474
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Direct Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete UnitsDirect Punjabi to English Speech Translation using Discrete Units
Direct Punjabi to English Speech Translation using Discrete Units
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
 
A performance of svm with modified lesk approach for word sense disambiguatio...
A performance of svm with modified lesk approach for word sense disambiguatio...A performance of svm with modified lesk approach for word sense disambiguatio...
A performance of svm with modified lesk approach for word sense disambiguatio...
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
 
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTSLEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
LEARNING CROSS-LINGUAL WORD EMBEDDINGS WITH UNIVERSAL CONCEPTS
 

Más de Rushdi Shams

L1 l2 l3 introduction to machine translation
L1 l2 l3  introduction to machine translationL1 l2 l3  introduction to machine translation
L1 l2 l3 introduction to machine translation
Rushdi Shams
 
Syntax and semantics
Syntax and semanticsSyntax and semantics
Syntax and semantics
Rushdi Shams
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
Rushdi Shams
 
Probabilistic logic
Probabilistic logicProbabilistic logic
Probabilistic logic
Rushdi Shams
 
Knowledge structure
Knowledge structureKnowledge structure
Knowledge structure
Rushdi Shams
 
Knowledge representation
Knowledge representationKnowledge representation
Knowledge representation
Rushdi Shams
 
L5 understanding hacking
L5  understanding hackingL5  understanding hacking
L5 understanding hacking
Rushdi Shams
 
L2 Intrusion Detection System (IDS)
L2  Intrusion Detection System (IDS)L2  Intrusion Detection System (IDS)
L2 Intrusion Detection System (IDS)
Rushdi Shams
 
L2 l3 l4 software process models
L2 l3 l4  software process modelsL2 l3 l4  software process models
L2 l3 l4 software process models
Rushdi Shams
 

Más de Rushdi Shams (20)

Research Methodology and Tips on Better Research
Research Methodology and Tips on Better ResearchResearch Methodology and Tips on Better Research
Research Methodology and Tips on Better Research
 
Common evaluation measures in NLP and IR
Common evaluation measures in NLP and IRCommon evaluation measures in NLP and IR
Common evaluation measures in NLP and IR
 
Machine learning with nlp 101
Machine learning with nlp 101Machine learning with nlp 101
Machine learning with nlp 101
 
Semi-supervised classification for natural language processing
Semi-supervised classification for natural language processingSemi-supervised classification for natural language processing
Semi-supervised classification for natural language processing
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
L1 l2 l3 introduction to machine translation
L1 l2 l3  introduction to machine translationL1 l2 l3  introduction to machine translation
L1 l2 l3 introduction to machine translation
 
Syntax and semantics
Syntax and semanticsSyntax and semantics
Syntax and semantics
 
Propositional logic
Propositional logicPropositional logic
Propositional logic
 
Probabilistic logic
Probabilistic logicProbabilistic logic
Probabilistic logic
 
L15 fuzzy logic
L15  fuzzy logicL15  fuzzy logic
L15 fuzzy logic
 
Knowledge structure
Knowledge structureKnowledge structure
Knowledge structure
 
Knowledge representation
Knowledge representationKnowledge representation
Knowledge representation
 
First order logic
First order logicFirst order logic
First order logic
 
Belief function
Belief functionBelief function
Belief function
 
L5 understanding hacking
L5  understanding hackingL5  understanding hacking
L5 understanding hacking
 
L4 vpn
L4  vpnL4  vpn
L4 vpn
 
L3 defense
L3  defenseL3  defense
L3 defense
 
L2 Intrusion Detection System (IDS)
L2  Intrusion Detection System (IDS)L2  Intrusion Detection System (IDS)
L2 Intrusion Detection System (IDS)
 
L1 phishing
L1  phishingL1  phishing
L1 phishing
 
L2 l3 l4 software process models
L2 l3 l4  software process modelsL2 l3 l4  software process models
L2 l3 l4 software process models
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Types of machine translation

  • 1. Drop me a mail:Drop me a mail: rushdecoder@yahoo.comrushdecoder@yahoo.com Visit me at:Visit me at: http://http://rushdishams.googlepages.comrushdishams.googlepages.com 1Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh
  • 2. Translation Approach  The translation process may be stated as: 1. Decoding the meaning of the source text 2. Re-encoding this meaning in the target language.  Machine translation can use a method based on linguistic rules-  words will be translated in a linguistic way  the most suitable words of the target language will replace the ones in the source language. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 2
  • 3. Translation Approach  The success of machine translation requires the problem of natural language understanding to be solved first.  Generally, rule-based methods parse a text, usually creating an intermediary, symbolic representation, from which the text in the target language is generated. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 3
  • 4. Translation Approach  According to the nature of the intermediary representation, an approach is described as interlingual machine translation or transfer-based machine translation.  These methods require extensive lexicons with morphological, syntactic, and semantic information, and large sets of rules. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 4
  • 5. Translation Approach  Machine translation programs often work well enough for a native speaker of one language to get the approximate meaning of what is written by the other native speaker. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 5
  • 6. Translation Approach  the large multilingual corpus of data needed for statistical methods to work is not necessary for the grammar-based methods.  But then, the grammar methods need a skilled linguist to carefully design the grammar that they use. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 6
  • 7. Types of Machine Translation Text Generation Syntactic Parsing Semantic Analysis Sentence Planning Source (Arabic) Target (English) Transfer Rules Direct: SMT, EBMT Interlingua
  • 8. Rule based MT  The rule-based machine translation paradigm includes 1. transfer-based machine translation, 2. interlingual machine translation and 3. dictionary-based machine translation Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 8
  • 9. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 9
  • 10. Transfer based MT  Itis necessary to have an intermediate representation that captures the "meaning" of the original sentence in order to generate the correct translation  In interlingua-based MT this intermediate representation must be independent of the languages in question, whereas in transfer- based MT, it has some dependence on the language pair involved. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 10
  • 11. Transfer based MT  The original text is first analyzed morphologically and syntactically in order to obtain a syntactic representation.  This representation can then be refined to a more abstract level putting emphasis on the parts relevant for translation and ignoring other types of information. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 11
  • 12. Transfer based MT  The transfer process then converts this final representation (still in the original language) to a representation of the same level of abstraction in the target language.  These two representations are referred to as "intermediate" representations.  From the target language representation, the stages are then applied in reverse. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 12
  • 14. Transformation process  Morphological analysis Surface forms of the input text are classified as ○ to part-of-speech (e.g. noun, verb, etc.) and ○ sub-category (number, gender, tense, etc.) Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 14
  • 15. Transformation process  Lexical categorization In any given text some of the words may have more than one meaning, causing ambiguity in analysis. Lexical categorization looks at the context of a word to try and determine the correct meaning in the context of the input. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 15
  • 16. Transformation process  Lexical transfer This is basically dictionary translation the source language lemma (perhaps with sense information) is looked up in a bilingual dictionary and the translation is chosen. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 16
  • 17. Transformation process  Structural transfer While the previous stages deal with words, this stage deals with larger constituents Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 17
  • 18. Transformation process  Morphological generation  From the output of the structural transfer stage, the target language surface forms are generated. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 18
  • 19. Transfer Types  Superficial transfer (or syntactic) This level is characterized by transferring "syntactic structures" between the source and target languages. It is suitable for languages in the same family or of the same type. for example in the Romance languages between Spanish, Catalan, French, Italian, etc. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 19
  • 20. Transfer Types  Deep transfer (or semantic) This level constructs a semantic representation that is dependent on the source language. This representation can consist of a series of structures which represent the meaning. In these transfer systems predicates are typically produced. The translation also typically requires structural transfer. This level is used to translate between more distantly related languages (e.g. Spanish-English or Spanish- Basque, etc.) Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 20
  • 23. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 23
  • 24. Interlingual MT  the source language, i.e. the text to be translated is transformed into an interlingua, i.e., an abstract language- independent representation.  The target language is then generated from the interlingua. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 24
  • 25. Interlingual MT  In the direct approach, words are translated directly without passing through an additional representation.  In the transfer approach the source language is transformed into an abstract, less language- specific representation. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 25
  • 27. Advantage and disadvantage  The advantage in multilingual machine translations is that no transfer component has to be created for each language pair  The obvious disadvantage is that the definition of an interlingua is difficult and maybe even impossible for a wider domain. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 27
  • 28. Components  Dictionaries for analysis and generation  A conceptual lexicon, which is the knowledge base about events and entities known in the domain.  A set of projection rules (specific to the domain and the languages).  Grammars for the analysis and generation of the languages involved. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 28
  • 29. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 29
  • 30. Dictionary-based MT  The words will be translated as a dictionary does — word by word, usually without much correlation of meaning between them  Dictionary lookups may be done with or without morphological analysis or lemmatisation  used to expedite manual translation, if the person carrying it out is fluent in both languages and therefore capable of correcting syntax and grammar. Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 30
  • 33. Example-based MT Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 33
  • 34. Example-based MT  characterized by its use of a bilingual corpus with parallel texts as its main knowledge base  It is essentially a translation by analogy and can be viewed as an implementation of case-based reasoning approach of machine learning
  • 35. Example-based MT  characterized by its use of a bilingual corpus with parallel texts as its main knowledge base  It is essentially a translation by analogy and can be viewed as an implementation of case-based reasoning approach of machine learning
  • 37. Example-based MT  bilingual parallel corpora contain sentence pairs like the example shown in the table.  How much is that X ? corresponds to Ano X wa ikura desu ka.  red umbrella corresponds to akai kasa  small camera corresponds to chiisai kamera
  • 38. Example-based MT  President Kennedy was shot dead during the parade. and The convict escaped on July 15th. We could translate the sentence The convict was shot dead during the parade. by substituting the appropriate parts of the sentences.
  • 39. Statistical MT Rushdi Shams, Lecturer, Dept of CSE, KUET, Bangladesh 39
  • 40. Statistical MT  The idea behind statistical machine translation comes from information theory.  A document is translated according to the probability distribution p(e | f) that a string e in the target language (for example, English) is the translation of a string f in the source language (for example, French).
  • 41. Statistical MT  The problem of modeling the probability distribution p(e | f) has been approached in a number of ways. One intuitive approach is to apply Bayes Theorem
  • 42. where the translation model p(f | e) is the probability that the source string is the translation of the target string, and the language model p(e) is the probability of seeing that target language string string.
  • 43. Statistical MT  Finding the best translation is done by picking up the one that gives the highest probability