SlideShare una empresa de Scribd logo
1 de 23
An Overview
of Machine
Translation




         A
    Presentation
        by:
Outline
2



     Introduction
     A brief introduction to Translation technology
     Interest in MT
     Problems Involved in Machine Translation
    Translation Technology
     Knowledge-based systems
     Statistical machine translation systems
     Rule-Based vs. Statistical MT
     Current State of Machine Translation in Use
    Personal Speech-to-Speech Translators


                                 Machine Translation
Introduction
3




•These factors have increased both the
demand for translation services and
interest in computerized translation
technology.




•Some industry observers say machine
translation, a largely experimental
technology that has been around since
the late 1950s, is now ready to become
commercially viable.
                                   Machine Translation
Definition

                               The sub-domain of artificial
                               intelligence concerned with the
                               task of developing programs
                               possessing some capability of
              NLP                            ’ a natural language
                               in order to achieve some specific
                               goal.




A transformation from one
representation (the input text) to               Understanding
another (internal representation)



                               Machine Translation
Introduction:
5



Machine Translation :



The use of computers to
translate from one language
to another.


One of the oldest dreams of
NLP, AI, and CS
(first system in 1954).




                              Machine Translation
6




    Why Machine Translation?

    •Cheap, universal access to
    world’s online information
    regardless of original language.

    (That’s the goal)




                               Machine Translation
Interest in MT
7




                                 Interest in MT


    Commercial interest                                             Academic interest


                                 challenging problems in     Requires knowledge from
      U.S. has invested in MT                                                               transferring resources from one
                                      NLP research            many NLP sub-areas
                                                                                                  language to another

      MT is popular on the web                  lexical semantics            parsing


      EU spends more than $1                                                  statistical
                                                morphological analysis
        billion on translation                                                modeling


         (Semi-)automated
            translation


                                                      Machine Translation
Problems Involved in Machine
       Translation
8


            Ambiguity

            syntactic irregularity

            multiple word meanings

            the influence of context
    are the main problems faced by MT systems.

    A classic example is illustrated in the following pair of
    sentences:


    Time flies like an arrow.


    Fruit flies like an apple.
                                              Machine Translation
How can a machine understand these
    differences?
9


      Get   the cat with the gloves.




                             Machine Translation
Outline
10



      Introduction
      A brief introduction to Translation technology
      Interest in MT
      Problems Involved in Machine Translation
     Translation Technology
      Knowledge-based systems
      Statistical machine translation systems
      Rule-Based vs. Statistical MT
      Current State of Machine Translation in Use
     Personal Speech-to-Speech Translators


                                  Machine Translation
TRANSLATION TECHNOLOGY
11



           •There are two kinds of machine translation:


                                            •Knowledge-based systems


                                           •Statistical machine translation
         •Knowledge-based systems

     Traditional translation technology takes a knowledge-
     based approach.
     These expert systems—used by vendors such as Fujitsu,
     Logos, and Systran—translate documents by converting
     words and grammar directly from one language into
     another.

                                           Machine Translation
Knowledge-based systems
12

How they work.
                                                                       Hmm, every time he sees
Knowledge based systems                                                ―banco‖, he either types
rely on programmers to enter                                           ―bank‖ or ―bench‖ … but if
various languages’ vocabulary      Man, this is so boring.             he sees ―banco de…‖,
                                                                       he always types ―bank‖,
and syntax information into                                            never ―bench‖…
data bases.

The programmers then write
lists of rules that describe the
possible relationships
between a language’s parts of
speech.

The software, which can run
                                                             Translated documents
on a high-powered PC,
analyzes a document and
examines the rules for both the             Machine Translation                12
text’s language and the target
Statistical machine translation systems
13


                                    Rather than using the knowledge based system’s
       Statistical machine translation
                                    direct word-by-word translation techniques, statistical
                                    approaches translate documents by statistically
                                    analyzing entire phrases and, over time, ―learning‖
                                    how various languages work.

     How it works. Statistical systems
     start with minimal dictionary and language
     resources. Users then must train the
     system before they can work with it on
     extensive translations.
     During the training, researchers feed the
     system documents for which they already
     have accurate human translations.
     The system then uses its resources to
     guess at
      the documents’ meanings.
                                           Machine Translation
Statistical machine translation
14
     systems
 Statistical systems generally work by
 dividing documents into N-grams, with N
 the number of words, usually three, in a
 phrase. N-grams are statistical translation’s
 building blocks.




 Analyzing N-grams helps improve
 translation accuracy and performance
 because, while a word by itself may have
 many definitions, it has far fewer potential
 meanings when used as part of a phrase.
                                       Machine Translation
Statistical machine translation
15
       systems


                                        Machine
                                        Learning
                                         Magic


        Books in       Same books,
        English        in Farsi                               P(F|E) model

     Statistical machine translation (SMT) can be defined as the process of
     maximizing the probability of a sentence s in the source language
     matching a sentence t in the target language. We call collections stored
     in two languages parallel corpora or parallel texts.
                                        Machine Translation
Statistical machine translation
16
     systems
 Statistical machine translation systems, which
 statistically analyze entire phrases and ―learn‖ how
 various languages work, frequently work with other
 types of systems to improve output quality.

 The lexicon system provides translated words and their
 variations.

 The alignment system assures that phrases from the
 source language are converted to the proper phrases
 and presented in the proper order in the target
 language.

 The language system performs a morphological
 analysis of individual words or a syntactic analysis of
 sentences and thereby produces translations that read
 properly.                                 Machine Translation
Rule-Based vs. Statistical MT
17

    Rule-based MT:
        very labour intensive, time-consuming, and expensive
        Rules can be based on lexical or structural transfer
        Each program must be customized for each language-pair it works with.

        Pro: firm grip on complex translation phenomena
        Con: time-consuming, and expensive,Often very labor-intensive -> lack
         of robustness

    Statistical MT
        Mainly word or phrase-based translations
        Translation are learned from actual data
        In general, in statistical machine translation, if more data will be
         provided for learning; higher will be the quality of translation.

        Pro: Translations are learned automatically
        Con: Difficult to model complex translation phenomena
                                           Machine Translation
Current State of Machine Translation in
     Use
18




     Google Translate is a service provided by
     Google Inc. to translate a section of text, or a
     webpage, into another language, with limits to
     the number of paragraphs, or range of
     technical terms, translated. For some
     languages, users are asked for alternative
     translations, such as for technical terms, to be
     included for future updates to the translation
     process. Google translate is based on an
     approach called statistical machine translation.


                                          Machine Translation
Current State of Machine Translation in Use
     cont.
19




            SYSTRAN's methodology is a sentence by sentence approach,
            concentrating on individual words and their dictionary data, then
            on the parse of the sentence unit, followed by the translation of
            the parsed sentence.

           AltaVista’s Babel fish
           Babel Fish is a web-based application developed by AltaVista
           (now part of Yahoo!) which automatically translates text or
           web pages from one of several languages into another. The
           translation technology for Babel Fish is provided by
           SYSTRAN, whose technology also powers a number of other
           sites and portals.


                                    Machine Translation
Current State of Machine Translation in Use
     cont.
20



           is a Los Angeles, California–based company that was founded in 2002
           by the University of Southern California's Kevin Knight and Daniel
           Marcu, to commercialize a statistical approach to automatic |language
           translation and natural language processing - now known globally as
           statistical machine translation software (SMTS)
           Language Weaver’s statistically-based translation software is an
           instance of a recent advance in automated translation.


             is a service provided by Microsoft as part of its
             Windows Live services which allow users to translate
             texts or entire web pages into different languages.
             Computer-related texts are translated by Microsoft's
             own statistical machine translation technology for eight
             supported languages

                                   Machine Translation
Personal Speech-to-Speech
Translators
21


•One of the newest research areas in machine translation is the personal speech to-
speech translator. People on business or personal trips could use these devices to
translate on the fly.
Speech-to-speech translation, which is still in the experimental
stage, is a complex process requiring speech-recognition
technology that converts speech to text, machine translation of the text, and then text-
to-speech conversion.
•IBM is working on the handheld multilingual automatic speech-to-speech translator
(Mastor), which uses a hybrid statistical/knowledge-base engine to translate the
content. Mastor tries to determine the general meaning of a phrase, rather than its
exact translation. This approach requires less database capacity, which makes it more
suitable for small devices.




                                         Machine Translation
LOOKING AHEAD
22




     •Because of ongoing
     demand for better
     translation systems,
     research money will
     continue to flow into the
     field. In addition,
     companies are likely to
     develop and release more
     commercial products.
                            Machine Translation
Questions ?
23

      http://www.youtube.com/watch?v=jZCecsdlM7Q




                                  Machine Translation

Más contenido relacionado

La actualidad más candente

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processingsaurabhnarhe
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.netwww.myassignmenthelp.net
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP) ASWINKP11
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingRishikese MR
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
Natural language processing
Natural language processingNatural language processing
Natural language processingHansi Thenuwara
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AISaurav Shrestha
 
Natural language processing
Natural language processingNatural language processing
Natural language processingAbash shah
 
Natural language processing PPT presentation
Natural language processing PPT presentationNatural language processing PPT presentation
Natural language processing PPT presentationSai Mohith
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: ParsingRushdi Shams
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introductionRobert Lujo
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingBhavya Chawla
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processinggulshan kumar
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingMercy Rani
 

La actualidad más candente (20)

Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
Natural language processing (NLP)
Natural language processing (NLP) Natural language processing (NLP)
Natural language processing (NLP)
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing in AI
Natural Language Processing in AINatural Language Processing in AI
Natural Language Processing in AI
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing PPT presentation
Natural language processing PPT presentationNatural language processing PPT presentation
Natural language processing PPT presentation
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
NLP
NLPNLP
NLP
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introduction
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processing
 
Nlp
NlpNlp
Nlp
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Similar a Machine Tanslation

Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationChamani Shiranthika
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...write4
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...write5
 
CH 01.pptx
CH 01.pptxCH 01.pptx
CH 01.pptxObsa2
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)TAUS - The Language Data Network
 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approachvini89
 
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...AishwaryaChemate
 
Machine Translation
Machine TranslationMachine Translation
Machine TranslationUwe Muegge
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsIOSR Journals
 
The Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationThe Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationIconic Translation Machines
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Iconic Translation Machines
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...John Tinsley
 
Lecture_1_Introduction_to_Programming.pptx
Lecture_1_Introduction_to_Programming.pptxLecture_1_Introduction_to_Programming.pptx
Lecture_1_Introduction_to_Programming.pptxChewe Lulembo
 
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...ijnlc
 

Similar a Machine Tanslation (20)

Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translation
 
E-Translation
E-TranslationE-Translation
E-Translation
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...
 
Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...Jawaharlal Nehru Technological University Natural Language Processing Capston...
Jawaharlal Nehru Technological University Natural Language Processing Capston...
 
CH 01.pptx
CH 01.pptxCH 01.pptx
CH 01.pptx
 
Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
 
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
 
Machine Translation
Machine TranslationMachine Translation
Machine Translation
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design Aspects
 
Introduction to programming languages part 1
Introduction to programming languages   part 1Introduction to programming languages   part 1
Introduction to programming languages part 1
 
sample PPT.pptx
sample PPT.pptxsample PPT.pptx
sample PPT.pptx
 
NLP unit-VI.pptx
NLP unit-VI.pptxNLP unit-VI.pptx
NLP unit-VI.pptx
 
The Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationThe Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine Translation
 
machine transaltion
machine transaltionmachine transaltion
machine transaltion
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
Lecture_1_Introduction_to_Programming.pptx
Lecture_1_Introduction_to_Programming.pptxLecture_1_Introduction_to_Programming.pptx
Lecture_1_Introduction_to_Programming.pptx
 
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
 

Último

Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 

Último (20)

Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 

Machine Tanslation

  • 2. Outline 2  Introduction  A brief introduction to Translation technology  Interest in MT  Problems Involved in Machine Translation Translation Technology  Knowledge-based systems  Statistical machine translation systems  Rule-Based vs. Statistical MT  Current State of Machine Translation in Use Personal Speech-to-Speech Translators Machine Translation
  • 3. Introduction 3 •These factors have increased both the demand for translation services and interest in computerized translation technology. •Some industry observers say machine translation, a largely experimental technology that has been around since the late 1950s, is now ready to become commercially viable. Machine Translation
  • 4. Definition The sub-domain of artificial intelligence concerned with the task of developing programs possessing some capability of NLP ’ a natural language in order to achieve some specific goal. A transformation from one representation (the input text) to Understanding another (internal representation) Machine Translation
  • 5. Introduction: 5 Machine Translation : The use of computers to translate from one language to another. One of the oldest dreams of NLP, AI, and CS (first system in 1954). Machine Translation
  • 6. 6 Why Machine Translation? •Cheap, universal access to world’s online information regardless of original language. (That’s the goal) Machine Translation
  • 7. Interest in MT 7 Interest in MT Commercial interest Academic interest challenging problems in Requires knowledge from U.S. has invested in MT transferring resources from one NLP research many NLP sub-areas language to another MT is popular on the web lexical semantics parsing EU spends more than $1 statistical morphological analysis billion on translation modeling (Semi-)automated translation Machine Translation
  • 8. Problems Involved in Machine Translation 8 Ambiguity syntactic irregularity multiple word meanings the influence of context are the main problems faced by MT systems. A classic example is illustrated in the following pair of sentences: Time flies like an arrow. Fruit flies like an apple. Machine Translation
  • 9. How can a machine understand these differences? 9  Get the cat with the gloves. Machine Translation
  • 10. Outline 10  Introduction  A brief introduction to Translation technology  Interest in MT  Problems Involved in Machine Translation Translation Technology  Knowledge-based systems  Statistical machine translation systems  Rule-Based vs. Statistical MT  Current State of Machine Translation in Use Personal Speech-to-Speech Translators Machine Translation
  • 11. TRANSLATION TECHNOLOGY 11 •There are two kinds of machine translation: •Knowledge-based systems •Statistical machine translation •Knowledge-based systems Traditional translation technology takes a knowledge- based approach. These expert systems—used by vendors such as Fujitsu, Logos, and Systran—translate documents by converting words and grammar directly from one language into another. Machine Translation
  • 12. Knowledge-based systems 12 How they work. Hmm, every time he sees Knowledge based systems ―banco‖, he either types rely on programmers to enter ―bank‖ or ―bench‖ … but if various languages’ vocabulary Man, this is so boring. he sees ―banco de…‖, he always types ―bank‖, and syntax information into never ―bench‖… data bases. The programmers then write lists of rules that describe the possible relationships between a language’s parts of speech. The software, which can run Translated documents on a high-powered PC, analyzes a document and examines the rules for both the Machine Translation 12 text’s language and the target
  • 13. Statistical machine translation systems 13 Rather than using the knowledge based system’s Statistical machine translation direct word-by-word translation techniques, statistical approaches translate documents by statistically analyzing entire phrases and, over time, ―learning‖ how various languages work. How it works. Statistical systems start with minimal dictionary and language resources. Users then must train the system before they can work with it on extensive translations. During the training, researchers feed the system documents for which they already have accurate human translations. The system then uses its resources to guess at the documents’ meanings. Machine Translation
  • 14. Statistical machine translation 14 systems Statistical systems generally work by dividing documents into N-grams, with N the number of words, usually three, in a phrase. N-grams are statistical translation’s building blocks. Analyzing N-grams helps improve translation accuracy and performance because, while a word by itself may have many definitions, it has far fewer potential meanings when used as part of a phrase. Machine Translation
  • 15. Statistical machine translation 15 systems Machine Learning Magic Books in Same books, English in Farsi P(F|E) model Statistical machine translation (SMT) can be defined as the process of maximizing the probability of a sentence s in the source language matching a sentence t in the target language. We call collections stored in two languages parallel corpora or parallel texts. Machine Translation
  • 16. Statistical machine translation 16 systems Statistical machine translation systems, which statistically analyze entire phrases and ―learn‖ how various languages work, frequently work with other types of systems to improve output quality. The lexicon system provides translated words and their variations. The alignment system assures that phrases from the source language are converted to the proper phrases and presented in the proper order in the target language. The language system performs a morphological analysis of individual words or a syntactic analysis of sentences and thereby produces translations that read properly. Machine Translation
  • 17. Rule-Based vs. Statistical MT 17  Rule-based MT:  very labour intensive, time-consuming, and expensive  Rules can be based on lexical or structural transfer  Each program must be customized for each language-pair it works with.  Pro: firm grip on complex translation phenomena  Con: time-consuming, and expensive,Often very labor-intensive -> lack of robustness  Statistical MT  Mainly word or phrase-based translations  Translation are learned from actual data  In general, in statistical machine translation, if more data will be provided for learning; higher will be the quality of translation.  Pro: Translations are learned automatically  Con: Difficult to model complex translation phenomena Machine Translation
  • 18. Current State of Machine Translation in Use 18 Google Translate is a service provided by Google Inc. to translate a section of text, or a webpage, into another language, with limits to the number of paragraphs, or range of technical terms, translated. For some languages, users are asked for alternative translations, such as for technical terms, to be included for future updates to the translation process. Google translate is based on an approach called statistical machine translation. Machine Translation
  • 19. Current State of Machine Translation in Use cont. 19 SYSTRAN's methodology is a sentence by sentence approach, concentrating on individual words and their dictionary data, then on the parse of the sentence unit, followed by the translation of the parsed sentence. AltaVista’s Babel fish Babel Fish is a web-based application developed by AltaVista (now part of Yahoo!) which automatically translates text or web pages from one of several languages into another. The translation technology for Babel Fish is provided by SYSTRAN, whose technology also powers a number of other sites and portals. Machine Translation
  • 20. Current State of Machine Translation in Use cont. 20 is a Los Angeles, California–based company that was founded in 2002 by the University of Southern California's Kevin Knight and Daniel Marcu, to commercialize a statistical approach to automatic |language translation and natural language processing - now known globally as statistical machine translation software (SMTS) Language Weaver’s statistically-based translation software is an instance of a recent advance in automated translation. is a service provided by Microsoft as part of its Windows Live services which allow users to translate texts or entire web pages into different languages. Computer-related texts are translated by Microsoft's own statistical machine translation technology for eight supported languages Machine Translation
  • 21. Personal Speech-to-Speech Translators 21 •One of the newest research areas in machine translation is the personal speech to- speech translator. People on business or personal trips could use these devices to translate on the fly. Speech-to-speech translation, which is still in the experimental stage, is a complex process requiring speech-recognition technology that converts speech to text, machine translation of the text, and then text- to-speech conversion. •IBM is working on the handheld multilingual automatic speech-to-speech translator (Mastor), which uses a hybrid statistical/knowledge-base engine to translate the content. Mastor tries to determine the general meaning of a phrase, rather than its exact translation. This approach requires less database capacity, which makes it more suitable for small devices. Machine Translation
  • 22. LOOKING AHEAD 22 •Because of ongoing demand for better translation systems, research money will continue to flow into the field. In addition, companies are likely to develop and release more commercial products. Machine Translation
  • 23. Questions ? 23 http://www.youtube.com/watch?v=jZCecsdlM7Q Machine Translation

Notas del editor

  1.  Statistical machine translation (SMT) can be defined as the process of maximizing the probability of a sentence s in the source language matching a sentence t in the target language.
  2. the knowledge-based approach’s is very labour intensive, time-consuming, and expensive. And even after decades of work, the systems don’t generally provide more than the basic idea of a document’s meaning. However, until recently, knowledge based systems were still preferred by many researchers who contended that the statistical approach was too simple to effectively handle a complex task like translation. In addition, statistical systems require fast processors and large amounts of RAM, which were not readily and inexpensively available until several years ago.
  3. It is critical to continue research and development in any field, knowing the current state of the technology, rather than re-inventing the wheel. Existing translation engines will be explained in the following slides.
  4. Even as technology opens up e-commerce opportunities, companies must overcome language barriers to reach new potential customers and business partners. For example, many companies have decided to develop Web sites in the languages of the countries in which their customers and partners live.
  5. Building machines to automate tasks requiring intelligent behaviour. Machine Translation (MT) is a subfield of natural language processing that involves automatic translation of sentences from one natural language to another.The sub-domain of artificial intelligence concerned with the task of developing programs possessing some capability of ‘understanding’ a natural language in order to achieve some specific goal.
  6. Machine Translation also known as Automatic Translation is the process that translates one human language to another. A Machine Translation Systems can be thought of as a compiler. A compiler translates a high-level programming language like C++, Java and the like to low-level languages like assembly and machine language. The only difference being that the grammar of a natural language like English or Hindi is much more complex compared to the grammar of a programming language.
  7. A Machine Translation Systems can be thought of as a compiler. A compiler translates a high-level programming language like C++, Java and the like to low-level languages like assembly and machine language. The only difference being that the grammar of a natural language like English or Hindi is much more complex compared to the grammar of a programming language.
  8. Commercial interest:U.S. has invested in MT for intelligence purposesMT is popular on the web—it is the most used of Google’s special featuresEU spends more than $1 billion on translation costs each year.(Semi-)automated translation could lead to huge savingsAcademic interest:One of the most challenging problems in NLP researchRequires knowledge from many NLP sub-areas, e.g., lexical semantics, parsing, morphological analysis, statistical modeling,…Being able to establish links between two languages allows for transferring resources from one language to another
  9. The sentence construction is parallel, but the meanings are entirely different: the first is a figure of speech involving a metaphor and the second is a literal description. And the identical words in the sentences - flies and like - are used in different grammatical categories