SlideShare una empresa de Scribd logo
1 de 12
TAUS USER CONFERENCE 2010
LANGUAGE BUSINESS INNOVATION
4 – 6 OCTOBER / PORTLAND (OR), USA




TUESDAY 5 OCTOBER / 11.15

THE DEEP HYBRID MACHINE TRANSLATION ENGINE
Olga Beregovaya, PROMT
Company Profile

• Experienced. Founded in 1991

• International. Offices in US, Germany, Russia

• Innovative. 150 employees, 80 of them are in R&D

• Widely used. Over 120 million hits per month on our online
   translation sites
Enterprise MT User Challenges
Market need: translated content built of fluent and relevant sentences that
   preserve metadata information, branding, tone of voice and terminology.

Source: This error occurs in SQL Partner products when code in a trigger cancels
    the operation using the SQL RAISE function, or if the SQLConnection.cancel
    or SQLStatement.cancel methods are called when a statement is executed
    using SQLStatement.execute or SQLStatement.next.
Not-so-good Target: Este error se produce en SQL productos de socios cuando
    el código en un desencadenador se cancela la operación utilizando la
    función de SQL levantar, o si el o los métodos SQLConnection.cancel
    SQLStatement.cancel se llama cuando una declaración se ejecuta utilizando
    SQLStatement.execute o SQLStatement.next.

Known challenges:
RBMT limitations: fluency, terminology, engine customization effort
SMT limitations: sentence structure, duplicates and omissions, over-
   normalization both at training and at run-time
PROMT DeepHybrid Engine –
                     Taking on the Challenge
PROMT DeepHybrid – both approaches work side by side providing the best choice
   possible during each step of the translation process

•  Fluency: PROMT DeepHybrid approach increases the fluency of the final translation by letting
   the corpus make translation choices – both grammatical and lexical
• Sentence structure: PROMT DeepHybrid preserves the syntactic accuracy and predictability
   of the RBMT engine output
• Relevance: PROMT DeepHybrid combines terminology management capabilities of RBMT
   systems with SMT corpus-based terminology validation
PROMT DeepHybrid also supports and enhances existing key product features:
• Style integrity: PROMT’s Virtual Style Guide technology automates the preservation of tone-
   of-voice and corporate identity through automated rules selection
• Extensive Metadata Support (Translation Anchors): the rule-based core engine takes on all
   heavy-duty metadata processing and preservation
PROMT DeepHybrid Engine Flowchart

    Dictionaries                           Rules                         Corpora
  PROMT
                                                                   PROMT
  General
                  Client                                           Corpora:       Client
 Dictionary                                         Client
               Dictionaries        PROMT                           Parallel      Corpora
    and                                          Translation
                   and          Transfer Rules                       and           and
  Domain                                         Preferences
               Glossaries                                         Monolingual     TMs
Dictionaries

PROMT Assets    Client Assets    PROMT Assets     Client Assets   PROMT Assets   Client Assets



                                                                                                 Post-edited
                                                                                                    TM




               Customized        Translation      Statistical     Translation    Language
  Source        Branching        Candidates         Post-         Candidates       Model         Best Target
                 Transfer                          Editing                       Selection
                                 1.                               1.
                                 2.                               2.
                                 X.                               X.
PROMT DeepHybrid at a Glance
Branching Transfer
 PROMT Branching Transfer is a sequence of rule-based algorithmic steps enhanced by client-
    specific statistical input
• Translation choices largely depend on the context ; PROMT engine only makes the most
    apparent forced choices; otherwise, all probable instances are generated
• PROMT translation model usually produces 4-12 candidates for a 10-15 word sentence after
    tree pruning techniques are applied
• Each step during lexical analysis relies on terminology from client TMs in addition to baseline
    PROMT dictionaries
• Each step during syntactic analysis relies on PROMT rule library enhanced by syntactic
    patterns mined from client TMs
Statistical Post-editing
Before being fed to the language model, the candidate translations undergo statistical post-
    editing based on the sub-sentential parsing of both MT output and client data
Language Model:
•   General corpora - billions of words are available
•   In-domain corpora – pooling TDA data is helpful because of the thin domain space
•   Client corpora provide probability skew in favor of client-specific choices
PROMT DeepHybrid Engine: Candidate
                     Selection Examples
Example 1: Syntactic choice                                             Example 2: Lexical choice
Source:                                                                 Source:
It is used for patient information, lab results, reports, images, and   The "Nehalem" system architecture features an integrated memory
clinical data.                                                          controller

    RBMT translation:                                                      RBMT translation:
    Es usado para información sobre los pacientes, resultados del          La arquitectura del sistema "Nehalem" presenta un controlador
    laboratorio, informes, imágenes, y datos clínicos.                     de memoria integrado

    Hybrid engine candidates:                                              Hybrid engine candidates:
    а) Es usado para información sobre los pacientes, resultados del       а) La arquitectura del sistema "Nehalem" presenta un
    laboratorio, informes, imágenes, y datos clínicos.                     controlador de memoria integrado
            ppl= 791.4319204909                                                    ppl= 288.17916810444
    b) Se usa para información sobre los pacientes, resultados del         b) La arquitectura del sistema "Nehalem" incluye un controlador
    laboratorio, informes, imágenes, y datos clínicos.                     de memoria integrado
            ppl= 424.83820234214                                                   ppl= 234.86938828311
    c) Está usado para información sobre los pacientes, resultados
    del laboratorio, informes, imágenes, y datos clínicos.
                                                                                   Hybrid Outcome:
            ppl= 814.24328845084                                                   La arquitectura del sistema "Nehalem" incluye un
                                                                                   controlador de memoria integrado
            Hybrid Outcome:
            Se usa para información sobre los pacientes, resultados
            del laboratorio, informes, imágenes, y datos clínicos.




• Hybrid engine chooses the candidate with the lowest perplexity (ppl)
Statistical Post-Editing at runtime

Example 1
Source: The following options were included with this subscription:
Pre post-editing: Las opciones siguientes se incluyeron con esta suscripción:
After post-editing: Las siguientes opciones se han incluido con esta suscripción:
Reference human translation: Las siguientes opciones se han incluido con su suscripción:

Example 2
Source: To meet financial service industry regulations, we need to confirm some of your personal
    information.
Pre post-editing: Para encontrar normas de la industria del servicio financiero, tenemos que
    confirmar un poco de su información personal.
After post-editing: Para cumplir las normas de la industria de servicios financieros, necesitamos
    confirmar parte de información personal.
Reference human translation: Para cumplir las normas de la industria de servicios financieros,
    necesitamos confirmar información personal suya.
PROMT 9.0 vs. PROMT DeepHybrid BLEU Scores

    Engine status    English-Spanish English-Spanish English-Spanish
                     Sample 1        Sample 2        Sample 3
                     ~1,800 words    ~2,000 words    ~2,500 words
    Out-of-the-box   31.80           26.74           29.02


    Customized       39.00           34.30           36.50
    RBMT
    PROMT            46.20           41.02           43.65
    DeepHybrid
PROMT DeepHybrid– Bridging the Post-Editing
                   Gap

Post editing effort for PROMT customized translation is now reported to range
   between 4,000 - 8,000 words a day

Average post-editing effort is comprised of:
• Correcting sentence structure
• Correcting part of speech errors
• Correcting general grammar errors
• Looking up terminology
• Reordering meta-tags

PROMT DeepHybrid technology addresses the above challenges, which will have an
   even greater impact on post-editors’ productivity
PROMT DeepHybrid – Putting the Puzzle
                    together

Not-so-good target: Este error se produce en SQL productos de socios cuando el código en un
    desencadenador se cancela la operación utilizando la función de SQL levantar, o si el o los
    métodos SQLConnection.cancel SQLStatement.cancel se llama cuando una declaración se
    ejecuta utilizando SQLStatement.execute o SQLStatement.next.
Good sentence: Este error ocurre en los productos SQL Partner cuando código en un activador
    cancela la operación llamando a la función SQL RAISE, o si los métodos SQLConnection.cancel
    o SQLStatement.cancel son llamados cuando la declaración es ejecutada usando
    SQLStatement.execute o SQLStatement.next .


                PROMT DeepHybrid – up to the challenge!
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine

Más contenido relacionado

Destacado

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS - The Language Data Network
 
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...TAUS - The Language Data Network
 
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS - The Language Data Network
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS - The Language Data Network
 
Summary of Rule-based Reordering Space in Statistical Machine Translation
Summary of Rule-based Reordering Space in Statistical Machine TranslationSummary of Rule-based Reordering Space in Statistical Machine Translation
Summary of Rule-based Reordering Space in Statistical Machine TranslationHiroshi Matsumoto
 
A statistical approach to machine translation
A statistical approach to machine translationA statistical approach to machine translation
A statistical approach to machine translationHiroshi Matsumoto
 
TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS - The Language Data Network
 
Statistical machine translation in a few slides
Statistical machine translation in a few slidesStatistical machine translation in a few slides
Statistical machine translation in a few slidesForcada Mikel
 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approachvini89
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?Multilizer
 
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...TAUS - The Language Data Network
 
How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)
How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)
How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)TAUS - The Language Data Network
 

Destacado (20)

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
 
WEBINAR: TAUS Outlook 2013
WEBINAR: TAUS Outlook 2013WEBINAR: TAUS Outlook 2013
WEBINAR: TAUS Outlook 2013
 
TAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology GuidelinesTAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology Guidelines
 
TAUS Best Practices Adequacy/Fluency Guidelines
TAUS Best Practices Adequacy/Fluency GuidelinesTAUS Best Practices Adequacy/Fluency Guidelines
TAUS Best Practices Adequacy/Fluency Guidelines
 
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
TAUS USER CONFERENCE 2010, Machine translation in the imperfect world - Pract...
 
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
 
Summary of Rule-based Reordering Space in Statistical Machine Translation
Summary of Rule-based Reordering Space in Statistical Machine TranslationSummary of Rule-based Reordering Space in Statistical Machine Translation
Summary of Rule-based Reordering Space in Statistical Machine Translation
 
7. ebmt based on st sm
7. ebmt based on st sm7. ebmt based on st sm
7. ebmt based on st sm
 
A statistical approach to machine translation
A statistical approach to machine translationA statistical approach to machine translation
A statistical approach to machine translation
 
Towards OpenLogos Hybrid Machine Translation - Anabela Barreiro
Towards OpenLogos Hybrid Machine Translation - Anabela BarreiroTowards OpenLogos Hybrid Machine Translation - Anabela Barreiro
Towards OpenLogos Hybrid Machine Translation - Anabela Barreiro
 
TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013
 
Statistical machine translation in a few slides
Statistical machine translation in a few slidesStatistical machine translation in a few slides
Statistical machine translation in a few slides
 
Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?
 
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
 
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
 
Mirai Translate - TAUS Tokyo 2015
Mirai Translate - TAUS Tokyo 2015Mirai Translate - TAUS Tokyo 2015
Mirai Translate - TAUS Tokyo 2015
 
How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)
How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)
How to keep post-editors engaged and prevent attrition. (Jose Sanchez, eBay)
 
TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013TAUS Moses Roundtable, Prague, 11 September 2013
TAUS Moses Roundtable, Prague, 11 September 2013
 

Similar a TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine

MEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docx
MEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docxMEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docx
MEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docxARIV4
 
Tokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfTokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfssuser2ae721
 
CONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docx
CONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docxCONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docx
CONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docxdonnajames55
 
Transcend_BeyondTXT_Promo
Transcend_BeyondTXT_PromoTranscend_BeyondTXT_Promo
Transcend_BeyondTXT_PromoJane Rudov
 
SDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated TranslationSDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated TranslationSDL Trados
 
What’s new in Rational collaborative lifecycle management 2011?
What’s new in Rational collaborative lifecycle management 2011?What’s new in Rational collaborative lifecycle management 2011?
What’s new in Rational collaborative lifecycle management 2011?IBM Danmark
 
03 software test-plan-template
03 software test-plan-template03 software test-plan-template
03 software test-plan-templateAndrei Hortúa
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)TAUS - The Language Data Network
 
Archana G_Resume
Archana G_ResumeArchana G_Resume
Archana G_Resumearchu3011
 
Archana kalapgar 19210184_ca684
Archana kalapgar 19210184_ca684Archana kalapgar 19210184_ca684
Archana kalapgar 19210184_ca684ArchanaKalapgar
 
Webinar: Does Your Data Center Need NVMe?
Webinar: Does Your Data Center Need NVMe?Webinar: Does Your Data Center Need NVMe?
Webinar: Does Your Data Center Need NVMe?Storage Switzerland
 
Intel® Xeon Phi™ processor (codenamed Knights Landing) applications Code and...
Intel® Xeon Phi™ processor (codenamed Knights Landing)applications Code and...Intel® Xeon Phi™ processor (codenamed Knights Landing)applications Code and...
Intel® Xeon Phi™ processor (codenamed Knights Landing) applications Code and...Ilson Schames
 
Brainware ITAM Review Tools Day
Brainware ITAM Review Tools Day Brainware ITAM Review Tools Day
Brainware ITAM Review Tools Day Martin Thompson
 
Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012
Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012
Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012UGIF
 

Similar a TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine (20)

MEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docx
MEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docxMEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docx
MEDICAL FACILITY ANALYSIS2MEDICAL FACILITY ANALYSIS16.docx
 
Tokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfTokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdf
 
CONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docx
CONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docxCONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docx
CONSULTANT ANALYSIS FOR MEDICAL FACILITY2CONSULTANT ANALYSIS FO.docx
 
cxpbroch
cxpbrochcxpbroch
cxpbroch
 
Transcend_BeyondTXT_Promo
Transcend_BeyondTXT_PromoTranscend_BeyondTXT_Promo
Transcend_BeyondTXT_Promo
 
SDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated TranslationSDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated Translation
 
What’s new in Rational collaborative lifecycle management 2011?
What’s new in Rational collaborative lifecycle management 2011?What’s new in Rational collaborative lifecycle management 2011?
What’s new in Rational collaborative lifecycle management 2011?
 
Feasible
FeasibleFeasible
Feasible
 
03 software test-plan-template
03 software test-plan-template03 software test-plan-template
03 software test-plan-template
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
 
Archana G_Resume
Archana G_ResumeArchana G_Resume
Archana G_Resume
 
Ameya_Kasbekar_Resume
Ameya_Kasbekar_ResumeAmeya_Kasbekar_Resume
Ameya_Kasbekar_Resume
 
SANTOSH KUMAR M -FD
SANTOSH KUMAR M -FDSANTOSH KUMAR M -FD
SANTOSH KUMAR M -FD
 
Archana kalapgar 19210184_ca684
Archana kalapgar 19210184_ca684Archana kalapgar 19210184_ca684
Archana kalapgar 19210184_ca684
 
Webinar: Does Your Data Center Need NVMe?
Webinar: Does Your Data Center Need NVMe?Webinar: Does Your Data Center Need NVMe?
Webinar: Does Your Data Center Need NVMe?
 
Intel® Xeon Phi™ processor (codenamed Knights Landing) applications Code and...
Intel® Xeon Phi™ processor (codenamed Knights Landing)applications Code and...Intel® Xeon Phi™ processor (codenamed Knights Landing)applications Code and...
Intel® Xeon Phi™ processor (codenamed Knights Landing) applications Code and...
 
Brainware ITAM Review Tools Day
Brainware ITAM Review Tools Day Brainware ITAM Review Tools Day
Brainware ITAM Review Tools Day
 
LVTS Projects
LVTS ProjectsLVTS Projects
LVTS Projects
 
Ia rm001 -en-p
Ia rm001 -en-pIa rm001 -en-p
Ia rm001 -en-p
 
Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012
Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012
Ugif 10 2012 informix pssc-benchmark -l.revel_oct2012
 

Más de TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 

Más de TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Último

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Último (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine

  • 1. TAUS USER CONFERENCE 2010 LANGUAGE BUSINESS INNOVATION 4 – 6 OCTOBER / PORTLAND (OR), USA TUESDAY 5 OCTOBER / 11.15 THE DEEP HYBRID MACHINE TRANSLATION ENGINE Olga Beregovaya, PROMT
  • 2. Company Profile • Experienced. Founded in 1991 • International. Offices in US, Germany, Russia • Innovative. 150 employees, 80 of them are in R&D • Widely used. Over 120 million hits per month on our online translation sites
  • 3. Enterprise MT User Challenges Market need: translated content built of fluent and relevant sentences that preserve metadata information, branding, tone of voice and terminology. Source: This error occurs in SQL Partner products when code in a trigger cancels the operation using the SQL RAISE function, or if the SQLConnection.cancel or SQLStatement.cancel methods are called when a statement is executed using SQLStatement.execute or SQLStatement.next. Not-so-good Target: Este error se produce en SQL productos de socios cuando el código en un desencadenador se cancela la operación utilizando la función de SQL levantar, o si el o los métodos SQLConnection.cancel SQLStatement.cancel se llama cuando una declaración se ejecuta utilizando SQLStatement.execute o SQLStatement.next. Known challenges: RBMT limitations: fluency, terminology, engine customization effort SMT limitations: sentence structure, duplicates and omissions, over- normalization both at training and at run-time
  • 4. PROMT DeepHybrid Engine – Taking on the Challenge PROMT DeepHybrid – both approaches work side by side providing the best choice possible during each step of the translation process • Fluency: PROMT DeepHybrid approach increases the fluency of the final translation by letting the corpus make translation choices – both grammatical and lexical • Sentence structure: PROMT DeepHybrid preserves the syntactic accuracy and predictability of the RBMT engine output • Relevance: PROMT DeepHybrid combines terminology management capabilities of RBMT systems with SMT corpus-based terminology validation PROMT DeepHybrid also supports and enhances existing key product features: • Style integrity: PROMT’s Virtual Style Guide technology automates the preservation of tone- of-voice and corporate identity through automated rules selection • Extensive Metadata Support (Translation Anchors): the rule-based core engine takes on all heavy-duty metadata processing and preservation
  • 5. PROMT DeepHybrid Engine Flowchart Dictionaries Rules Corpora PROMT PROMT General Client Corpora: Client Dictionary Client Dictionaries PROMT Parallel Corpora and Translation and Transfer Rules and and Domain Preferences Glossaries Monolingual TMs Dictionaries PROMT Assets Client Assets PROMT Assets Client Assets PROMT Assets Client Assets Post-edited TM Customized Translation Statistical Translation Language Source Branching Candidates Post- Candidates Model Best Target Transfer Editing Selection 1. 1. 2. 2. X. X.
  • 6. PROMT DeepHybrid at a Glance Branching Transfer PROMT Branching Transfer is a sequence of rule-based algorithmic steps enhanced by client- specific statistical input • Translation choices largely depend on the context ; PROMT engine only makes the most apparent forced choices; otherwise, all probable instances are generated • PROMT translation model usually produces 4-12 candidates for a 10-15 word sentence after tree pruning techniques are applied • Each step during lexical analysis relies on terminology from client TMs in addition to baseline PROMT dictionaries • Each step during syntactic analysis relies on PROMT rule library enhanced by syntactic patterns mined from client TMs Statistical Post-editing Before being fed to the language model, the candidate translations undergo statistical post- editing based on the sub-sentential parsing of both MT output and client data Language Model: • General corpora - billions of words are available • In-domain corpora – pooling TDA data is helpful because of the thin domain space • Client corpora provide probability skew in favor of client-specific choices
  • 7. PROMT DeepHybrid Engine: Candidate Selection Examples Example 1: Syntactic choice Example 2: Lexical choice Source: Source: It is used for patient information, lab results, reports, images, and The "Nehalem" system architecture features an integrated memory clinical data. controller RBMT translation: RBMT translation: Es usado para información sobre los pacientes, resultados del La arquitectura del sistema "Nehalem" presenta un controlador laboratorio, informes, imágenes, y datos clínicos. de memoria integrado Hybrid engine candidates: Hybrid engine candidates: а) Es usado para información sobre los pacientes, resultados del а) La arquitectura del sistema "Nehalem" presenta un laboratorio, informes, imágenes, y datos clínicos. controlador de memoria integrado ppl= 791.4319204909 ppl= 288.17916810444 b) Se usa para información sobre los pacientes, resultados del b) La arquitectura del sistema "Nehalem" incluye un controlador laboratorio, informes, imágenes, y datos clínicos. de memoria integrado ppl= 424.83820234214 ppl= 234.86938828311 c) Está usado para información sobre los pacientes, resultados del laboratorio, informes, imágenes, y datos clínicos. Hybrid Outcome: ppl= 814.24328845084 La arquitectura del sistema "Nehalem" incluye un controlador de memoria integrado Hybrid Outcome: Se usa para información sobre los pacientes, resultados del laboratorio, informes, imágenes, y datos clínicos. • Hybrid engine chooses the candidate with the lowest perplexity (ppl)
  • 8. Statistical Post-Editing at runtime Example 1 Source: The following options were included with this subscription: Pre post-editing: Las opciones siguientes se incluyeron con esta suscripción: After post-editing: Las siguientes opciones se han incluido con esta suscripción: Reference human translation: Las siguientes opciones se han incluido con su suscripción: Example 2 Source: To meet financial service industry regulations, we need to confirm some of your personal information. Pre post-editing: Para encontrar normas de la industria del servicio financiero, tenemos que confirmar un poco de su información personal. After post-editing: Para cumplir las normas de la industria de servicios financieros, necesitamos confirmar parte de información personal. Reference human translation: Para cumplir las normas de la industria de servicios financieros, necesitamos confirmar información personal suya.
  • 9. PROMT 9.0 vs. PROMT DeepHybrid BLEU Scores Engine status English-Spanish English-Spanish English-Spanish Sample 1 Sample 2 Sample 3 ~1,800 words ~2,000 words ~2,500 words Out-of-the-box 31.80 26.74 29.02 Customized 39.00 34.30 36.50 RBMT PROMT 46.20 41.02 43.65 DeepHybrid
  • 10. PROMT DeepHybrid– Bridging the Post-Editing Gap Post editing effort for PROMT customized translation is now reported to range between 4,000 - 8,000 words a day Average post-editing effort is comprised of: • Correcting sentence structure • Correcting part of speech errors • Correcting general grammar errors • Looking up terminology • Reordering meta-tags PROMT DeepHybrid technology addresses the above challenges, which will have an even greater impact on post-editors’ productivity
  • 11. PROMT DeepHybrid – Putting the Puzzle together Not-so-good target: Este error se produce en SQL productos de socios cuando el código en un desencadenador se cancela la operación utilizando la función de SQL levantar, o si el o los métodos SQLConnection.cancel SQLStatement.cancel se llama cuando una declaración se ejecuta utilizando SQLStatement.execute o SQLStatement.next. Good sentence: Este error ocurre en los productos SQL Partner cuando código en un activador cancela la operación llamando a la función SQL RAISE, o si los métodos SQLConnection.cancel o SQLStatement.cancel son llamados cuando la declaración es ejecutada usando SQLStatement.execute o SQLStatement.next . PROMT DeepHybrid – up to the challenge!