SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
TAUS	
  MACHINE	
  TRANSLATION	
  SHOWCASE	
  


Google Translator Toolkit

14:20 – 14:40
Wednesday, 10 April 2013

Patcharin Areewong
Google
Google Confidential and Proprietary   2




                                  Machine Translation




Patcharin Areewong
Language Specialist • Singapore
3


Agenda
                               Google Confidential and Proprietary




•  Introduction

•  Google translate

•  Google translator toolkit

•  Demo
4


Translation is the key to universal access
                                                                                    Google Confidential and Proprietary


            “Organize the world's information and make it
                 universally accessible and useful.”

 kiana
                                                                         привет
                                                 hallo
         hello                                                                         こんにちは
                                                                               你好

                 hola                                       ‫ﻡمﺭرﺡحﺏبﺍا‬
                                                                         !लो

                                       bonjour
                                                         jambo

                 napaykullayki
                                 olá               mhoroi
5


A translator’s story…
                                    Google Confidential and Proprietary




                           Translator
                          2,000 words per day
                           250 days per year
                        500,000 words per year
6


The Google story…
                                                                                        Google Confidential and Proprietary


                                                                               Google
                                                                    1,000,000,000 new pages indexed
                                                                                  per year*
                                                                               400 words per page
                                                                400,000,000,000 words per year



                                                                                400B/0.5M=800,000
                                                                                  translators

                                                                                  and that’s for only

                                                                                1 additional language!



 * actually, it’s several billion pages but let’s round it down to 1 billion
7


So much content! So little resource!
                                                                                        Google Confidential and Proprietary


Organize the world’s information and make it universally accessible and
                                useful.

   8% of world population speaks Arabic. Only 1% of the web is in Arabic.

    >50% of Google searches come from outside the United States.
       Localized search in more than 55 countries, 35 languages.
 Google UI available in 117 languages across 157 international domains.

 >99% of what people write, say, or generate never leaves the language in
                         which it was created.




 http://www.commonsenseadvisory.com/Resources/FactsandFigures/tabid/1213/Default.aspx

Madar research, 2007
8




Google Confidential and Proprietary
9


Machine translation is better than nothing
                                             Google Confidential and Proprietary



•  Handle overwhelming volume
•  Provide Immediate turn-around
•  No access to human translator
•  Users are one click away from inaccessible
   content




                                  9

                                                           Google Confidential
10


Drive demand for human translation
                                                          Google Confidential and Proprietary




               Variable               Value
               Customer demand                1 vs. 10
               Revenue per customer      €10 vs. €100


MT experiment can measure customer demand, profitability.
If the content is proved valuable, it’s worth the human
translation price


If you talk to a man in a language he understands, that goes to his head.
If you talk to him in his language, that goes to his heart.

                                                         – Nelson Mandela
11



Expand translation industry
                                                             Google Confidential and Proprietary



Human translation expensive for content with uncertain value
•  If 10 readers, expensive to pay $10,000 for translation
•  If 100,000 readers, $10,000 translation worth the price



Monitor adoption, feedback
•  Human-translate inscrutable content: offensive, misleading

•  Human-translate high-traffic content: worth the price



 Bottom line:
 - Quick experiments
 - Low cost
12




                                                      Google Confidential and Proprietary




       Target users: amateurs and advanced translators
        Languages:     100,000+ pairs

    MT languages:      88 (includes variants)

     UI languages:     36

         File types:   AdWords, HTML, TXT, PO, Android
                       App, Chrome Extension, RTF,
                       subtitles, Word, ODT



  General features: WYSIWYG editor, sharing, chat, spell
                       check, publish to Google Docs

Advanced features: translation memory, glossary, Google
                       dictionary, label, concordance,
                       scoping, split/merge, repetition
13


Google: open translation ecosystem
                                                              Google Confidential and Proprietary



               How can you be big without being evil? We don’t trap
               end users. So if you don’t like Google, if for whatever
               reason we do a bad job for you, we make it easy for you
data           to move to our competitor.
liberation!                                                  – Eric Schmidt


 Google translation follows “data liberation” principle
 •  Support open standards, APIs


 Data stored in Translator Toolkit is your data – not Google’s
 •  Cloud storage for access, scale
 •  Documents, glossaries, and TMs for your eyes only unless you share
 •  You control addition, deletion of your translations
14


Demo Time!
             Google Confidential and Proprietary
15




                            Google Confidential and Proprietary




           Try it today at
http://translate.google.com/toolkit
16




Questions?




             Google Confidential and Proprietary

Más contenido relacionado

Más de TAUS - The Language Data Network

Más de TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 April 2013

  • 1. TAUS  MACHINE  TRANSLATION  SHOWCASE   Google Translator Toolkit 14:20 – 14:40 Wednesday, 10 April 2013 Patcharin Areewong Google
  • 2. Google Confidential and Proprietary 2 Machine Translation Patcharin Areewong Language Specialist • Singapore
  • 3. 3 Agenda Google Confidential and Proprietary •  Introduction •  Google translate •  Google translator toolkit •  Demo
  • 4. 4 Translation is the key to universal access Google Confidential and Proprietary “Organize the world's information and make it universally accessible and useful.” kiana привет hallo hello こんにちは 你好 hola ‫ﻡمﺭرﺡحﺏبﺍا‬ !लो bonjour jambo napaykullayki olá mhoroi
  • 5. 5 A translator’s story… Google Confidential and Proprietary Translator 2,000 words per day 250 days per year 500,000 words per year
  • 6. 6 The Google story… Google Confidential and Proprietary Google 1,000,000,000 new pages indexed per year* 400 words per page 400,000,000,000 words per year 400B/0.5M=800,000 translators and that’s for only 1 additional language! * actually, it’s several billion pages but let’s round it down to 1 billion
  • 7. 7 So much content! So little resource! Google Confidential and Proprietary Organize the world’s information and make it universally accessible and useful. 8% of world population speaks Arabic. Only 1% of the web is in Arabic. >50% of Google searches come from outside the United States. Localized search in more than 55 countries, 35 languages. Google UI available in 117 languages across 157 international domains. >99% of what people write, say, or generate never leaves the language in which it was created. http://www.commonsenseadvisory.com/Resources/FactsandFigures/tabid/1213/Default.aspx Madar research, 2007
  • 9. 9 Machine translation is better than nothing Google Confidential and Proprietary •  Handle overwhelming volume •  Provide Immediate turn-around •  No access to human translator •  Users are one click away from inaccessible content 9 Google Confidential
  • 10. 10 Drive demand for human translation Google Confidential and Proprietary Variable Value Customer demand 1 vs. 10 Revenue per customer €10 vs. €100 MT experiment can measure customer demand, profitability. If the content is proved valuable, it’s worth the human translation price If you talk to a man in a language he understands, that goes to his head. If you talk to him in his language, that goes to his heart. – Nelson Mandela
  • 11. 11 Expand translation industry Google Confidential and Proprietary Human translation expensive for content with uncertain value •  If 10 readers, expensive to pay $10,000 for translation •  If 100,000 readers, $10,000 translation worth the price Monitor adoption, feedback •  Human-translate inscrutable content: offensive, misleading •  Human-translate high-traffic content: worth the price Bottom line: - Quick experiments - Low cost
  • 12. 12 Google Confidential and Proprietary Target users: amateurs and advanced translators Languages: 100,000+ pairs MT languages: 88 (includes variants) UI languages: 36 File types: AdWords, HTML, TXT, PO, Android App, Chrome Extension, RTF, subtitles, Word, ODT General features: WYSIWYG editor, sharing, chat, spell check, publish to Google Docs Advanced features: translation memory, glossary, Google dictionary, label, concordance, scoping, split/merge, repetition
  • 13. 13 Google: open translation ecosystem Google Confidential and Proprietary How can you be big without being evil? We don’t trap end users. So if you don’t like Google, if for whatever reason we do a bad job for you, we make it easy for you data to move to our competitor. liberation! – Eric Schmidt Google translation follows “data liberation” principle •  Support open standards, APIs Data stored in Translator Toolkit is your data – not Google’s •  Cloud storage for access, scale •  Documents, glossaries, and TMs for your eyes only unless you share •  You control addition, deletion of your translations
  • 14. 14 Demo Time! Google Confidential and Proprietary
  • 15. 15 Google Confidential and Proprietary Try it today at http://translate.google.com/toolkit
  • 16. 16 Questions? Google Confidential and Proprietary