SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
..
Joachim Daiber
..
Institute for Logic, Language and Computation
University of Amsterdam
.
On Using Syntactic Preordering Models to
Delimit Morphosyntactic Search Space.
Introduction Word order Morphology Conclusion
Introduction Motivation
Introduction
Project title: Exploiting hierarchical alignments for linguistically-informed
SMT models to meet the hybrid approaches that aim at compositional
translation
▶ ESR 10
▶ University of Amsterdam
▶ Supervisor: Prof. Khalil Sima'an
1/19
Introduction Word order Morphology Conclusion
Introduction Motivation
Introduction
Project title: Exploiting hierarchical alignments for linguistically-informed
SMT models to meet the hybrid approaches that aim at compositional
translation
▶ ESR 10
▶ University of Amsterdam
▶ Supervisor: Prof. Khalil Sima'an
1/19
Introduction Word order Morphology Conclusion
Introduction Motivation
Motivation
▶ Current MT models work well if languages are structurally similar
▶ Difficulties with morphologically rich languages:
− freer word order
− more productive morphological in ections
− agreement over long distances
2/19
Introduction Word order Morphology Conclusion
Introduction Motivation
Motivation
..
..der ..Mann ..schlug ..Peter
..the ..man ..punched ..Peter
..derx ..Mannx ..schlugx ..Peterx
..
..den ..Mann ..schlug ..Peter
..Peter ..punched ..the ..man
..Peter ..schlug ..den ..Mann
3/19
Introduction Word order Morphology Conclusion
Introduction Motivation
Motivation
..
..der ..Mann ..schlug ..Peter
..the ..man ..punched ..Peter
..derx ..Mannx ..schlugx ..Peterx
..
..den ..Mann ..schlug ..Peter
..Peter ..punched ..the ..man
..Peter ..schlug ..den ..Mann
3/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Part I: Word Order
4/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Preordering source trees
....Peter ..escaped ..from ..the ..police.
Root
.
Sb
.
AuxP
.
Adv
.
AuxA
.
Peter
.
entkam
.
der
case=dat
.
Polizei
case=dat
▶ Source dependency trees are well tted for preordering:
− Lerner and Petrov (2013) present two classi er-based dep. tree
preordering models
− Jehl et al. (2014) and de Gispert et al. (2015) preorder dep. trees via
branch-and-bound search
5/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Preordering source trees
▶ Lerner and Petrov (2013) preoder trees starting at the root
▶ Order all children (model 1) or left and right children (model 2)
....Peter ..escaped ..from ..the ..police.
Root
.
Sb
.
AuxP
.
Adv
.
AuxA
6/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Preordering source trees
▶ Lerner and Petrov (2013) preoder trees starting at the root
▶ Order all children (model 1) or left and right children (model 2)
....Peter ..escaped ..from ..the ..police.
Root
.
Sb
.
AuxP
.
Adv
.
AuxA
6/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Preordering source trees
▶ Lerner and Petrov (2013) preoder trees starting at the root
▶ Order all children (model 1) or left and right children (model 2)
....Peter ..escaped ..from ..the ..police.
Root
.
Sb
.
AuxP
.
Adv
.
AuxA
6/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Preordering source trees
▶ Lerner and Petrov (2013) preoder trees starting at the root
▶ Order all children (model 1) or left and right children (model 2)
....Peter ..escaped ..from ..the ..police.
Root
.
Sb
.
AuxP
.
Adv
.
AuxA
6/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Generating a preordering space
▶ Both Lerner and Petrov (2013) and Jehl et al. (2014) make only
single-best predictions
▶ We want:
− ALL REASONABLE predictions instead of SINGLE BEST
− more exible model
7/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Multiple predictions and more exible model
▶ Multiple predictions
− Mistakes in order decisions propagate
− Extract n-best decisions from the model to pass to later models
▶ Making the model more exible
− Bad: order decisions are local to tree families
− Non-local features would help (e.g. LM)
→ integration via cube pruning
8/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Making the model more exible
▶ Use standard log-linear model (Och and Ney, 2002)
ˆs′
= arg max
s′
∑
i
λi log ϕi(s′
)
▶ Where to get the weights?
− PRO: tuning as ranking (Hopkins and May, 2011)
− Scoring functions:
1. Kendall's τ coefficient
2. Simulate word level MT system, score by BLEU
9/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Do non-local features help?
Model Kendall's τ BLEU (ˆs′
→ s′
)
First-best −LM 92.16 68.1
First-best +LM (cube) 92.27 68.7
10/19
Introduction Word order Morphology Conclusion
Source preordering Preordering spaces Evaluation
Quality of the preordering space
▶ Experiments with top 10 preordering outputs of this model
Distortion BLEU MTR TER
Baseline
7
15.2 35.4 66.6
Oracle (k = 10) 17.26 37.97 62.64
11/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Part II: Morphology
12/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Morphology
▶ Word order is only one part of the problem for MRLs
▶ Many linguistic properties are not expressed via word order
▶ Three questions:
− Does knowing morphological target properties help?
− Can we predict these on source trees?
− Which properties should we predict?
13/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Does knowing morphological target properties help?
▶ Perform morph. tagging of target side of translation
▶ Project morphological attributes via the alignments
Decoration Morph. attributes Tags BLEU
None - - 15.12
Gold
All attributes 846 15.96
Manual selection 77 15.86
Automatic selection 225 15.73
14/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Predicting target morphology on source trees
▶ Prediction based on dependency chains instead of linear chains
▶ Can take into account full syntactic context
....Peter ..escaped ..from ..the ..police.
Root
.
Sb
.
AuxP
.
Adv
.
AuxA
.
Peter
.
entkam
.
der
case=dat
.
Polizei
case=dat
15/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Learning what to predict
Idea: Only include attr. if it leads to better lexical selection
Learning Procedure (sketch):
1. Decorate the source with all attributes
2. Calc. likelihood of heldout set with word-based system (IBM model 1)
3. As long as the likelihood increases:
− Find worst attribute by merging tags + recal. liklihood
− Remove attribute, re-align
− Repeat
16/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Learning what to predict (English–German)
Part of speech Manual selection Automatic selection
noun
gender†
number
case
gender
number
case
adj
gender†
number‡
case‡
declension
gender
number
case
synpos
degree
verb
number‡*
person‡*
tense*
mode*
-
Additionally only in automatic: part:negativeness, part:subpos, punc:type, num:type.
17/19
Introduction Word order Morphology Conclusion
Motivation Prediction on source trees Learning what to predict
Learning what to predict
Manual Automatic All
Training 50k 36m 45m 77m
Training 100k 58m 82m 2h51m
Training 200k 1h54m 3h5m 6h44m
Best F1 72.67 74.67 62.18
18/19
Introduction Word order Morphology Conclusion
Conclusion
Our work so far:
Question 1: Can we make syntactic preordering models more exible
and generate a space of possible preorderings?
Question 2: Can we predict target morphology on the source?
Current and future work:
Question 3: Can we combine both ideas to exploit interactions?
19/19
References
Thank You!
Any questions?
19/19
References
References
de Gispert, A., Iglesias, G., and Byrne, W. (2015). Fast and accurate preordering for smt using neural
networks. In Proceedings of the Conference of the North American Chapter of the Association for
Computational Linguistics - Human Language Technologies (NAACL HLT 2015).
Hopkins, M. and May, J. (2011). Tuning as ranking. In Proceedings of the 2011 Conference on Empirical
Methods in Natural Language Processing, pages 1352--1362, Edinburgh, Scotland, UK. Association for
Computational Linguistics.
Jehl, L., de Gispert, A., Hopkins, M., and Byrne, B. (2014). Source-side preordering for translation using
logistic regression and depth- rst branch-and-bound search. In Proceedings of the 14th Conference of
the European Chapter of the Association for Computational Linguistics, pages 239--248, Gothenburg,
Sweden. Association for Computational Linguistics.
Lerner, U. and Petrov, S. (2013). Source-side classi er preordering for machine translation. In Proceedings
of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 513--523, Seattle,
Washington, USA. Association for Computational Linguistics.
Och, F. J. and Ney, H. (2002). Discriminative training and maximum entropy models for statistical machine
translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL
'02, pages 295--302, Stroudsburg, PA, USA. Association for Computational Linguistics.
19/19

Más contenido relacionado

La actualidad más candente

Deep Dependency Graph Conversion in English
Deep Dependency Graph Conversion in EnglishDeep Dependency Graph Conversion in English
Deep Dependency Graph Conversion in EnglishJinho Choi
 
A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...Ilia Karpov
 
A Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description LogicsA Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description LogicsJie Bao
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalNik Spirin
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Vsevolod Dyomkin
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Rajnish Raj
 
Python Training Institutes In Noida / Greater Noida.
Python Training Institutes In Noida / Greater Noida.Python Training Institutes In Noida / Greater Noida.
Python Training Institutes In Noida / Greater Noida.Aditya Kaushik
 
Learning Probabilistic Relational Models
Learning Probabilistic Relational ModelsLearning Probabilistic Relational Models
Learning Probabilistic Relational ModelsUniversity of Nantes
 
Random Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian NetworksRandom Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian NetworksUniversity of Nantes
 
14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for TranslationRIILP
 
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Chunyang Chen
 
Modular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and ExpressivityModular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and ExpressivityJie Bao
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational SemanticsMarina Santini
 
An exact approach to learning Probabilistic Relational Model
An exact approach to learning Probabilistic Relational ModelAn exact approach to learning Probabilistic Relational Model
An exact approach to learning Probabilistic Relational ModelUniversity of Nantes
 
Adapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology LanguageAdapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology LanguageJie Bao
 
Near Duplicate Document Detection: Mathematical Modeling and Algorithms
Near Duplicate Document Detection: Mathematical Modeling and AlgorithmsNear Duplicate Document Detection: Mathematical Modeling and Algorithms
Near Duplicate Document Detection: Mathematical Modeling and AlgorithmsLiwei Ren任力偉
 

La actualidad más candente (20)

Deep Dependency Graph Conversion in English
Deep Dependency Graph Conversion in EnglishDeep Dependency Graph Conversion in English
Deep Dependency Graph Conversion in English
 
A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...
 
A Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description LogicsA Distributed Tableau Algorithm for Package-based Description Logics
A Distributed Tableau Algorithm for Package-based Description Logics
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)
 
Parekh dfa
Parekh dfaParekh dfa
Parekh dfa
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
Python Training Institutes In Noida / Greater Noida.
Python Training Institutes In Noida / Greater Noida.Python Training Institutes In Noida / Greater Noida.
Python Training Institutes In Noida / Greater Noida.
 
Learning Probabilistic Relational Models
Learning Probabilistic Relational ModelsLearning Probabilistic Relational Models
Learning Probabilistic Relational Models
 
Random Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian NetworksRandom Generation of Relational Bayesian Networks
Random Generation of Relational Bayesian Networks
 
L3 v2
L3 v2L3 v2
L3 v2
 
14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation14. Michael Oakes (UoW) Natural Language Processing for Translation
14. Michael Oakes (UoW) Natural Language Processing for Translation
 
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
Unsupervised Software-Specific Morphological Forms Inference from Informal Di...
 
Modular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and ExpressivityModular Ontologies - A Formal Investigation of Semantics and Expressivity
Modular Ontologies - A Formal Investigation of Semantics and Expressivity
 
8 issues in pos tagging
8 issues in pos tagging8 issues in pos tagging
8 issues in pos tagging
 
Nlp
NlpNlp
Nlp
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational Semantics
 
An exact approach to learning Probabilistic Relational Model
An exact approach to learning Probabilistic Relational ModelAn exact approach to learning Probabilistic Relational Model
An exact approach to learning Probabilistic Relational Model
 
Adapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology LanguageAdapt OWL as a Modular Ontology Language
Adapt OWL as a Modular Ontology Language
 
Near Duplicate Document Detection: Mathematical Modeling and Algorithms
Near Duplicate Document Detection: Mathematical Modeling and AlgorithmsNear Duplicate Document Detection: Mathematical Modeling and Algorithms
Near Duplicate Document Detection: Mathematical Modeling and Algorithms
 

Similar a ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015

An Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextAn Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextFulvio Rotella
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.pptmilkesa13
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondIsabelle Augenstein
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.pptbutest
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsBaden Hughes
 
Artificial intelligence for Social Good
Artificial intelligence for Social GoodArtificial intelligence for Social Good
Artificial intelligence for Social GoodOana Tifrea-Marciuska
 
A Simple Explanation of XLNet
A Simple Explanation of XLNetA Simple Explanation of XLNet
A Simple Explanation of XLNetDomyoung Lee
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
NAACL2015 presentation
NAACL2015 presentationNAACL2015 presentation
NAACL2015 presentationHan Xu, PhD
 
Yves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPYves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPHendrik D'Oosterlinck
 
Latent Semantic Transliteration using Dirichlet Mixture
Latent Semantic Transliteration using Dirichlet MixtureLatent Semantic Transliteration using Dirichlet Mixture
Latent Semantic Transliteration using Dirichlet MixtureRakuten Group, Inc.
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Julien PLU
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to HindiRajat Jain
 
Lecture 01: Machine Learning for Language Technology - Introduction
 Lecture 01: Machine Learning for Language Technology - Introduction Lecture 01: Machine Learning for Language Technology - Introduction
Lecture 01: Machine Learning for Language Technology - IntroductionMarina Santini
 
Natural language procssing
Natural language procssing Natural language procssing
Natural language procssing Rajnish Raj
 
ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5
ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5
ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5Daisuke BEKKI
 

Similar a ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015 (20)

An Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from TextAn Approach to Automated Learning of Conceptual Graphs from Text
An Approach to Automated Learning of Conceptual Graphs from Text
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Esa act
Esa actEsa act
Esa act
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary Linguistics
 
Artificial intelligence for Social Good
Artificial intelligence for Social GoodArtificial intelligence for Social Good
Artificial intelligence for Social Good
 
A Simple Explanation of XLNet
A Simple Explanation of XLNetA Simple Explanation of XLNet
A Simple Explanation of XLNet
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
NAACL2015 presentation
NAACL2015 presentationNAACL2015 presentation
NAACL2015 presentation
 
Yves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLPYves Peirsman - Deep Learning for NLP
Yves Peirsman - Deep Learning for NLP
 
Latent Semantic Transliteration using Dirichlet Mixture
Latent Semantic Transliteration using Dirichlet MixtureLatent Semantic Transliteration using Dirichlet Mixture
Latent Semantic Transliteration using Dirichlet Mixture
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
 
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure SoulierHow to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
How to supervise a thesis in NLP in the ChatGPT era? By Laure Soulier
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
Lecture 01: Machine Learning for Language Technology - Introduction
 Lecture 01: Machine Learning for Language Technology - Introduction Lecture 01: Machine Learning for Language Technology - Introduction
Lecture 01: Machine Learning for Language Technology - Introduction
 
Natural language procssing
Natural language procssing Natural language procssing
Natural language procssing
 
ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5
ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5
ESSLLI2016 DTS Lecture Day 5-1: Introduction to day 5
 

Más de RIILP

Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD RIILP
 
Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic RIILP
 
Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones RIILP
 
Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones RIILP
 
Gianluca Giulinin - FAO
Gianluca Giulinin - FAO Gianluca Giulinin - FAO
Gianluca Giulinin - FAO RIILP
 
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic RIILP
 
Tony O'Dowd - KantanMT
Tony O'Dowd -  KantanMT Tony O'Dowd -  KantanMT
Tony O'Dowd - KantanMT RIILP
 
Santanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAARSantanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAARRIILP
 
Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU RIILP
 
Anna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMAAnna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMARIILP
 
Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD  Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD RIILP
 
Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW RIILP
 
Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA RIILP
 
Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU RIILP
 
Liling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAARLiling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAARRIILP
 
Sandra de luca - Acclaro
Sandra de luca - AcclaroSandra de luca - Acclaro
Sandra de luca - AcclaroRIILP
 
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015RIILP
 
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015RIILP
 
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015RIILP
 
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015RIILP
 

Más de RIILP (20)

Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD
 
Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic
 
Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones
 
Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones
 
Gianluca Giulinin - FAO
Gianluca Giulinin - FAO Gianluca Giulinin - FAO
Gianluca Giulinin - FAO
 
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
 
Tony O'Dowd - KantanMT
Tony O'Dowd -  KantanMT Tony O'Dowd -  KantanMT
Tony O'Dowd - KantanMT
 
Santanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAARSantanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAAR
 
Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU
 
Anna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMAAnna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMA
 
Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD  Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD
 
Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW
 
Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA
 
Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU
 
Liling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAARLiling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAAR
 
Sandra de luca - Acclaro
Sandra de luca - AcclaroSandra de luca - Acclaro
Sandra de luca - Acclaro
 
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
 
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
 
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
 
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
 

Último

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 

Último (20)

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 

ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015

  • 1. .. Joachim Daiber .. Institute for Logic, Language and Computation University of Amsterdam . On Using Syntactic Preordering Models to Delimit Morphosyntactic Search Space.
  • 2. Introduction Word order Morphology Conclusion Introduction Motivation Introduction Project title: Exploiting hierarchical alignments for linguistically-informed SMT models to meet the hybrid approaches that aim at compositional translation ▶ ESR 10 ▶ University of Amsterdam ▶ Supervisor: Prof. Khalil Sima'an 1/19
  • 3. Introduction Word order Morphology Conclusion Introduction Motivation Introduction Project title: Exploiting hierarchical alignments for linguistically-informed SMT models to meet the hybrid approaches that aim at compositional translation ▶ ESR 10 ▶ University of Amsterdam ▶ Supervisor: Prof. Khalil Sima'an 1/19
  • 4. Introduction Word order Morphology Conclusion Introduction Motivation Motivation ▶ Current MT models work well if languages are structurally similar ▶ Difficulties with morphologically rich languages: − freer word order − more productive morphological in ections − agreement over long distances 2/19
  • 5. Introduction Word order Morphology Conclusion Introduction Motivation Motivation .. ..der ..Mann ..schlug ..Peter ..the ..man ..punched ..Peter ..derx ..Mannx ..schlugx ..Peterx .. ..den ..Mann ..schlug ..Peter ..Peter ..punched ..the ..man ..Peter ..schlug ..den ..Mann 3/19
  • 6. Introduction Word order Morphology Conclusion Introduction Motivation Motivation .. ..der ..Mann ..schlug ..Peter ..the ..man ..punched ..Peter ..derx ..Mannx ..schlugx ..Peterx .. ..den ..Mann ..schlug ..Peter ..Peter ..punched ..the ..man ..Peter ..schlug ..den ..Mann 3/19
  • 7. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Part I: Word Order 4/19
  • 8. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Preordering source trees ....Peter ..escaped ..from ..the ..police. Root . Sb . AuxP . Adv . AuxA . Peter . entkam . der case=dat . Polizei case=dat ▶ Source dependency trees are well tted for preordering: − Lerner and Petrov (2013) present two classi er-based dep. tree preordering models − Jehl et al. (2014) and de Gispert et al. (2015) preorder dep. trees via branch-and-bound search 5/19
  • 9. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Preordering source trees ▶ Lerner and Petrov (2013) preoder trees starting at the root ▶ Order all children (model 1) or left and right children (model 2) ....Peter ..escaped ..from ..the ..police. Root . Sb . AuxP . Adv . AuxA 6/19
  • 10. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Preordering source trees ▶ Lerner and Petrov (2013) preoder trees starting at the root ▶ Order all children (model 1) or left and right children (model 2) ....Peter ..escaped ..from ..the ..police. Root . Sb . AuxP . Adv . AuxA 6/19
  • 11. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Preordering source trees ▶ Lerner and Petrov (2013) preoder trees starting at the root ▶ Order all children (model 1) or left and right children (model 2) ....Peter ..escaped ..from ..the ..police. Root . Sb . AuxP . Adv . AuxA 6/19
  • 12. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Preordering source trees ▶ Lerner and Petrov (2013) preoder trees starting at the root ▶ Order all children (model 1) or left and right children (model 2) ....Peter ..escaped ..from ..the ..police. Root . Sb . AuxP . Adv . AuxA 6/19
  • 13. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Generating a preordering space ▶ Both Lerner and Petrov (2013) and Jehl et al. (2014) make only single-best predictions ▶ We want: − ALL REASONABLE predictions instead of SINGLE BEST − more exible model 7/19
  • 14. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Multiple predictions and more exible model ▶ Multiple predictions − Mistakes in order decisions propagate − Extract n-best decisions from the model to pass to later models ▶ Making the model more exible − Bad: order decisions are local to tree families − Non-local features would help (e.g. LM) → integration via cube pruning 8/19
  • 15. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Making the model more exible ▶ Use standard log-linear model (Och and Ney, 2002) ˆs′ = arg max s′ ∑ i λi log ϕi(s′ ) ▶ Where to get the weights? − PRO: tuning as ranking (Hopkins and May, 2011) − Scoring functions: 1. Kendall's τ coefficient 2. Simulate word level MT system, score by BLEU 9/19
  • 16. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Do non-local features help? Model Kendall's τ BLEU (ˆs′ → s′ ) First-best −LM 92.16 68.1 First-best +LM (cube) 92.27 68.7 10/19
  • 17. Introduction Word order Morphology Conclusion Source preordering Preordering spaces Evaluation Quality of the preordering space ▶ Experiments with top 10 preordering outputs of this model Distortion BLEU MTR TER Baseline 7 15.2 35.4 66.6 Oracle (k = 10) 17.26 37.97 62.64 11/19
  • 18. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Part II: Morphology 12/19
  • 19. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Morphology ▶ Word order is only one part of the problem for MRLs ▶ Many linguistic properties are not expressed via word order ▶ Three questions: − Does knowing morphological target properties help? − Can we predict these on source trees? − Which properties should we predict? 13/19
  • 20. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Does knowing morphological target properties help? ▶ Perform morph. tagging of target side of translation ▶ Project morphological attributes via the alignments Decoration Morph. attributes Tags BLEU None - - 15.12 Gold All attributes 846 15.96 Manual selection 77 15.86 Automatic selection 225 15.73 14/19
  • 21. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Predicting target morphology on source trees ▶ Prediction based on dependency chains instead of linear chains ▶ Can take into account full syntactic context ....Peter ..escaped ..from ..the ..police. Root . Sb . AuxP . Adv . AuxA . Peter . entkam . der case=dat . Polizei case=dat 15/19
  • 22. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Learning what to predict Idea: Only include attr. if it leads to better lexical selection Learning Procedure (sketch): 1. Decorate the source with all attributes 2. Calc. likelihood of heldout set with word-based system (IBM model 1) 3. As long as the likelihood increases: − Find worst attribute by merging tags + recal. liklihood − Remove attribute, re-align − Repeat 16/19
  • 23. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Learning what to predict (English–German) Part of speech Manual selection Automatic selection noun gender† number case gender number case adj gender† number‡ case‡ declension gender number case synpos degree verb number‡* person‡* tense* mode* - Additionally only in automatic: part:negativeness, part:subpos, punc:type, num:type. 17/19
  • 24. Introduction Word order Morphology Conclusion Motivation Prediction on source trees Learning what to predict Learning what to predict Manual Automatic All Training 50k 36m 45m 77m Training 100k 58m 82m 2h51m Training 200k 1h54m 3h5m 6h44m Best F1 72.67 74.67 62.18 18/19
  • 25. Introduction Word order Morphology Conclusion Conclusion Our work so far: Question 1: Can we make syntactic preordering models more exible and generate a space of possible preorderings? Question 2: Can we predict target morphology on the source? Current and future work: Question 3: Can we combine both ideas to exploit interactions? 19/19
  • 27. References References de Gispert, A., Iglesias, G., and Byrne, W. (2015). Fast and accurate preordering for smt using neural networks. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT 2015). Hopkins, M. and May, J. (2011). Tuning as ranking. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1352--1362, Edinburgh, Scotland, UK. Association for Computational Linguistics. Jehl, L., de Gispert, A., Hopkins, M., and Byrne, B. (2014). Source-side preordering for translation using logistic regression and depth- rst branch-and-bound search. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pages 239--248, Gothenburg, Sweden. Association for Computational Linguistics. Lerner, U. and Petrov, S. (2013). Source-side classi er preordering for machine translation. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 513--523, Seattle, Washington, USA. Association for Computational Linguistics. Och, F. J. and Ney, H. (2002). Discriminative training and maximum entropy models for statistical machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL '02, pages 295--302, Stroudsburg, PA, USA. Association for Computational Linguistics. 19/19