SlideShare una empresa de Scribd logo
1 de 11
Descargar para leer sin conexión
Mul$-­‐Target	
  Machine	
  Transla$on	
  with	
  
Mul$-­‐Synchronous	
  Context-­‐free	
  Grammars	
  
	
  
Graham	
  Neubig	
  and	
  Philip	
  Arthur	
  and	
  Kevin	
  Duh	
Presenter:Shin	
  Kanouchi	
  
NAACL_Reading2015@KomachiLab,	
  TMU	
1
Mo#va#on	
  	
•  When	
  transla$ng	
  into	
  language	
  T1,	
  equivalent	
  
transla$ons	
  into	
  a	
  second	
  language	
  T2	
  can	
  help	
  	
  
	
  
	
  
•  T1	
  has	
  a	
  weak	
  language	
  model	
  
•  T2	
  has	
  a	
  strong	
  language	
  model	
  
can	
  we	
  use	
  a	
  T2	
  in	
  to	
  improve	
  results?	
  	
  
2
Overall	
  view	
•  Mo$va$on	
  
•  Propose	
  mul*-­‐synchronous	
  context-­‐free	
  
grammars	
  (MSCFGs)	
  
•  How	
  to	
  Learning	
  MSCFGs	
  	
  
•  How	
  to	
  perform	
  search	
  (Decoding)	
  
– including	
  calcula$on	
  of	
  LM	
  probabili$es	
  over	
  
mul$ple	
  target	
  language	
  strings	
  	
  
•  Experiment	
  
– gains	
  of	
  up	
  to	
  0.8-­‐1.5	
  BLEU	
  points	
  	
  
3
Proposed	
  Framework	
•  Build	
  on	
  the	
  well-­‐known	
  synchronous	
  context-­‐
free	
  grammars	
  (SCFG)	
  
•  Propose	
  mul*-­‐synchronous	
  context-­‐	
  free	
  
grammars	
  (MSCFGs),	
  with	
  mul$ple	
  targets	
  	
  
4
How	
  to	
  learning	
  MSCFGs	
  	
•  Learn	
  from	
  tri-­‐lingual	
  parallel	
  data	
  	
  
1.  Alignment	
  
•  alignments	
  for	
  each	
  sentence	
  automa$cally	
  
•  IBM	
  models	
  implemented	
  by	
  GIZA++	
  (Och	
  and	
  Ney,	
  2003)	
  	
  
2.  Phrase	
  Extrac$on	
  
3.  Calculate	
  Features	
  
Source:	
  	
T2:	
  	
T1:	
  	
5	
independent
How	
  to	
  learning	
  MSCFGs	
  	
•  Learn	
  from	
  tri-­‐lingual	
  parallel	
  data	
  	
  
1.  Alignment	
  
2.  Phrase	
  Extrac$on	
  
•  phrase-­‐extract	
  algorithm	
  of	
  Och	
  (2002)	
  	
  
•  Source	
  	
  →	
  T1	
  
•  a	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  →	
  了 
•  ra$fié	
  	
  	
  	
  →	
  批准 
•  a	
  ra$fié	
  →	
  批准 了 
	
  
3.  Calculate	
  Features	
  
•  Source	
  →	
  T2	
  
•  X	
  
•  X	
  
•  a	
  ra$fié	
  →	
  ra$fied	
  	
  
→	
  a	
  ra$fié	
  →	
  批准 了 |	
  ra$fied	
  	
  
6
3.	
  Calculate	
  Features	
  (13	
  Features)	
  
•  In	
  standard	
  SCFGs	
  
–  P	
  (γ|α1)	
  and	
  P	
  (α1|γ)	
  
•  log	
  forward	
  and	
  backward	
  transla$on	
  probabili$es	
  
–  Plex(γ|α1)	
  and	
  Plex(α1|γ)	
  
•  log	
  forward	
  and	
  backward	
  lexical	
  transla$on	
  probabili$es	
  
–  a	
  word	
  penalty	
  coun$ng	
  the	
  non-­‐terminals	
  in	
  α1,	
  	
  
–  a	
  constant	
  phrase	
  penalty	
  of	
  1.	
  	
  
•  In	
  MSCFGs	
  
–  P	
  (γ|α2)	
  and	
  P	
  (α2|γ)	
  
–  Plex(γ|α2)	
  and	
  Plex(α2|γ)	
  
–  word	
  penalty	
  for	
  α2	
  
•  	
  In	
  addi$on	
  
–  P	
  (γ|α1,	
  α2)	
  and	
  P	
  (α1,	
  α2|γ)	
  
	
   7
Decoding	
(a)	
  one	
  LM	
  (only	
  T1)	
  
(b)	
  joint	
  search	
  method,	
  is	
  based	
  on	
  consecu$vely	
  expanding	
  the	
  
LM	
  states	
  of	
  both	
  T1	
  and	
  T2	
  	
  
(c)	
  sequen*al	
  search	
  method,	
  first	
  expands	
  the	
  state	
  space	
  of	
  T1,	
  
then	
  later	
  expands	
  the	
  search	
  space	
  of	
  T2.	
  	
  
	
8
Experiments	
  	
•  Mul$	
  UN	
  Corpus:	
  
–  Parallel,	
  T1	
  LM	
  data:	
  100,000	
  Sentences	
  	
  
–  T2	
  LM	
  data:	
  4,000,000	
  Sentences	
  
S:	
  en	
  T1,	
  T2:	
  ar,	
  es,	
  fr,	
  ru,	
  zh	
  (all	
  combina$ons)	
  
•  Decoder:	
  	
  
–  Travatar	
  (Neubig,	
  2013)	
  	
  
•  Baseline:	
  
–  A	
  standard	
  SCFG	
  grammar	
  with	
  only	
  the	
  source	
  and	
  T1	
  	
  
•  Proposed:	
  
–  The	
  full	
  MSCFG	
  model	
  with	
  the	
  T2	
  LM	
  	
  
	
   9
Result	
  1	
•  T2	
  =	
  Spanish	
  	
  (best	
  results)	
  
•  Par$cularly	
  effec$ve	
  in	
  similar	
  languages	
  	
  	
  
BLEU	
10
Result	
  2	
•  BLEU	
  scores	
  for	
  different	
  T1	
  LM	
  sizes	
  without	
  (-­‐LM2)	
  or	
  
with	
  (+LM2)	
  an	
  LM	
  for	
  the	
  second	
  target.	
  	
  
11

Más contenido relacionado

La actualidad más candente

Access to non local names
Access to non local namesAccess to non local names
Access to non local names
Varsha Kumar
 
A software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasksA software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasks
RuleML
 
Merghani-SACNAS Poster
Merghani-SACNAS PosterMerghani-SACNAS Poster
Merghani-SACNAS Poster
Taha Merghani
 
Pauls klein 2011-lm_paper(3)
Pauls klein 2011-lm_paper(3)Pauls klein 2011-lm_paper(3)
Pauls klein 2011-lm_paper(3)
Red Over
 
Learning R via Python…or the other way around
Learning R via Python…or the other way aroundLearning R via Python…or the other way around
Learning R via Python…or the other way around
Sid Xing
 

La actualidad más candente (18)

18103010 algorithm complexity (iterative)
18103010 algorithm complexity (iterative)18103010 algorithm complexity (iterative)
18103010 algorithm complexity (iterative)
 
AA ppt9107
AA ppt9107AA ppt9107
AA ppt9107
 
XLnet RoBERTa Reformer
XLnet RoBERTa ReformerXLnet RoBERTa Reformer
XLnet RoBERTa Reformer
 
Getting the Most out of Transition-based Dependency Parsing
Getting the Most out of Transition-based Dependency ParsingGetting the Most out of Transition-based Dependency Parsing
Getting the Most out of Transition-based Dependency Parsing
 
Access to non local names
Access to non local namesAccess to non local names
Access to non local names
 
Marek Rei - 2017 - Semi-supervised Multitask Learning for Sequence Labeling
Marek Rei - 2017 - Semi-supervised Multitask Learning for Sequence LabelingMarek Rei - 2017 - Semi-supervised Multitask Learning for Sequence Labeling
Marek Rei - 2017 - Semi-supervised Multitask Learning for Sequence Labeling
 
Ngrams smoothing
Ngrams smoothingNgrams smoothing
Ngrams smoothing
 
[Question Paper] Network Security (75-25 Pattern) [November / 2015]
[Question Paper] Network Security (75-25 Pattern) [November / 2015][Question Paper] Network Security (75-25 Pattern) [November / 2015]
[Question Paper] Network Security (75-25 Pattern) [November / 2015]
 
A software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasksA software agent controlling 2 robot arms in co-operating concurrent tasks
A software agent controlling 2 robot arms in co-operating concurrent tasks
 
1548 PROJECT DEMO
1548 PROJECT DEMO1548 PROJECT DEMO
1548 PROJECT DEMO
 
Merghani-SACNAS Poster
Merghani-SACNAS PosterMerghani-SACNAS Poster
Merghani-SACNAS Poster
 
Pauls klein 2011-lm_paper(3)
Pauls klein 2011-lm_paper(3)Pauls klein 2011-lm_paper(3)
Pauls klein 2011-lm_paper(3)
 
Algorithm Complexity & Big-O Analysis
Algorithm Complexity & Big-O AnalysisAlgorithm Complexity & Big-O Analysis
Algorithm Complexity & Big-O Analysis
 
[Question Paper] Network Security (75:25 Pattern) [November / 2015]
[Question Paper] Network Security (75:25 Pattern) [November / 2015][Question Paper] Network Security (75:25 Pattern) [November / 2015]
[Question Paper] Network Security (75:25 Pattern) [November / 2015]
 
Speech recognition: Survey
Speech recognition: SurveySpeech recognition: Survey
Speech recognition: Survey
 
Sprint 6
Sprint 6Sprint 6
Sprint 6
 
Learning R via Python…or the other way around
Learning R via Python…or the other way aroundLearning R via Python…or the other way around
Learning R via Python…or the other way around
 
[Question Paper] Network Security (Revised Syllabus) [October / 2012]
[Question Paper] Network Security (Revised Syllabus) [October / 2012][Question Paper] Network Security (Revised Syllabus) [October / 2012]
[Question Paper] Network Security (Revised Syllabus) [October / 2012]
 

Destacado (6)

E sports
E sportsE sports
E sports
 
Verification Conditions for Single-Assignment Programs
Verification Conditions for Single-Assignment ProgramsVerification Conditions for Single-Assignment Programs
Verification Conditions for Single-Assignment Programs
 
Optimization technical introduction version1.011
Optimization technical introduction version1.011Optimization technical introduction version1.011
Optimization technical introduction version1.011
 
compiler ppt on symbol table
 compiler ppt on symbol table compiler ppt on symbol table
compiler ppt on symbol table
 
Symbol table design (Compiler Construction)
Symbol table design (Compiler Construction)Symbol table design (Compiler Construction)
Symbol table design (Compiler Construction)
 
Embedded System Tools ppt
Embedded System Tools  pptEmbedded System Tools  ppt
Embedded System Tools ppt
 

Similar a Multi-Target Machine Translation with Multi-Synchronous Context-free Grammars @NAACL読み会_KomachiLab

CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
Lifeng (Aaron) Han
 
Logic Programming and ILP
Logic Programming and ILPLogic Programming and ILP
Logic Programming and ILP
Pierre de Lacaze
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
lauratoni4
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
Lifeng (Aaron) Han
 
The L2F Spoken Web Search system for Mediaeval 2012
The L2F Spoken Web Search system for Mediaeval 2012The L2F Spoken Web Search system for Mediaeval 2012
The L2F Spoken Web Search system for Mediaeval 2012
MediaEval2012
 
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Lviv Startup Club
 

Similar a Multi-Target Machine Translation with Multi-Synchronous Context-free Grammars @NAACL読み会_KomachiLab (20)

Language models
Language modelsLanguage models
Language models
 
Reference Scope Identification in Citing Sentences
Reference Scope Identification in Citing SentencesReference Scope Identification in Citing Sentences
Reference Scope Identification in Citing Sentences
 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
 
NS-CUK Seminar: H.B.Kim, Review on "subgraph2vec: Learning Distributed Repre...
NS-CUK Seminar: H.B.Kim,  Review on "subgraph2vec: Learning Distributed Repre...NS-CUK Seminar: H.B.Kim,  Review on "subgraph2vec: Learning Distributed Repre...
NS-CUK Seminar: H.B.Kim, Review on "subgraph2vec: Learning Distributed Repre...
 
Introduction on Prolog - Programming in Logic
Introduction on Prolog - Programming in LogicIntroduction on Prolog - Programming in Logic
Introduction on Prolog - Programming in Logic
 
Logic Programming and ILP
Logic Programming and ILPLogic Programming and ILP
Logic Programming and ILP
 
Towards OpenLogos Hybrid Machine Translation - Anabela Barreiro
Towards OpenLogos Hybrid Machine Translation - Anabela BarreiroTowards OpenLogos Hybrid Machine Translation - Anabela Barreiro
Towards OpenLogos Hybrid Machine Translation - Anabela Barreiro
 
20230419-LLaMA-Adapter_ Efficient Fine-tuning of Language Models with Zero-in...
20230419-LLaMA-Adapter_ Efficient Fine-tuning of Language Models with Zero-in...20230419-LLaMA-Adapter_ Efficient Fine-tuning of Language Models with Zero-in...
20230419-LLaMA-Adapter_ Efficient Fine-tuning of Language Models with Zero-in...
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
 
Lexically constrained decoding for sequence generation using grid beam search
Lexically constrained decoding for sequence generation using grid beam searchLexically constrained decoding for sequence generation using grid beam search
Lexically constrained decoding for sequence generation using grid beam search
 
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
Transformer Seq2Sqe Models: Concepts, Trends & Limitations (DLI)
 
The L2F Spoken Web Search system for Mediaeval 2012
The L2F Spoken Web Search system for Mediaeval 2012The L2F Spoken Web Search system for Mediaeval 2012
The L2F Spoken Web Search system for Mediaeval 2012
 
N20181217
N20181217N20181217
N20181217
 
Latent Semantic Transliteration using Dirichlet Mixture
Latent Semantic Transliteration using Dirichlet MixtureLatent Semantic Transliteration using Dirichlet Mixture
Latent Semantic Transliteration using Dirichlet Mixture
 
Energy-Based Models with Applications to Speech and Language Processing
Energy-Based Models with Applications to Speech and Language ProcessingEnergy-Based Models with Applications to Speech and Language Processing
Energy-Based Models with Applications to Speech and Language Processing
 
Transfer Learning in NLP: A Survey
Transfer Learning in NLP: A SurveyTransfer Learning in NLP: A Survey
Transfer Learning in NLP: A Survey
 
Esa act
Esa actEsa act
Esa act
 
[slide] A Compare-Aggregate Model with Latent Clustering for Answer Selection
[slide] A Compare-Aggregate Model with Latent Clustering for Answer Selection[slide] A Compare-Aggregate Model with Latent Clustering for Answer Selection
[slide] A Compare-Aggregate Model with Latent Clustering for Answer Selection
 
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
 

Último

Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET
 

Último (20)

SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 

Multi-Target Machine Translation with Multi-Synchronous Context-free Grammars @NAACL読み会_KomachiLab

  • 1. Mul$-­‐Target  Machine  Transla$on  with   Mul$-­‐Synchronous  Context-­‐free  Grammars     Graham  Neubig  and  Philip  Arthur  and  Kevin  Duh Presenter:Shin  Kanouchi   NAACL_Reading2015@KomachiLab,  TMU 1
  • 2. Mo#va#on   •  When  transla$ng  into  language  T1,  equivalent   transla$ons  into  a  second  language  T2  can  help         •  T1  has  a  weak  language  model   •  T2  has  a  strong  language  model   can  we  use  a  T2  in  to  improve  results?     2
  • 3. Overall  view •  Mo$va$on   •  Propose  mul*-­‐synchronous  context-­‐free   grammars  (MSCFGs)   •  How  to  Learning  MSCFGs     •  How  to  perform  search  (Decoding)   – including  calcula$on  of  LM  probabili$es  over   mul$ple  target  language  strings     •  Experiment   – gains  of  up  to  0.8-­‐1.5  BLEU  points     3
  • 4. Proposed  Framework •  Build  on  the  well-­‐known  synchronous  context-­‐ free  grammars  (SCFG)   •  Propose  mul*-­‐synchronous  context-­‐  free   grammars  (MSCFGs),  with  mul$ple  targets     4
  • 5. How  to  learning  MSCFGs   •  Learn  from  tri-­‐lingual  parallel  data     1.  Alignment   •  alignments  for  each  sentence  automa$cally   •  IBM  models  implemented  by  GIZA++  (Och  and  Ney,  2003)     2.  Phrase  Extrac$on   3.  Calculate  Features   Source:   T2:   T1:   5 independent
  • 6. How  to  learning  MSCFGs   •  Learn  from  tri-­‐lingual  parallel  data     1.  Alignment   2.  Phrase  Extrac$on   •  phrase-­‐extract  algorithm  of  Och  (2002)     •  Source    →  T1   •  a                        →  了 •  ra$fié        →  批准 •  a  ra$fié  →  批准 了   3.  Calculate  Features   •  Source  →  T2   •  X   •  X   •  a  ra$fié  →  ra$fied     →  a  ra$fié  →  批准 了 |  ra$fied     6
  • 7. 3.  Calculate  Features  (13  Features)   •  In  standard  SCFGs   –  P  (γ|α1)  and  P  (α1|γ)   •  log  forward  and  backward  transla$on  probabili$es   –  Plex(γ|α1)  and  Plex(α1|γ)   •  log  forward  and  backward  lexical  transla$on  probabili$es   –  a  word  penalty  coun$ng  the  non-­‐terminals  in  α1,     –  a  constant  phrase  penalty  of  1.     •  In  MSCFGs   –  P  (γ|α2)  and  P  (α2|γ)   –  Plex(γ|α2)  and  Plex(α2|γ)   –  word  penalty  for  α2   •   In  addi$on   –  P  (γ|α1,  α2)  and  P  (α1,  α2|γ)     7
  • 8. Decoding (a)  one  LM  (only  T1)   (b)  joint  search  method,  is  based  on  consecu$vely  expanding  the   LM  states  of  both  T1  and  T2     (c)  sequen*al  search  method,  first  expands  the  state  space  of  T1,   then  later  expands  the  search  space  of  T2.     8
  • 9. Experiments   •  Mul$  UN  Corpus:   –  Parallel,  T1  LM  data:  100,000  Sentences     –  T2  LM  data:  4,000,000  Sentences   S:  en  T1,  T2:  ar,  es,  fr,  ru,  zh  (all  combina$ons)   •  Decoder:     –  Travatar  (Neubig,  2013)     •  Baseline:   –  A  standard  SCFG  grammar  with  only  the  source  and  T1     •  Proposed:   –  The  full  MSCFG  model  with  the  T2  LM       9
  • 10. Result  1 •  T2  =  Spanish    (best  results)   •  Par$cularly  effec$ve  in  similar  languages       BLEU 10
  • 11. Result  2 •  BLEU  scores  for  different  T1  LM  sizes  without  (-­‐LM2)  or   with  (+LM2)  an  LM  for  the  second  target.     11