SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
c
Parallel Construction: A Parallel Corpus Method for
Automatic Question Generation in Non-English
Languages
Benny G. Johnson, Jeffrey S. Dittel, Rachel Van Campenhout,
Rodrigo Bistolfi, Aida Maeda, and Bill Jerome
VitalSource Technologies, Research and Development
AIED2022 iTextbooks
c
Problem
• English is the dominant language in automatic question
generation (AQG) research.
• NLP tools needed for AQG are often under-resourced in
non-English languages.
It would be desirable to leverage the research and existing
AQG systems in English for other languages.
c
Automatically Generated Questions
c
Automatic Question
Generation
Research has found no
difference in how
students use AI-
generated versus human-
authored questions.
Van Campenhout, R., Brown, N., Jerome, B., Dittel, J. S., & Johnson, B. G. (2021).
Toward Effective Courseware at Scale: Investigating Automatically Generated
Questions as Formative Practice. Learning at Scale. pp. 295–298.
https://doi.org/10.1145/3430895.3460162
Van Campenhout, R., Dittel, J. S., Jerome, B., & Johnson, B. G. (2021). Transforming
textbooks into learning by doing environments: an evaluation of textbook-based
automatic question generation. In: Third Workshop on Intelligent Textbooks at the
22nd International Conference on Artificial Intelligence in Education. CEUR Workshop
Proceedings, ISSN 1613-0073, pp. 1–12. Retrieved from: http://ceur-ws.org/Vol-
2895/paper06.pdf
Johnson, B. G., Dittel, J. S., Van Campenhout, R., & Jerome, B. (2022). Discrimination of
automatically generated questions used as formative practice. Proceedings of the
Ninth ACM Conference on Learning@Scale (pp. 325-329).
https://doi.org/10.1145/3491140.3528323
c
Method
Parallel construction uses machine translation (MT) and a
parallel corpus approach.
1. Translate the textbook to English using MT, e.g.,
Google Translate.
2. Align the sentences and words in the parallel corpus.
3. Perform English AQG exactly as usual.
4. For each QG step in English, perform the equivalent
manipulation directly on the original text using the
alignment.
c
Questions
Why not simply implement AQG directly in Spanish?
This can be done, but it’s much more work. In our case, the
English AQG system had already been developed, validated, and
tested. Parallel construction enables its reuse for other
languages too.
c
Questions
Why not simply use MT to translate the textbook to English, do
AQG, and then translate the questions back to the original
language?
There is still a large gap in quality between MT and human
translation. The errors and noise in MT make this approach
insufficient for educational applications.
c
Method
Source language questions are kept up to date in parallel
with the English questions being generated, hence parallel
construction.
Advantages:
• All AQG decisions are made by the English system.
• The linguistic quality of the source text is preserved.
• Much less development work than direct AQG.
c
Example
Cloze matching question, Spanish-language macroeconomics
textbook.
Step 1: English system selects sentence for question creation.
However, during the 1980s many borrowing LDCs were unable to cope with the burden
of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a
consequence, their economic growth. countries experienced a serious decline.
c
Example
Cloze matching question, Spanish-language macroeconomics
textbook.
Step 1: English system selects sentence for question creation.
However, during the 1980s many borrowing LDCs were unable to cope with the burden
of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a
consequence, their economic growth. countries experienced a serious decline.
Corresponding Spanish sentence retrieved using alignment.
Sin embargo, durante la década de 1980 muchos PMD prestatarios no pudieron hacer
frente a la carga de su deuda exterior –situación que se conoce con el nombre de crisis
de la deuda de los PMD– y, quizá como consecuencia, el crecimiento económico de
estos países experimentó una grave disminución.
c
Example
Step 2: English system selects answer words.
borrowing, crisis, decline
Corresponding Spanish words retrieved using alignment.
prestatarios, crisis, disminución
c
Example
Step 3: Final question in English.
However, during the 1980s many ______ LDCs were unable to cope with the burden of
their foreign debt - a situation known as the LDC debt ______ - and, perhaps as a
consequence, their economic growth. countries experienced a serious ______.
Choices: borrowing, crisis, decline
Final question in Spanish.
Sin embargo, durante la década de 1980 muchos PMD ______ no pudieron hacer frente
a la carga de su deuda exterior –situación que se conoce con el nombre de ______ de la
deuda de los PMD– y, quizá como consecuencia, el crecimiento económico de estos
países experimentó una grave ______.
Opciones: crisis, disminución, prestatarios
c
Example
The translated English sentence is noisy.
However, during the 1980s many borrowing LDCs were unable to cope with the burden
of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a
consequence, their economic growth. countries experienced a serious decline.
The back-translated Spanish question is unacceptable.
Sin embargo, durante la década de 1980, muchos PMA ______ no pudieron
hacer frente a la carga de su deuda externa, una situación conocida como la
______ de la deuda de los PMA, y, tal vez, como consecuencia, su crecimiento
económico. Los países experimentaron un grave ______.
Opciones: crisis, declive, prestatarios
c
Example
The translated English sentence is noisy.
However, during the 1980s many borrowing LDCs were unable to cope with the burden
of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a
consequence, their economic growth. countries experienced a serious decline.
The back-translated Spanish question is unacceptable.
Sin embargo, durante la década de 1980, muchos PMA ______ no pudieron
hacer frente a la carga de su deuda externa, una situación conocida como la
______ de la deuda de los PMA, y, tal vez, como consecuencia, su crecimiento
económico. Los países experimentaron un grave ______.
Opciones: crisis, declive, prestatarios
PMA = países menos avanzados
PMD = países menos desarrollados
c
Example
The parallel construction Spanish question is correct.
Sin embargo, durante la década de 1980 muchos PMD ______ no pudieron hacer frente
a la carga de su deuda exterior –situación que se conoce con el nombre de ______ de la
deuda de los PMD– y, quizá como consecuencia, el crecimiento económico de estos
países experimentó una grave ______.
Opciones: crisis, disminución, prestatarios
c
Thank You!
For questions or comments, please email:
Benny Johnson, benny.johnson@vitalsource.com

Más contenido relacionado

Similar a Parallel Construction: A Parallel Corpus Approach for Automatic Question Generation in Non-English Languages

Lesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docx
Lesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docxLesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docx
Lesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docxsmile790243
 
Best Practices When Localizing And Translating Marketing Materials
Best Practices When Localizing And Translating Marketing MaterialsBest Practices When Localizing And Translating Marketing Materials
Best Practices When Localizing And Translating Marketing MaterialsChris Raulf
 
Managerial economics solved papers
Managerial economics solved papersManagerial economics solved papers
Managerial economics solved papersyunus khan
 
Examen de la nueva selectividad de Lengua Extranjera
Examen de la nueva selectividad de Lengua Extranjera Examen de la nueva selectividad de Lengua Extranjera
Examen de la nueva selectividad de Lengua Extranjera 20minutos
 
1.2.3.306.uma.dhara.maulik.bus.eng
1.2.3.306.uma.dhara.maulik.bus.eng1.2.3.306.uma.dhara.maulik.bus.eng
1.2.3.306.uma.dhara.maulik.bus.engmaulikbhatt
 
The Demographic Dimension of Portugals Crisis
The Demographic Dimension of Portugals CrisisThe Demographic Dimension of Portugals Crisis
The Demographic Dimension of Portugals CrisisEdward Hugh
 
Great Ideas For A Essay How To Write A Hook Cu
Great Ideas For A Essay How To Write A Hook  CuGreat Ideas For A Essay How To Write A Hook  Cu
Great Ideas For A Essay How To Write A Hook CuMary Price
 
Advanced Andrew
Advanced AndrewAdvanced Andrew
Advanced Andrewcyutafl
 
Highlights from Saving the American Dream
Highlights from Saving the American DreamHighlights from Saving the American Dream
Highlights from Saving the American DreamThe Heritage Foundation
 
English Language Test Prep Radio - Travel and Leisure Collocations
English Language Test Prep Radio - Travel and Leisure CollocationsEnglish Language Test Prep Radio - Travel and Leisure Collocations
English Language Test Prep Radio - Travel and Leisure CollocationsWinn Trivette II
 
Econ 202 Principles of Microeconomics Spring 2022 Ins
Econ 202 Principles of Microeconomics Spring 2022 InsEcon 202 Principles of Microeconomics Spring 2022 Ins
Econ 202 Principles of Microeconomics Spring 2022 InsEvonCanales257
 
blanchard20190305ppt.pdf
blanchard20190305ppt.pdfblanchard20190305ppt.pdf
blanchard20190305ppt.pdfmervin48
 
Week 5 organization
Week 5 organizationWeek 5 organization
Week 5 organizationAmy Hayashi
 
How To Format Essays - Ocean County College
How To Format Essays - Ocean County CollegeHow To Format Essays - Ocean County College
How To Format Essays - Ocean County CollegeTodd Turner
 
The Best White Pens For Writing On Black Paper
The Best White Pens For Writing On Black PaperThe Best White Pens For Writing On Black Paper
The Best White Pens For Writing On Black PaperKristen Flores
 

Similar a Parallel Construction: A Parallel Corpus Approach for Automatic Question Generation in Non-English Languages (20)

Response to Prime Minister Kenny Anthony June 10 2014 speech
Response to Prime Minister Kenny Anthony June 10 2014 speechResponse to Prime Minister Kenny Anthony June 10 2014 speech
Response to Prime Minister Kenny Anthony June 10 2014 speech
 
Lesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docx
Lesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docxLesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docx
Lesson 2.8IntroductionCourse ObjectivesThis lesson will addr.docx
 
cbsc2bhattdhara
cbsc2bhattdharacbsc2bhattdhara
cbsc2bhattdhara
 
Best Practices When Localizing And Translating Marketing Materials
Best Practices When Localizing And Translating Marketing MaterialsBest Practices When Localizing And Translating Marketing Materials
Best Practices When Localizing And Translating Marketing Materials
 
Managerial economics solved papers
Managerial economics solved papersManagerial economics solved papers
Managerial economics solved papers
 
Examen de la nueva selectividad de Lengua Extranjera
Examen de la nueva selectividad de Lengua Extranjera Examen de la nueva selectividad de Lengua Extranjera
Examen de la nueva selectividad de Lengua Extranjera
 
1.2.3.306.uma.dhara.maulik.bus.eng
1.2.3.306.uma.dhara.maulik.bus.eng1.2.3.306.uma.dhara.maulik.bus.eng
1.2.3.306.uma.dhara.maulik.bus.eng
 
The Demographic Dimension of Portugals Crisis
The Demographic Dimension of Portugals CrisisThe Demographic Dimension of Portugals Crisis
The Demographic Dimension of Portugals Crisis
 
Rīga presentation 2014
Rīga presentation 2014Rīga presentation 2014
Rīga presentation 2014
 
Great Ideas For A Essay How To Write A Hook Cu
Great Ideas For A Essay How To Write A Hook  CuGreat Ideas For A Essay How To Write A Hook  Cu
Great Ideas For A Essay How To Write A Hook Cu
 
Advanced Andrew
Advanced AndrewAdvanced Andrew
Advanced Andrew
 
Highlights from Saving the American Dream
Highlights from Saving the American DreamHighlights from Saving the American Dream
Highlights from Saving the American Dream
 
English Language Test Prep Radio - Travel and Leisure Collocations
English Language Test Prep Radio - Travel and Leisure CollocationsEnglish Language Test Prep Radio - Travel and Leisure Collocations
English Language Test Prep Radio - Travel and Leisure Collocations
 
2003 -May -MAT
2003 -May -MAT2003 -May -MAT
2003 -May -MAT
 
Econ 202 Principles of Microeconomics Spring 2022 Ins
Econ 202 Principles of Microeconomics Spring 2022 InsEcon 202 Principles of Microeconomics Spring 2022 Ins
Econ 202 Principles of Microeconomics Spring 2022 Ins
 
blanchard20190305ppt.pdf
blanchard20190305ppt.pdfblanchard20190305ppt.pdf
blanchard20190305ppt.pdf
 
Week 5 organization
Week 5 organizationWeek 5 organization
Week 5 organization
 
How To Format Essays - Ocean County College
How To Format Essays - Ocean County CollegeHow To Format Essays - Ocean County College
How To Format Essays - Ocean County College
 
The Best White Pens For Writing On Black Paper
The Best White Pens For Writing On Black PaperThe Best White Pens For Writing On Black Paper
The Best White Pens For Writing On Black Paper
 
Journalism in Exponential Times by Randy Smith
Journalism in Exponential Times by Randy SmithJournalism in Exponential Times by Randy Smith
Journalism in Exponential Times by Randy Smith
 

Más de Sergey Sosnovsky

Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...Sergey Sosnovsky
 
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...Sergey Sosnovsky
 
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...Sergey Sosnovsky
 
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...Sergey Sosnovsky
 
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...Sergey Sosnovsky
 
Creating Session Data from eTextbook Event Streams
Creating Session Data from eTextbook Event StreamsCreating Session Data from eTextbook Event Streams
Creating Session Data from eTextbook Event StreamsSergey Sosnovsky
 
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...Sergey Sosnovsky
 
Interactions of reading and assessment activities
Interactions of reading and assessment activitiesInteractions of reading and assessment activities
Interactions of reading and assessment activitiesSergey Sosnovsky
 
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for EducationYAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for EducationSergey Sosnovsky
 
Automatic Question Generation for Evidence-based Online Courseware Engineering
Automatic Question Generation for Evidence-based Online Courseware EngineeringAutomatic Question Generation for Evidence-based Online Courseware Engineering
Automatic Question Generation for Evidence-based Online Courseware EngineeringSergey Sosnovsky
 
Reading Comprehension Quiz Generation using Generative Pre-trained Transformers
Reading Comprehension Quiz Generation using Generative Pre-trained TransformersReading Comprehension Quiz Generation using Generative Pre-trained Transformers
Reading Comprehension Quiz Generation using Generative Pre-trained TransformersSergey Sosnovsky
 
Mathematical Language Processing via Tree Embeddings
Mathematical Language Processing via Tree EmbeddingsMathematical Language Processing via Tree Embeddings
Mathematical Language Processing via Tree EmbeddingsSergey Sosnovsky
 
Contextual Definition Generation
Contextual Definition GenerationContextual Definition Generation
Contextual Definition GenerationSergey Sosnovsky
 
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...Sergey Sosnovsky
 
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
Generation of Assessment Questions from Textbooks Enriched with Knowledge ModelsGeneration of Assessment Questions from Textbooks Enriched with Knowledge Models
Generation of Assessment Questions from Textbooks Enriched with Knowledge ModelsSergey Sosnovsky
 
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...Sergey Sosnovsky
 
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
Dental TutorBot: Exploitation of Dental Textbooks for Automated LearningDental TutorBot: Exploitation of Dental Textbooks for Automated Learning
Dental TutorBot: Exploitation of Dental Textbooks for Automated LearningSergey Sosnovsky
 
Using Programmed Instruction to Help Students Engage with eTextbook Content
Using Programmed Instruction to Help Students Engage with eTextbook Content Using Programmed Instruction to Help Students Engage with eTextbook Content
Using Programmed Instruction to Help Students Engage with eTextbook Content Sergey Sosnovsky
 
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...Sergey Sosnovsky
 

Más de Sergey Sosnovsky (20)

Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
Harnessing Textbooks for High-Quality Labeled Data: An Approach to Automatic ...
 
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
Toward Eliminating Hallucinations: GPT-based Explanatory AI for Intelligent T...
 
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
Layout- and Activity-based Textbook Modeling for Automatic PDF Textbook Extra...
 
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
Exploring the Content Ecosystem of the First Open-source Adaptive Tutor and i...
 
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
Advancing Intelligent Textbooks with Automatically Generated Practice: A Larg...
 
Creating Session Data from eTextbook Event Streams
Creating Session Data from eTextbook Event StreamsCreating Session Data from eTextbook Event Streams
Creating Session Data from eTextbook Event Streams
 
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
Augmenting Digital Textbooks with Reusable Smart Learning Content: Solutions ...
 
Interactions of reading and assessment activities
Interactions of reading and assessment activitiesInteractions of reading and assessment activities
Interactions of reading and assessment activities
 
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for EducationYAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
YAI4Edu: an Explanatory AI to Generate Interactive e-Books for Education
 
Automatic Question Generation for Evidence-based Online Courseware Engineering
Automatic Question Generation for Evidence-based Online Courseware EngineeringAutomatic Question Generation for Evidence-based Online Courseware Engineering
Automatic Question Generation for Evidence-based Online Courseware Engineering
 
Reading Comprehension Quiz Generation using Generative Pre-trained Transformers
Reading Comprehension Quiz Generation using Generative Pre-trained TransformersReading Comprehension Quiz Generation using Generative Pre-trained Transformers
Reading Comprehension Quiz Generation using Generative Pre-trained Transformers
 
Mathematical Language Processing via Tree Embeddings
Mathematical Language Processing via Tree EmbeddingsMathematical Language Processing via Tree Embeddings
Mathematical Language Processing via Tree Embeddings
 
Contextual Definition Generation
Contextual Definition GenerationContextual Definition Generation
Contextual Definition Generation
 
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
Transforming Textbooks into Learning by Doing Environments: An Evaluation of ...
 
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
Generation of Assessment Questions from Textbooks Enriched with Knowledge ModelsGeneration of Assessment Questions from Textbooks Enriched with Knowledge Models
Generation of Assessment Questions from Textbooks Enriched with Knowledge Models
 
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
Using Semantics of Textbook Highlights to Predict Student Comprehension and K...
 
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
Dental TutorBot: Exploitation of Dental Textbooks for Automated LearningDental TutorBot: Exploitation of Dental Textbooks for Automated Learning
Dental TutorBot: Exploitation of Dental Textbooks for Automated Learning
 
What's in a textbook
What's in a textbookWhat's in a textbook
What's in a textbook
 
Using Programmed Instruction to Help Students Engage with eTextbook Content
Using Programmed Instruction to Help Students Engage with eTextbook Content Using Programmed Instruction to Help Students Engage with eTextbook Content
Using Programmed Instruction to Help Students Engage with eTextbook Content
 
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
Adding Intelligence to a Textbook for Human Anatomy with a Causal Concept Map...
 

Último

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 

Último (20)

Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 

Parallel Construction: A Parallel Corpus Approach for Automatic Question Generation in Non-English Languages

  • 1. c Parallel Construction: A Parallel Corpus Method for Automatic Question Generation in Non-English Languages Benny G. Johnson, Jeffrey S. Dittel, Rachel Van Campenhout, Rodrigo Bistolfi, Aida Maeda, and Bill Jerome VitalSource Technologies, Research and Development AIED2022 iTextbooks
  • 2. c Problem • English is the dominant language in automatic question generation (AQG) research. • NLP tools needed for AQG are often under-resourced in non-English languages. It would be desirable to leverage the research and existing AQG systems in English for other languages.
  • 4. c Automatic Question Generation Research has found no difference in how students use AI- generated versus human- authored questions. Van Campenhout, R., Brown, N., Jerome, B., Dittel, J. S., & Johnson, B. G. (2021). Toward Effective Courseware at Scale: Investigating Automatically Generated Questions as Formative Practice. Learning at Scale. pp. 295–298. https://doi.org/10.1145/3430895.3460162 Van Campenhout, R., Dittel, J. S., Jerome, B., & Johnson, B. G. (2021). Transforming textbooks into learning by doing environments: an evaluation of textbook-based automatic question generation. In: Third Workshop on Intelligent Textbooks at the 22nd International Conference on Artificial Intelligence in Education. CEUR Workshop Proceedings, ISSN 1613-0073, pp. 1–12. Retrieved from: http://ceur-ws.org/Vol- 2895/paper06.pdf Johnson, B. G., Dittel, J. S., Van Campenhout, R., & Jerome, B. (2022). Discrimination of automatically generated questions used as formative practice. Proceedings of the Ninth ACM Conference on Learning@Scale (pp. 325-329). https://doi.org/10.1145/3491140.3528323
  • 5. c Method Parallel construction uses machine translation (MT) and a parallel corpus approach. 1. Translate the textbook to English using MT, e.g., Google Translate. 2. Align the sentences and words in the parallel corpus. 3. Perform English AQG exactly as usual. 4. For each QG step in English, perform the equivalent manipulation directly on the original text using the alignment.
  • 6. c Questions Why not simply implement AQG directly in Spanish? This can be done, but it’s much more work. In our case, the English AQG system had already been developed, validated, and tested. Parallel construction enables its reuse for other languages too.
  • 7. c Questions Why not simply use MT to translate the textbook to English, do AQG, and then translate the questions back to the original language? There is still a large gap in quality between MT and human translation. The errors and noise in MT make this approach insufficient for educational applications.
  • 8. c Method Source language questions are kept up to date in parallel with the English questions being generated, hence parallel construction. Advantages: • All AQG decisions are made by the English system. • The linguistic quality of the source text is preserved. • Much less development work than direct AQG.
  • 9. c Example Cloze matching question, Spanish-language macroeconomics textbook. Step 1: English system selects sentence for question creation. However, during the 1980s many borrowing LDCs were unable to cope with the burden of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a consequence, their economic growth. countries experienced a serious decline.
  • 10. c Example Cloze matching question, Spanish-language macroeconomics textbook. Step 1: English system selects sentence for question creation. However, during the 1980s many borrowing LDCs were unable to cope with the burden of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a consequence, their economic growth. countries experienced a serious decline. Corresponding Spanish sentence retrieved using alignment. Sin embargo, durante la década de 1980 muchos PMD prestatarios no pudieron hacer frente a la carga de su deuda exterior –situación que se conoce con el nombre de crisis de la deuda de los PMD– y, quizá como consecuencia, el crecimiento económico de estos países experimentó una grave disminución.
  • 11. c Example Step 2: English system selects answer words. borrowing, crisis, decline Corresponding Spanish words retrieved using alignment. prestatarios, crisis, disminución
  • 12. c Example Step 3: Final question in English. However, during the 1980s many ______ LDCs were unable to cope with the burden of their foreign debt - a situation known as the LDC debt ______ - and, perhaps as a consequence, their economic growth. countries experienced a serious ______. Choices: borrowing, crisis, decline Final question in Spanish. Sin embargo, durante la década de 1980 muchos PMD ______ no pudieron hacer frente a la carga de su deuda exterior –situación que se conoce con el nombre de ______ de la deuda de los PMD– y, quizá como consecuencia, el crecimiento económico de estos países experimentó una grave ______. Opciones: crisis, disminución, prestatarios
  • 13. c Example The translated English sentence is noisy. However, during the 1980s many borrowing LDCs were unable to cope with the burden of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a consequence, their economic growth. countries experienced a serious decline. The back-translated Spanish question is unacceptable. Sin embargo, durante la década de 1980, muchos PMA ______ no pudieron hacer frente a la carga de su deuda externa, una situación conocida como la ______ de la deuda de los PMA, y, tal vez, como consecuencia, su crecimiento económico. Los países experimentaron un grave ______. Opciones: crisis, declive, prestatarios
  • 14. c Example The translated English sentence is noisy. However, during the 1980s many borrowing LDCs were unable to cope with the burden of their foreign debt - a situation known as the LDC debt crisis - and, perhaps as a consequence, their economic growth. countries experienced a serious decline. The back-translated Spanish question is unacceptable. Sin embargo, durante la década de 1980, muchos PMA ______ no pudieron hacer frente a la carga de su deuda externa, una situación conocida como la ______ de la deuda de los PMA, y, tal vez, como consecuencia, su crecimiento económico. Los países experimentaron un grave ______. Opciones: crisis, declive, prestatarios PMA = países menos avanzados PMD = países menos desarrollados
  • 15. c Example The parallel construction Spanish question is correct. Sin embargo, durante la década de 1980 muchos PMD ______ no pudieron hacer frente a la carga de su deuda exterior –situación que se conoce con el nombre de ______ de la deuda de los PMD– y, quizá como consecuencia, el crecimiento económico de estos países experimentó una grave ______. Opciones: crisis, disminución, prestatarios
  • 16. c Thank You! For questions or comments, please email: Benny Johnson, benny.johnson@vitalsource.com