SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
Attentional Parallel RNNs for
Generating Punctuation in
Transcribed Speech
Alp Öktem, Mireia Farrús, Leo Wanner
E-mail: alp.oktem@upf.edu
Other works: https://www.researchgate.net/profile/Alp_Oktem
Github: https://github.com/alpoktem
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Motivation
...
so under that basis we put it out and said
look we're skeptical about this thing we
don't know but what can we do the
material looks good it feels right but we
just can't verify it and we then got a letter
just this week from the company who
wrote it wanting to track down the source
saying hey we want to track down the
source and we were like oh tell us more
what document is it precisely you're
talking about can you show that you had
legal authority over that document is it
really yours
...
ASR
Motivation
ASR
...
So under that basis, we put it out and
said, "Look, we're skeptical about this
thing. We don't know, but what can we
do? The material looks good, it feels
right, but we just can't verify it." And we
then got a letter just this week from the
company who wrote it, wanting to track
down the source saying, "Hey, we want
to track down the source." And we were
like, "Oh, tell us more. What document is
it, precisely, you're talking about? Can
you show that you had legal authority
over that document? Is it really yours?
...
Why punctuation?
Punctuation serves for:
● For human readability,
● To aid interpretation,
● For machine processing:
○ Parsing
○ Machine translation
Motivation
RESEARCH QUESTIONS
1. How to approach the problem of unpunctuated ASR output?
2. Which linguistic phenomena affect the placement of
punctuation marks in spoken text?
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Punctuating Spoken Text
What signals punctuation in speech?
1) Syntax/Orthography:
Usage of commas, which are required e.g. in seperating clauses, depend a lot on
syntax.
Today, I am giving a talk.
Punctuating Spoken Text
What signals punctuation in speech?
2) Prosody:
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Related Work
❖ Data-driven models → Trainable on any language
❖ Recurrent Neural Networks (RNN) employed on two kinds of data:
Written Data
Features: Lexical, POS
Written+Spoken Data
Features: Lexical, pause
durations
Training in two stages
(Ballesteros et al., 2016)
Many prosodic features contributing to punctuation usage
are neglected!
(Tilk et al., 2016)
Our Approach
❖ Process lexical and prosodic information in parallel.
❖ Train a model solely from spoken data
❖ Test various acoustic features contributing to prosody:
➢ Pause durations
➢ Fundemental frequency (f0)
➢ Intensity
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Proposed Model
Proposed Model
Bidirectional
Proposed Model
(Bahdanau et al.)
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Data
❖ 1046 TED Talks
❖ 884 English speakers
❖ 156034 sentences
❖ Manual transcription available
https://www.ted.com/talks
Acoustic/Prosodic Features
0,12 s
pause
duration
mean f0
mean
intensity
range
features
(max - min)
Measurements in semitones relative to speaker mean
Experimental Setup
❖ Reduced punctuation set
❖ 50 words per training sample
❖ 59811 samples
❖ %70-%15-%15: Training,
testing, validation
❖ Word vocabulary: 13830
❖ Implementation using Theano
no
punctuation
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Overall Results
Results for Each Punctuation Mark
Results from Testing Set
julian _ welcome . it's _ been _ reported _ that _ wikileaks _ your _ baby _ has _ in _
the _ last _ few _ years _ has _ released _ more _ classified _ documents _ than _ the
_ rest _ of _ the _ world's _ media _ combined . can _ that _ possibly _ be _ true ?
yeah , can _ it _ possibly _ be _ true ? it's _ a _ worry . isn't _ it _ that _ the _ rest _ of _
the _ world's _ media _ is _ doing _ such _ a _ bad _ job _ that _ a _ little _ group _ of
_ activists _ is _ able _ to _ release _ more _ of _ that _ type _ of _ information _ than _
the _ rest _ of _ the _ world _ press _ combined . how _ does _ it _ work ? how _ do _
people _ release _ the _ documents ?
who _ was _ the _ richest _ man ? still _ is _ the _ richest _ man _ in _ kenya .
when _ we _ released _ that _ report , we _ did _ so _ three _ days _ after _ the _ new
_ president _ kibaki _ had _ decided _ to _ pal _ up _ with _ the _ man _ that _ he _
was _ going _ to _ clean _ out , daniel _ arap _ moi .
Contents
1) Motivation
2) Punctuating spoken text
3) Approaches
a) Related Work
b) Our approach
4) Proposed model
5) Data and experimental setup
6) Results
7) Contributions
Contributions
❖ A study on the effect of various acoustic features on
punctuating spoken text.
❖ A model that is able to...
➢ process lexical/prosodic features in parallel
➢ integrate any aligned feature
❖ Training solely on spoken data
❖ Improvement compared to baseline (+%9,1 in terms of
F1
-score)
Source code available at:
https://github.com/alpoktem/punkProse
Results

Más contenido relacionado

La actualidad más candente

Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processingMinh Pham
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processinggulshan kumar
 
Using Stanza NLP and TensorFlow to create a summary of a book
Using Stanza NLP and TensorFlow to create a summary of a bookUsing Stanza NLP and TensorFlow to create a summary of a book
Using Stanza NLP and TensorFlow to create a summary of a bookOlusola Amusan
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Mustafa Jarrar
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyanrudolf eremyan
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingDavid Rostcheck
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: ParsingRushdi Shams
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingIla Group
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...Antonio Toral
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyMarina Santini
 
NLP & Machine Learning - An Introductory Talk
NLP & Machine Learning - An Introductory Talk NLP & Machine Learning - An Introductory Talk
NLP & Machine Learning - An Introductory Talk Vijay Ganti
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introductionRobert Lujo
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games ResearchJose Zagal
 

La actualidad más candente (20)

Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
Natural lanaguage processing
Natural lanaguage processingNatural lanaguage processing
Natural lanaguage processing
 
Using Stanza NLP and TensorFlow to create a summary of a book
Using Stanza NLP and TensorFlow to create a summary of a bookUsing Stanza NLP and TensorFlow to create a summary of a book
Using Stanza NLP and TensorFlow to create a summary of a book
 
Tips and Tools for NMT
Tips and Tools for NMTTips and Tools for NMT
Tips and Tools for NMT
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
 
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf EremyanDataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
DataFest 2017. Introduction to Natural Language Processing by Rudolf Eremyan
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Intro to nlp
Intro to nlpIntro to nlp
Intro to nlp
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
NLP Project Full Cycle
NLP Project Full CycleNLP Project Full Cycle
NLP Project Full Cycle
 
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...
Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Mach...
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
 
NLP
NLPNLP
NLP
 
NLP & Machine Learning - An Introductory Talk
NLP & Machine Learning - An Introductory Talk NLP & Machine Learning - An Introductory Talk
NLP & Machine Learning - An Introductory Talk
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introduction
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 

Similar a SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech

Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introductionananth
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
 
An-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingAn-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingTheodore J. LaGrow
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Daniel Adenew
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.Lifeng (Aaron) Han
 
French machine reading for question answering
French machine reading for question answeringFrench machine reading for question answering
French machine reading for question answeringAli Kabbadj
 
Natural language processing: feature extraction
Natural language processing: feature extractionNatural language processing: feature extraction
Natural language processing: feature extractionGabriel Hamilton
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
NLP Meetup 2023
NLP Meetup 2023NLP Meetup 2023
NLP Meetup 2023GabiMaeztu
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translationMarcis Pinnis
 
Introduction to Text Mining and Topic Modelling
Introduction to Text Mining and Topic ModellingIntroduction to Text Mining and Topic Modelling
Introduction to Text Mining and Topic ModellingDavid Paule
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1Sara Hooker
 
A decision tree based word sense disambiguation system in manipuri language
A decision tree based word sense disambiguation system in manipuri languageA decision tree based word sense disambiguation system in manipuri language
A decision tree based word sense disambiguation system in manipuri languageacijjournal
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needsIvan Berlocher
 
Problem-based Learning & Resource-based Learning two complementary approac...
Problem-based Learning & Resource-based Learning  two complementary approac...Problem-based Learning & Resource-based Learning  two complementary approac...
Problem-based Learning & Resource-based Learning two complementary approac...Wilco te Winkel
 
NLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inNLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inKumari Naveen
 

Similar a SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech (20)

Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
N01741100102
N01741100102N01741100102
N01741100102
 
An-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-ProcessingAn-Exploration-of-scientific-literature-using-Natural-Language-Processing
An-Exploration-of-scientific-literature-using-Natural-Language-Processing
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
 
French machine reading for question answering
French machine reading for question answeringFrench machine reading for question answering
French machine reading for question answering
 
Parafraseo-Chenggang.pdf
Parafraseo-Chenggang.pdfParafraseo-Chenggang.pdf
Parafraseo-Chenggang.pdf
 
Natural language processing: feature extraction
Natural language processing: feature extractionNatural language processing: feature extraction
Natural language processing: feature extraction
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
NLP Meetup 2023
NLP Meetup 2023NLP Meetup 2023
NLP Meetup 2023
 
1910 HCLT
1910 HCLT1910 HCLT
1910 HCLT
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
 
SCTUR: A Sentiment Classification Technique for URDU
SCTUR: A Sentiment Classification Technique for URDUSCTUR: A Sentiment Classification Technique for URDU
SCTUR: A Sentiment Classification Technique for URDU
 
Introduction to Text Mining and Topic Modelling
Introduction to Text Mining and Topic ModellingIntroduction to Text Mining and Topic Modelling
Introduction to Text Mining and Topic Modelling
 
Module 8: Natural language processing Pt 1
Module 8:  Natural language processing Pt 1Module 8:  Natural language processing Pt 1
Module 8: Natural language processing Pt 1
 
A decision tree based word sense disambiguation system in manipuri language
A decision tree based word sense disambiguation system in manipuri languageA decision tree based word sense disambiguation system in manipuri language
A decision tree based word sense disambiguation system in manipuri language
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needs
 
Problem-based Learning & Resource-based Learning two complementary approac...
Problem-based Learning & Resource-based Learning  two complementary approac...Problem-based Learning & Resource-based Learning  two complementary approac...
Problem-based Learning & Resource-based Learning two complementary approac...
 
NLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful inNLP Tasks and Applications.ppt useful in
NLP Tasks and Applications.ppt useful in
 

Último

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...Lokesh Kothari
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsNurulAfiqah307317
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 

Último (20)

Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 

SLSP 2017 presentation - Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech

  • 1. Attentional Parallel RNNs for Generating Punctuation in Transcribed Speech Alp Öktem, Mireia Farrús, Leo Wanner E-mail: alp.oktem@upf.edu Other works: https://www.researchgate.net/profile/Alp_Oktem Github: https://github.com/alpoktem
  • 2. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 3. Motivation ... so under that basis we put it out and said look we're skeptical about this thing we don't know but what can we do the material looks good it feels right but we just can't verify it and we then got a letter just this week from the company who wrote it wanting to track down the source saying hey we want to track down the source and we were like oh tell us more what document is it precisely you're talking about can you show that you had legal authority over that document is it really yours ... ASR
  • 4. Motivation ASR ... So under that basis, we put it out and said, "Look, we're skeptical about this thing. We don't know, but what can we do? The material looks good, it feels right, but we just can't verify it." And we then got a letter just this week from the company who wrote it, wanting to track down the source saying, "Hey, we want to track down the source." And we were like, "Oh, tell us more. What document is it, precisely, you're talking about? Can you show that you had legal authority over that document? Is it really yours? ...
  • 5. Why punctuation? Punctuation serves for: ● For human readability, ● To aid interpretation, ● For machine processing: ○ Parsing ○ Machine translation
  • 6. Motivation RESEARCH QUESTIONS 1. How to approach the problem of unpunctuated ASR output? 2. Which linguistic phenomena affect the placement of punctuation marks in spoken text?
  • 7. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 8. Punctuating Spoken Text What signals punctuation in speech? 1) Syntax/Orthography: Usage of commas, which are required e.g. in seperating clauses, depend a lot on syntax. Today, I am giving a talk.
  • 9. Punctuating Spoken Text What signals punctuation in speech? 2) Prosody:
  • 10. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 11. Related Work ❖ Data-driven models → Trainable on any language ❖ Recurrent Neural Networks (RNN) employed on two kinds of data: Written Data Features: Lexical, POS Written+Spoken Data Features: Lexical, pause durations Training in two stages (Ballesteros et al., 2016) Many prosodic features contributing to punctuation usage are neglected! (Tilk et al., 2016)
  • 12. Our Approach ❖ Process lexical and prosodic information in parallel. ❖ Train a model solely from spoken data ❖ Test various acoustic features contributing to prosody: ➢ Pause durations ➢ Fundemental frequency (f0) ➢ Intensity
  • 13. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 17. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 18. Data ❖ 1046 TED Talks ❖ 884 English speakers ❖ 156034 sentences ❖ Manual transcription available https://www.ted.com/talks
  • 19. Acoustic/Prosodic Features 0,12 s pause duration mean f0 mean intensity range features (max - min) Measurements in semitones relative to speaker mean
  • 20. Experimental Setup ❖ Reduced punctuation set ❖ 50 words per training sample ❖ 59811 samples ❖ %70-%15-%15: Training, testing, validation ❖ Word vocabulary: 13830 ❖ Implementation using Theano no punctuation
  • 21. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 23. Results for Each Punctuation Mark
  • 24. Results from Testing Set julian _ welcome . it's _ been _ reported _ that _ wikileaks _ your _ baby _ has _ in _ the _ last _ few _ years _ has _ released _ more _ classified _ documents _ than _ the _ rest _ of _ the _ world's _ media _ combined . can _ that _ possibly _ be _ true ? yeah , can _ it _ possibly _ be _ true ? it's _ a _ worry . isn't _ it _ that _ the _ rest _ of _ the _ world's _ media _ is _ doing _ such _ a _ bad _ job _ that _ a _ little _ group _ of _ activists _ is _ able _ to _ release _ more _ of _ that _ type _ of _ information _ than _ the _ rest _ of _ the _ world _ press _ combined . how _ does _ it _ work ? how _ do _ people _ release _ the _ documents ? who _ was _ the _ richest _ man ? still _ is _ the _ richest _ man _ in _ kenya . when _ we _ released _ that _ report , we _ did _ so _ three _ days _ after _ the _ new _ president _ kibaki _ had _ decided _ to _ pal _ up _ with _ the _ man _ that _ he _ was _ going _ to _ clean _ out , daniel _ arap _ moi .
  • 25. Contents 1) Motivation 2) Punctuating spoken text 3) Approaches a) Related Work b) Our approach 4) Proposed model 5) Data and experimental setup 6) Results 7) Contributions
  • 26. Contributions ❖ A study on the effect of various acoustic features on punctuating spoken text. ❖ A model that is able to... ➢ process lexical/prosodic features in parallel ➢ integrate any aligned feature ❖ Training solely on spoken data ❖ Improvement compared to baseline (+%9,1 in terms of F1 -score) Source code available at: https://github.com/alpoktem/punkProse