SlideShare a Scribd company logo
1 of 34
AFLAT 2010 – Malta Copyright JC Roux
Introduction
Aims
Multidisciplinary nature of our activities
Interaction between linguists and engineers / computer scientists “Every time I fire a linguist, the performance of our speech recognition system goes up.”
Striking a balance
Factors
Linguistic knowledge  Demonstrate how the development of technologies has an impact on the type of linguistic knowledge needed for particular applications Focus :  Two approaches towards speech synthesis 	Unit selection / Concatenative synthesis (Rule driven) 	HMM-based generation synthesis (Data driven)
Unit selection / Concatenative synthesis Widely accepted and approved technique Festival Speech Synthesis System (Black et al. 1998) FestVox (Black & Lenzo, 2000) Rule based :  e.g. prosodic models based on detailed data on duration, stress, intonation -  major languages of the world  Normally excellent (domain specific) quality of speech
Concatenative synthesis in a tone language Consider the following two words in isiXhosa where tonal movement takes place when the diminutive suffix /-ana/ is invoked: úmfúndìsì (teacher) úmfúndìsì + ana   > úmfùndísànà      HL > LH   In order to generate a speech version of example (2) consider the following (oversimplified) description of the process:
Language Component TEXT INPUT umfundisana MORPHOLOGICAL COMPONENT  u +mu +fund + is + ana # ú + mù+ fúnd + ìs + ana# LEXICON / TONAL ASSIGNMENT GRAPHEME PHONEME CONVERSION #u +mu +fund + is + ana #  NORMALISATION  # ú + m_+ fúnd + ìs + ana # # ú + m+ fúnd + ìs + ana # PHONOLOGICAL RULES V -> Ø / m_+ [ V root Nas-> [+ syllabic] / # V__ + C PHONETIC FORM [úmfundísa:na] PROSODIC RULES Penultimate vowel lengthening Tonal assignment HL > LH # ú + m+ fúnd + ìs + a:na # # ú + m+ fund + ís+ a:na #
Technical Component : Speech Generation SPEECH DATABASE of PRE-RECORDED UNITS: è bèsètèlèbé.., ù bùsùtùlùbú.., fúfùfáfà… (diphones, triphones………) UNIT SELECTION PROCESS Selection of applicable units from the speech database to match the required phonetic form  SETS OF POTENTIAL CANDIDATES ú        ndísa:        m      fu        na [   ]    [     ]     [    ]      [    ]   [    ]    [     ] [   ]    [     ]     [    ]      [    ]   [    ]    [     ] CONCATENATION ú   m   fundísa:    na SMOOTHING OF JUNCTURES SPOKEN FORM  [úmfundísa:na]
Availability of resources for rule based synthesis (1) Morphology? UNISA morphology project for Southern African languages of the Bantu group – great progress -  implementation level Phonology ? Various studies on aspects of the phonologies of African languages within different theories – theoretical models per se have limited implementation value Phonetics? Mainly impressionistic descriptions, very little quantitative studies in formats that are suitable for speech technology applications Experimental phonetic data mainly represent laboratory based speech – read speech -  lacking real world authenticity
Availability of resources for rule based synthesis (2) Prosody - Intonation? Pronunciation dictionaries Mapping: orthography to pronunciation > manual / G2P; Progress with the Lwazi project of Meraka, but  problem remains – basic tonal representation Zerbian & Barnard (2008): Phonetics of intonation in Southern African Bantu languages  Emphasises the need for quantitative acoustic data on intonational processes.
Nature of tonal data in Southern Bantu   Impressionistic descriptions (Interpretations of researcher) Examples of inconsistencies: isiXhosa  – the same speaker – three different  researchers (1969, 1973 & 1992) with different  tone marking for same items ! [Detailed discussions in Roux, 1991, 1995 (a) (b), 2001, 2003]
Questions   Which rules are to be used for implementation in a rule-based speech synthesis system? Alternatively:   When will reliable (tonal) rules be available in a useable format for speech synthesis, particularly in African tone languages? But, technologies change and give rise to new approaches to linguistic data – and the definition of linguistic knowledge
HMM-based Speech Synthesis System (HTS) 2002 First version released following pioneering work of Keiichi Tokudahttp://www.sp.nitech.ac.jp/~tokuda/
Technical detail on HMM synthesis for less resourced languages    Described in Roux, JC &  Visagie, AS. 2007.   Data-driven approach to rapid prototyping Xhosa speech synthesis, Proceedings of the 6th ISCA Speech 	Synthesis Workshop, Bonn, Germany, pp 143-147 Maia, R, Zen, H, Tokuda, K, Kitamura, T & Resende, FGV.  2003.  Towards the development of a Brazilian Portuguese Text-to-Speech system based on HMM, Eurospeech, Geneva, pp 2465-68
Types of knowledge involved  (1)
Types of knowledge involved  (2)
Types of knowledge involved  (3)    Point is: Fine grained phonetic data / tonological rules are not necessarily required for generating plausible intelligible speech – as long as the carriers of those information are present in the text data (and corresponding speech recordings)  used for training the system
Characteristics of text-to-speech (TTS) systems
Examples  isiXhosa ,[object Object]
no tonal information included
3 339 wordsSA English ,[object Object],[object Object]
Kubonisa (uku…)
English
Application of HTS  for TTS  development  in under resourced languages
African language TTS embedded on mobile devices ISIXHOSA  TEXT-TO-SPEECH  CONVERTER [Embedded into the system] [Also English TTS] ISIXHOSA  TEXTS   [English texts] Job training material / manuals Literacy and numeracy training material Terminologies  / definitions Multilingual Communication Phrases in different  environments, etc [REFERENCE TOOL] SPEECH :   ISIXHOSA  / ENGLISH
Implications for (phonetic) knowledge development
Pragmatic considerations: sustainability in African context?
In conclusion
References Black, A. 2006.  Multilingual speech synthesis.  In Schultz T & Kirchhoff, K (eds) Multilingual Speech Processing. Amsterdam; ELSEVIER.  Louw, JA, Davel, M & Barnard, E. 2005. A general-purpose isiZulu speech synthesiser. South African Journal of African Languages, 25(2): 92-100.  Roux, JC. 1991. On the integration of phonetics and phonology, South African Journal of Linguistics, vol 11:34-52 Roux, JC. 1995a. Prosodic data and phonological analyses in Zulu and Xhosa. South African Journal of African Languages, 15.1:19-28 Roux, JC. 1995b. On the perception and production of tone in Xhosa. South African Journal of African Languages, 15.4: 196-203
References (2) Roux, JC. 2001. Comments on “Zulu tonology and its relationship to other Nguni languages” by Cassimjee and Kisseberth.  Proceedings of the Symposium Cross-Linguistic Studies of Tonal Phenomena. Ed. S Kaji. Tokyo. pp. 361-367 Roux, JC. 2003. On the perception and description of tone in the Sotho and Nguni languages.  Proceedings of the Symposium Cross-Linguistic Studies of Tonal Phenomena: historical development, phonetics of tone and descriptive studies. Ed. S Kaji. Tokyo. pp. 155-176 Zerbian, S & Barnard, E. 2008. Phonetics of intonation in South African Bantu languages. Southern African Linguistics and Applied Language Studies, 26(2): 235-254

More Related Content

What's hot

Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Umm-e-Rooman Yaqoob
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguisticsRaul Vargas
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageijnlc
 
Role of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesRole of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesDr. Amit Kumar Jha
 
voice recognition
voice recognition voice recognition
voice recognition Hemant Jain
 
Corpus study design
Corpus study designCorpus study design
Corpus study designbikashtaly
 
Introduction to Language and Linguistics 002: Introduction to Phonetics
Introduction to Language and Linguistics 002: Introduction to PhoneticsIntroduction to Language and Linguistics 002: Introduction to Phonetics
Introduction to Language and Linguistics 002: Introduction to PhoneticsMeagan Louie
 
The Ekegusii Determiner Phrase Analysis in the Minimalist Program
The Ekegusii Determiner Phrase Analysis in the Minimalist ProgramThe Ekegusii Determiner Phrase Analysis in the Minimalist Program
The Ekegusii Determiner Phrase Analysis in the Minimalist ProgramBasweti Nobert
 
Corpus Linguistics for Language Teaching and Learning
Corpus Linguistics for Language Teaching and LearningCorpus Linguistics for Language Teaching and Learning
Corpus Linguistics for Language Teaching and LearningMartin Wynne
 
A Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical AnalyzerA Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical AnalyzerWaqas Tariq
 
Introduction to Linguistic Subfields
Introduction to Linguistic SubfieldsIntroduction to Linguistic Subfields
Introduction to Linguistic SubfieldsMeagan Louie
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basicsJorge Baptista
 
Introduction to Language and Linguistics 001: What do you know when you know ...
Introduction to Language and Linguistics 001: What do you know when you know ...Introduction to Language and Linguistics 001: What do you know when you know ...
Introduction to Language and Linguistics 001: What do you know when you know ...Meagan Louie
 

What's hot (19)

Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics Corpus Analysis in Corpus linguistics
Corpus Analysis in Corpus linguistics
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
 
Role of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languagesRole of language engineering to preserve endangered languages
Role of language engineering to preserve endangered languages
 
voice recognition
voice recognition voice recognition
voice recognition
 
Corpus study design
Corpus study designCorpus study design
Corpus study design
 
The corpus research method
The corpus research methodThe corpus research method
The corpus research method
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Introduction to Language and Linguistics 002: Introduction to Phonetics
Introduction to Language and Linguistics 002: Introduction to PhoneticsIntroduction to Language and Linguistics 002: Introduction to Phonetics
Introduction to Language and Linguistics 002: Introduction to Phonetics
 
The Ekegusii Determiner Phrase Analysis in the Minimalist Program
The Ekegusii Determiner Phrase Analysis in the Minimalist ProgramThe Ekegusii Determiner Phrase Analysis in the Minimalist Program
The Ekegusii Determiner Phrase Analysis in the Minimalist Program
 
Corpus Linguistics for Language Teaching and Learning
Corpus Linguistics for Language Teaching and LearningCorpus Linguistics for Language Teaching and Learning
Corpus Linguistics for Language Teaching and Learning
 
ROSP-20201-Syllabus
ROSP-20201-SyllabusROSP-20201-Syllabus
ROSP-20201-Syllabus
 
Corpus linguistics
Corpus linguisticsCorpus linguistics
Corpus linguistics
 
Corpora and its use in elt
Corpora and its use in eltCorpora and its use in elt
Corpora and its use in elt
 
A Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical AnalyzerA Computational Model of Yoruba Morphology Lexical Analyzer
A Computational Model of Yoruba Morphology Lexical Analyzer
 
Introduction to Linguistic Subfields
Introduction to Linguistic SubfieldsIntroduction to Linguistic Subfields
Introduction to Linguistic Subfields
 
Corpus linguistics the basics
Corpus linguistics the basicsCorpus linguistics the basics
Corpus linguistics the basics
 
Introduction to Language and Linguistics 001: What do you know when you know ...
Introduction to Language and Linguistics 001: What do you know when you know ...Introduction to Language and Linguistics 001: What do you know when you know ...
Introduction to Language and Linguistics 001: What do you know when you know ...
 

Viewers also liked

Power Point - Acquiring Knowledge for Second Language Use
Power Point - Acquiring Knowledge for Second Language UsePower Point - Acquiring Knowledge for Second Language Use
Power Point - Acquiring Knowledge for Second Language UseIndra Kurniawan
 
Pautas verano 2012 infantil
Pautas verano 2012 infantilPautas verano 2012 infantil
Pautas verano 2012 infantilmirkonicolas
 
GoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web ScaleGoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web ScaleMartin Hepp
 
Hands on Data Communications, Networking & TCP/IP Troubleshooting
Hands on Data Communications, Networking & TCP/IP TroubleshootingHands on Data Communications, Networking & TCP/IP Troubleshooting
Hands on Data Communications, Networking & TCP/IP TroubleshootingLiving Online
 
The Future of Retail: Recasting the Retail Store in Today's Omni-channel World
The Future of Retail: Recasting the Retail Store in Today's Omni-channel WorldThe Future of Retail: Recasting the Retail Store in Today's Omni-channel World
The Future of Retail: Recasting the Retail Store in Today's Omni-channel WorldDeborah Weinswig
 
CTBL-Watch - Issue 32 - August 2016
CTBL-Watch - Issue 32 - August 2016CTBL-Watch - Issue 32 - August 2016
CTBL-Watch - Issue 32 - August 2016Rachel Bennett
 
Trabajo individual nº 2 benites dominguez nocturno computacion e informatica
Trabajo individual nº 2 benites dominguez nocturno computacion e informaticaTrabajo individual nº 2 benites dominguez nocturno computacion e informatica
Trabajo individual nº 2 benites dominguez nocturno computacion e informaticaALISSA SAC
 
Bonded abrasive tools catergory
Bonded abrasive tools catergoryBonded abrasive tools catergory
Bonded abrasive tools catergoryBrian Zhao
 
Connotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_ppt
Connotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_pptConnotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_ppt
Connotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_pptCarlos Lopes
 
Sirris manufacturingday2011 future-of-mes-sirris
Sirris manufacturingday2011 future-of-mes-sirrisSirris manufacturingday2011 future-of-mes-sirris
Sirris manufacturingday2011 future-of-mes-sirrisSirris
 
ATMC TPS case study_A10-CS-80178-EN_0316 FINAL
ATMC TPS case study_A10-CS-80178-EN_0316 FINALATMC TPS case study_A10-CS-80178-EN_0316 FINAL
ATMC TPS case study_A10-CS-80178-EN_0316 FINALPete Kowalsky
 
Melissa Mathena Resume (1)
Melissa Mathena Resume (1)Melissa Mathena Resume (1)
Melissa Mathena Resume (1)Melissa Mathena
 
Manual Relaciones Públicas 2013
Manual Relaciones Públicas 2013Manual Relaciones Públicas 2013
Manual Relaciones Públicas 2013Rotary 4320
 
Tipos De Tiempo Carac TeríSticos En EspañA
Tipos De Tiempo Carac TeríSticos En EspañATipos De Tiempo Carac TeríSticos En EspañA
Tipos De Tiempo Carac TeríSticos En EspañAsalvamenor
 

Viewers also liked (20)

Power Point - Acquiring Knowledge for Second Language Use
Power Point - Acquiring Knowledge for Second Language UsePower Point - Acquiring Knowledge for Second Language Use
Power Point - Acquiring Knowledge for Second Language Use
 
Pautas verano 2012 infantil
Pautas verano 2012 infantilPautas verano 2012 infantil
Pautas verano 2012 infantil
 
Haider Ruf Juni 2013
Haider Ruf Juni 2013Haider Ruf Juni 2013
Haider Ruf Juni 2013
 
Impact delta works pilarczyk
Impact delta works  pilarczykImpact delta works  pilarczyk
Impact delta works pilarczyk
 
GoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web ScaleGoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
 
Buscar en internet 1
Buscar en internet 1Buscar en internet 1
Buscar en internet 1
 
Hands on Data Communications, Networking & TCP/IP Troubleshooting
Hands on Data Communications, Networking & TCP/IP TroubleshootingHands on Data Communications, Networking & TCP/IP Troubleshooting
Hands on Data Communications, Networking & TCP/IP Troubleshooting
 
The Future of Retail: Recasting the Retail Store in Today's Omni-channel World
The Future of Retail: Recasting the Retail Store in Today's Omni-channel WorldThe Future of Retail: Recasting the Retail Store in Today's Omni-channel World
The Future of Retail: Recasting the Retail Store in Today's Omni-channel World
 
CTBL-Watch - Issue 32 - August 2016
CTBL-Watch - Issue 32 - August 2016CTBL-Watch - Issue 32 - August 2016
CTBL-Watch - Issue 32 - August 2016
 
Trabajo individual nº 2 benites dominguez nocturno computacion e informatica
Trabajo individual nº 2 benites dominguez nocturno computacion e informaticaTrabajo individual nº 2 benites dominguez nocturno computacion e informatica
Trabajo individual nº 2 benites dominguez nocturno computacion e informatica
 
Bonded abrasive tools catergory
Bonded abrasive tools catergoryBonded abrasive tools catergory
Bonded abrasive tools catergory
 
Connotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_ppt
Connotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_pptConnotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_ppt
Connotea and CiteUlike | Milk Group | 3 EHAIL 2010 | Carlos Lopes_ppt
 
Sirris manufacturingday2011 future-of-mes-sirris
Sirris manufacturingday2011 future-of-mes-sirrisSirris manufacturingday2011 future-of-mes-sirris
Sirris manufacturingday2011 future-of-mes-sirris
 
ATMC TPS case study_A10-CS-80178-EN_0316 FINAL
ATMC TPS case study_A10-CS-80178-EN_0316 FINALATMC TPS case study_A10-CS-80178-EN_0316 FINAL
ATMC TPS case study_A10-CS-80178-EN_0316 FINAL
 
Melissa Mathena Resume (1)
Melissa Mathena Resume (1)Melissa Mathena Resume (1)
Melissa Mathena Resume (1)
 
Visual Thinking
Visual ThinkingVisual Thinking
Visual Thinking
 
Informe Final del Presupuesto Participativo 2009
Informe Final del Presupuesto Participativo 2009Informe Final del Presupuesto Participativo 2009
Informe Final del Presupuesto Participativo 2009
 
Manual Relaciones Públicas 2013
Manual Relaciones Públicas 2013Manual Relaciones Públicas 2013
Manual Relaciones Públicas 2013
 
09. el verbo
09. el verbo09. el verbo
09. el verbo
 
Tipos De Tiempo Carac TeríSticos En EspañA
Tipos De Tiempo Carac TeríSticos En EspañATipos De Tiempo Carac TeríSticos En EspañA
Tipos De Tiempo Carac TeríSticos En EspañA
 

Similar to Do we need linguistic knowledge for speech technology applications in African languages?

Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
Guia Didactica Uno
Guia Didactica UnoGuia Didactica Uno
Guia Didactica UnoUNEFM
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speechBilgin Aksoy
 
thesis(overview)
thesis(overview)thesis(overview)
thesis(overview)Michael D.
 
B047006011
B047006011B047006011
B047006011inventy
 
The main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-systemThe main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-systemCemal Ardil
 
DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...
DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...
DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...indexPub
 
Machine transliteration survey
Machine transliteration surveyMachine transliteration survey
Machine transliteration surveyunyil96
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...kevig
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...ijnlc
 
A new framework based on KNN and DT for speech identification through emphat...
A new framework based on KNN and DT for speech  identification through emphat...A new framework based on KNN and DT for speech  identification through emphat...
A new framework based on KNN and DT for speech identification through emphat...nooriasukmaningtyas
 
Introduction to English phonology and phonetics pptx
Introduction to English phonology and phonetics pptxIntroduction to English phonology and phonetics pptx
Introduction to English phonology and phonetics pptxAttallah Alanazi
 
LINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCE
LINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCELINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCE
LINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCEArgie Ramos
 
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...iosrjce
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesissipij
 

Similar to Do we need linguistic knowledge for speech technology applications in African languages? (20)

Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Guia Didactica Uno
Guia Didactica UnoGuia Didactica Uno
Guia Didactica Uno
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
G1803013542
G1803013542G1803013542
G1803013542
 
thesis(overview)
thesis(overview)thesis(overview)
thesis(overview)
 
B047006011
B047006011B047006011
B047006011
 
The main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-systemThe main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-system
 
Ey4301913917
Ey4301913917Ey4301913917
Ey4301913917
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
 
DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...
DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...
DIALECTAL VARIABILITY IN SPOKEN LANGUAGE: A COMPREHENSIVE SURVEY OF MODERN TE...
 
Machine transliteration survey
Machine transliteration surveyMachine transliteration survey
Machine transliteration survey
 
551 466-472
551 466-472551 466-472
551 466-472
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
CONSTRUCTION OF ENGLISH-BODO PARALLEL TEXT CORPUS FOR STATISTICAL MACHINE TRA...
 
A new framework based on KNN and DT for speech identification through emphat...
A new framework based on KNN and DT for speech  identification through emphat...A new framework based on KNN and DT for speech  identification through emphat...
A new framework based on KNN and DT for speech identification through emphat...
 
Introduction to English phonology and phonetics pptx
Introduction to English phonology and phonetics pptxIntroduction to English phonology and phonetics pptx
Introduction to English phonology and phonetics pptx
 
LINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCE
LINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCELINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCE
LINGUISTIC - DISCIPLINE AND IDEAS IN THE SOCIAL SCIENCE
 
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
 
FYPReport
FYPReportFYPReport
FYPReport
 
High Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech SynthesisHigh Quality Arabic Concatenative Speech Synthesis
High Quality Arabic Concatenative Speech Synthesis
 

More from Guy De Pauw

Technological Tools for Dictionary and Corpora Building for Minority Language...
Technological Tools for Dictionary and Corpora Building for Minority Language...Technological Tools for Dictionary and Corpora Building for Minority Language...
Technological Tools for Dictionary and Corpora Building for Minority Language...Guy De Pauw
 
Semi-automated extraction of morphological grammars for Nguni with special re...
Semi-automated extraction of morphological grammars for Nguni with special re...Semi-automated extraction of morphological grammars for Nguni with special re...
Semi-automated extraction of morphological grammars for Nguni with special re...Guy De Pauw
 
Resource-Light Bantu Part-of-Speech Tagging
Resource-Light Bantu Part-of-Speech TaggingResource-Light Bantu Part-of-Speech Tagging
Resource-Light Bantu Part-of-Speech TaggingGuy De Pauw
 
Natural Language Processing for Amazigh Language
Natural Language Processing for Amazigh LanguageNatural Language Processing for Amazigh Language
Natural Language Processing for Amazigh LanguageGuy De Pauw
 
POS Annotated 50m Corpus of Tajik Language
POS Annotated 50m Corpus of Tajik LanguagePOS Annotated 50m Corpus of Tajik Language
POS Annotated 50m Corpus of Tajik LanguageGuy De Pauw
 
The Tagged Icelandic Corpus (MÍM)
The Tagged Icelandic Corpus (MÍM)The Tagged Icelandic Corpus (MÍM)
The Tagged Icelandic Corpus (MÍM)Guy De Pauw
 
Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...
Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...
Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...Guy De Pauw
 
Tagging and Verifying an Amharic News Corpus
Tagging and Verifying an Amharic News CorpusTagging and Verifying an Amharic News Corpus
Tagging and Verifying an Amharic News CorpusGuy De Pauw
 
A Corpus of Santome
A Corpus of SantomeA Corpus of Santome
A Corpus of SantomeGuy De Pauw
 
Automatic Structuring and Correction Suggestion System for Hungarian Clinical...
Automatic Structuring and Correction Suggestion System for Hungarian Clinical...Automatic Structuring and Correction Suggestion System for Hungarian Clinical...
Automatic Structuring and Correction Suggestion System for Hungarian Clinical...Guy De Pauw
 
Compiling Apertium Dictionaries with HFST
Compiling Apertium Dictionaries with HFSTCompiling Apertium Dictionaries with HFST
Compiling Apertium Dictionaries with HFSTGuy De Pauw
 
The Database of Modern Icelandic Inflection
The Database of Modern Icelandic InflectionThe Database of Modern Icelandic Inflection
The Database of Modern Icelandic InflectionGuy De Pauw
 
Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming
Learning Morphological Rules for Amharic Verbs Using Inductive Logic ProgrammingLearning Morphological Rules for Amharic Verbs Using Inductive Logic Programming
Learning Morphological Rules for Amharic Verbs Using Inductive Logic ProgrammingGuy De Pauw
 
Issues in Designing a Corpus of Spoken Irish
Issues in Designing a Corpus of Spoken IrishIssues in Designing a Corpus of Spoken Irish
Issues in Designing a Corpus of Spoken IrishGuy De Pauw
 
How to build language technology resources for the next 100 years
How to build language technology resources for the next 100 yearsHow to build language technology resources for the next 100 years
How to build language technology resources for the next 100 yearsGuy De Pauw
 
Towards Standardizing Evaluation Test Sets for Compound Analysers
Towards Standardizing Evaluation Test Sets for Compound AnalysersTowards Standardizing Evaluation Test Sets for Compound Analysers
Towards Standardizing Evaluation Test Sets for Compound AnalysersGuy De Pauw
 
The PALDO Concept - New Paradigms for African Language Resource Development
The PALDO Concept - New Paradigms for African Language Resource DevelopmentThe PALDO Concept - New Paradigms for African Language Resource Development
The PALDO Concept - New Paradigms for African Language Resource DevelopmentGuy De Pauw
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersA System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersGuy De Pauw
 
IFE-MT: An English-to-Yorùbá Machine Translation System
IFE-MT: An English-to-Yorùbá Machine Translation SystemIFE-MT: An English-to-Yorùbá Machine Translation System
IFE-MT: An English-to-Yorùbá Machine Translation SystemGuy De Pauw
 
A Number to Yorùbá Text Transcription System
A Number to Yorùbá Text Transcription SystemA Number to Yorùbá Text Transcription System
A Number to Yorùbá Text Transcription SystemGuy De Pauw
 

More from Guy De Pauw (20)

Technological Tools for Dictionary and Corpora Building for Minority Language...
Technological Tools for Dictionary and Corpora Building for Minority Language...Technological Tools for Dictionary and Corpora Building for Minority Language...
Technological Tools for Dictionary and Corpora Building for Minority Language...
 
Semi-automated extraction of morphological grammars for Nguni with special re...
Semi-automated extraction of morphological grammars for Nguni with special re...Semi-automated extraction of morphological grammars for Nguni with special re...
Semi-automated extraction of morphological grammars for Nguni with special re...
 
Resource-Light Bantu Part-of-Speech Tagging
Resource-Light Bantu Part-of-Speech TaggingResource-Light Bantu Part-of-Speech Tagging
Resource-Light Bantu Part-of-Speech Tagging
 
Natural Language Processing for Amazigh Language
Natural Language Processing for Amazigh LanguageNatural Language Processing for Amazigh Language
Natural Language Processing for Amazigh Language
 
POS Annotated 50m Corpus of Tajik Language
POS Annotated 50m Corpus of Tajik LanguagePOS Annotated 50m Corpus of Tajik Language
POS Annotated 50m Corpus of Tajik Language
 
The Tagged Icelandic Corpus (MÍM)
The Tagged Icelandic Corpus (MÍM)The Tagged Icelandic Corpus (MÍM)
The Tagged Icelandic Corpus (MÍM)
 
Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...
Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...
Describing Morphologically Rich Languages Using Metagrammars a Look at Verbs ...
 
Tagging and Verifying an Amharic News Corpus
Tagging and Verifying an Amharic News CorpusTagging and Verifying an Amharic News Corpus
Tagging and Verifying an Amharic News Corpus
 
A Corpus of Santome
A Corpus of SantomeA Corpus of Santome
A Corpus of Santome
 
Automatic Structuring and Correction Suggestion System for Hungarian Clinical...
Automatic Structuring and Correction Suggestion System for Hungarian Clinical...Automatic Structuring and Correction Suggestion System for Hungarian Clinical...
Automatic Structuring and Correction Suggestion System for Hungarian Clinical...
 
Compiling Apertium Dictionaries with HFST
Compiling Apertium Dictionaries with HFSTCompiling Apertium Dictionaries with HFST
Compiling Apertium Dictionaries with HFST
 
The Database of Modern Icelandic Inflection
The Database of Modern Icelandic InflectionThe Database of Modern Icelandic Inflection
The Database of Modern Icelandic Inflection
 
Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming
Learning Morphological Rules for Amharic Verbs Using Inductive Logic ProgrammingLearning Morphological Rules for Amharic Verbs Using Inductive Logic Programming
Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming
 
Issues in Designing a Corpus of Spoken Irish
Issues in Designing a Corpus of Spoken IrishIssues in Designing a Corpus of Spoken Irish
Issues in Designing a Corpus of Spoken Irish
 
How to build language technology resources for the next 100 years
How to build language technology resources for the next 100 yearsHow to build language technology resources for the next 100 years
How to build language technology resources for the next 100 years
 
Towards Standardizing Evaluation Test Sets for Compound Analysers
Towards Standardizing Evaluation Test Sets for Compound AnalysersTowards Standardizing Evaluation Test Sets for Compound Analysers
Towards Standardizing Evaluation Test Sets for Compound Analysers
 
The PALDO Concept - New Paradigms for African Language Resource Development
The PALDO Concept - New Paradigms for African Language Resource DevelopmentThe PALDO Concept - New Paradigms for African Language Resource Development
The PALDO Concept - New Paradigms for African Language Resource Development
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersA System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá Characters
 
IFE-MT: An English-to-Yorùbá Machine Translation System
IFE-MT: An English-to-Yorùbá Machine Translation SystemIFE-MT: An English-to-Yorùbá Machine Translation System
IFE-MT: An English-to-Yorùbá Machine Translation System
 
A Number to Yorùbá Text Transcription System
A Number to Yorùbá Text Transcription SystemA Number to Yorùbá Text Transcription System
A Number to Yorùbá Text Transcription System
 

Recently uploaded

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 

Recently uploaded (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 

Do we need linguistic knowledge for speech technology applications in African languages?

  • 1. AFLAT 2010 – Malta Copyright JC Roux
  • 5. Interaction between linguists and engineers / computer scientists “Every time I fire a linguist, the performance of our speech recognition system goes up.”
  • 8. Linguistic knowledge Demonstrate how the development of technologies has an impact on the type of linguistic knowledge needed for particular applications Focus : Two approaches towards speech synthesis Unit selection / Concatenative synthesis (Rule driven) HMM-based generation synthesis (Data driven)
  • 9. Unit selection / Concatenative synthesis Widely accepted and approved technique Festival Speech Synthesis System (Black et al. 1998) FestVox (Black & Lenzo, 2000) Rule based : e.g. prosodic models based on detailed data on duration, stress, intonation - major languages of the world Normally excellent (domain specific) quality of speech
  • 10. Concatenative synthesis in a tone language Consider the following two words in isiXhosa where tonal movement takes place when the diminutive suffix /-ana/ is invoked: úmfúndìsì (teacher) úmfúndìsì + ana > úmfùndísànà HL > LH In order to generate a speech version of example (2) consider the following (oversimplified) description of the process:
  • 11. Language Component TEXT INPUT umfundisana MORPHOLOGICAL COMPONENT u +mu +fund + is + ana # ú + mù+ fúnd + ìs + ana# LEXICON / TONAL ASSIGNMENT GRAPHEME PHONEME CONVERSION #u +mu +fund + is + ana # NORMALISATION # ú + m_+ fúnd + ìs + ana # # ú + m+ fúnd + ìs + ana # PHONOLOGICAL RULES V -> Ø / m_+ [ V root Nas-> [+ syllabic] / # V__ + C PHONETIC FORM [úmfundísa:na] PROSODIC RULES Penultimate vowel lengthening Tonal assignment HL > LH # ú + m+ fúnd + ìs + a:na # # ú + m+ fund + ís+ a:na #
  • 12. Technical Component : Speech Generation SPEECH DATABASE of PRE-RECORDED UNITS: è bèsètèlèbé.., ù bùsùtùlùbú.., fúfùfáfà… (diphones, triphones………) UNIT SELECTION PROCESS Selection of applicable units from the speech database to match the required phonetic form SETS OF POTENTIAL CANDIDATES ú ndísa: m fu na [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] [ ] CONCATENATION ú m fundísa: na SMOOTHING OF JUNCTURES SPOKEN FORM [úmfundísa:na]
  • 13. Availability of resources for rule based synthesis (1) Morphology? UNISA morphology project for Southern African languages of the Bantu group – great progress - implementation level Phonology ? Various studies on aspects of the phonologies of African languages within different theories – theoretical models per se have limited implementation value Phonetics? Mainly impressionistic descriptions, very little quantitative studies in formats that are suitable for speech technology applications Experimental phonetic data mainly represent laboratory based speech – read speech - lacking real world authenticity
  • 14. Availability of resources for rule based synthesis (2) Prosody - Intonation? Pronunciation dictionaries Mapping: orthography to pronunciation > manual / G2P; Progress with the Lwazi project of Meraka, but problem remains – basic tonal representation Zerbian & Barnard (2008): Phonetics of intonation in Southern African Bantu languages Emphasises the need for quantitative acoustic data on intonational processes.
  • 15. Nature of tonal data in Southern Bantu Impressionistic descriptions (Interpretations of researcher) Examples of inconsistencies: isiXhosa – the same speaker – three different researchers (1969, 1973 & 1992) with different tone marking for same items ! [Detailed discussions in Roux, 1991, 1995 (a) (b), 2001, 2003]
  • 16. Questions Which rules are to be used for implementation in a rule-based speech synthesis system? Alternatively: When will reliable (tonal) rules be available in a useable format for speech synthesis, particularly in African tone languages? But, technologies change and give rise to new approaches to linguistic data – and the definition of linguistic knowledge
  • 17. HMM-based Speech Synthesis System (HTS) 2002 First version released following pioneering work of Keiichi Tokudahttp://www.sp.nitech.ac.jp/~tokuda/
  • 18. Technical detail on HMM synthesis for less resourced languages Described in Roux, JC & Visagie, AS. 2007. Data-driven approach to rapid prototyping Xhosa speech synthesis, Proceedings of the 6th ISCA Speech Synthesis Workshop, Bonn, Germany, pp 143-147 Maia, R, Zen, H, Tokuda, K, Kitamura, T & Resende, FGV. 2003. Towards the development of a Brazilian Portuguese Text-to-Speech system based on HMM, Eurospeech, Geneva, pp 2465-68
  • 19. Types of knowledge involved (1)
  • 20. Types of knowledge involved (2)
  • 21. Types of knowledge involved (3) Point is: Fine grained phonetic data / tonological rules are not necessarily required for generating plausible intelligible speech – as long as the carriers of those information are present in the text data (and corresponding speech recordings) used for training the system
  • 23.
  • 25.
  • 28. Application of HTS for TTS development in under resourced languages
  • 29. African language TTS embedded on mobile devices ISIXHOSA TEXT-TO-SPEECH CONVERTER [Embedded into the system] [Also English TTS] ISIXHOSA TEXTS [English texts] Job training material / manuals Literacy and numeracy training material Terminologies / definitions Multilingual Communication Phrases in different environments, etc [REFERENCE TOOL] SPEECH : ISIXHOSA / ENGLISH
  • 30. Implications for (phonetic) knowledge development
  • 33. References Black, A. 2006. Multilingual speech synthesis. In Schultz T & Kirchhoff, K (eds) Multilingual Speech Processing. Amsterdam; ELSEVIER. Louw, JA, Davel, M & Barnard, E. 2005. A general-purpose isiZulu speech synthesiser. South African Journal of African Languages, 25(2): 92-100. Roux, JC. 1991. On the integration of phonetics and phonology, South African Journal of Linguistics, vol 11:34-52 Roux, JC. 1995a. Prosodic data and phonological analyses in Zulu and Xhosa. South African Journal of African Languages, 15.1:19-28 Roux, JC. 1995b. On the perception and production of tone in Xhosa. South African Journal of African Languages, 15.4: 196-203
  • 34. References (2) Roux, JC. 2001. Comments on “Zulu tonology and its relationship to other Nguni languages” by Cassimjee and Kisseberth. Proceedings of the Symposium Cross-Linguistic Studies of Tonal Phenomena. Ed. S Kaji. Tokyo. pp. 361-367 Roux, JC. 2003. On the perception and description of tone in the Sotho and Nguni languages. Proceedings of the Symposium Cross-Linguistic Studies of Tonal Phenomena: historical development, phonetics of tone and descriptive studies. Ed. S Kaji. Tokyo. pp. 155-176 Zerbian, S & Barnard, E. 2008. Phonetics of intonation in South African Bantu languages. Southern African Linguistics and Applied Language Studies, 26(2): 235-254
  • 35.
  • 37. Why mobile platform? “Africa has become the fastest growing mobile market in the world with mobile penetration in the region ranging from 100% to 30%.” http://whiteafrican.com/2008/08/01/2007-african-mobile-phone-statistics
  • 38.
  • 39. Why speech output? Speech is a natural way of communication - hence, in this context, it provides access to information in a language of choice to non-literates as well a literates. Acutely aware of the low literacy rate in many parts of Africa.