SlideShare una empresa de Scribd logo
1 de 5
Descargar para leer sin conexión
Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications
ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450

RESEARCH ARTICLE

www.ijera.com

OPEN ACCESS

Automatic Syllabification Rules for ASSAMESE Language
Laba Kr. Thakuria1, Prof. P.H. Talukdar2
Department of Instrumentation & USIC, Gauhati University Guwahati, India
Department of Instrumentation & USIC, Gauhati University Guwahati, India

Abstract
For unit selection based text-to-speech system, syllabification acts as a backbone. Based on different structures
of different languages syllabification rules are also varies. The purpose of this study is to examine and analyse
the syllabification rules for Assamese language. Imparting education and training, preferably, in the
local/regional language is urgently necessary in today’s context in order to maintain social harmony and
homogeneity. Language heterogeneity is a global problem in bringing all the benefits of Information
Technology (IT) to our doorsteps. Syllabification rules are implemented into an algorithm which later can be
integrated into a text-to-speech system. The analysis of these rules has been taken using 10000 phonetically rich
words which reports to produce a comparable result of 99% accuracy as compared to manual syllabification.
Keywords: Assamese language, diphthongs, phonemes, syllable, text-to-speech.

I. Introduction
Syllable is a unit of sound which is larger
than phoneme and smaller than a word [5].
Syllabification algorithms are mainly used in text-tospeech (TTS) systems in producing natural sounding
speech, and in speech recognizers in detecting out-ofvocabulary words[4]. Syllable forms as a gap
between a phonemes and words [1]. Various attempts
have been made to define a syllable earlier.
According to phonetics it is defined with respect to
its articulation whereas in phonology it is simply
termed as a sequence of phonemes [3].When a word
is broken down into its syllables the process is called
syllabification. As we humans also syllabify a word,
as far as possible before speaking if possible and
phonemic segmentation if not, text-to-speech systems
usually use syllable based approach as a basic unit
[6]. A syllable based text-to-speech system performs
always better than a phoneme level approach in terms
of naturalness and easy in boundary analysis.In this
study there is an honest endeavour to syllabify
ASSAMESE language as well as implemented an
algorithm which further automatically divides words
into its constituent syllables.It is the aim of the
proposed paper to study the phonetic variations of
Assamese language to develop syllabification rules
for Assamese language. the algorithm was tested
using 5000 phonetically rich words. The degree of
accuracy of the algorithm is 99%.
The research paper is organized as
Assamese Language and its Phonological structure. It
also describes the Family tree of the Assamese
Language. Then it describes the Syllable structure
and its algorithm with rules. This section also
includes its working methodology.
www.ijera.com

II. Assamese Language And Its
Phonological Structure
Assamese is an Indo-Aryan language spoken
by the Assamese people in general. The mixed Aryan
culture and the mongoloid culture gave birth to a new
culture. So, every community from this region always
exhibits their indigenous culture with diversity. It is
the link language for the people living in Assam and
adjoining states of Arunachal Pradesh, Meghalaya,
Nagaland etc. This language has come from Sanskrit
as its offshoot, through different stages of
development.

Fig-1: Proto Indo-European Family Tree
The ASSAMESE phonemic inventory
consists of eight vowels, ten diphthongs, twenty-one
consonants and two semi vowels [3]. The
ASSAMESE vowels and consonants are shown in the
tables below.

446 | P a g e
Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications
ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450
Table 1: Vowels in ASSAMESE.
Front Central
i
Close
eê
Half-close
ɛ
Half-open

Back
uû
oô
ɔ

A

Open

The vowel sounds in Assamese Language
occur in all the three positions, namely word initially,
medially, and finally. Examples are shown below.
Table 2: Examples of Vowel Sounds
Monophthongs

/ɛ /

Initially Medially
Finally
/aaitaa/
/i/ ‘he’ ‘grandmother’ /xi/ ‘he’
/deutaa/
/ek/ ‘one ’
‘father
/de/ ‘give’
/bɛ l/ ‘kind of /kɛ nɛ /
/ɛ taa/ ‘one piece’ Fruit’
‘how’

/a/
/u/
/û/

/aam/ ‘mango’
/ur/ ‘fly’
/ ûkani/ ‘leech’

/o/

/osar/ ‘near’
/ ɔ kanmaan/
‘little’

/i/
/e/

/ɔ /

/bal/ ‘child’
/phul/‘flower’
/bûl/ ‘color’
/gondha/
‘smell’
/bɔ l/
‘strength’

/kalaa/
‘black’
/saku/ ‘eye’

/lo/ ‘iron’
/lɔ /e ‘take’

There are ten diphthongs in Assamese Language as
follows:
Table 3: Diphthongs in Assamese Language
Diphthongs
/ai/
/ei/
/ oi /

/ ɔ i/
/ui/
/iu/
/ou/
/ au/
/eu/
/ua/

Initially Medially
/aai/ ‘mother’ /kaait/ ‘thorn’
/eitu/ ‘this ’
/xeitu/ ‘that’
/oi/
‘interjection’
/ghoiniiyek/ ‘wife’

Finally
/paai/ ‘to
reach’
/dei/ ‘okay’
/ekaanabboi/
‘ninety one’

/ ɔ inya /
‘somebody else’ /p ɔ itaa / ‘cockroch’ /h ɔ i / ‘yes’
/ui/ ‘white ant’
/dui/ ‘two’
/ ouxadh /
‘medicine’
/ aaloukik / ‘spiritual’
/ auxi/ ‘no moon /paautaa/ ‘one who
day’
get’
/deuri/’personal title’
/uaar/ ‘cover’ /guaal/ ‘milkman’

www.ijera.com

/mou/
‘honey’
/xaau/
‘curse’
/deu/ ‘priest’

www.ijera.com

There are twenty-three consonant sounds
including two semi-vowels in Assamese Language.
Table 4: Consonants in ASSAMESE
Nature
of
Alveol
Vel
Articulat
Bi-labial ar
Palatal ar
ion
p
b t
d
K
g
Plosive

ph

bʱ

Fricative
Nasal
Lateral
Rolled
Semi Vowel

M

d̪
th ʱ
s
z
n
l
r

kh
x

ɡ
ʱ

ŋ
l

w

Glottal

j

III. Syllabification
Syllables are considered as the set of
smallest speech sound in a language that
distinguishes one word from another. The key
objective of this study is to set the syllabification
rules for ASSAMESE language using an algorithm
which syllabify a word. In syllabification the location
of some syllables in word plays a great role. Syllable
shows different behaviour in a form of articulation
features and pronunciation if it occurs in word initial,
final or intermediary position[7]. In this research
work the syllabification is actually achieved by
looking at different patterns of syllables meaning that
we can experiment with monosyllabic, disyllabic,
trisyllabic and polysyllabic.
Thus, the purpose of this work is to formulate an
algorithm for dividing Assamese words into
syllables.

IV. Methodology
The methodologies followed in the present
study are:
a) Examine Assamese syllables structures from
linguistic literature.[6]
b) Group discussions with linguists.
c) Dialect variations.
d) Study of previous works performed [2].

V. Subjects
Data was obtained from the Assamese
Dictionary (Hemkosha) and also from the native
speakers of the respective language and was
syllabified by five different speakers between 20 and
25 years of age participated in the study. The entire
test was done on two male speakers and three female
speakers respectively. On the basis of these
experiments almost all the syllable templates
available in Assamese were found. About 5000
447 | P a g e

ɦ
Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications
ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450
monosyllabic, disyllabic and polysyllabic words were
experimented to find out the syllable structure in
Assamese. For the words of Assamese having
different structure see the Appendix A.

iii.

iv.

VI. Procedure
The entire five subjects were asked to record
the words of Assamese language, which were
selected for contain almost all the syllable templates.
The recording was done in a quiet room with a noise
cancelling microphone using the recording facilities
of a typical multimedia computer system. The
boundaries of syllable were marked according to the
pronunciation of the team. The voice was recorded
and processed on Audacity and Cool Edit Pro. By
observing formants through spectrogram, syllable
structure and boundary were evaluated. The
equipment used in recording was a microphone with
a frequency response of 48000 Hz with a 16 bit
sound card and high quality speakers.

VII.

Result

The recording was done on 5000 distinct words
extracted from a Assamese corpus and then compared
with manual syllabification of the same words to
measure accuracy. Heterogeneous nature of texts
obtained from the News Paper, Feature Articles Text
books, Radio news, short stories etc[2]. A list of
distinct words of 5000 most frequently occurring
words chosen for testing the algorithm. The 5000
words results some 20,655 syllables .The algorithm
achieves an overall accuracy of 98.05% when
compared with the same words manually syllabified
by an expert.
A.

Syllable Structure
Syllable structure of ASSAMESE phonemes
of vowels (V) and consonants(C) are as follows:1. V
2. VC
3. CV
4. CVC
5. CVV
6. CCVC
7. CCV
8. VCC
B.

Syllabification rules
An assumption is taken in this rule that
diphthongs are considered as a single vowel unit. The
syllabification rules developed after study are given
below:i.
The syllable boundary of a word having single
vowel is at the end of the word. (V, CV)
ii.
For a word having CVCC* structure, then
syllable boundary is marked after first vowel.
ie CV/C, CV/CVV, CV/CV,CV/CVC etc
www.ijera.com

v.

vi.

vii.

viii.

ix.

www.ijera.com

For a word having VCV* structure , then
syllable boundary is marked after SECOND
vowel. ie VCV/, VCV/C, VCV/CV,
VCV/CVCetc
For a word having CVV* structure , then
syllable boundary is marked after SECOND
vowel. ie
CVV/,CVV/C,CVV/CV,CVV/CVC,CVV/V
etc
For a word having VCC* structure , then
syllable boundary is marked after second
CONSONANT. ie VCC/VCV, VCC/V,
VCC/VC etc
For a word having CCV structure , then NO
syllable boundary is marked.
SYLLABIFICATION is only makred after
completion of the word. ie CCV/
For a word having CCVC* structure , then
syllable boundary is marked after first vowel.
ie CCV/C, CCV/CV, CCV/CVC,CCV/CVV
etc
For a word having CVCV structure , then
syllable boundary is marked after first vowel.
ie CV/CV, CV/CVV etc
For a word having CVVCX structure , then
CHECK X
A) If x=vowel(V) ie CVVCV, boundary will
found after second vowel
B) If x=consonant(C) ie CVVCC , boundary
will found between the two vowels.

C.

Implementation
All the above rules were implemented as a
recursive function in a C++ environment which
checks the presence of syllables in the sub word till
the word boundary is reached. The words are
processed from left to right.

VIII.

Algorithm

In this section, the Bodo syllabification rules
identified in the section (4.2) are presented in the
form of a formal algorithm. The function syllabify()
accepts an array of phonemes generated, along with a
variable called current_index which is used to
determine the position of the given array currently
being processed by the algorithm. Initially the
current_index variable will be initialized to 0. The
syllabify() functionis called recursively until all
phonemes in the array are processed.
The
function
mark_syllable_boundary(position) will mark the
syllable boundaries of an accepted array of
phonemes. The other functions used within the
syllabify () function are described below.
i) total_vowels(phonemes): accepts an array of
phonemes and returns the number of vowels
contained in that array.
448 | P a g e
Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications
ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450
ii) is_vowel(phoneme): accepts a phoneme and
returns true if the given phoneme is a vowel.
iii) total_vowels_bet_cons (): accepts a word with
current position and returns the total number of
vowels till the next consonant from current
position. It also returns the length of the total
vowels, length of the first vowel and length of
the first consonant, after current position, as
reference.
iv) total_cons_bet_vowels () : accepts a word with
Current position and returns the total number of
Consonants till the next vowel from current
position. It also returns the length of the first
consonants, after current position, as reference.
v) get_syllables () : accepts a word with current
position, syllable boundary and returns the
syllable.

[3].
[4].

[5].

[6].

IX. Conclusion
The above algorithm was tested on 5000
distinct Assamese Words, collected from Assamese
corpus of dictionary (Hemkosha), short stories and
news and compared it with manual syllabification we
get the result of about 20,665 syllables with accuracy
of 99%.This paper is presented an algorithm for
dividing Assamese words into syllables. The
algorithm is based on grammatical rules proposed in
section (4.2) and does not require high computational
cost. Syllabification is an important component of
many speech and language processing system[4], and
it is expected that this algorithm is going to be a
significant contribution to the researchers working on
various aspects of the Assamese language.

X. Acknowledgement
We would like to express my sincere
gratitude and heartfelt thanks to my guide Prof. Pran
Hari
Talukdar,
Professor,
Department
of
Instrumentation and USIC, Gauhati University. we
would not have been able to complete the research
work and shape it in the form of the research paper
without his consistent advice, and never ending
enthusiasm, positivity, encouragement, support and
understanding. we are very fortunate for having an
opportunity to work with him from which I benefited
enormously. we are also grateful to Gauhati
University for giving me to access the data from
GU_Galo_Adi corpus.

[7].
[8].
[9].
[10].
[11].

[12].

[13].
[14].

[15].

www.ijera.com

Rules for Bodo Language, International
Journal Of Computational Engineering
Research, vol. 2 issue. 6, pp 110-114,
October 2012.
Banikanta Kakati. Assamese, its formation
and development.
Ruvan Weerasinghe, Asanka Wasala and
Kumudu Gamage. A Rule Based
Syllabification Algorithm for Sinhala
Parminder Singh, Gurpreet Singh Lehal.
Syllables Selection for the Development of
Speech Database for Punjabi TTS
System, IJCSI International Journal of
Computer Science Issues, vol. 7, Issue 6,
November 2010.
Nungleppam Gopil Singh, Purnendu
Acharjee, Prof. P. H. Talukdar. An
Improved
Automatic
Syllabification
Rules for BODO Language. International
Journal of Computing, Communications and
Networking, vol. 1, No.3, NovemberDecember 2012.
Jehangir zaman khan. Syllabification rules
in pashto
Heriberto Cuay´ahuitl. A Syllabification
Algorithm for Spanish
Upendra Nath Goswami. An Introduction
to Assamese
Hemchandra Barua. HEMKOSHA
Couto I., Neto N., Tadaiesky, V. Klautau, A.
Maia, R.2010. An open source HMMbased text-to-speech system for Brazilian
Portuguese, in Proc. 7th International
Telecommunications Symposium Manaus.
Kevin Hock, Julian Smart, Stefan Csomor.
Cross-Platform GUI Programming with
Wxwidgets, Pearson edition, 2006.
Juliette
Blevins,The
syllable
in
phonological theory,1995
George Kiraz and Bernd M¨obius.
Multilingual
syllabification
using
weighted finite-state transducers, in Proc.
of the 3rd Workshop on Speech Synthesis,
1998.
Robert Damper. Learning about speech
from data: Beyond NETtalk, in DataDriven Techniques in Speech Synthesis, pp
1–25, 2001

References
[1].

[2].

ChandanSarma,
U.Sharma,
C.K.Nath,
S.Kalita, P.H.Talukdar. Selection of Units
and Development of Speech Database for
Natural Sounding Bodo TTS System,
CISP Guwahati, March 2012.
Jyotismita Talukdar, Chandan Sarma, Prof.
P.H Talukdar. Automatic Syllabification

www.ijera.com

449 | P a g e
Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications
ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450

www.ijera.com

Appendix A: Word List
Table 5 : Word list
Assamese

Structure

Type

এই

ei

V

monosyllabic

এক

ek

Vc

monosyllabic

এ

e

V

monosyllabic

ঊধ্

uudh

Vc

monosyllabic

ঊণ্

uun

Vc

monosyllabic

ঊস্

uux

Vc

monosyllabic

ককছিল

koisil

cv/cvc

Disyllabic

ককছি

koise

cv/cv

Disyllabic

কৈছিল

goisil

cv/cvc

Disyllabic

কৈছিল

thoisil

cv/cvc

Disyllabic

কৈছি

thoise

cv/cv

Disyllabic

দূৰৈৈ

duuroir

cv/cvc

Disyllabic

কৈছি

hoise

cv/cv

Disyllabic

এৰৈৱা

ekhoiwaa

v/cv/cv

Trisyllabic

ককছিলা

koisilaa

cv/cv/cv

Trisyllabic

ককছিছল

koisile

cv/cv/cv

Trisyllabic

কৈণীছেক
ত াৰলছক

ghoiniiyek
toloike

cv/cv/cvc
cv/cv/cv

Trisyllabic
Trisyllabic

কদৱছ াৈ

doiwajog

cv/cv/cvc

Trisyllabic

এপৈৰলছক
ত ছ োৰলছক

www.ijera.com

Roman

eparaloike
tetiyaaloike

v/cv/cv/cv/cv
cv/cv/cv/cv/cv

polysyllabic
polysyllabic

450 | P a g e

Más contenido relacionado

La actualidad más candente

Morpheme Based Myanmar Word Segmenter
Morpheme Based Myanmar Word SegmenterMorpheme Based Myanmar Word Segmenter
Morpheme Based Myanmar Word Segmenterijtsrd
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerijnlc
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...sipij
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai
'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai 'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai
'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai Dyslexia International
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
 
Phonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech SystemsPhonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech Systemspaperpublications3
 
Designing A Rule Based Stemming Algorithm for Kambaata Language Text
Designing A Rule Based Stemming Algorithm for Kambaata Language TextDesigning A Rule Based Stemming Algorithm for Kambaata Language Text
Designing A Rule Based Stemming Algorithm for Kambaata Language TextCSCJournals
 
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσαςβ2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσαςPETROS MANZIERIS
 
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσαςβ2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσαςPetros Manzieris
 
OPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMER
OPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMEROPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMER
OPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMERijnlc
 
Anaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer methodAnaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer methodijcsa
 
Word sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy wordsWord sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy wordsijnlc
 

La actualidad más candente (16)

Morpheme Based Myanmar Word Segmenter
Morpheme Based Myanmar Word SegmenterMorpheme Based Myanmar Word Segmenter
Morpheme Based Myanmar Word Segmenter
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzer
 
Ey4301913917
Ey4301913917Ey4301913917
Ey4301913917
 
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
Sipij040305SPEECH EVALUATION WITH SPECIAL FOCUS ON CHILDREN SUFFERING FROM AP...
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai
'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai 'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai
'Understanding Chinese Developmental Dyslexia...' By Professor Alice Lai
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
 
Phonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech SystemsPhonetic Recognition In Words For Persian Text To Speech Systems
Phonetic Recognition In Words For Persian Text To Speech Systems
 
Designing A Rule Based Stemming Algorithm for Kambaata Language Text
Designing A Rule Based Stemming Algorithm for Kambaata Language TextDesigning A Rule Based Stemming Algorithm for Kambaata Language Text
Designing A Rule Based Stemming Algorithm for Kambaata Language Text
 
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσαςβ2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
 
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσαςβ2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
β2 πινακες δεξιοτητων και αξιολογησεων εξετασεων αγγλικης γλωσσας
 
Aw32322326
Aw32322326Aw32322326
Aw32322326
 
OPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMER
OPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMEROPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMER
OPTIMIZE THE LEARNING RATE OF NEURAL ARCHITECTURE IN MYANMAR STEMMER
 
Anaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer methodAnaphora resolution in hindi language using gazetteer method
Anaphora resolution in hindi language using gazetteer method
 
Word sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy wordsWord sense disambiguation using wsd specific wordnet of polysemy words
Word sense disambiguation using wsd specific wordnet of polysemy words
 

Destacado (20)

Bj4201407412
Bj4201407412Bj4201407412
Bj4201407412
 
Br4201458461
Br4201458461Br4201458461
Br4201458461
 
Av4103298302
Av4103298302Av4103298302
Av4103298302
 
Ba4102382385
Ba4102382385Ba4102382385
Ba4102382385
 
Au4102342349
Au4102342349Au4102342349
Au4102342349
 
Bf4102414417
Bf4102414417Bf4102414417
Bf4102414417
 
Az4102375381
Az4102375381Az4102375381
Az4102375381
 
Hr3613551360
Hr3613551360Hr3613551360
Hr3613551360
 
Go3511621168
Go3511621168Go3511621168
Go3511621168
 
diploma_final
diploma_finaldiploma_final
diploma_final
 
Green Lantern Info
Green Lantern InfoGreen Lantern Info
Green Lantern Info
 
Degree
DegreeDegree
Degree
 
umperville
umpervilleumperville
umperville
 
letter of reccomendation
letter of reccomendationletter of reccomendation
letter of reccomendation
 
Diario Resumen 20150122
Diario Resumen 20150122Diario Resumen 20150122
Diario Resumen 20150122
 
ACROPOLIS
ACROPOLISACROPOLIS
ACROPOLIS
 
Scanned from Baku RO_ EBRD-project management
Scanned from Baku RO_ EBRD-project managementScanned from Baku RO_ EBRD-project management
Scanned from Baku RO_ EBRD-project management
 
seo1
seo1seo1
seo1
 
Management Course
Management CourseManagement Course
Management Course
 
IEEE CERTIFICATE
IEEE CERTIFICATEIEEE CERTIFICATE
IEEE CERTIFICATE
 

Similar a Bp4201446450

MYANMAR WORDS SORTING
MYANMAR WORDS SORTING MYANMAR WORDS SORTING
MYANMAR WORDS SORTING kevig
 
Efficiency of the energy contained in modulators in the Arabic vowels recogni...
Efficiency of the energy contained in modulators in the Arabic vowels recogni...Efficiency of the energy contained in modulators in the Arabic vowels recogni...
Efficiency of the energy contained in modulators in the Arabic vowels recogni...IJECEIAES
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silencepaperpublications3
 
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...IOSR Journals
 
A Word Stemming Algorithm for Hausa Language
A Word Stemming Algorithm for Hausa LanguageA Word Stemming Algorithm for Hausa Language
A Word Stemming Algorithm for Hausa Languageiosrjce
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...ijma
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...ijma
 
Automatic Phonetization-based Statistical Linguistic Study of Standard Arabic
Automatic Phonetization-based Statistical Linguistic Study of Standard ArabicAutomatic Phonetization-based Statistical Linguistic Study of Standard Arabic
Automatic Phonetization-based Statistical Linguistic Study of Standard ArabicCSCJournals
 
B047006011
B047006011B047006011
B047006011inventy
 
B047006011
B047006011B047006011
B047006011inventy
 
Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...
Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...
Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...IJERA Editor
 
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...iosrjce
 
An OT Account of Phonological Alignment and Epenthesis in Aligarh Urdu
An OT Account of Phonological Alignment and Epenthesis in Aligarh UrduAn OT Account of Phonological Alignment and Epenthesis in Aligarh Urdu
An OT Account of Phonological Alignment and Epenthesis in Aligarh Urduijtsrd
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionaryiosrjce
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
Ijarcet vol-3-issue-3-623-625 (1)
Ijarcet vol-3-issue-3-623-625 (1)Ijarcet vol-3-issue-3-623-625 (1)
Ijarcet vol-3-issue-3-623-625 (1)Dhabal Sethi
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityijaia
 

Similar a Bp4201446450 (20)

MYANMAR WORDS SORTING
MYANMAR WORDS SORTING MYANMAR WORDS SORTING
MYANMAR WORDS SORTING
 
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
 
Efficiency of the energy contained in modulators in the Arabic vowels recogni...
Efficiency of the energy contained in modulators in the Arabic vowels recogni...Efficiency of the energy contained in modulators in the Arabic vowels recogni...
Efficiency of the energy contained in modulators in the Arabic vowels recogni...
 
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On SilenceSegmentation Words for Speech Synthesis in Persian Language Based On Silence
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
 
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...Substitution Error Analysis for Improving the Word Accuracy in  Telugu Langua...
Substitution Error Analysis for Improving the Word Accuracy in Telugu Langua...
 
A Word Stemming Algorithm for Hausa Language
A Word Stemming Algorithm for Hausa LanguageA Word Stemming Algorithm for Hausa Language
A Word Stemming Algorithm for Hausa Language
 
D017362531
D017362531D017362531
D017362531
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
 
Automatic Phonetization-based Statistical Linguistic Study of Standard Arabic
Automatic Phonetization-based Statistical Linguistic Study of Standard ArabicAutomatic Phonetization-based Statistical Linguistic Study of Standard Arabic
Automatic Phonetization-based Statistical Linguistic Study of Standard Arabic
 
B0340710
B0340710B0340710
B0340710
 
B047006011
B047006011B047006011
B047006011
 
B047006011
B047006011B047006011
B047006011
 
Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...
Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...
Duration for Classification and Regression Treefor Marathi Textto- Speech Syn...
 
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
Acoustic Duration of Arabic Utterances Spoken by the Indonesian Learners of A...
 
An OT Account of Phonological Alignment and Epenthesis in Aligarh Urdu
An OT Account of Phonological Alignment and Epenthesis in Aligarh UrduAn OT Account of Phonological Alignment and Epenthesis in Aligarh Urdu
An OT Account of Phonological Alignment and Epenthesis in Aligarh Urdu
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionary
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
Ijarcet vol-3-issue-3-623-625 (1)
Ijarcet vol-3-issue-3-623-625 (1)Ijarcet vol-3-issue-3-623-625 (1)
Ijarcet vol-3-issue-3-623-625 (1)
 
Using automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivityUsing automated lexical resources in arabic sentence subjectivity
Using automated lexical resources in arabic sentence subjectivity
 

Último

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 

Último (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 

Bp4201446450

  • 1. Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450 RESEARCH ARTICLE www.ijera.com OPEN ACCESS Automatic Syllabification Rules for ASSAMESE Language Laba Kr. Thakuria1, Prof. P.H. Talukdar2 Department of Instrumentation & USIC, Gauhati University Guwahati, India Department of Instrumentation & USIC, Gauhati University Guwahati, India Abstract For unit selection based text-to-speech system, syllabification acts as a backbone. Based on different structures of different languages syllabification rules are also varies. The purpose of this study is to examine and analyse the syllabification rules for Assamese language. Imparting education and training, preferably, in the local/regional language is urgently necessary in today’s context in order to maintain social harmony and homogeneity. Language heterogeneity is a global problem in bringing all the benefits of Information Technology (IT) to our doorsteps. Syllabification rules are implemented into an algorithm which later can be integrated into a text-to-speech system. The analysis of these rules has been taken using 10000 phonetically rich words which reports to produce a comparable result of 99% accuracy as compared to manual syllabification. Keywords: Assamese language, diphthongs, phonemes, syllable, text-to-speech. I. Introduction Syllable is a unit of sound which is larger than phoneme and smaller than a word [5]. Syllabification algorithms are mainly used in text-tospeech (TTS) systems in producing natural sounding speech, and in speech recognizers in detecting out-ofvocabulary words[4]. Syllable forms as a gap between a phonemes and words [1]. Various attempts have been made to define a syllable earlier. According to phonetics it is defined with respect to its articulation whereas in phonology it is simply termed as a sequence of phonemes [3].When a word is broken down into its syllables the process is called syllabification. As we humans also syllabify a word, as far as possible before speaking if possible and phonemic segmentation if not, text-to-speech systems usually use syllable based approach as a basic unit [6]. A syllable based text-to-speech system performs always better than a phoneme level approach in terms of naturalness and easy in boundary analysis.In this study there is an honest endeavour to syllabify ASSAMESE language as well as implemented an algorithm which further automatically divides words into its constituent syllables.It is the aim of the proposed paper to study the phonetic variations of Assamese language to develop syllabification rules for Assamese language. the algorithm was tested using 5000 phonetically rich words. The degree of accuracy of the algorithm is 99%. The research paper is organized as Assamese Language and its Phonological structure. It also describes the Family tree of the Assamese Language. Then it describes the Syllable structure and its algorithm with rules. This section also includes its working methodology. www.ijera.com II. Assamese Language And Its Phonological Structure Assamese is an Indo-Aryan language spoken by the Assamese people in general. The mixed Aryan culture and the mongoloid culture gave birth to a new culture. So, every community from this region always exhibits their indigenous culture with diversity. It is the link language for the people living in Assam and adjoining states of Arunachal Pradesh, Meghalaya, Nagaland etc. This language has come from Sanskrit as its offshoot, through different stages of development. Fig-1: Proto Indo-European Family Tree The ASSAMESE phonemic inventory consists of eight vowels, ten diphthongs, twenty-one consonants and two semi vowels [3]. The ASSAMESE vowels and consonants are shown in the tables below. 446 | P a g e
  • 2. Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450 Table 1: Vowels in ASSAMESE. Front Central i Close eê Half-close ɛ Half-open Back uû oô ɔ A Open The vowel sounds in Assamese Language occur in all the three positions, namely word initially, medially, and finally. Examples are shown below. Table 2: Examples of Vowel Sounds Monophthongs /ɛ / Initially Medially Finally /aaitaa/ /i/ ‘he’ ‘grandmother’ /xi/ ‘he’ /deutaa/ /ek/ ‘one ’ ‘father /de/ ‘give’ /bɛ l/ ‘kind of /kɛ nɛ / /ɛ taa/ ‘one piece’ Fruit’ ‘how’ /a/ /u/ /û/ /aam/ ‘mango’ /ur/ ‘fly’ / ûkani/ ‘leech’ /o/ /osar/ ‘near’ / ɔ kanmaan/ ‘little’ /i/ /e/ /ɔ / /bal/ ‘child’ /phul/‘flower’ /bûl/ ‘color’ /gondha/ ‘smell’ /bɔ l/ ‘strength’ /kalaa/ ‘black’ /saku/ ‘eye’ /lo/ ‘iron’ /lɔ /e ‘take’ There are ten diphthongs in Assamese Language as follows: Table 3: Diphthongs in Assamese Language Diphthongs /ai/ /ei/ / oi / / ɔ i/ /ui/ /iu/ /ou/ / au/ /eu/ /ua/ Initially Medially /aai/ ‘mother’ /kaait/ ‘thorn’ /eitu/ ‘this ’ /xeitu/ ‘that’ /oi/ ‘interjection’ /ghoiniiyek/ ‘wife’ Finally /paai/ ‘to reach’ /dei/ ‘okay’ /ekaanabboi/ ‘ninety one’ / ɔ inya / ‘somebody else’ /p ɔ itaa / ‘cockroch’ /h ɔ i / ‘yes’ /ui/ ‘white ant’ /dui/ ‘two’ / ouxadh / ‘medicine’ / aaloukik / ‘spiritual’ / auxi/ ‘no moon /paautaa/ ‘one who day’ get’ /deuri/’personal title’ /uaar/ ‘cover’ /guaal/ ‘milkman’ www.ijera.com /mou/ ‘honey’ /xaau/ ‘curse’ /deu/ ‘priest’ www.ijera.com There are twenty-three consonant sounds including two semi-vowels in Assamese Language. Table 4: Consonants in ASSAMESE Nature of Alveol Vel Articulat Bi-labial ar Palatal ar ion p b t d K g Plosive ph bʱ Fricative Nasal Lateral Rolled Semi Vowel M d̪ th ʱ s z n l r kh x ɡ ʱ ŋ l w Glottal j III. Syllabification Syllables are considered as the set of smallest speech sound in a language that distinguishes one word from another. The key objective of this study is to set the syllabification rules for ASSAMESE language using an algorithm which syllabify a word. In syllabification the location of some syllables in word plays a great role. Syllable shows different behaviour in a form of articulation features and pronunciation if it occurs in word initial, final or intermediary position[7]. In this research work the syllabification is actually achieved by looking at different patterns of syllables meaning that we can experiment with monosyllabic, disyllabic, trisyllabic and polysyllabic. Thus, the purpose of this work is to formulate an algorithm for dividing Assamese words into syllables. IV. Methodology The methodologies followed in the present study are: a) Examine Assamese syllables structures from linguistic literature.[6] b) Group discussions with linguists. c) Dialect variations. d) Study of previous works performed [2]. V. Subjects Data was obtained from the Assamese Dictionary (Hemkosha) and also from the native speakers of the respective language and was syllabified by five different speakers between 20 and 25 years of age participated in the study. The entire test was done on two male speakers and three female speakers respectively. On the basis of these experiments almost all the syllable templates available in Assamese were found. About 5000 447 | P a g e ɦ
  • 3. Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450 monosyllabic, disyllabic and polysyllabic words were experimented to find out the syllable structure in Assamese. For the words of Assamese having different structure see the Appendix A. iii. iv. VI. Procedure The entire five subjects were asked to record the words of Assamese language, which were selected for contain almost all the syllable templates. The recording was done in a quiet room with a noise cancelling microphone using the recording facilities of a typical multimedia computer system. The boundaries of syllable were marked according to the pronunciation of the team. The voice was recorded and processed on Audacity and Cool Edit Pro. By observing formants through spectrogram, syllable structure and boundary were evaluated. The equipment used in recording was a microphone with a frequency response of 48000 Hz with a 16 bit sound card and high quality speakers. VII. Result The recording was done on 5000 distinct words extracted from a Assamese corpus and then compared with manual syllabification of the same words to measure accuracy. Heterogeneous nature of texts obtained from the News Paper, Feature Articles Text books, Radio news, short stories etc[2]. A list of distinct words of 5000 most frequently occurring words chosen for testing the algorithm. The 5000 words results some 20,655 syllables .The algorithm achieves an overall accuracy of 98.05% when compared with the same words manually syllabified by an expert. A. Syllable Structure Syllable structure of ASSAMESE phonemes of vowels (V) and consonants(C) are as follows:1. V 2. VC 3. CV 4. CVC 5. CVV 6. CCVC 7. CCV 8. VCC B. Syllabification rules An assumption is taken in this rule that diphthongs are considered as a single vowel unit. The syllabification rules developed after study are given below:i. The syllable boundary of a word having single vowel is at the end of the word. (V, CV) ii. For a word having CVCC* structure, then syllable boundary is marked after first vowel. ie CV/C, CV/CVV, CV/CV,CV/CVC etc www.ijera.com v. vi. vii. viii. ix. www.ijera.com For a word having VCV* structure , then syllable boundary is marked after SECOND vowel. ie VCV/, VCV/C, VCV/CV, VCV/CVCetc For a word having CVV* structure , then syllable boundary is marked after SECOND vowel. ie CVV/,CVV/C,CVV/CV,CVV/CVC,CVV/V etc For a word having VCC* structure , then syllable boundary is marked after second CONSONANT. ie VCC/VCV, VCC/V, VCC/VC etc For a word having CCV structure , then NO syllable boundary is marked. SYLLABIFICATION is only makred after completion of the word. ie CCV/ For a word having CCVC* structure , then syllable boundary is marked after first vowel. ie CCV/C, CCV/CV, CCV/CVC,CCV/CVV etc For a word having CVCV structure , then syllable boundary is marked after first vowel. ie CV/CV, CV/CVV etc For a word having CVVCX structure , then CHECK X A) If x=vowel(V) ie CVVCV, boundary will found after second vowel B) If x=consonant(C) ie CVVCC , boundary will found between the two vowels. C. Implementation All the above rules were implemented as a recursive function in a C++ environment which checks the presence of syllables in the sub word till the word boundary is reached. The words are processed from left to right. VIII. Algorithm In this section, the Bodo syllabification rules identified in the section (4.2) are presented in the form of a formal algorithm. The function syllabify() accepts an array of phonemes generated, along with a variable called current_index which is used to determine the position of the given array currently being processed by the algorithm. Initially the current_index variable will be initialized to 0. The syllabify() functionis called recursively until all phonemes in the array are processed. The function mark_syllable_boundary(position) will mark the syllable boundaries of an accepted array of phonemes. The other functions used within the syllabify () function are described below. i) total_vowels(phonemes): accepts an array of phonemes and returns the number of vowels contained in that array. 448 | P a g e
  • 4. Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450 ii) is_vowel(phoneme): accepts a phoneme and returns true if the given phoneme is a vowel. iii) total_vowels_bet_cons (): accepts a word with current position and returns the total number of vowels till the next consonant from current position. It also returns the length of the total vowels, length of the first vowel and length of the first consonant, after current position, as reference. iv) total_cons_bet_vowels () : accepts a word with Current position and returns the total number of Consonants till the next vowel from current position. It also returns the length of the first consonants, after current position, as reference. v) get_syllables () : accepts a word with current position, syllable boundary and returns the syllable. [3]. [4]. [5]. [6]. IX. Conclusion The above algorithm was tested on 5000 distinct Assamese Words, collected from Assamese corpus of dictionary (Hemkosha), short stories and news and compared it with manual syllabification we get the result of about 20,665 syllables with accuracy of 99%.This paper is presented an algorithm for dividing Assamese words into syllables. The algorithm is based on grammatical rules proposed in section (4.2) and does not require high computational cost. Syllabification is an important component of many speech and language processing system[4], and it is expected that this algorithm is going to be a significant contribution to the researchers working on various aspects of the Assamese language. X. Acknowledgement We would like to express my sincere gratitude and heartfelt thanks to my guide Prof. Pran Hari Talukdar, Professor, Department of Instrumentation and USIC, Gauhati University. we would not have been able to complete the research work and shape it in the form of the research paper without his consistent advice, and never ending enthusiasm, positivity, encouragement, support and understanding. we are very fortunate for having an opportunity to work with him from which I benefited enormously. we are also grateful to Gauhati University for giving me to access the data from GU_Galo_Adi corpus. [7]. [8]. [9]. [10]. [11]. [12]. [13]. [14]. [15]. www.ijera.com Rules for Bodo Language, International Journal Of Computational Engineering Research, vol. 2 issue. 6, pp 110-114, October 2012. Banikanta Kakati. Assamese, its formation and development. Ruvan Weerasinghe, Asanka Wasala and Kumudu Gamage. A Rule Based Syllabification Algorithm for Sinhala Parminder Singh, Gurpreet Singh Lehal. Syllables Selection for the Development of Speech Database for Punjabi TTS System, IJCSI International Journal of Computer Science Issues, vol. 7, Issue 6, November 2010. Nungleppam Gopil Singh, Purnendu Acharjee, Prof. P. H. Talukdar. An Improved Automatic Syllabification Rules for BODO Language. International Journal of Computing, Communications and Networking, vol. 1, No.3, NovemberDecember 2012. Jehangir zaman khan. Syllabification rules in pashto Heriberto Cuay´ahuitl. A Syllabification Algorithm for Spanish Upendra Nath Goswami. An Introduction to Assamese Hemchandra Barua. HEMKOSHA Couto I., Neto N., Tadaiesky, V. Klautau, A. Maia, R.2010. An open source HMMbased text-to-speech system for Brazilian Portuguese, in Proc. 7th International Telecommunications Symposium Manaus. Kevin Hock, Julian Smart, Stefan Csomor. Cross-Platform GUI Programming with Wxwidgets, Pearson edition, 2006. Juliette Blevins,The syllable in phonological theory,1995 George Kiraz and Bernd M¨obius. Multilingual syllabification using weighted finite-state transducers, in Proc. of the 3rd Workshop on Speech Synthesis, 1998. Robert Damper. Learning about speech from data: Beyond NETtalk, in DataDriven Techniques in Speech Synthesis, pp 1–25, 2001 References [1]. [2]. ChandanSarma, U.Sharma, C.K.Nath, S.Kalita, P.H.Talukdar. Selection of Units and Development of Speech Database for Natural Sounding Bodo TTS System, CISP Guwahati, March 2012. Jyotismita Talukdar, Chandan Sarma, Prof. P.H Talukdar. Automatic Syllabification www.ijera.com 449 | P a g e
  • 5. Laba Kr. Thakuria et al Int. Journal of Engineering Research and Applications ISSN : 2248-9622, Vol. 4, Issue 2( Version 1), February 2014, pp.446-450 www.ijera.com Appendix A: Word List Table 5 : Word list Assamese Structure Type এই ei V monosyllabic এক ek Vc monosyllabic এ e V monosyllabic ঊধ্ uudh Vc monosyllabic ঊণ্ uun Vc monosyllabic ঊস্ uux Vc monosyllabic ককছিল koisil cv/cvc Disyllabic ককছি koise cv/cv Disyllabic কৈছিল goisil cv/cvc Disyllabic কৈছিল thoisil cv/cvc Disyllabic কৈছি thoise cv/cv Disyllabic দূৰৈৈ duuroir cv/cvc Disyllabic কৈছি hoise cv/cv Disyllabic এৰৈৱা ekhoiwaa v/cv/cv Trisyllabic ককছিলা koisilaa cv/cv/cv Trisyllabic ককছিছল koisile cv/cv/cv Trisyllabic কৈণীছেক ত াৰলছক ghoiniiyek toloike cv/cv/cvc cv/cv/cv Trisyllabic Trisyllabic কদৱছ াৈ doiwajog cv/cv/cvc Trisyllabic এপৈৰলছক ত ছ োৰলছক www.ijera.com Roman eparaloike tetiyaaloike v/cv/cv/cv/cv cv/cv/cv/cv/cv polysyllabic polysyllabic 450 | P a g e