SlideShare una empresa de Scribd logo
1 de 7
Descargar para leer sin conexión
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
132
Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using
Lexical Level Context
D. Samson1*
, Dr. Ignatius A Herman2
, Dr. Ibrahim Olanya Bwalya3
& Dr. Kitraj Penchinkilas4
1
Research Scholar, Gideon Robert University, Lusaka, Zambia. 2
Director-DMI Group of Institutions, Africa. 3
Professor, Gideon
Robert University, Lusaka, Zambia. 4
Department of Computer Science, Gideon Robert University, Lusaka, Zambia.
Corresponding Author (D. Samson) - swansamson@gmail.com*
DOI: https://doi.org/10.38177/ajast.2023.7112
Copyright: © 2023 D.Samson et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits
unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Article Received: 29 January 2023 Article Accepted: 28 February 2023 Article Published: 30 March 2023
░ Objective
(a) To build an end-to-end solution for the language translation using NMT;
(b) To detect the source sentence and target sentence rare words using FSR with hierarchical clustering and perform
embedding;
(c) To design and train CNN to estimate the sentence context as LTR;
(d) To obtain the internal vector representation by training the encoder model using source embedding [Seq-to-Vec
representation];
(e) To predict the target word by training the decoder model using target embedding [Vec-to-Seq representation].
░ 1. Introduction
As in the current scenario there exist several languages in the world. All the languages can’t be understood, so there
is a need for a language translator between the source and target language. Natural Language Translation, which is
a NLP task, is the process of converting source language into the target language using a Seq-Seq Encoder -
Decoder model [1]. A machine translator is a NLP task which solves the problem of taking the input sentence of the
source language and given to the Neural Machine Translator (NMT) for processing the language and gives the
output of the target language [2]. Neural machine translation (NMT) uses a model that considers the context of the
entire sentence to produce an accurate translation. In this method, an artificial neural network is used to calculate
the likelihood of a given word sequence. Using this approach, the translation process may proceed without a hitch
[3]. The modules required to build an end-to-end Encoder Decoder system for translation is described here. From
the figure 1 the high-level block diagram where the dataset of bilingual parallel English-German corpus is
ABSTRACT
Artificial intelligence based Machine Translation is a Natural Language Processing, attaining significant attention to fully automate the system that
can translate source language content into the target languages. The proposed method is a Neural Machine Translation, end-to-end system, with the
lexical level context for converting German sentences to English sentences. The Transformer based Machine Translation is used for translation for
handling the occurrence of rare words in the lexical level context. The FSR group rare words together and represent sentence level context as Latent
Topic Representation. WMT En-De bilingual parallel corpus is used for translation handling the Out of Vocabulary words using clustering of
<UNK> tags. In the existing method there is a mismatch of translation but the proposed system is more superior due to the sentence context inclusion.
The model performance is enhanced with hyper parameter optimization obtaining a BLEU score with a better translation of source to target language.
Finally minimizing the TER score to attain a better translation rate.
Keywords: Artificial intelligence; Natural language translation; Natural language processing; Machine learning.
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
133
preprocessed to get a clean sentence by removing special characters and punctuation marks. The cleaned dataset is
splitted into separate files of German sentences and English sentences. The words which are OOV are given to the
FSR most commonly related together under a universal <UNK> tag [4].
Figure 1. High level block diagram
The embedding module converts the sentence sequence into its corresponding vector representation. The positional
encoding module uses sinusoidal functions to indicate the location of the words in the sentence at both source
language and target side language [5]. The context of the sentence is handled as Latent Topic Representation which
uses CNN to extract the context of the sentence [6-12]. The transformer encoder and decoder which uses a
self-attention mechanism to translate. Finally, the decoded vector is passed to the linear classifier and softmax
functions. Finally, the google translator is integrated into our NMT system to check the correctness of the predicted
sentence using the sentence prediction module.
░ 2. Proposed Method
2.1. Dataset description
Dataset Name - WMT’14
Accessibility - Free
Modality - Text sentences
2.2. Description
Train (4.5M sentence pairs): [train.en] [train.de]
Test:[newstest2012.en],[newstest2012.de],[newstest2013.en],[newstest2013.de],
[newstest2014.en],[newstest2014.de],[newstest2015. en ],[newstest2015.de]
Vocabularies (top 50K frequent words): [vocab.50K.en] [vocab.50K.de]
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
134
Dictionary (extracted from alignment data): [dict.en-de]
2.3. Experimental Setup
The proposed model is a Transformer based Neural Machine Translation with Self Attention Mechanism which is
added at both the Encoder and Decoder layer for translating the source language sentence (German) to the target
language sentence (English). The objective is to build an end to end system for Language Translation using NMT.
From the dataset 50K sentence pairs are taken and split in the ratio of 80:20 for the training and testing purpose
respectively.
A. Pre-Processing
Data preprocessing is a vital phase in the data mining process. The preprocessing module takes care of the
representation and quality of data firs t and foremost before running any analysis. The translation corpus includes
data loading, data cleaning and data splitting.
B. Data Loading
The dataset is loaded in google drive which is zipped file. The path to the file should be mentioned for downloading
the data and unzipping ‘En-De.zip’ file and extracting Bilingual corpus Deu.txt file.
C. Data Cleaning
The file contains punctuations, uppercase, lowercase and special German characters which is not of much
importance for translation purposes so it can be removed using the Regular Expression. The file is converted from
Unicode to ASCII characters for each word in a sentence. Finally <start> and <end> tags are added for every
sentence which indicates the start and end of Encoder and Decoder respectively.
D. Data Splitting
The cleaned data of the corpus is stripped to break the bilingual corpus line by line. Every line contains the English
and its corresponding German sentence. The next step is to split the bilingual sentences into English and German
sentences separately. German sentences are given to the encoder and English sentences are given to the decoder.
The training set with sentence length less than or equal to 50 is taken for consideration at both source and target
files. Padding is done to make all sentences of the same length so that encoding and decoding becomes easier.
E. Word Embedding
INPUT: English Sentence (Input Embedding)
German Sentences (Output Embedding) OUTPUT: Word vector representation The embedding module is
responsible for SeqtoVec conversion which specifically uses the Word2Vec model for this purpose. The layers of
the Neural Network are dense and fully connected to attain better translation prediction.
The input layer node consists of every word in a sentence which is a vector and based on the predefined weights
from CBOW model is used of size VxN for each word with C words in a sentence. The hidden layer is the weighted
sum of input nodes and its corresponding weights with a linear activation function and output layer gives the vectors
of a sentence which uses sigmoid activation function.
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
135
Figure 2. Layer Representation of Word Embedding
The results obtained from the Neural Machine Translation for German to English Conversion is discussed in this
paper. The preprocessing, word tokenizer and the translation sentences are added as the snapshots. The Google
Translate module is integrated for the checking correct translation of the German sentences to its corresponding
English sentences. The Model is trained by limiting the corpus to 50000 sentence examples.
Figure 3. Preprocessing Steps for English German Dataset Training Time
The total time taken by the batches per epoch is tabulated below. Each epoch has 6 batches of size 100 to be trained
and the sum of all the time of batches per epoch is calculated. The GPU with RAM of 25 GB is used to run the
model and the time taken per epoch is given in seconds (sec). The training time for every 5 epochs increases by a
slight margin. From the training time for 50000 training sentences the time for every epoch increases slightly and
the average time taken for every epoch is almost 45 seconds. To minimize the time per epoch, a faster GPU with
increased RAM size is needed.
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
136
░ 3. Performance Analysis
Figure 4. Bar Plot for Loss and Accuracy with Epochs
Table 1. Machine translation test cases
Test Cases Input (German) Actual Output
Expected
Output
Pass/Fail BLEU Score
T1 mir gehts gut. I’m good. i m fine. Pass 0.93
T2 folge mir. Follow me. follow me. Pass 0.99
T3
das scheint der
plan zu sein.
That seems to
be the plan.
it looks like half. Fail 0.18
T4
ich habe
gewonnen!
I have won! i won! Pass 0.85
Figure 5. Graph for Training Loss Score Vs Epochs
The performance of the system is assessed based on the BLEU score and the TER score. The aim is to maximize the
BLEU score so that the predicted translation is closely related to the reference translation (human interference) and
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
137
to minimize the TER score so that the error in the translation becomes less. For sample 5 test cases are evaluated
using the above two scores and this can be extended to the entire corpus if required.
░ 4. Conclusion
In this work, building an end to end Language Translation system which uses a Transformer based NMT to convert
German Language sentences to English Language sentences. The context of a sentence is taken into account so that
the target sentence translation will be of the same meaning of the source sentence. The Self Attention mechanism
used in the transformer predicts a better translated sentence. The rare words which are OOV in the sentence are
handed using FSR with the help of hierarchical clustering which is integrated with the transformer model. The
context of the sentence acts as the LTR which uses CNN is also integrated within the system. The Sequence to
Sequence system for the dataset takes a significant time for modelling the system. The experiments indicated that
the proposed NMT system achieves good performance with a better BLEU score compared to conventional MT
systems and minimizes the TER significantly.
Declarations
Source of Funding
This research work did not receive any grant from funding agencies in the public or not-for-profit sectors.
Competing Interests Statement
The authors declare the total absence of conflicts of interest, both during the conduct of the research and during the
written drafting of this work.
Consent for Publication
The authors declare that they consented to the publication of this research work.
References
[1] Samant, Rahul Manohar, Mrinal R. Bachute, Shilpa Gite, and Ketan Kotecha (2022). Framework for deep
learning-based language models using multi-task learning in natural language understanding: A systematic
literature review and future directions. IEEE Access, 10: 17078-17097.
[2] Tonja, Atnafu Lambebo, Olga Kolesnikova, Muhammad Arif, Alexander Gelbukh, and Grigori Sidorov (2022).
Improving neural machine translation for low resource languages using mixed training: The case of ethiopian
languages. In Advances in Computational Intelligence: 21st Mexican International Conference on Artificial
Intelligence, MICAI 2022, Monterrey, Mexico, October 24–29, 2022, Proceedings, Part II, pp. 30-40, Cham:
Springer Nature, Switzerland.
[3] Bharathi, S., & T. Ananth Kumar (2020). Translation its results and insinuation in language learning. PalArch's
Journal of Archaeology of Egypt/Egyptology, 17(9): 5081-5090.
[4] Heafield, Kenneth, Elaine Farrow, Jelmer Van der Linde, Gema Ramírez‐Sánchez, and Dion Wiggins (2022).
The EuroPat Corpus: A Parallel Corpus of European Patent Data. In Proceedings of the Thirteenth Language
Resources and Evaluation Conference, Pages 732-740.
Asian Journal of Applied Science and Technology (AJAST)
Volume 7, Issue 1, Pages 132-138, January-March 2023
ISSN: 2456-883X
138
[5] Baniata, Laith H., Sangwoo Kang, and Isaac KE Ampomah (2022). A Reverse Positional Encoding Multi-Head
Attention-Based Neural Machine Translation Model for Arabic Dialects. Mathematics, 10(19): 3666.
[6] Venugopalan, Manju, and Deepa Gupta (2022). An enhanced guided LDA model augmented with BERT based
semantic strength for aspect term extraction in sentiment analysis. Knowledge-based systems, 246: 108668.
[7] Kumar, K. Suresh, A. S. Radhamani, and T. Ananth Kumar (2022). Sentiment lexicon for cross-domain
adaptation with multi-domain dataset in Indian languages enhanced with BERT classification model. Journal of
Intelligent & Fuzzy Systems, Preprint, Pages 1-18.
[8] Chitnis, Rohan, and John DeNero (2015). Variable-length word encodings for neural translation models. In
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Pages 2088-2093.
[9] Uddin, Farid, Yibo Chen, Zuping Zhang, and Xin Huang (2022). Corpus Statistics Empowered Document
Classification. Electronics, 11(14): 2168.
[10] Sun, Haipeng, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao (2020).
Unsupervised neural machine translation with cross-lingual language representation agreement. IEEE/ACM
Transactions on Audio, Speech, and Language Processing, 28: 1170-1182.
[11] Geng, Xinwei, Longyue Wang, Xing Wang, Mingtao Yang, Xiaocheng Feng, Bing Qin, and Zhaopeng Tu
(2022). Learning to refine source representations for neural machine translation. International Journal of Machine
Learning and Cybernetics, 13(8): 2199-2212.
[12] Lin, Yan, Huaiyu Wan, Shengnan Guo, and Youfang Lin (2021). Pre-training context and time aware location
embeddings from spatial-temporal trajectories for user next location prediction. In Proceedings of the AAAI
Conference on Artificial Intelligence, 35(5): 4241-4248.

Más contenido relacionado

Similar a Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using Lexical Level Context

EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...ijnlc
 
Improving the role of language model in statistical machine translation (Indo...
Improving the role of language model in statistical machine translation (Indo...Improving the role of language model in statistical machine translation (Indo...
Improving the role of language model in statistical machine translation (Indo...IJECEIAES
 
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET Journal
 
The first FOSD-tacotron-2-based text-to-speech application for Vietnamese
The first FOSD-tacotron-2-based text-to-speech application for VietnameseThe first FOSD-tacotron-2-based text-to-speech application for Vietnamese
The first FOSD-tacotron-2-based text-to-speech application for VietnamesejournalBEEI
 
A study on the techniques for speech to speech translation
A study on the techniques for speech to speech translationA study on the techniques for speech to speech translation
A study on the techniques for speech to speech translationIRJET Journal
 
IRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine LearningIRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine LearningIRJET Journal
 
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGESOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGEIJCI JOURNAL
 
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGESOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGEIJCI JOURNAL
 
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKSTEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKSijdms
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...IRJET Journal
 
Recent Trends in Translation of Programming Languages using NLP Approaches
Recent Trends in Translation of Programming Languages using NLP ApproachesRecent Trends in Translation of Programming Languages using NLP Approaches
Recent Trends in Translation of Programming Languages using NLP ApproachesIRJET Journal
 
An Efficient Approach to Produce Source Code by Interpreting Algorithm
An Efficient Approach to Produce Source Code by Interpreting AlgorithmAn Efficient Approach to Produce Source Code by Interpreting Algorithm
An Efficient Approach to Produce Source Code by Interpreting AlgorithmIRJET Journal
 
Automatic keyword extraction.pptx
Automatic keyword extraction.pptxAutomatic keyword extraction.pptx
Automatic keyword extraction.pptxBiswarupDas18
 
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKSTEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKSijdms
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech TranslationIRJET Journal
 
EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODEL
EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODELEXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODEL
EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODELijaia
 
Overlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammarsOverlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammarsIAEME Publication
 
IRJET- On-Screen Translator using NLP and Text Detection
IRJET- On-Screen Translator using NLP and Text DetectionIRJET- On-Screen Translator using NLP and Text Detection
IRJET- On-Screen Translator using NLP and Text DetectionIRJET Journal
 

Similar a Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using Lexical Level Context (20)

[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
 
Moses
MosesMoses
Moses
 
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
 
Improving the role of language model in statistical machine translation (Indo...
Improving the role of language model in statistical machine translation (Indo...Improving the role of language model in statistical machine translation (Indo...
Improving the role of language model in statistical machine translation (Indo...
 
IRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation SystemIRJET- Speech to Speech Translation System
IRJET- Speech to Speech Translation System
 
The first FOSD-tacotron-2-based text-to-speech application for Vietnamese
The first FOSD-tacotron-2-based text-to-speech application for VietnameseThe first FOSD-tacotron-2-based text-to-speech application for Vietnamese
The first FOSD-tacotron-2-based text-to-speech application for Vietnamese
 
A study on the techniques for speech to speech translation
A study on the techniques for speech to speech translationA study on the techniques for speech to speech translation
A study on the techniques for speech to speech translation
 
IRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine LearningIRJET - Pseudocode to Python Translation using Machine Learning
IRJET - Pseudocode to Python Translation using Machine Learning
 
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGESOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
 
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGESOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
SOFTWARE TOOL FOR TRANSLATING PSEUDOCODE TO A PROGRAMMING LANGUAGE
 
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKSTEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
 
Recent Trends in Translation of Programming Languages using NLP Approaches
Recent Trends in Translation of Programming Languages using NLP ApproachesRecent Trends in Translation of Programming Languages using NLP Approaches
Recent Trends in Translation of Programming Languages using NLP Approaches
 
An Efficient Approach to Produce Source Code by Interpreting Algorithm
An Efficient Approach to Produce Source Code by Interpreting AlgorithmAn Efficient Approach to Produce Source Code by Interpreting Algorithm
An Efficient Approach to Produce Source Code by Interpreting Algorithm
 
Automatic keyword extraction.pptx
Automatic keyword extraction.pptxAutomatic keyword extraction.pptx
Automatic keyword extraction.pptx
 
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKSTEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
TEXT ADVERTISEMENTS ANALYSIS USING CONVOLUTIONAL NEURAL NETWORKS
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
 
EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODEL
EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODELEXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODEL
EXTRACTIVE SUMMARIZATION WITH VERY DEEP PRETRAINED LANGUAGE MODEL
 
Overlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammarsOverlapping optimization with parsing through metagrammars
Overlapping optimization with parsing through metagrammars
 
IRJET- On-Screen Translator using NLP and Text Detection
IRJET- On-Screen Translator using NLP and Text DetectionIRJET- On-Screen Translator using NLP and Text Detection
IRJET- On-Screen Translator using NLP and Text Detection
 

Más de IIJSRJournal

Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...
Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...
Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...IIJSRJournal
 
Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...
Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...
Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...IIJSRJournal
 
Agriculture in Indian Economy and Contribution of Science and Technology
Agriculture in Indian Economy and Contribution of Science and Technology  Agriculture in Indian Economy and Contribution of Science and Technology
Agriculture in Indian Economy and Contribution of Science and Technology IIJSRJournal
 
The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...
The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...
The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...IIJSRJournal
 
Basic Criteria for Building the Third Renaissance in Uzbekistan
Basic Criteria for Building the Third Renaissance in Uzbekistan Basic Criteria for Building the Third Renaissance in Uzbekistan
Basic Criteria for Building the Third Renaissance in Uzbekistan IIJSRJournal
 
Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...
Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...
Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...IIJSRJournal
 
Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...
Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...
Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...IIJSRJournal
 
Factors Influencing Professional Project Management Ethical Practices in Buil...
Factors Influencing Professional Project Management Ethical Practices in Buil...Factors Influencing Professional Project Management Ethical Practices in Buil...
Factors Influencing Professional Project Management Ethical Practices in Buil...IIJSRJournal
 
Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...
Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...
Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...IIJSRJournal
 
Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...
Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...
Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...IIJSRJournal
 
On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...
On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...
On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...IIJSRJournal
 
Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...
Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...
Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...IIJSRJournal
 
Machine Learning Based House Price Prediction Using Modified Extreme Boosting
Machine Learning Based House Price Prediction Using Modified Extreme Boosting Machine Learning Based House Price Prediction Using Modified Extreme Boosting
Machine Learning Based House Price Prediction Using Modified Extreme Boosting IIJSRJournal
 
Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...
Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...
Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...IIJSRJournal
 
Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...
Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...
Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...IIJSRJournal
 
Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals
Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals
Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals IIJSRJournal
 
Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...
Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...
Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...IIJSRJournal
 
Comparison of Glucose in Urine with Likening of Pigeons as Pets
Comparison of Glucose in Urine with Likening of Pigeons as Pets Comparison of Glucose in Urine with Likening of Pigeons as Pets
Comparison of Glucose in Urine with Likening of Pigeons as Pets IIJSRJournal
 
Non-unique Fixed Points of Self Mappings in Bi-metric Spaces
Non-unique Fixed Points of Self Mappings in Bi-metric Spaces Non-unique Fixed Points of Self Mappings in Bi-metric Spaces
Non-unique Fixed Points of Self Mappings in Bi-metric Spaces IIJSRJournal
 
Research on the Impact of Short-Form Video Advertising on Social Media on the...
Research on the Impact of Short-Form Video Advertising on Social Media on the...Research on the Impact of Short-Form Video Advertising on Social Media on the...
Research on the Impact of Short-Form Video Advertising on Social Media on the...IIJSRJournal
 

Más de IIJSRJournal (20)

Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...
Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...
Analysis of Artificial Intelligence Techniques for Network Intrusion Detectio...
 
Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...
Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...
Methodologies for Enhancing Data Integrity and Security in Distributed Cloud ...
 
Agriculture in Indian Economy and Contribution of Science and Technology
Agriculture in Indian Economy and Contribution of Science and Technology  Agriculture in Indian Economy and Contribution of Science and Technology
Agriculture in Indian Economy and Contribution of Science and Technology
 
The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...
The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...
The Effect of Kronecker Tensor Product Values on ECG Rates: A Study on Savitz...
 
Basic Criteria for Building the Third Renaissance in Uzbekistan
Basic Criteria for Building the Third Renaissance in Uzbekistan Basic Criteria for Building the Third Renaissance in Uzbekistan
Basic Criteria for Building the Third Renaissance in Uzbekistan
 
Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...
Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...
Assessment of Neglected and Under-Utilized Crop Species of African Horned Mel...
 
Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...
Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...
Prevalence of Trichomonas Vaginalis Infection Among Married Pregnant Women in...
 
Factors Influencing Professional Project Management Ethical Practices in Buil...
Factors Influencing Professional Project Management Ethical Practices in Buil...Factors Influencing Professional Project Management Ethical Practices in Buil...
Factors Influencing Professional Project Management Ethical Practices in Buil...
 
Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...
Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...
Assessment of Water Occupancy Rates of the Çamlıgöze Dam Lake between 2010-20...
 
Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...
Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...
Sustainability of Pod Yields of Groundnut through Crop Seasonal Rainfall, Len...
 
On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...
On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...
On the Modulation of Biocompatibility of Hydrogels with Collagen and Guar Gum...
 
Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...
Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...
Incorporation of Se (IV) Complexes based on Amino Acids in Biomatrixes in Hyd...
 
Machine Learning Based House Price Prediction Using Modified Extreme Boosting
Machine Learning Based House Price Prediction Using Modified Extreme Boosting Machine Learning Based House Price Prediction Using Modified Extreme Boosting
Machine Learning Based House Price Prediction Using Modified Extreme Boosting
 
Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...
Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...
Preliminary Evaluation on Vegetative of Rambutan (Nephelium lappaceum) in San...
 
Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...
Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...
Analysis of Physicochemical and Microbiological Parameters of Wine Produced f...
 
Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals
Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals
Cohesive and Thermal Properties of Sodium Cyanide-Halide Mixed Crystals
 
Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...
Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...
Discussion on Analysis of Effects of Short-Form Video Advertising on the Purc...
 
Comparison of Glucose in Urine with Likening of Pigeons as Pets
Comparison of Glucose in Urine with Likening of Pigeons as Pets Comparison of Glucose in Urine with Likening of Pigeons as Pets
Comparison of Glucose in Urine with Likening of Pigeons as Pets
 
Non-unique Fixed Points of Self Mappings in Bi-metric Spaces
Non-unique Fixed Points of Self Mappings in Bi-metric Spaces Non-unique Fixed Points of Self Mappings in Bi-metric Spaces
Non-unique Fixed Points of Self Mappings in Bi-metric Spaces
 
Research on the Impact of Short-Form Video Advertising on Social Media on the...
Research on the Impact of Short-Form Video Advertising on Social Media on the...Research on the Impact of Short-Form Video Advertising on Social Media on the...
Research on the Impact of Short-Form Video Advertising on Social Media on the...
 

Último

Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoordharasingh5698
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptMsecMca
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01KreezheaRecto
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Intro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfIntro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfrs7054576148
 

Último (20)

Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01Double rodded leveling 1 pdf activity 01
Double rodded leveling 1 pdf activity 01
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Intro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdfIntro To Electric Vehicles PDF Notes.pdf
Intro To Electric Vehicles PDF Notes.pdf
 

Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using Lexical Level Context

  • 1. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 132 Artificial Intelligent Decoding of Rare Words in Natural Language Translation Using Lexical Level Context D. Samson1* , Dr. Ignatius A Herman2 , Dr. Ibrahim Olanya Bwalya3 & Dr. Kitraj Penchinkilas4 1 Research Scholar, Gideon Robert University, Lusaka, Zambia. 2 Director-DMI Group of Institutions, Africa. 3 Professor, Gideon Robert University, Lusaka, Zambia. 4 Department of Computer Science, Gideon Robert University, Lusaka, Zambia. Corresponding Author (D. Samson) - swansamson@gmail.com* DOI: https://doi.org/10.38177/ajast.2023.7112 Copyright: © 2023 D.Samson et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Article Received: 29 January 2023 Article Accepted: 28 February 2023 Article Published: 30 March 2023 ░ Objective (a) To build an end-to-end solution for the language translation using NMT; (b) To detect the source sentence and target sentence rare words using FSR with hierarchical clustering and perform embedding; (c) To design and train CNN to estimate the sentence context as LTR; (d) To obtain the internal vector representation by training the encoder model using source embedding [Seq-to-Vec representation]; (e) To predict the target word by training the decoder model using target embedding [Vec-to-Seq representation]. ░ 1. Introduction As in the current scenario there exist several languages in the world. All the languages can’t be understood, so there is a need for a language translator between the source and target language. Natural Language Translation, which is a NLP task, is the process of converting source language into the target language using a Seq-Seq Encoder - Decoder model [1]. A machine translator is a NLP task which solves the problem of taking the input sentence of the source language and given to the Neural Machine Translator (NMT) for processing the language and gives the output of the target language [2]. Neural machine translation (NMT) uses a model that considers the context of the entire sentence to produce an accurate translation. In this method, an artificial neural network is used to calculate the likelihood of a given word sequence. Using this approach, the translation process may proceed without a hitch [3]. The modules required to build an end-to-end Encoder Decoder system for translation is described here. From the figure 1 the high-level block diagram where the dataset of bilingual parallel English-German corpus is ABSTRACT Artificial intelligence based Machine Translation is a Natural Language Processing, attaining significant attention to fully automate the system that can translate source language content into the target languages. The proposed method is a Neural Machine Translation, end-to-end system, with the lexical level context for converting German sentences to English sentences. The Transformer based Machine Translation is used for translation for handling the occurrence of rare words in the lexical level context. The FSR group rare words together and represent sentence level context as Latent Topic Representation. WMT En-De bilingual parallel corpus is used for translation handling the Out of Vocabulary words using clustering of <UNK> tags. In the existing method there is a mismatch of translation but the proposed system is more superior due to the sentence context inclusion. The model performance is enhanced with hyper parameter optimization obtaining a BLEU score with a better translation of source to target language. Finally minimizing the TER score to attain a better translation rate. Keywords: Artificial intelligence; Natural language translation; Natural language processing; Machine learning.
  • 2. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 133 preprocessed to get a clean sentence by removing special characters and punctuation marks. The cleaned dataset is splitted into separate files of German sentences and English sentences. The words which are OOV are given to the FSR most commonly related together under a universal <UNK> tag [4]. Figure 1. High level block diagram The embedding module converts the sentence sequence into its corresponding vector representation. The positional encoding module uses sinusoidal functions to indicate the location of the words in the sentence at both source language and target side language [5]. The context of the sentence is handled as Latent Topic Representation which uses CNN to extract the context of the sentence [6-12]. The transformer encoder and decoder which uses a self-attention mechanism to translate. Finally, the decoded vector is passed to the linear classifier and softmax functions. Finally, the google translator is integrated into our NMT system to check the correctness of the predicted sentence using the sentence prediction module. ░ 2. Proposed Method 2.1. Dataset description Dataset Name - WMT’14 Accessibility - Free Modality - Text sentences 2.2. Description Train (4.5M sentence pairs): [train.en] [train.de] Test:[newstest2012.en],[newstest2012.de],[newstest2013.en],[newstest2013.de], [newstest2014.en],[newstest2014.de],[newstest2015. en ],[newstest2015.de] Vocabularies (top 50K frequent words): [vocab.50K.en] [vocab.50K.de]
  • 3. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 134 Dictionary (extracted from alignment data): [dict.en-de] 2.3. Experimental Setup The proposed model is a Transformer based Neural Machine Translation with Self Attention Mechanism which is added at both the Encoder and Decoder layer for translating the source language sentence (German) to the target language sentence (English). The objective is to build an end to end system for Language Translation using NMT. From the dataset 50K sentence pairs are taken and split in the ratio of 80:20 for the training and testing purpose respectively. A. Pre-Processing Data preprocessing is a vital phase in the data mining process. The preprocessing module takes care of the representation and quality of data firs t and foremost before running any analysis. The translation corpus includes data loading, data cleaning and data splitting. B. Data Loading The dataset is loaded in google drive which is zipped file. The path to the file should be mentioned for downloading the data and unzipping ‘En-De.zip’ file and extracting Bilingual corpus Deu.txt file. C. Data Cleaning The file contains punctuations, uppercase, lowercase and special German characters which is not of much importance for translation purposes so it can be removed using the Regular Expression. The file is converted from Unicode to ASCII characters for each word in a sentence. Finally <start> and <end> tags are added for every sentence which indicates the start and end of Encoder and Decoder respectively. D. Data Splitting The cleaned data of the corpus is stripped to break the bilingual corpus line by line. Every line contains the English and its corresponding German sentence. The next step is to split the bilingual sentences into English and German sentences separately. German sentences are given to the encoder and English sentences are given to the decoder. The training set with sentence length less than or equal to 50 is taken for consideration at both source and target files. Padding is done to make all sentences of the same length so that encoding and decoding becomes easier. E. Word Embedding INPUT: English Sentence (Input Embedding) German Sentences (Output Embedding) OUTPUT: Word vector representation The embedding module is responsible for SeqtoVec conversion which specifically uses the Word2Vec model for this purpose. The layers of the Neural Network are dense and fully connected to attain better translation prediction. The input layer node consists of every word in a sentence which is a vector and based on the predefined weights from CBOW model is used of size VxN for each word with C words in a sentence. The hidden layer is the weighted sum of input nodes and its corresponding weights with a linear activation function and output layer gives the vectors of a sentence which uses sigmoid activation function.
  • 4. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 135 Figure 2. Layer Representation of Word Embedding The results obtained from the Neural Machine Translation for German to English Conversion is discussed in this paper. The preprocessing, word tokenizer and the translation sentences are added as the snapshots. The Google Translate module is integrated for the checking correct translation of the German sentences to its corresponding English sentences. The Model is trained by limiting the corpus to 50000 sentence examples. Figure 3. Preprocessing Steps for English German Dataset Training Time The total time taken by the batches per epoch is tabulated below. Each epoch has 6 batches of size 100 to be trained and the sum of all the time of batches per epoch is calculated. The GPU with RAM of 25 GB is used to run the model and the time taken per epoch is given in seconds (sec). The training time for every 5 epochs increases by a slight margin. From the training time for 50000 training sentences the time for every epoch increases slightly and the average time taken for every epoch is almost 45 seconds. To minimize the time per epoch, a faster GPU with increased RAM size is needed.
  • 5. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 136 ░ 3. Performance Analysis Figure 4. Bar Plot for Loss and Accuracy with Epochs Table 1. Machine translation test cases Test Cases Input (German) Actual Output Expected Output Pass/Fail BLEU Score T1 mir gehts gut. I’m good. i m fine. Pass 0.93 T2 folge mir. Follow me. follow me. Pass 0.99 T3 das scheint der plan zu sein. That seems to be the plan. it looks like half. Fail 0.18 T4 ich habe gewonnen! I have won! i won! Pass 0.85 Figure 5. Graph for Training Loss Score Vs Epochs The performance of the system is assessed based on the BLEU score and the TER score. The aim is to maximize the BLEU score so that the predicted translation is closely related to the reference translation (human interference) and
  • 6. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 137 to minimize the TER score so that the error in the translation becomes less. For sample 5 test cases are evaluated using the above two scores and this can be extended to the entire corpus if required. ░ 4. Conclusion In this work, building an end to end Language Translation system which uses a Transformer based NMT to convert German Language sentences to English Language sentences. The context of a sentence is taken into account so that the target sentence translation will be of the same meaning of the source sentence. The Self Attention mechanism used in the transformer predicts a better translated sentence. The rare words which are OOV in the sentence are handed using FSR with the help of hierarchical clustering which is integrated with the transformer model. The context of the sentence acts as the LTR which uses CNN is also integrated within the system. The Sequence to Sequence system for the dataset takes a significant time for modelling the system. The experiments indicated that the proposed NMT system achieves good performance with a better BLEU score compared to conventional MT systems and minimizes the TER significantly. Declarations Source of Funding This research work did not receive any grant from funding agencies in the public or not-for-profit sectors. Competing Interests Statement The authors declare the total absence of conflicts of interest, both during the conduct of the research and during the written drafting of this work. Consent for Publication The authors declare that they consented to the publication of this research work. References [1] Samant, Rahul Manohar, Mrinal R. Bachute, Shilpa Gite, and Ketan Kotecha (2022). Framework for deep learning-based language models using multi-task learning in natural language understanding: A systematic literature review and future directions. IEEE Access, 10: 17078-17097. [2] Tonja, Atnafu Lambebo, Olga Kolesnikova, Muhammad Arif, Alexander Gelbukh, and Grigori Sidorov (2022). Improving neural machine translation for low resource languages using mixed training: The case of ethiopian languages. In Advances in Computational Intelligence: 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Monterrey, Mexico, October 24–29, 2022, Proceedings, Part II, pp. 30-40, Cham: Springer Nature, Switzerland. [3] Bharathi, S., & T. Ananth Kumar (2020). Translation its results and insinuation in language learning. PalArch's Journal of Archaeology of Egypt/Egyptology, 17(9): 5081-5090. [4] Heafield, Kenneth, Elaine Farrow, Jelmer Van der Linde, Gema Ramírez‐Sánchez, and Dion Wiggins (2022). The EuroPat Corpus: A Parallel Corpus of European Patent Data. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, Pages 732-740.
  • 7. Asian Journal of Applied Science and Technology (AJAST) Volume 7, Issue 1, Pages 132-138, January-March 2023 ISSN: 2456-883X 138 [5] Baniata, Laith H., Sangwoo Kang, and Isaac KE Ampomah (2022). A Reverse Positional Encoding Multi-Head Attention-Based Neural Machine Translation Model for Arabic Dialects. Mathematics, 10(19): 3666. [6] Venugopalan, Manju, and Deepa Gupta (2022). An enhanced guided LDA model augmented with BERT based semantic strength for aspect term extraction in sentiment analysis. Knowledge-based systems, 246: 108668. [7] Kumar, K. Suresh, A. S. Radhamani, and T. Ananth Kumar (2022). Sentiment lexicon for cross-domain adaptation with multi-domain dataset in Indian languages enhanced with BERT classification model. Journal of Intelligent & Fuzzy Systems, Preprint, Pages 1-18. [8] Chitnis, Rohan, and John DeNero (2015). Variable-length word encodings for neural translation models. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Pages 2088-2093. [9] Uddin, Farid, Yibo Chen, Zuping Zhang, and Xin Huang (2022). Corpus Statistics Empowered Document Classification. Electronics, 11(14): 2168. [10] Sun, Haipeng, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao (2020). Unsupervised neural machine translation with cross-lingual language representation agreement. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28: 1170-1182. [11] Geng, Xinwei, Longyue Wang, Xing Wang, Mingtao Yang, Xiaocheng Feng, Bing Qin, and Zhaopeng Tu (2022). Learning to refine source representations for neural machine translation. International Journal of Machine Learning and Cybernetics, 13(8): 2199-2212. [12] Lin, Yan, Huaiyu Wan, Shengnan Guo, and Youfang Lin (2021). Pre-training context and time aware location embeddings from spatial-temporal trajectories for user next location prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, 35(5): 4241-4248.