SlideShare una empresa de Scribd logo
1 de 14
Question Answering on
Romanian, English and
French Languages
„„Al. I. Cuza” University of IaAl. I. Cuza” University of Ia ssi, Romi, Romaaniania
Faculty of Computer ScienceFaculty of Computer Science
 Introduction
 System components
◦ Questions analysis
◦ Index creation and information retrieval
◦ Answer extraction
 Results
 Application of QA system
◦ eLearning
◦ Robotics
◦ CriES 2010
 Conclusions
 Our group participate in CLEF exercises from 2006:
◦ 2006 – Ro–En (English collection) – 9.47% right answers
◦ 2007 – Ro–Ro (Romanian Wikipedia) – 12 %
◦ 2008 – Ro–Ro (Romanian Wikipedia) – 31 %
◦ 2009 – Ro–Ro, En–En (JRC-Acquis) – 47.2 % (48.6%)
◦ 2010 – Ro-Ro, En-En, Fr-Fr (JRC-Acquis, Europarl) – 47.5%
(42.5%, 27 %)
Lucene
queries
Lucene
Index
Question analysis:
- Tokenization & lemmatization
- Focus, keywords and names
entities identification
- Question classification
JRC-Acquis
corpus
Initial
questions
Information
Retrieval
Relevant
snippets
Romanian
Grammar
Definition Answer
Extraction
Reason Answer
Extraction
Other Answer
Extraction
Final
Answers
EUROPARL
corpus
Q1: What percentage of people in Italy relies on television for
information?
<q q_id="0001" source_lang="EN" target_lang="RO">
<string>Ce procent al populaţiei din Italia contează pe televiziune
pentru a obţine informaţii</string>
<focus>procent</focus>
<verb>contează obţine</verb>
<noun>populaţiei televiziune informaţii</noun>
<nameEntities>Italia</nameEntities>
<luceneQuery>procent~0.7 populaţiei~0.7 Italia^3 (contează^2
conta) televiziune~0.7 obţine informaţii~0.7 </luceneQuery>
<questionType>FACTOID</questionType> ~ 40 patterns
<answerType>MEASURE</answerType> ~ 30 patterns
</q>
 We used Lucene and we have created two
indexes, one at paragraph level and one at
document level
 Using the Lucene queries and the indexes we
used the Lucene search engine to extract a
ranked list of snippets for every question as
possible answer candidates
 Depend by Lucene score and additional we built
special modules to extract answers for questions of
type DEFINITION, REASON-PURPOSE,
PROCEDURE, OPINION
 Two thresholds values
◦ A higher one - in this case, the system offers many NOA
answers – RA is affected, but c@1 is higher
◦ A lower one – in this case we offer only a few NOA
answers – RA is higher, but c@1 is lower
RO-RO EN-EN FR-FR
answered right 95 102 85 78 54 47
answered wrong 74 93 98 99 124 153
total answered 169 195 183 177 178 200
unanswered right 0 0 0 0 0 0
unanswered
wrong
0 0 0 0 0 0
unanswered
empty
31 5 17 23 22 0
total unanswered 31 5 17 23 22 0
c@1 measure 0.55 0.42 0.46 0.43 0.30 0.24
 eLearning – fast answer for ~30% questions
 Robotics – communication
 CriES 2010 – identify experts on Yahoo!
Answers
Similar questions Answer Ok?
Exista aplicatii in care folosirea
Design Patterns-urilor sa nu fie
eficienta?
Evident (de exemplu un program Hello World)... Go
La ce altceva design pattrenurile ar
putea fi aplicate?
Gasirea solutiei la o problema, crearea unui limbaj de
programator avansat, in scrierea documentatiei, in
discutii cu colegii de la o firma de software.
Go
Care este diferenta dintre pattern si o
expresie (idiom) de codare.
Nu am auzit de expresia idiom de codare, dar pare
ceva particular, pe cand design pattern-ul este
general...
Go
Care este diferenta dintre pattern si
clase.
Un design pattern este o solutie la o problema si prin
urmare este compusa dintr-o ierarhie de clase intre
care avem relatii.
Go
Este design pattern-ul diferit de
pattern? De ce a fost ales acest nume?
Design pattern-ul este un pattern in domeniul
ingineriei software. Nu stiu de ce a fost ales acest
nume.... :)
Go
Folosim Design Patterns in aceeasi
aplicatie sau le folosim in aplicatii
diferite?
In aceeasi aplicatie. Go
Ce este un design pattern? In primul rand: un nume, o problema si o solutie Go
Questions Answer Priority Status Details
La ce se folosesc design
pattern-urile?
normal
nevoieN
eaparat Go
Raspunde la intrebare
Raspuns
Go
Exception handlingul in
Java poate fi considerat o
aplicatie a Decorator
pattern?
urgent
nevoieN
eaparat
Go
Raspunde la intrebare
Raspuns
Go
Exista aplicatii in care
folosirea Design Patterns-
urilor sa nu fie eficienta?
Evident (de exemplu un program
Hello World)...
normal doarAsa
La ce altceva design
Gasirea solutiei la o problema, crearea
unui limbaj de programator avansat, in saAfluM
 With Swoogle we extend the knowledge base
 The ontologies returned are then converted to AIML
format and saved in the robot’s memory
Initial
digraph
Initial Yahoo!answers collections
en fr ge sp
Eliminate stop
words
Domains
keywords
Initial users
questions
Eliminate stop
words
Questions
keywords
Relevant words for
questions
Relevant words for
domains
Similarity
score between
questions and
domains
Run 2 Run 1Run 0
 UAIC QA system evolved over time (from 9 % in
2006 at 47.5 % in 2010)
 The main problem is related to quality and quantity of
Romanian resources involved
 In present we are concerned with using of QA
components in other applications in order to improve
their capabilities
Question Answering on Romanian, English and French Languages

Más contenido relacionado

Similar a Question Answering on Romanian, English and French Languages

Continuous Deployment and Testing Workshop from Better Software West
Continuous Deployment and Testing Workshop from Better Software WestContinuous Deployment and Testing Workshop from Better Software West
Continuous Deployment and Testing Workshop from Better Software WestCory Foy
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET Journal
 
Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)evabl444
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le webAhmed Hammami
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needsIvan Berlocher
 
Boost Your Base Bootcamp - [Online & Offline] In Bangla
Boost Your Base Bootcamp - [Online & Offline] In BanglaBoost Your Base Bootcamp - [Online & Offline] In Bangla
Boost Your Base Bootcamp - [Online & Offline] In BanglaStack Learner
 
Question Focus Recognition in Question Answering Systems
Question Focus Recognition in Question  Answering Systems Question Focus Recognition in Question  Answering Systems
Question Focus Recognition in Question Answering Systems Waheeb Ahmed
 
Answer Selection and Validation for Arabic Questions
Answer Selection and Validation for Arabic QuestionsAnswer Selection and Validation for Arabic Questions
Answer Selection and Validation for Arabic QuestionsAhmed Magdy Ezzeldin, MSc.
 
When develpment met test(shift left testing)
When develpment met test(shift left testing)When develpment met test(shift left testing)
When develpment met test(shift left testing)SangIn Choung
 
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language ModelsDataScienceConferenc1
 
Creativity vs Best Practices
Creativity vs Best PracticesCreativity vs Best Practices
Creativity vs Best PracticesSupun Dissanayake
 
MLlecture1.ppt
MLlecture1.pptMLlecture1.ppt
MLlecture1.pptbutest
 
MLlecture1.ppt
MLlecture1.pptMLlecture1.ppt
MLlecture1.pptbutest
 
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGEVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGIJwest
 
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGEVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGdannyijwest
 
[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineeringIvano Malavolta
 
Aspiring Minds | Automata
Aspiring Minds | Automata Aspiring Minds | Automata
Aspiring Minds | Automata Aspiring Minds
 
[2016/2017] RESEARCH in software engineering
[2016/2017] RESEARCH in software engineering[2016/2017] RESEARCH in software engineering
[2016/2017] RESEARCH in software engineeringIvano Malavolta
 

Similar a Question Answering on Romanian, English and French Languages (20)

Continuous Deployment and Testing Workshop from Better Software West
Continuous Deployment and Testing Workshop from Better Software WestContinuous Deployment and Testing Workshop from Better Software West
Continuous Deployment and Testing Workshop from Better Software West
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question Matching
 
Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)Experimenting with eXtreme Design (EKAW2010)
Experimenting with eXtreme Design (EKAW2010)
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le web
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needs
 
The terminology of methodology
The terminology of methodologyThe terminology of methodology
The terminology of methodology
 
Boost Your Base Bootcamp - [Online & Offline] In Bangla
Boost Your Base Bootcamp - [Online & Offline] In BanglaBoost Your Base Bootcamp - [Online & Offline] In Bangla
Boost Your Base Bootcamp - [Online & Offline] In Bangla
 
Question Focus Recognition in Question Answering Systems
Question Focus Recognition in Question  Answering Systems Question Focus Recognition in Question  Answering Systems
Question Focus Recognition in Question Answering Systems
 
Answer Selection and Validation for Arabic Questions
Answer Selection and Validation for Arabic QuestionsAnswer Selection and Validation for Arabic Questions
Answer Selection and Validation for Arabic Questions
 
When develpment met test(shift left testing)
When develpment met test(shift left testing)When develpment met test(shift left testing)
When develpment met test(shift left testing)
 
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
 
Creativity vs Best Practices
Creativity vs Best PracticesCreativity vs Best Practices
Creativity vs Best Practices
 
MLlecture1.ppt
MLlecture1.pptMLlecture1.ppt
MLlecture1.ppt
 
MLlecture1.ppt
MLlecture1.pptMLlecture1.ppt
MLlecture1.ppt
 
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGEVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
 
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERINGEVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
EVALUATION OF SINGLE-SPAN MODELS ON EXTRACTIVE MULTI-SPAN QUESTION-ANSWERING
 
[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering
 
Aspiring Minds | Automata
Aspiring Minds | Automata Aspiring Minds | Automata
Aspiring Minds | Automata
 
Cser13.ppt
Cser13.pptCser13.ppt
Cser13.ppt
 
[2016/2017] RESEARCH in software engineering
[2016/2017] RESEARCH in software engineering[2016/2017] RESEARCH in software engineering
[2016/2017] RESEARCH in software engineering
 

Más de Faculty of Computer Science

Using Artificial Intelligence in Software Engineering
Using Artificial Intelligence in Software EngineeringUsing Artificial Intelligence in Software Engineering
Using Artificial Intelligence in Software EngineeringFaculty of Computer Science
 
Eye and Voice Control for an Augmented Reality Cooking Experience
Eye and Voice Control for an Augmented Reality Cooking ExperienceEye and Voice Control for an Augmented Reality Cooking Experience
Eye and Voice Control for an Augmented Reality Cooking ExperienceFaculty of Computer Science
 
Exploiting Social Networks. Technological Trends
Exploiting Social Networks. Technological TrendsExploiting Social Networks. Technological Trends
Exploiting Social Networks. Technological TrendsFaculty of Computer Science
 
Using opinion mining techniques for early crisis detection
Using opinion mining techniques for early crisis detectionUsing opinion mining techniques for early crisis detection
Using opinion mining techniques for early crisis detectionFaculty of Computer Science
 
I See You, You Can't See Me: On People's Perception About Surveillance In Po...
I See You, You Can't See Me: On People's Perception About Surveillance In Po...I See You, You Can't See Me: On People's Perception About Surveillance In Po...
I See You, You Can't See Me: On People's Perception About Surveillance In Po...Faculty of Computer Science
 
Question Answering for Machine Reading Evaluation on Romanian and English
Question Answering for Machine Reading Evaluation on Romanian and EnglishQuestion Answering for Machine Reading Evaluation on Romanian and English
Question Answering for Machine Reading Evaluation on Romanian and EnglishFaculty of Computer Science
 
Recovering Diacritics using Wikipedia and Google
Recovering Diacritics using Wikipedia and GoogleRecovering Diacritics using Wikipedia and Google
Recovering Diacritics using Wikipedia and GoogleFaculty of Computer Science
 
Hypothesis Transformation and Semantic Variability Rules Used in RTE
Hypothesis Transformation and Semantic Variability Rules Used in RTEHypothesis Transformation and Semantic Variability Rules Used in RTE
Hypothesis Transformation and Semantic Variability Rules Used in RTEFaculty of Computer Science
 
Improving a Question Answering System for Romanian Using Textual Entailment
Improving a Question Answering System for Romanian Using Textual EntailmentImproving a Question Answering System for Romanian Using Textual Entailment
Improving a Question Answering System for Romanian Using Textual EntailmentFaculty of Computer Science
 
A Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual EntailmentA Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual EntailmentFaculty of Computer Science
 
Formalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable NetworkFormalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable NetworkFaculty of Computer Science
 

Más de Faculty of Computer Science (19)

Using Artificial Intelligence in Software Engineering
Using Artificial Intelligence in Software EngineeringUsing Artificial Intelligence in Software Engineering
Using Artificial Intelligence in Software Engineering
 
Eye and Voice Control for an Augmented Reality Cooking Experience
Eye and Voice Control for an Augmented Reality Cooking ExperienceEye and Voice Control for an Augmented Reality Cooking Experience
Eye and Voice Control for an Augmented Reality Cooking Experience
 
Learn Chemistry with Augmented Reality
Learn Chemistry with Augmented RealityLearn Chemistry with Augmented Reality
Learn Chemistry with Augmented Reality
 
Exploiting Social Networks. Technological Trends
Exploiting Social Networks. Technological TrendsExploiting Social Networks. Technological Trends
Exploiting Social Networks. Technological Trends
 
Augmented Reality in Education
Augmented Reality in EducationAugmented Reality in Education
Augmented Reality in Education
 
Diversification in an Image Retrieval System
Diversification in an Image Retrieval SystemDiversification in an Image Retrieval System
Diversification in an Image Retrieval System
 
Using opinion mining techniques for early crisis detection
Using opinion mining techniques for early crisis detectionUsing opinion mining techniques for early crisis detection
Using opinion mining techniques for early crisis detection
 
Augmented reality
Augmented realityAugmented reality
Augmented reality
 
I See You, You Can't See Me: On People's Perception About Surveillance In Po...
I See You, You Can't See Me: On People's Perception About Surveillance In Po...I See You, You Can't See Me: On People's Perception About Surveillance In Po...
I See You, You Can't See Me: On People's Perception About Surveillance In Po...
 
Named Entity Recognition for Romanian
Named Entity Recognition for RomanianNamed Entity Recognition for Romanian
Named Entity Recognition for Romanian
 
Question Answering for Machine Reading Evaluation on Romanian and English
Question Answering for Machine Reading Evaluation on Romanian and EnglishQuestion Answering for Machine Reading Evaluation on Romanian and English
Question Answering for Machine Reading Evaluation on Romanian and English
 
Identify Experts from a Domain of Interest
Identify Experts from a Domain of Interest Identify Experts from a Domain of Interest
Identify Experts from a Domain of Interest
 
Recovering Diacritics using Wikipedia and Google
Recovering Diacritics using Wikipedia and GoogleRecovering Diacritics using Wikipedia and Google
Recovering Diacritics using Wikipedia and Google
 
UAIC Participation at RTE4
UAIC Participation at RTE4UAIC Participation at RTE4
UAIC Participation at RTE4
 
Hypothesis Transformation and Semantic Variability Rules Used in RTE
Hypothesis Transformation and Semantic Variability Rules Used in RTEHypothesis Transformation and Semantic Variability Rules Used in RTE
Hypothesis Transformation and Semantic Variability Rules Used in RTE
 
Improving a Question Answering System for Romanian Using Textual Entailment
Improving a Question Answering System for Romanian Using Textual EntailmentImproving a Question Answering System for Romanian Using Textual Entailment
Improving a Question Answering System for Romanian Using Textual Entailment
 
A Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual EntailmentA Distributed Architecture System for Recognizing Textual Entailment
A Distributed Architecture System for Recognizing Textual Entailment
 
Graph Coloring using Peer-to-Peer Networks
Graph Coloring using Peer-to-Peer NetworksGraph Coloring using Peer-to-Peer Networks
Graph Coloring using Peer-to-Peer Networks
 
Formalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable NetworkFormalizing Peer-to-Peer Systems based on Content Addressable Network
Formalizing Peer-to-Peer Systems based on Content Addressable Network
 

Último

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Último (20)

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

Question Answering on Romanian, English and French Languages

  • 1. Question Answering on Romanian, English and French Languages „„Al. I. Cuza” University of IaAl. I. Cuza” University of Ia ssi, Romi, Romaaniania Faculty of Computer ScienceFaculty of Computer Science
  • 2.  Introduction  System components ◦ Questions analysis ◦ Index creation and information retrieval ◦ Answer extraction  Results  Application of QA system ◦ eLearning ◦ Robotics ◦ CriES 2010  Conclusions
  • 3.  Our group participate in CLEF exercises from 2006: ◦ 2006 – Ro–En (English collection) – 9.47% right answers ◦ 2007 – Ro–Ro (Romanian Wikipedia) – 12 % ◦ 2008 – Ro–Ro (Romanian Wikipedia) – 31 % ◦ 2009 – Ro–Ro, En–En (JRC-Acquis) – 47.2 % (48.6%) ◦ 2010 – Ro-Ro, En-En, Fr-Fr (JRC-Acquis, Europarl) – 47.5% (42.5%, 27 %)
  • 4. Lucene queries Lucene Index Question analysis: - Tokenization & lemmatization - Focus, keywords and names entities identification - Question classification JRC-Acquis corpus Initial questions Information Retrieval Relevant snippets Romanian Grammar Definition Answer Extraction Reason Answer Extraction Other Answer Extraction Final Answers EUROPARL corpus
  • 5. Q1: What percentage of people in Italy relies on television for information? <q q_id="0001" source_lang="EN" target_lang="RO"> <string>Ce procent al populaţiei din Italia contează pe televiziune pentru a obţine informaţii</string> <focus>procent</focus> <verb>contează obţine</verb> <noun>populaţiei televiziune informaţii</noun> <nameEntities>Italia</nameEntities> <luceneQuery>procent~0.7 populaţiei~0.7 Italia^3 (contează^2 conta) televiziune~0.7 obţine informaţii~0.7 </luceneQuery> <questionType>FACTOID</questionType> ~ 40 patterns <answerType>MEASURE</answerType> ~ 30 patterns </q>
  • 6.  We used Lucene and we have created two indexes, one at paragraph level and one at document level  Using the Lucene queries and the indexes we used the Lucene search engine to extract a ranked list of snippets for every question as possible answer candidates
  • 7.  Depend by Lucene score and additional we built special modules to extract answers for questions of type DEFINITION, REASON-PURPOSE, PROCEDURE, OPINION  Two thresholds values ◦ A higher one - in this case, the system offers many NOA answers – RA is affected, but c@1 is higher ◦ A lower one – in this case we offer only a few NOA answers – RA is higher, but c@1 is lower
  • 8. RO-RO EN-EN FR-FR answered right 95 102 85 78 54 47 answered wrong 74 93 98 99 124 153 total answered 169 195 183 177 178 200 unanswered right 0 0 0 0 0 0 unanswered wrong 0 0 0 0 0 0 unanswered empty 31 5 17 23 22 0 total unanswered 31 5 17 23 22 0 c@1 measure 0.55 0.42 0.46 0.43 0.30 0.24
  • 9.  eLearning – fast answer for ~30% questions  Robotics – communication  CriES 2010 – identify experts on Yahoo! Answers
  • 10. Similar questions Answer Ok? Exista aplicatii in care folosirea Design Patterns-urilor sa nu fie eficienta? Evident (de exemplu un program Hello World)... Go La ce altceva design pattrenurile ar putea fi aplicate? Gasirea solutiei la o problema, crearea unui limbaj de programator avansat, in scrierea documentatiei, in discutii cu colegii de la o firma de software. Go Care este diferenta dintre pattern si o expresie (idiom) de codare. Nu am auzit de expresia idiom de codare, dar pare ceva particular, pe cand design pattern-ul este general... Go Care este diferenta dintre pattern si clase. Un design pattern este o solutie la o problema si prin urmare este compusa dintr-o ierarhie de clase intre care avem relatii. Go Este design pattern-ul diferit de pattern? De ce a fost ales acest nume? Design pattern-ul este un pattern in domeniul ingineriei software. Nu stiu de ce a fost ales acest nume.... :) Go Folosim Design Patterns in aceeasi aplicatie sau le folosim in aplicatii diferite? In aceeasi aplicatie. Go Ce este un design pattern? In primul rand: un nume, o problema si o solutie Go Questions Answer Priority Status Details La ce se folosesc design pattern-urile? normal nevoieN eaparat Go Raspunde la intrebare Raspuns Go Exception handlingul in Java poate fi considerat o aplicatie a Decorator pattern? urgent nevoieN eaparat Go Raspunde la intrebare Raspuns Go Exista aplicatii in care folosirea Design Patterns- urilor sa nu fie eficienta? Evident (de exemplu un program Hello World)... normal doarAsa La ce altceva design Gasirea solutiei la o problema, crearea unui limbaj de programator avansat, in saAfluM
  • 11.  With Swoogle we extend the knowledge base  The ontologies returned are then converted to AIML format and saved in the robot’s memory
  • 12. Initial digraph Initial Yahoo!answers collections en fr ge sp Eliminate stop words Domains keywords Initial users questions Eliminate stop words Questions keywords Relevant words for questions Relevant words for domains Similarity score between questions and domains Run 2 Run 1Run 0
  • 13.  UAIC QA system evolved over time (from 9 % in 2006 at 47.5 % in 2010)  The main problem is related to quality and quantity of Romanian resources involved  In present we are concerned with using of QA components in other applications in order to improve their capabilities

Notas del editor

  1. Swoogle – a semantic web search engine