SlideShare una empresa de Scribd logo
1 de 38
THE 26TH INTERNATIONAL
CONFERENCE ON
COMPUTATIONAL LINGUISTICS
DECEMBER 11-16, 2016
OSAKA, JAPAN
COLING 2016
STATISTICS
• 3rd CoLing in Japan after Tokyo & Kyoto
• ~1100 presenters in total
• Almost 400 students
• 1039 papers submitted
• 32% acceptance rate
• 337 accepted papers
• 135 presentations
• 202 posters
• ~25% neural network papers
AREA-WISE ACCEPTANCE RATES
Area Total
submissio
ns
Total (%)
accepted
Paraphrasing, Textual Entailment 32 9 (28%)
Sentiment Analysis, Computational Argumentation 85 22 (26%)
Information Retrieval, Information Extraction, Question
Answering
126 26 (21%)
Applications 76 28 (37%)
Dialog Processing and Dialog Systems, Multimodal
Interfaces
29 12 (41%)
Speech Recognition, Text-to-Speech, Spoken Language
Understanding
24 11 (46%)
Machine Translation 88 31 (35%)
Resources, Software and Tools 56 25 (45%)
ROBOTS
QUOTES
• Probably the last PBMT paper ever
• People working on digital humanities don't really know what digital
humanities are…
• Kids learn language having heard a very small amount – to further
advance AI we need to focus on low resourced conditions instead of
big data
• Home Made Restaurant Warmly
• to make by hand taste
HYTRA 6
SIXTH WORKSHOP ON HYBRID APPROACHES TO TRANSLATION
KEYNOTE: MARK SELIGMAN, SPOKEN
TRANSLATION, INC.
PERCEPTUALLY GROUNDED DEEP SEMANTICS
IN FUTURE HYBRID MACHINE TRANSLATION
Nine Issues in Speech Translation
– Discourse
– Speech acts
– Topic tracking
– Domain
– Prosody
– Pauses
– Pitch, stress
– Translation mismatches
– System architecture, data
structures
Improve Statistical MT
• User feedback + machine learning
• More, better data
• Parsing > hybrid MT
KEYNOTE: MARK SELIGMAN, SPOKEN
TRANSLATION, INC.
PERCEPTUALLY GROUNDED DEEP SEMANTICS
IN FUTURE HYBRID MACHINE TRANSLATION
車
_car
を
_obj
運転
_driving
する
_do
人
_person
Syntactic
structure
NP
VP
Semantic
structure
PP V
N NP VN V
drive
person
person
car
mod
agt obj
The Return of Semantics:
Interlingua/Ontologies
Grounded Semantics
MAIN CONFERENCE
KEYNOTES:
JOAKIM NIVRE, REIKO MAZUKA, DINA DEMNER-FUSHMAN, SIMONE
TEUFEL
JOAKIM NIVRE
UPPSALA UNIVERSITY, SWEDEN
Universal Dependencies - Dubious Linguistics and Crappy Parsing?
• Maximize parallelism – but don’t overdo it
• Don’t annotate the same thing in different ways
• Don’t make different things look the same
• Don’t annotate things that are not there
• Universal taxonomy with language-specific elaboration
• Languages select from a universal pool of categories
• Allow language-specific extensions
JOAKIM NIVRE
UPPSALA UNIVERSITY, SWEDEN
Manning's law
1. UD needs to be satisfactory on linguistic analysis grounds for individual languages.
2. UD needs to be good for linguistic typology, i.e., providing a suitable basis for bringing
out cross-linguistic parallelism across languages and language families.
3. UD must be suitable for rapid, consistent annotation by a human annotator.
4. UD must be suitable for computer parsing with high accuracy.
5. UD must be easily comprehended and used by a non-linguist, whether a language
learner or an engineer with prosaic needs for language processing.
6. UD must support well downstream language understanding tasks (relation extraction,
reading comprehension, machine translation, …).
JOAKIM NIVRE
UPPSALA UNIVERSITY, SWEDEN
Dubious linguistics?
• Lexical dependencies and functional relations encoded in a
single tree
• Grounded in linguistic typology and dependency grammar
traditions
Crappy parsing?
• Not so bad with existing parsers, especially for cross-lingual
parsing
• Learn richer parsing models grounded in linguistic typology
REIKO MAZUKA
RIKEN BRAIN SCIENCE INSTITUTE, JAPAN
• 12month old babies are called 'old babies‘
• Medical stuff has lots of data, lots of problems
• … let alone …
DINA DEMNER-FUSHMAN
U.S. NATIONAL LIBRARY OF MEDICINE, U.S.A.
SIMONE TEUFEL
UNIVERSITY OF CAMBRIDGE, U.K.
PRESENTATIONS
MAIN CONFERENCE
CHARNER: CHARACTER-LEVEL
NAMED ENTITY RECOGNITION
Onur Kuru, Ozan Arkan Can, Deniz Yuret
• Stacked bidirectional LSTMs
• inputs characters
• outputs tag probabilities for each character
• Probabilities are then converted to word level named entity tags using a
Viterbi decoder
• Close to state-of-the-art NER performance in seven languages with the
same basic model using only labeled NER data and no hand-engineered
features or other external resources like syntactic taggers or Gazetteers
WHAT TOPIC DO YOU WANT TO HEAR ABOUT?
A BILINGUAL TALKING ROBOT
USING ENGLISH AND JAPANESE WIKIPEDIAS
Graham Wilcock, Kristiina Jokinen
PHRASE-BASED MACHINE TRANSLATION
USING MULTIPLE PREORDERING
CANDIDATES
Yusuke Oda, Taku Kudo, Tetsuji Nakagawa, Taro Watanabe
INTERACTIVE ATTENTION
FOR NEURAL MACHINE TRANSLATION
Fandong Meng, Zhengdong Lu, Hang Li, Qun Liu
INTERACTIVE ATTENTION
FOR NEURAL MACHINE TRANSLATION
Fandong Meng, Zhengdong Lu, Hang Li, Qun Liu
• Models the interaction between the decoder and the
representation of source sentence during translation by both
reading and writing operations
• Can keep track of the interaction history and therefore improve
the translation performance
SUB-WORD SIMILARITY BASED SEARCH FOR
EMBEDDINGS: INDUCING RARE-WORD
EMBEDDINGS FOR WORD SIMILARITY TASKS AND
LANGUAGE MODELLING
Mittul Singh, Clayton Greenberg, Youssef Oualil, Dietrich Klakow
• Training good word embeddings requires large amounts of data.
Out-of-vocabulary words will still be encountered.
• Existing methods use computationally-intensive morphological
analysis to generate embeddings
• The proposed system applies a computationally-simpler sub-word
search on words that have existing embeddings
• Up to 50% reduction in rare word perplexity in comparison to other
more complex language models
MULTI-ENGINE AND MULTI-ALIGNMENT BASED
AUTOMATIC POST-EDITING
AND ITS IMPACT ON TRANSLATION
PRODUCTIVITY
Santanu Pal, Sudip Kumar Naskar, Josef van Genabith
• Parallel system combination in the APE stage of a sequential MT-
APE combination
• Substantial translation improvements
• automatic evaluation (+5.9%)
• productivity in post-editing (21.76%)
• System combination on the level of APE alignments yields further
improvements
POSTERS
MAIN CONFERENCE
Achieves the state-of-the-art conversion Fscore 95.6
IMPROVING ATTENTION MODELING WITH
IMPLICIT DISTORTION AND FERTILITY FOR
MACHINE TRANSLATION
BEST PAPERS
2+1
PREDICTING HUMAN SIMILARITY JUDGMENTS
WITH DISTRIBUTIONAL MODELS:
THE VALUE OF WORD ASSOCIATIONS
Simon De Deyne, Amy Perfors, Daniel J Navarro
• Internal language models, that are more closely aligned to the
mental representations of words
• Count based model for text corpora
• Predicting structure from text corpora using word embeddings
• Count based model for word associations
• A spreading activation approach to semantic structure
EXTENDING THE USE OF ADAPTOR GRAMMARS
FOR UNSUPERVISED MORPHOLOGICAL
SEGMENTATION OF UNSEEN LANGUAGES
Ramy Eskander, Owen Rambow, Tianchun Yang
• Segmentation of words in a language into a sequence of
morphs
• Without rewriting or normalizing morphs
• Without identifying the stem
• Without identifying morphological features
KEYSTROKE DYNAMICS
AS SIGNAL FOR SHALLOW SYNTACTIC
PARSING
Barbara Plank
• Runnuer-up for best paper
WHAT’S NEXT
FUTURE COLING
2018
COLING 2018
• Santa Fe, New Mexico, USA
• August 20-25, 2018
LREC 2018
• Miyazaki, Japan
• May 7-12, 2018
LESS COLING & MORE OSAKA
HTTP://LIELAKEDA.LV HTTP://EJ.UZ/COLING2016

Más contenido relacionado

La actualidad más candente

Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
Mustafa Jarrar
 
An Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationAn Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense Disambiguation
Surabhi Verma
 

La actualidad más candente (20)

Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
 
Introduction to natural language processing
Introduction to natural language processingIntroduction to natural language processing
Introduction to natural language processing
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...What can typological knowledge bases and language representations tell us abo...
What can typological knowledge bases and language representations tell us abo...
 
Enriching Word Vectors with Subword Information
Enriching Word Vectors with Subword InformationEnriching Word Vectors with Subword Information
Enriching Word Vectors with Subword Information
 
AINL 2016: Nikolenko
AINL 2016: NikolenkoAINL 2016: Nikolenko
AINL 2016: Nikolenko
 
Research data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitlingResearch data as an aid in teaching technical competence in subtitling
Research data as an aid in teaching technical competence in subtitling
 
Building NLP solutions using Python
Building NLP solutions using PythonBuilding NLP solutions using Python
Building NLP solutions using Python
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
 
Deep Learning for Information Retrieval
Deep Learning for Information RetrievalDeep Learning for Information Retrieval
Deep Learning for Information Retrieval
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Gender Classification of Blog Authors: With Feature Engineering and Deep Lear...
Gender Classification of Blog Authors: With Feature Engineering and Deep Lear...Gender Classification of Blog Authors: With Feature Engineering and Deep Lear...
Gender Classification of Blog Authors: With Feature Engineering and Deep Lear...
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
AINL 2016: Filchenkov
AINL 2016: FilchenkovAINL 2016: Filchenkov
AINL 2016: Filchenkov
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
An Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationAn Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense Disambiguation
 
Nlp
NlpNlp
Nlp
 

Similar a CoLing 2016

1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
tanishamahajan11
 
naturallanguageprocessing-160722053804.pdf
naturallanguageprocessing-160722053804.pdfnaturallanguageprocessing-160722053804.pdf
naturallanguageprocessing-160722053804.pdf
shakeelAsghar6
 
Natural Language Processing-(NLP).pptx
Natural Language Processing-(NLP).pptxNatural Language Processing-(NLP).pptx
Natural Language Processing-(NLP).pptx
SHIBDASDUTTA
 
NLP presentation.pptx
NLP presentation.pptxNLP presentation.pptx
NLP presentation.pptx
pysgpa
 
• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx
• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx
• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx
odiliagilby
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
VishnuRajuV
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Abdullah al Mamun
 

Similar a CoLing 2016 (20)

2010 INTERSPEECH
2010 INTERSPEECH 2010 INTERSPEECH
2010 INTERSPEECH
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
 
L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
1 Introduction.ppt
1 Introduction.ppt1 Introduction.ppt
1 Introduction.ppt
 
NLP,expert,robotics.pptx
NLP,expert,robotics.pptxNLP,expert,robotics.pptx
NLP,expert,robotics.pptx
 
Morphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil LanguageMorphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil Language
 
International journal on natural language computing(ijnlc)
International journal on natural language computing(ijnlc)International journal on natural language computing(ijnlc)
International journal on natural language computing(ijnlc)
 
naturallanguageprocessing-160722053804.pdf
naturallanguageprocessing-160722053804.pdfnaturallanguageprocessing-160722053804.pdf
naturallanguageprocessing-160722053804.pdf
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language Processing-(NLP).pptx
Natural Language Processing-(NLP).pptxNatural Language Processing-(NLP).pptx
Natural Language Processing-(NLP).pptx
 
Amadou
AmadouAmadou
Amadou
 
NLP presentation.pptx
NLP presentation.pptxNLP presentation.pptx
NLP presentation.pptx
 
• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx
• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx
• COMMUNICATEBUSINESS VISION• WHAT TO EXPECT• .docx
 
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptxLiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
LiDeng-BerlinOct2015-ASR-GenDisc-4by3.pptx
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech RecognitionTeaching Machines to Listen: An Introduction to Automatic Speech Recognition
Teaching Machines to Listen: An Introduction to Automatic Speech Recognition
 
Building a Neural Machine Translation System From Scratch
Building a Neural Machine Translation System From ScratchBuilding a Neural Machine Translation System From Scratch
Building a Neural Machine Translation System From Scratch
 
Understanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational DatabasesUnderstanding Natural Language Queries over Relational Databases
Understanding Natural Language Queries over Relational Databases
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 

Más de Matīss ‎‎‎‎‎‎‎  

Más de Matīss ‎‎‎‎‎‎‎   (20)

日本のお風呂
日本のお風呂日本のお風呂
日本のお風呂
 
Thrifty Food Tweets on a Rainy Day
Thrifty Food Tweets on a Rainy DayThrifty Food Tweets on a Rainy Day
Thrifty Food Tweets on a Rainy Day
 
私の趣味
私の趣味私の趣味
私の趣味
 
How Masterly Are People at Playing with Their Vocabulary?
How Masterly Are People at Playing with Their Vocabulary?How Masterly Are People at Playing with Their Vocabulary?
How Masterly Are People at Playing with Their Vocabulary?
 
私の町リガ
私の町リガ私の町リガ
私の町リガ
 
大学への交通手段
大学への交通手段大学への交通手段
大学への交通手段
 
小学生に 携帯電話
小学生に 携帯電話小学生に 携帯電話
小学生に 携帯電話
 
Tracing multisensory food experience on twitter
Tracing multisensory food experience on twitterTracing multisensory food experience on twitter
Tracing multisensory food experience on twitter
 
ラトビア大学
ラトビア大学ラトビア大学
ラトビア大学
 
私の趣味
私の趣味私の趣味
私の趣味
 
富士山りょこう
富士山りょこう富士山りょこう
富士山りょこう
 
Tips and Tools for NMT
Tips and Tools for NMTTips and Tools for NMT
Tips and Tools for NMT
 
Hybrid Machine Translation by Combining Multiple Machine Translation Systems
Hybrid Machine Translation by Combining Multiple Machine Translation SystemsHybrid Machine Translation by Combining Multiple Machine Translation Systems
Hybrid Machine Translation by Combining Multiple Machine Translation Systems
 
The Impact of Corpora Qulality on Neural Machine Translation
The Impact of Corpora Qulality on Neural Machine TranslationThe Impact of Corpora Qulality on Neural Machine Translation
The Impact of Corpora Qulality on Neural Machine Translation
 
Advancing Estonian Machine Translation
Advancing Estonian Machine TranslationAdvancing Estonian Machine Translation
Advancing Estonian Machine Translation
 
Debugging neural machine translations
Debugging neural machine translationsDebugging neural machine translations
Debugging neural machine translations
 
Effective online learning implementation for statistical machine translation
Effective online learning implementation for statistical machine translationEffective online learning implementation for statistical machine translation
Effective online learning implementation for statistical machine translation
 
Neirontulkojumu atkļūdošana
Neirontulkojumu atkļūdošanaNeirontulkojumu atkļūdošana
Neirontulkojumu atkļūdošana
 
Hybrid machine translation by combining multiple machine translation systems
Hybrid machine translation by combining multiple machine translation systemsHybrid machine translation by combining multiple machine translation systems
Hybrid machine translation by combining multiple machine translation systems
 
Paying attention to MWEs in NMT
Paying attention to MWEs in NMTPaying attention to MWEs in NMT
Paying attention to MWEs in NMT
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

CoLing 2016

  • 1. THE 26TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS DECEMBER 11-16, 2016 OSAKA, JAPAN
  • 3. STATISTICS • 3rd CoLing in Japan after Tokyo & Kyoto • ~1100 presenters in total • Almost 400 students • 1039 papers submitted • 32% acceptance rate • 337 accepted papers • 135 presentations • 202 posters • ~25% neural network papers
  • 4. AREA-WISE ACCEPTANCE RATES Area Total submissio ns Total (%) accepted Paraphrasing, Textual Entailment 32 9 (28%) Sentiment Analysis, Computational Argumentation 85 22 (26%) Information Retrieval, Information Extraction, Question Answering 126 26 (21%) Applications 76 28 (37%) Dialog Processing and Dialog Systems, Multimodal Interfaces 29 12 (41%) Speech Recognition, Text-to-Speech, Spoken Language Understanding 24 11 (46%) Machine Translation 88 31 (35%) Resources, Software and Tools 56 25 (45%)
  • 6. QUOTES • Probably the last PBMT paper ever • People working on digital humanities don't really know what digital humanities are… • Kids learn language having heard a very small amount – to further advance AI we need to focus on low resourced conditions instead of big data • Home Made Restaurant Warmly • to make by hand taste
  • 7.
  • 8. HYTRA 6 SIXTH WORKSHOP ON HYBRID APPROACHES TO TRANSLATION
  • 9. KEYNOTE: MARK SELIGMAN, SPOKEN TRANSLATION, INC. PERCEPTUALLY GROUNDED DEEP SEMANTICS IN FUTURE HYBRID MACHINE TRANSLATION Nine Issues in Speech Translation – Discourse – Speech acts – Topic tracking – Domain – Prosody – Pauses – Pitch, stress – Translation mismatches – System architecture, data structures Improve Statistical MT • User feedback + machine learning • More, better data • Parsing > hybrid MT
  • 10. KEYNOTE: MARK SELIGMAN, SPOKEN TRANSLATION, INC. PERCEPTUALLY GROUNDED DEEP SEMANTICS IN FUTURE HYBRID MACHINE TRANSLATION 車 _car を _obj 運転 _driving する _do 人 _person Syntactic structure NP VP Semantic structure PP V N NP VN V drive person person car mod agt obj The Return of Semantics: Interlingua/Ontologies Grounded Semantics
  • 11. MAIN CONFERENCE KEYNOTES: JOAKIM NIVRE, REIKO MAZUKA, DINA DEMNER-FUSHMAN, SIMONE TEUFEL
  • 12. JOAKIM NIVRE UPPSALA UNIVERSITY, SWEDEN Universal Dependencies - Dubious Linguistics and Crappy Parsing? • Maximize parallelism – but don’t overdo it • Don’t annotate the same thing in different ways • Don’t make different things look the same • Don’t annotate things that are not there • Universal taxonomy with language-specific elaboration • Languages select from a universal pool of categories • Allow language-specific extensions
  • 13. JOAKIM NIVRE UPPSALA UNIVERSITY, SWEDEN Manning's law 1. UD needs to be satisfactory on linguistic analysis grounds for individual languages. 2. UD needs to be good for linguistic typology, i.e., providing a suitable basis for bringing out cross-linguistic parallelism across languages and language families. 3. UD must be suitable for rapid, consistent annotation by a human annotator. 4. UD must be suitable for computer parsing with high accuracy. 5. UD must be easily comprehended and used by a non-linguist, whether a language learner or an engineer with prosaic needs for language processing. 6. UD must support well downstream language understanding tasks (relation extraction, reading comprehension, machine translation, …).
  • 14. JOAKIM NIVRE UPPSALA UNIVERSITY, SWEDEN Dubious linguistics? • Lexical dependencies and functional relations encoded in a single tree • Grounded in linguistic typology and dependency grammar traditions Crappy parsing? • Not so bad with existing parsers, especially for cross-lingual parsing • Learn richer parsing models grounded in linguistic typology
  • 15. REIKO MAZUKA RIKEN BRAIN SCIENCE INSTITUTE, JAPAN • 12month old babies are called 'old babies‘ • Medical stuff has lots of data, lots of problems • … let alone … DINA DEMNER-FUSHMAN U.S. NATIONAL LIBRARY OF MEDICINE, U.S.A. SIMONE TEUFEL UNIVERSITY OF CAMBRIDGE, U.K.
  • 17. CHARNER: CHARACTER-LEVEL NAMED ENTITY RECOGNITION Onur Kuru, Ozan Arkan Can, Deniz Yuret • Stacked bidirectional LSTMs • inputs characters • outputs tag probabilities for each character • Probabilities are then converted to word level named entity tags using a Viterbi decoder • Close to state-of-the-art NER performance in seven languages with the same basic model using only labeled NER data and no hand-engineered features or other external resources like syntactic taggers or Gazetteers
  • 18. WHAT TOPIC DO YOU WANT TO HEAR ABOUT? A BILINGUAL TALKING ROBOT USING ENGLISH AND JAPANESE WIKIPEDIAS Graham Wilcock, Kristiina Jokinen
  • 19. PHRASE-BASED MACHINE TRANSLATION USING MULTIPLE PREORDERING CANDIDATES Yusuke Oda, Taku Kudo, Tetsuji Nakagawa, Taro Watanabe
  • 20. INTERACTIVE ATTENTION FOR NEURAL MACHINE TRANSLATION Fandong Meng, Zhengdong Lu, Hang Li, Qun Liu
  • 21. INTERACTIVE ATTENTION FOR NEURAL MACHINE TRANSLATION Fandong Meng, Zhengdong Lu, Hang Li, Qun Liu • Models the interaction between the decoder and the representation of source sentence during translation by both reading and writing operations • Can keep track of the interaction history and therefore improve the translation performance
  • 22. SUB-WORD SIMILARITY BASED SEARCH FOR EMBEDDINGS: INDUCING RARE-WORD EMBEDDINGS FOR WORD SIMILARITY TASKS AND LANGUAGE MODELLING Mittul Singh, Clayton Greenberg, Youssef Oualil, Dietrich Klakow • Training good word embeddings requires large amounts of data. Out-of-vocabulary words will still be encountered. • Existing methods use computationally-intensive morphological analysis to generate embeddings • The proposed system applies a computationally-simpler sub-word search on words that have existing embeddings • Up to 50% reduction in rare word perplexity in comparison to other more complex language models
  • 23. MULTI-ENGINE AND MULTI-ALIGNMENT BASED AUTOMATIC POST-EDITING AND ITS IMPACT ON TRANSLATION PRODUCTIVITY Santanu Pal, Sudip Kumar Naskar, Josef van Genabith • Parallel system combination in the APE stage of a sequential MT- APE combination • Substantial translation improvements • automatic evaluation (+5.9%) • productivity in post-editing (21.76%) • System combination on the level of APE alignments yields further improvements
  • 25. Achieves the state-of-the-art conversion Fscore 95.6
  • 26.
  • 27.
  • 28.
  • 29.
  • 30.
  • 31. IMPROVING ATTENTION MODELING WITH IMPLICIT DISTORTION AND FERTILITY FOR MACHINE TRANSLATION
  • 33. PREDICTING HUMAN SIMILARITY JUDGMENTS WITH DISTRIBUTIONAL MODELS: THE VALUE OF WORD ASSOCIATIONS Simon De Deyne, Amy Perfors, Daniel J Navarro • Internal language models, that are more closely aligned to the mental representations of words • Count based model for text corpora • Predicting structure from text corpora using word embeddings • Count based model for word associations • A spreading activation approach to semantic structure
  • 34. EXTENDING THE USE OF ADAPTOR GRAMMARS FOR UNSUPERVISED MORPHOLOGICAL SEGMENTATION OF UNSEEN LANGUAGES Ramy Eskander, Owen Rambow, Tianchun Yang • Segmentation of words in a language into a sequence of morphs • Without rewriting or normalizing morphs • Without identifying the stem • Without identifying morphological features
  • 35. KEYSTROKE DYNAMICS AS SIGNAL FOR SHALLOW SYNTACTIC PARSING Barbara Plank • Runnuer-up for best paper
  • 37. 2018 COLING 2018 • Santa Fe, New Mexico, USA • August 20-25, 2018 LREC 2018 • Miyazaki, Japan • May 7-12, 2018
  • 38. LESS COLING & MORE OSAKA HTTP://LIELAKEDA.LV HTTP://EJ.UZ/COLING2016