SlideShare una empresa de Scribd logo
1 de 20
Natural Language to Visualization
by Neural Machine Translation
Yuyu Luo, Nan Tang, Guoliang Li, Jiawei Tang,
Chengliang Chai, Xuedi Qin
VIS 2021
2021. 11. 23
Hyunwook Lee
Contents
• Overview
• Related Work
• Design Requirements
• Background and Problem Formulation
• Model – ncNet
• Evaluation
• Limitation and Future Work
2
What is neural machine translation (NMT)?
• Given input sequence, translate them into output sequence (seq2seq)
 e.g., English to Spanish
 (input, output) can be any pair of (sequence, sequence)!
3
Overview
4
Contribution
• Proposing ncNet: Transformer-based seq2seq model for NL2VIS
• Presenting visualization-grammar, Vega-Zero for Vega-Lite
• Attention forcing and Visualization-aware translation
A. Satyanarayan, D. Moritz, K. Wongsuphasawat, and J. Heer. Vega-lite: A grammar of interactive graphics. IEEE TVCG, 23(1):341–350, 2017
5
Related Work
NL4DV
FlowSense
• Utilize semantic parsers, instead of using deep-learning based approach
B. Yu and C. T. Silva. Flowsense: A natural language interface for visual data exploration within a dataflow system. IEEE TVCG, pp. 1–11, 2020.
A. Narechania, A. Srinivasan, and J. T. Stasko. NL4DV: A toolkit for generating analytic specifications for data visualization from natural language queries. In VIS, 2020.
6
Design Requirements
• Easy-to-use
 Simple query from novice(to create visualization) user  Suitable visualization
• End-to-End
 Semantic parser  hard coded, should be tuned separately
 Need to be end-to-end manner
• Language-agnostic (Language-independent)
 The output sequence of ncNet should be utilized in various visualization languages
• e.g., Vega-Lite, D3, ggplot2…
7
Background – Data, NL Query
• Data D: tabular dataset from JSON, CSV, or related table from DB
• NL Query: user query for corresponding data D
• Objective: given NL query with dataset D, predict Vega-Zero specification
(Input Sequence, Output Sequence)
8
Language-Agnostic query: Vega-Zero
• A visualization grammar
 Simplified version of Vega-Lite
 Model friendly
• Direct use of Vega-Lite?
 User friendly rather than model friendly
 hierarchical grammar  hard to train
• Language-agnostic
 By simple formatting
 Vega-Lite, ggplot2
: unit
9
Background – Chart Templates
• User input
• Constrain output sequence
10
Methods: Overview
11
Methods: Input Handling
• Input Tokenization
 Tokenize three types of input data
• Input Embedding
12
What is Transformer?
• One of the most famous NLP model
• The model only utilize attention mechanism
• Parallelly handle input sequence
 better than RNN, but need positional encoding
• Well-motivated
13
Methods: Attention Forcing
14
Methods: visualization-aware translation
• Modified version of Beam Search
• Tokens in specific steps have appropriate tokens (e.g., scatter only in [T])
• Top-k with heuristic rules
 If there is no appropriate tokens, select top-1 token
15
Evaluation
• Dataset
 nvBench: 25,750 data in format (NL query, VIS) with ~780 tables from 105 domains
16
Evaluation: User Study
• 4 experts, 2 non-experts
• Four failure cases – fail on predicting 1) columns; 2) chart type;
3) top-1 selection; 4) Vega-Zero fails but vis match
17
Evaluation: Comparison
• Compare sota NL2SQL, ncNet w/o C, ncNet w/ C
• ncNet performs better than sota NL2SQL model
18
Limitation and Future work
• (Limitation) Benchmark data are relatively small and limited
• (Limitation) Supporting only one-shot NL queries
• (Future work) supporting conversational NL queries  dialog system?
• (Future work) Chart embedding methods
Natural Language to Visualization by Neural Machine Translation

Más contenido relacionado

La actualidad más candente

Bert pre_training_of_deep_bidirectional_transformers_for_language_understanding
Bert  pre_training_of_deep_bidirectional_transformers_for_language_understandingBert  pre_training_of_deep_bidirectional_transformers_for_language_understanding
Bert pre_training_of_deep_bidirectional_transformers_for_language_understanding
ThyrixYang1
 

La actualidad más candente (20)

On using monolingual corpora in neural machine translation
On using monolingual corpora in neural machine translationOn using monolingual corpora in neural machine translation
On using monolingual corpora in neural machine translation
 
Neural Machine Translation (D3L4 Deep Learning for Speech and Language UPC 2017)
Neural Machine Translation (D3L4 Deep Learning for Speech and Language UPC 2017)Neural Machine Translation (D3L4 Deep Learning for Speech and Language UPC 2017)
Neural Machine Translation (D3L4 Deep Learning for Speech and Language UPC 2017)
 
Moving to neural machine translation at google - gopro-meetup
Moving to neural machine translation at google  - gopro-meetupMoving to neural machine translation at google  - gopro-meetup
Moving to neural machine translation at google - gopro-meetup
 
Machine Translation Introduction
Machine Translation IntroductionMachine Translation Introduction
Machine Translation Introduction
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
Notes on attention mechanism
Notes on attention mechanismNotes on attention mechanism
Notes on attention mechanism
 
Advanced Neural Machine Translation (D4L2 Deep Learning for Speech and Langua...
Advanced Neural Machine Translation (D4L2 Deep Learning for Speech and Langua...Advanced Neural Machine Translation (D4L2 Deep Learning for Speech and Langua...
Advanced Neural Machine Translation (D4L2 Deep Learning for Speech and Langua...
 
Julia language: inside the corporation
Julia language: inside the corporationJulia language: inside the corporation
Julia language: inside the corporation
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
 
[Paper review] BERT
[Paper review] BERT[Paper review] BERT
[Paper review] BERT
 
Bert
BertBert
Bert
 
Attention Mechanism in Language Understanding and its Applications
Attention Mechanism in Language Understanding and its ApplicationsAttention Mechanism in Language Understanding and its Applications
Attention Mechanism in Language Understanding and its Applications
 
BERT
BERTBERT
BERT
 
BERT
BERTBERT
BERT
 
Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machi...
Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machi...Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machi...
Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machi...
 
Master Thesis of Computer Engineering: OpenTranslator
Master Thesis of Computer Engineering: OpenTranslatorMaster Thesis of Computer Engineering: OpenTranslator
Master Thesis of Computer Engineering: OpenTranslator
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
A Feature-Based Model for Nested Named-Entity Recognition at VLSP-2018 NER Ev...
 
Bert pre_training_of_deep_bidirectional_transformers_for_language_understanding
Bert  pre_training_of_deep_bidirectional_transformers_for_language_understandingBert  pre_training_of_deep_bidirectional_transformers_for_language_understanding
Bert pre_training_of_deep_bidirectional_transformers_for_language_understanding
 
A neural conversational_model
A neural conversational_modelA neural conversational_model
A neural conversational_model
 

Similar a Natural Language to Visualization by Neural Machine Translation

[論文紹介]A syntactic neural model for general purpose code generation
[論文紹介]A syntactic neural model for general purpose code generation [論文紹介]A syntactic neural model for general purpose code generation
[論文紹介]A syntactic neural model for general purpose code generation
OgataTomoya
 
Superworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueSuperworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and Fugue
Databricks
 

Similar a Natural Language to Visualization by Neural Machine Translation (20)

[論文紹介]A syntactic neural model for general purpose code generation
[論文紹介]A syntactic neural model for general purpose code generation [論文紹介]A syntactic neural model for general purpose code generation
[論文紹介]A syntactic neural model for general purpose code generation
 
Recurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
 
Nlp and transformer (v3s)
Nlp and transformer (v3s)Nlp and transformer (v3s)
Nlp and transformer (v3s)
 
230922 Semantic Exploration from Language Abstractions and Pretrained Represe...
230922 Semantic Exploration from Language Abstractions and Pretrained Represe...230922 Semantic Exploration from Language Abstractions and Pretrained Represe...
230922 Semantic Exploration from Language Abstractions and Pretrained Represe...
 
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
Deep Learning Architectures for NLP (Hungarian NLP Meetup 2016-09-07)
 
Transformer Mods for Document Length Inputs
Transformer Mods for Document Length InputsTransformer Mods for Document Length Inputs
Transformer Mods for Document Length Inputs
 
node2vec: Scalable Feature Learning for Networks.pptx
node2vec: Scalable Feature Learning for Networks.pptxnode2vec: Scalable Feature Learning for Networks.pptx
node2vec: Scalable Feature Learning for Networks.pptx
 
Natural Language Interface to Knowledge Graph
Natural Language Interface to Knowledge GraphNatural Language Interface to Knowledge Graph
Natural Language Interface to Knowledge Graph
 
ECCV2022 paper reading - MultiMAE: Multi-modal Multi-task Masked Autoencoders...
ECCV2022 paper reading - MultiMAE: Multi-modal Multi-task Masked Autoencoders...ECCV2022 paper reading - MultiMAE: Multi-modal Multi-task Masked Autoencoders...
ECCV2022 paper reading - MultiMAE: Multi-modal Multi-task Masked Autoencoders...
 
Superworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueSuperworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and Fugue
 
Gnn overview
Gnn overviewGnn overview
Gnn overview
 
Resnet.pdf
Resnet.pdfResnet.pdf
Resnet.pdf
 
5_RNN_LSTM.pdf
5_RNN_LSTM.pdf5_RNN_LSTM.pdf
5_RNN_LSTM.pdf
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
A Generalization of Transformer Networks to Graphs.pptx
A Generalization of Transformer Networks to Graphs.pptxA Generalization of Transformer Networks to Graphs.pptx
A Generalization of Transformer Networks to Graphs.pptx
 
008 GNNs at Scale With Graph Data Science Sampling and Python Client Integrat...
008 GNNs at Scale With Graph Data Science Sampling and Python Client Integrat...008 GNNs at Scale With Graph Data Science Sampling and Python Client Integrat...
008 GNNs at Scale With Graph Data Science Sampling and Python Client Integrat...
 
Transfer Learning for Low Resource Languages and Domains
Transfer Learning for Low Resource Languages and DomainsTransfer Learning for Low Resource Languages and Domains
Transfer Learning for Low Resource Languages and Domains
 
A Software Engineering Perspective on SDN Programmability
A Software Engineering Perspective on SDN ProgrammabilityA Software Engineering Perspective on SDN Programmability
A Software Engineering Perspective on SDN Programmability
 
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
 
Talk from NVidia Developer Connect
Talk from NVidia Developer ConnectTalk from NVidia Developer Connect
Talk from NVidia Developer Connect
 

Más de ivaderivader

Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
ivaderivader
 
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
ivaderivader
 

Más de ivaderivader (20)

Argument Mining
Argument MiningArgument Mining
Argument Mining
 
Papers at CHI23
Papers at CHI23Papers at CHI23
Papers at CHI23
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
 
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
 
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
 
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
 
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
 
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial NetworksA Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
 
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
 
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for VisualizationPerception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
 
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
 
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-PoolingNeural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
 
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
 
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTubeBad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
 
Invertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise RemovalInvertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise Removal
 
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural NetworkTraffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
 
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training  MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
 
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI ComponentsScreen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
 
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
 
Recommending What Video to Watch Next: A Multitask Ranking System
Recommending What Video to Watch Next: A Multitask Ranking SystemRecommending What Video to Watch Next: A Multitask Ranking System
Recommending What Video to Watch Next: A Multitask Ranking System
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 

Natural Language to Visualization by Neural Machine Translation

  • 1. Natural Language to Visualization by Neural Machine Translation Yuyu Luo, Nan Tang, Guoliang Li, Jiawei Tang, Chengliang Chai, Xuedi Qin VIS 2021 2021. 11. 23 Hyunwook Lee
  • 2. Contents • Overview • Related Work • Design Requirements • Background and Problem Formulation • Model – ncNet • Evaluation • Limitation and Future Work
  • 3. 2 What is neural machine translation (NMT)? • Given input sequence, translate them into output sequence (seq2seq)  e.g., English to Spanish  (input, output) can be any pair of (sequence, sequence)!
  • 5. 4 Contribution • Proposing ncNet: Transformer-based seq2seq model for NL2VIS • Presenting visualization-grammar, Vega-Zero for Vega-Lite • Attention forcing and Visualization-aware translation A. Satyanarayan, D. Moritz, K. Wongsuphasawat, and J. Heer. Vega-lite: A grammar of interactive graphics. IEEE TVCG, 23(1):341–350, 2017
  • 6. 5 Related Work NL4DV FlowSense • Utilize semantic parsers, instead of using deep-learning based approach B. Yu and C. T. Silva. Flowsense: A natural language interface for visual data exploration within a dataflow system. IEEE TVCG, pp. 1–11, 2020. A. Narechania, A. Srinivasan, and J. T. Stasko. NL4DV: A toolkit for generating analytic specifications for data visualization from natural language queries. In VIS, 2020.
  • 7. 6 Design Requirements • Easy-to-use  Simple query from novice(to create visualization) user  Suitable visualization • End-to-End  Semantic parser  hard coded, should be tuned separately  Need to be end-to-end manner • Language-agnostic (Language-independent)  The output sequence of ncNet should be utilized in various visualization languages • e.g., Vega-Lite, D3, ggplot2…
  • 8. 7 Background – Data, NL Query • Data D: tabular dataset from JSON, CSV, or related table from DB • NL Query: user query for corresponding data D • Objective: given NL query with dataset D, predict Vega-Zero specification (Input Sequence, Output Sequence)
  • 9. 8 Language-Agnostic query: Vega-Zero • A visualization grammar  Simplified version of Vega-Lite  Model friendly • Direct use of Vega-Lite?  User friendly rather than model friendly  hierarchical grammar  hard to train • Language-agnostic  By simple formatting  Vega-Lite, ggplot2 : unit
  • 10. 9 Background – Chart Templates • User input • Constrain output sequence
  • 12. 11 Methods: Input Handling • Input Tokenization  Tokenize three types of input data • Input Embedding
  • 13. 12 What is Transformer? • One of the most famous NLP model • The model only utilize attention mechanism • Parallelly handle input sequence  better than RNN, but need positional encoding • Well-motivated
  • 15. 14 Methods: visualization-aware translation • Modified version of Beam Search • Tokens in specific steps have appropriate tokens (e.g., scatter only in [T]) • Top-k with heuristic rules  If there is no appropriate tokens, select top-1 token
  • 16. 15 Evaluation • Dataset  nvBench: 25,750 data in format (NL query, VIS) with ~780 tables from 105 domains
  • 17. 16 Evaluation: User Study • 4 experts, 2 non-experts • Four failure cases – fail on predicting 1) columns; 2) chart type; 3) top-1 selection; 4) Vega-Zero fails but vis match
  • 18. 17 Evaluation: Comparison • Compare sota NL2SQL, ncNet w/o C, ncNet w/ C • ncNet performs better than sota NL2SQL model
  • 19. 18 Limitation and Future work • (Limitation) Benchmark data are relatively small and limited • (Limitation) Supporting only one-shot NL queries • (Future work) supporting conversational NL queries  dialog system? • (Future work) Chart embedding methods

Notas del editor

  1. Neural Machine Translation은 입력 시퀀스를 받아 출력 시퀀스를 만듭니다. 예를 들어, 아래와 같이 영어를 스페인어로 바꿀 수 있습니다. 그러나, translation 자체를 폭넓게 본다면, 이러한 방법론은 어느 분야에서도 동일하게 사용될 수 있습니다. 데이터가 충분하다는 전제하에, 예를 들어서, 입력 음악의 장르를 바꾸는 것 또한 가능합니다. 조금 더 명확하게 하자면, tokenize 가능한 모든 입력 시퀀스 데이터를 통해 tokenize 가능한 모든 출력 시퀀스를 만들어 낼 수 있을 것입니다. 물론, 다른 여러 tokenize 불가능한 데이터에도 활용 가능하지만, 그런 경우 tokenize를 이용하는 MT를 바로 사용하는 것은 불가능에 가까우니까요 
  2. N  실제 입력 Query, Visualization에 대한 자세한 요구사항. D  실제 시각화에 사용되는 데이터 집합 C  유저 입력 (Constraint, Optional), 차트의 타입, sorting 방식 등을 정할 수 있음, 출력에 관여하는 부분  이 논문의 이미지를 보고 무슨 모델이라고 생각이 드느냐  사실 이 모델은 정말 완벽하게 seq2seq NMT이다.  1) 포장기술이 중요하다 2) NMT란 생각보다 폭넓게 사용될 수 있다
  3. 이 외에도 Evaluation 등을 했다는 부분이 있지만, 사실상 중요한 부분이 아니므로 스킵
  4. FlowSense (street speed change를 시각화), NL4DV (영화 rating 등의 시각화) Semantic parsing은 간단하게 말하자면 NL을 기계가 이해가능한 논리로 변경하는 작업 해당 작업은 Deep Learning 등이 들어가지 않으며, 그에 따라 굉장히 시각화 가능한 부분이 한정적이며, 유동적으로 어느 부분이 중요한지 등을 잘 판단하지 못함
  5. Easy-to-use, End-to-End은 NL 모델과 관련되어 있는 항목이다. 해당 사항은 semantic parser로는 이룰 수 없다. Language-agnostic은 출력 sequence의 모델링에 관한 항목으로, 출력 sequence는 어느 프로그래밍 언어에도 바로 적용할 수 있어야한다.
  6. Output sequence를 표현하기 위해서 새로운 language-agnostic gramma를 만들 필요가 있음  Vega-Zero! Vega-Lite 등의 다른 language를 바로 표현하는 데에는 어려움이 있음 Vega-Lite, ggplot2에는 바로 적용가능, 다른 language에도 빠르게 적용 가능함
  7. 방법론 전체 Overview입니다. 우리는 1) 입력 처리에 관해서 알아볼 것이고 2) Transformer에 관해 간단하게 훑어보고 넘어갈 것이고 3) 이 논문의 추가적인 방법론인 Attention Forcing과 Visualization-aware Translation에 대해 알아볼 겁니다.
  8. T_N  tokenization of NL Query T_C  tokenization of Chart Templates and other be-filled objects T_D  tokenization of dataset, with special characters 각 Tokenization 결과를 concatenate해서 Embedding layer에 입력으로 사용
  9. 실제 모델에 주요한 방법론으로 사용된 것은 attention 뿐이지만, 이 외에도 많은 방법론이 들어갔다. 각 단어의 position 정보를 표현하기 위한 Positional Encoding, 단어에서의 중요도 / 유사도를 통한 QKV attention, 각 SubLayer(Multi-head attention) 이후의 Residual Connection, Layer Normalization, Decoder에서 치팅을 막기위한 Masked Multi-head attention, Encoder에서 encoding된 정보를 활용한 Encoder-Decoder attention… Design rationale가 굉장히 잘 설명되어있고, 모두 reasonable해서 한번쯤 읽어보면 좋다.
  10. 왼쪽, 오른쪽의 예시와 함께 X, Y, Z를 확인하면, 사실 X, Y, Z는 결국 columns에 따라 바뀐다라는 것을 알 수 있습니다. 이에 따라, 사실 Columns가 아닌 value를 집중할 필요가 없다는 것을 모델에 직접적으로 넣어줄 수 있다는겁니다. 이 외에도 AggFunction이나 K의 경우 NL Tokens에 대해서만 attention을 하면 될 것이라고 생각할 수 있습니다.
  11. Beam Search란 자연어 처리에서 자주 사용되는 방법론으로, Best-First Search를 사용하되, 각 step별로 기억해야할 (선택해야할) 단어를 제한하여 효율성을 높이는 방식이다.
  12. Hardness의 경우 Vega-Zero 등의 Vis Query가 얼마나 길고 복잡한지에 따라 매겨짐
  13. User study를 진행, Columns 예측 실패의 경우, (a)의 1번째 (좌상단) 과 같이, age of 45를 모델이 정확히 판별해내지 못함 의 경우 chart template를 받지 않는 경우에 일어났음 Vega-Zero에서 top-1이 출력으로 나오는데, 실제 맞는 값이 top-k에서 나온 경우 예를 들어, 입력 쿼리에서 차량 대수, Vega-Zero에서는 엔진 개수
  14. Vis에서 분석을 위해서 여러 쿼리를 써야할 수 있다. Overview First는 사실 Vis가 아니라 전체 논문 작성을 포함한 여러 분야에서 사용될 수 있는 말이다. NLP를 포함한 다른 분야에서도 Overview first 혹은 적어도 단계에 따른 분석 등을 진행하며, 항상 새기면 좋겠다. Transformer나 이 논문같은 경우, well-motivated 되어있고 굉장히 clarification, justification이 잘 되어있다. 어디에 무엇을 쓰면 왜 썼는지 등이 정확하게 기입되어있어, 이러한 부분을 논문 쓸 때 참고하면 좋다.