Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
VCWE[論文読み会]
1. VCWE:Visual Character-Enhanced
Word Embedding
Chi Sun, Xipeng Qiu, Xuanjing Huang
Shanghai Key Laboratory of Intelligent Information Processing,
Fudan University School of Computer Science,
Fudan University 825 Zhangheng Road, Shanghai, China
3. Chi Sun, Xipeng Qiu, Xuanjing Huang
VCWE:Visual Character-Enhanced Word Embedding
In Proceedings of the 2019 Conference of the North
American Chapter of the Association for Computational
Linguistics: Human Language Technologies
arxiv: https://arxiv.org/abs/1902.08795
3
文献情報
8. 背景
- Image-based Character Embedding for Document
Classification [Shimada+ 2016]
- CAEで文字画像を学習させることで
文字の形を考慮した文字の分散表現の獲得
[Shimada+ 2016] D. Shimada, R. Kotani, and H. Iyatomi, “Document classification through image-based character
embedding and wildcard training,” IEEE International Conference on Big Data, pp. 3922–3927, 2016. 8
9. 背景
- End-to-End Text Classification via
Image-based Embedding using
Character-level Networks [Kitada+ 2018]
- 文書分類タスクとcharacter encoderを
繋げたEnd-to-Endのモデル
- 画像空間と特徴空間による
data augmentationを行うことで精度を向上
- Wikipediaタイトルによる
カテゴリ分類にてstate of the art を達成
9
[Kitada+ 2018] Shunsuke Kitada, Ryunosuke Kotani, Hitoshi Iyatomi, "End-to-End Text Classification via
Image-based Embedding using Character-level Networks”, IEEE Applied Imagery Pattern Recognition
(AIPR) 2018 workshop
10. 先行研究
- 中国語における文字構造を考慮した単語分散表現
- CWE[Yin+ 2015]
- Word2vec + 文字の部首情報を追加
- GWE[Su+ 2017]
- CWE+文字画像による視覚的特徴量の追加
- JWE[Yu+ 2017]
- 単語や文字、部首をそれぞれ別々に扱う
10
[Yin+ 2016] Rongchao Yin, Quan Wang, Rui Li, Peng Li, Bin Wang, “Multi-Granularity Chinese Word
Embedding”, Empirical Methods in Natural Language Processing 2016
[Su and Lee+ 2017] Tzu-Ray Su, Hung-Yi Lee, “Learning Chinese Word Representations From Glyphs Of
Characters”, arXiv preprint arXiv:1708.04755.
[Yu+ 2017], Jinxing Yu, Xun Jian, Hao Xin, and Yangqiu Song, “Joint embeddings of chinese words,
characters, and fine-grained subcharacter components”, Empirical Methods in Natural Language Processing
2017