3. /22
論文一覧
(リンクは著者がアップロードした原稿. を紹介)
Zhang et al. (Cambridge), “Joint optimisation of tandem systems using Gaussian
mixture density neural network discriminative sequence training” (paper)
Gupta et al. (CMU), “Visual features for context-aware speech recognition” (video)
Sahraeian et al. (KU Leuven), “Exploiting sequential low-rank factorization for
multilingual DNNs” (paper)
Jyothi et al. (Indian Institute of Technology Bombay), “Low-resource grapheme-to-
phoneme conversion using recurrent neural networks” (paper)
Samarakoon et al. (National University of Singapore), “An investigation into learning
effective speaker subspaces for robust unsupervised DNN adaptation” (ググれば一応
出てくる)
Zhao et al. (Microsoft), “Extended low-rank plus diagonal adaptation for deep and
recurrent neural networks,” (icassp2016のpaper)
3
4. /22
紹介する論文の概要
タスクは3つの論文で異なる
– [Sahraeian et al.] … 多言語音声認識における言語適応
– [Zhao et al.] … 単言語音声認識における話者適応
– [Samarakoon et al.] … 〃
DNN (Deep Neural Network) の利用 [全論文]
– DNNのモデルパラメータを減らして,適応を頑健にしたい(少ない
データでも動作させたい)
DNNの低ランク適応
– Sequential low-rank adaptation [Sahraeian et al.]
– Low-rank plus diagonal adaptation [Zhao et al.]
– SVD-based vs. FHL-based [Samarakoon et al.]
4