8. データと論文
Corpora
・ AlJohri/OpenSubtitles
Get a lot of raw movie subtitles (~1.2Gb)
・ Cornell Movie-Dialogs Corpus
~ 40Mb after clearing out the technical data.
Papers
[1] Sequence to Sequence Learning with Neural Networks
[2] A Neural Conversational Model
https://github.com/nicolas-ivanov/seq2seq_chatbot_links