nlp reinforce learning nmt dialogue machine translation text generation style transfer naacl natural language processsing customer care pointer generator minimum risk training mrt sts simile bleu mt text style transfer naacl2019 human bandit feedback 強化学習 翻訳 自然言語処理 acl emnlp back translation neural network paraphrase model
Ver más