Why Self-Attention? A Targeted Evaluation of Neural Machine Translation Architectures

Why Self-Attention? A Targeted Evaluation of Neural Machine Translation Architectures