An Examination of the Strengths and Weaknesses of Neural Machine Translation
As Neural MT gains momentum we see more studies that explain why it is being seen as a major step forward, and we are now beginning to understand some of the very specific reasons for this momentum. This summary by Antonio Toral and Víctor M. Sánchez-Cartagena highlights how NMT provides some specific advantages using well-understood data and comparative systems. Their main findings are presented below, but I saw an additional comment in the paper that I am also including here. The paper also provides BLEU scores for all the systems that were used, which are consistent with the scores shown here, and it is interesting that Russian is still a language where rule-based systems still produce the highest scores in tests like this. The fact that NMT systems perform so well on translations going out of English should be especially interesting to the localization industry. Now we need some evidence of how NMT systems can be domain-adapted and SYSTRAN will soon provide some details.
The fact that NMT systems do not do well on very long sentences can be managed by making these sentences shorter. I tend to write really long sentences but 40-45 words in a sentence seems really long to me and in a localization setting I think this can be managed.
The fact that NMT systems do not do well on very long sentences can be managed by making these sentences shorter. I tend to write really long sentences but 40-45 words in a sentence seems really long to me and in a localization setting I think this can be managed.
"The best NMT system clearly outperforms the best PBMT system for all language directions out of English (relative improvements range from 5.5% for EN > RO to 17.6% for EN > FI) and the human evaluation (Bojar et al., 2016, Sec. 3.4) confirms these results. In the opposite direction, the human evaluation shows that the best NMT system outperforms the best PBMT system for all language directions except when the source language is Russian."
System From EN | CS | DE | FI | RO | RU |
PBMT | 23.7 | 30.6 | 15.3 | 27.4 | 24.3 |
NMT | 25.9 | 34.2 | 18.0 | 28.9 | 26.0 |
|
|
|
|
|
|
Into EN |
|
|
|
|
|
PBMT | 30.4 | 35.2 | 23.7 | 35.4 | 29.3 |
NMT | 31.4 | 38.7 | - | 34.1 | 28.2 |
BLEU scores of the best NMT and PBMT systems for each language pair at WMT16’s news translation task.
|