Understanding MT Quality: BLEU Scores
This is the first in a series of posts discussing various aspects of MT quality from the context of enterprise use and value, where linguistic quality is important, but not the only determinant of suitability in a structured MT technology evaluation process. A cleaner, more polished, and shorter studio version of this post is available here . You can consider this post a first draft, or the live stage performance (stream of consciousness) version. What is BLEU (Bilingual Evaluation Understudy)? As the use of enterprise machine translation expands, it becomes increasingly more important for users and practitioners to understand MT quality issues in a relevant, meaningful, and accurate way. The BLEU score is a string-matching algorithm that provides basic output quality metrics for MT researchers and developers . In this first post, we will review and look more closely at the BLEU score , which is probably the most widely used MT quality assessment metric in use by MT researchers an...
Comments
Post a Comment