Adding Statistical Machine Translation Adaptation to Computer-Assisted Translation

Report No. ARL-TR-6637
Authors: Robert P. Winkler, Somiya Metu, Stephen A. LaRocca, and John J. Morgan
Date/Pages: September 2013; 22 pages
Abstract: Statistical machine translation (SMT) has proven effective for general purpose language translation but not for highly specialized domains like medicine, military operations, and law enforcement, which have their own technical jargon. We present a novel approach for iteratively incorporating a human translator in the loop to adapt SMT models to a particular domain. We show how these models can be made accessible via Web services and integrated with computer-assisted translation (CAT) tools. In this report, we describe a novel human-in-the-loop post-editing domain adaptation algorithm for refining SMT models using the Joshua decoder and integrate it with a CAT tool called OmegaT.
Distribution: Approved for public release
  Download Report ( 0.981 MBytes )
If you are visually impaired or need a physical copy of this report, please visit and contact DTIC.

Last Update / Reviewed: September 1, 2013