Open Source Tools for Machine Translation
All of these are documented in my MT Marathon paper. Each tarball contains a README with compilation instructions. Code I wrote is under the LGPL and all dependencies are open source.- system combination.tar.gz
- My research: the multi-engine machine translation system. See the README.
- language model filter.tar.gz
- Fast filtering of language models to multiple vocabularies. Yields a 92% reduction in model size for system combination and 36% for translation systems. Updated June 9, 2010.
- scoring.tar.gz
- Script that makes it easy to score machine translation output using NIST's BLEU and NIST, TER, and METEOR. Use plaintext with one segment per line instead of three different formats. Puts all the scores on a single line ready for inclusion in a table. Updated June 10, 2010.