The Cambridge Statistical Machine Translation system

This open source package contains the Cambridge SMT system, a set of tools for statistical machine translation, which rely on OpenFST. You can download it here. You can also clone it directly with git doing as so:

git clone https://github.com/ucam-smt/ucam-smt.git

It includes the following features:


For this release, we have prepared an extensive tutorial that explains how to use these tools. It is available at: http://ucam-smt.github.io/tutorial

The tutorial is intended to serve as a guide for the use of the tools, but our research publications contain the best descriptions of the algorithms and modelling techniques described here. Our complete publications can be found at http://divf.eng.cam.ac.uk/smt/Main/SmtPapers

Authors and Contributors

This package grew out of the Ph.D. thesis work of Gonzalo Iglesias, in which he developed HiFST, a hierarchical phrase-based statistical machine translation system based on OpenFST.

Contributors to this release and the tutorial are:

with thanks to Cyril Allauzen and Michael Riley (OpenFST).

Support or Contact

Questions? Problems? Please leave a message at https://groups.google.com/forum/#!forum/ucam-smt