Machine Translation

Todd Sullivan and Pavani Vantimitta
Stanford's Natural Language Processing Course
Project 2 of 3
Stanford Department of Computer Science

In this project we implemented IBM Model One and IBM Model Two for the purposes of translating French text to English. Our implementation scaled well and could easily handle more than 200,000 training sentences. The project's recommended number to beat to get scaling "extra credit" was 100,000 sentences.

Technical Report

Member Contributions

The following list details all group contributions. These contributions were not the original tasks assigned to each group member, but were the end result due to each member's abilities and other issues.

  • I implemented Model Two.
  • Pavani and I pair programmed the rest of the assignment.
  • I performed all experiments, collected all data, and organized all results.
  • Pavani selected the examples and created the graphs.
  • I wrote sections 1 through 3 of the report.
  • Pavani wrote the initial version of Section 4 while I edited it.
  • I applied all formatting and presentation features to the report.

Source Code

Since this assignment will be used in future versions of the natural language processing course, I am not releasing the code at this time.