What We’re Reading: Neural Machine Translation with Reconstruction

by Spence Green
2 Minute Read

Neural MT systems generate translations one word at a time. They can still generate fluid translations because they choose each word based on all of the words generated so far. Typically, these systems are just trained to generate the next word correctly, based on all previous words. One systematic problem with this word-by-word approach to training and translating is that the translations are often too short and omit important content.

In the paper Neural Machine Translation with Reconstruction, the authors describe a clever new way to train and translate. During training, their system is encouraged not only to generate each next word correctly but also to correctly generate the original source sentence based on the translation that was generated. In this way, the model is rewarded for generating a translation that is sufficient to describe all of the content in the original source.

When translating, their system generates multiple alternatives and chooses the final translation such that two conditions are met simultaneously: the translation is predictable given the source, and the source is predictable given the translation. Ensuring this two-way coherence provides a substantial benefit in translation quality. In their experiments, translation lengths were more similar to human reference lengths, and the system was less prone to generating too much or too little content compared to a strong neural MT baseline.

A more technical summary of this paper when it was originally released was provided by Denny Britz, and the first author Zhaopeng Tu responded to those comments.

Paper: Neural Machine Translation with Reconstruction

Authors: Zhaopeng Tu, Yang Liu, Lifeng Shang, Xiaohua Liu, and Hang Li

Venue: Association for the Advancement of Artificial Intelligence (AAAI) 2017

screen_shot_2019-02-23_at_1.22.36_pm

Product update: improved translation review categories

1 Minute Read

We’re pleased to announce a new feature in our interface that improves the translation review process. Previously, reviewers could make general text comments about errors and introduce categories by using hashtags to indicate the type of error (such as #punctuation or #mistranslation). However, it was still a manual process to collect the different hashtags and identify precisely where the error occurred.

Read More

Understanding Artificial Neural Networks

4 Minute Read

What is an Artificial Neural Network? We rely on computers to perform tasks at an exponentially faster rate than the majority of humans can accomplish. Whether it’s crunching the numbers in a long mathematical sequence or compiling a vast database, a computer will always have the edge. However, human brains are remarkable - we can understand abstract ideas, formulate relatively sound decisions, and create new concepts through imagination. However, we may not retain this sole advantage for much longer. Loosely modeled off of our brain's structure, developers have crafted an artificial intelligence-based interconnected network - artificial neural networks. An artificial intelligence network is a powerful tool designed to learn and reason, drastically shaping our future. In this article, we’re going to explain what an artificial neural network is (with examples and use cases) and how companies use neural networking technology to grow.

Read More