What We’re Reading: Learning to Decode for Future Success

by Spence Green

August, 7, 2017 1 Minute Read

When doing beam search in sequence to sequence models, one explores next words in order of their likelihood. However, during decoding, there may be other constraints we have or objectives we wish to maximize. For example, sequence length, BLEU score, or mutual information between the target and source sentences. In order to accommodate these additional desiderata, the authors add an additional term Q onto the likelihood capturing the appropriate criterion and then choose words based on this combined objective.

The difficulty here is that we don’t know the values of these quantities until we have completed our decoding. Eg, we don’t know how long the sequence we are going to output is until we have actually finished decoding the sentence. In order to solve this issue, the authors learn Q as a function that has the following inputs: the source sentence, the prefix of previously outputted target symbols, and the current hidden state of the decoder. Based off of this information, it predicts the quantity in question. In the sequence length example, it predicts number of output tokens that the decoder will generate.

Paper: Learning to Decode for Future Success

Authors: Jiwei Li, Will Monroe, Dan Jurafsky

Publication: Stanford University

View All Posts

March, 29, 2023

Introducing Lilt Contextual AI: In-Context Learning for Enterprise Localization

7 Minute Read

Today we’re excited to announce the Lilt Contextual AI Engine, a new generative language model that implements in-context learning (ICL). With 5x more parameters than the previous Lilt model it replaces, Contextual AI powers the Verified (with human verification) and Instant (fully automatic) enterprise localization workflows. Contextual AI outperforms both GPT-4 and Google Translate while remaining over 1000x more compact than GPT-4, meaning that self-managed customers of the Lilt AI Platform can deploy on their own compute.

November, 17, 2020

Lilt Awarded AFWERX Small Business Innovation Contract

2 Minute Read

Language is one of the most powerful influences in our daily lives - the way we learn, the people we interact with, and the information we have access is all heavily dependent on the language we are born into.