Topics
The simplest decoding method which selects the most probable token at each step .
In an encoder-decoder architecture where we condition on input and previous token outputs
Note that in practice, we don’t directly condition on input , but use the context variable produced by the encoder to represent the input sequence.