About language model applications
In encoder-decoder architectures, the outputs of the encoder blocks act since the queries into the intermediate representation with the decoder, which provides the keys and values to compute a representation on the decoder conditioned on the encoder. This consideration is called cross-focus.We use cookies to help your consumer practical experience