Home

Decoder vs Encoder-Decoder

Note: This is a work in progress to help me understand the differences between the two architectures.

Decoder

Encoder-Decoder

Relevant Papers

It's possible that decorder-only + RLHF (e.g. ChatGPT) is better than encoder-decoder + multi-task fine tuning (e.g. Flan-T5).


© Mike Surowiec