Hacker Newsnew | past | comments | ask | show | jobs | submit | moritzdubois's commentslogin

> By now, everyone has heard the explanation that ChatGPT is a transformer encoder-decoder that ...

Except it is wrong. GPT models are decoder-only transformers. See Andrej Karpathy's outstanding series on implementing a toy-scale GPT model.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: