GPT 3

Lecture 12.3 Famous transformers (BERT, GPT-2, GPT-3)



DLVU

In this lecture we look at the details of some famous transformer models. How were they trained, and what could they do after they were trained.