Let’s build GPT: from scratch, in code, spelled out. – The Engineering of Conscious Experience

Andrej Karpathy

We build a Generatively Pretrained Transformer (GPT), following the paper “Attention is All You Need” and OpenAI’s GPT-2 / GPT-3. We talk about connections to ChatGPT, which has taken the world by storm. We watch GitHub Copilot, itself a GPT, help us write a GPT (meta :D!) . I recommend people watch the earlier makemore videos to get comfortable with the autoregressive language modeling framework and basics of tensors and PyTorch nn, which we take for granted in this video.

Links:
– Google colab for the video: https://colab.research.google.com/drive/1JMLa53HDuA-i7ZBmqV7ZnA3c_fvtXnx-?usp=sharing
– GitHub repo for the video: https://github.com/karpathy/ng-video-lecture
– Playlist of the whole Zero to Hero series so far: https://www.youtube.com/watch?v=VMj-3S1tku0&list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ
– nanoGPT repo: https://github.com/karpathy/nanoGPT
– my website: https://karpathy.ai
– my twitter: https://twitter.com/karpathy
– our Discord channel: https://discord.gg/3zy8kqD9Cp

Supplementary links:
– Attention is All You Need paper: https://arxiv.org/abs/1706.03762
– OpenAI GPT-3 paper: https://arxiv.org/abs/2005.14165
– OpenAI ChatGPT blog post: https://openai.com/blog/chatgpt/
– The GPU I’m training the model on is from Lambda GPU Cloud, I think the best and easiest way to spin up an on-demand GPU instance in the cloud that you can ssh to: https://lambdalabs.com . If you prefer to work in notebooks, I think the easiest path today is Google Colab.

Suggested exercises:
– EX1: The n-dimensional tensor mastery challenge: Combine the `Head` and `MultiHeadAttention` into one class that processes all the heads in parallel, treating the heads as another batch dimension (answer is in nanoGPT).
– EX2: Train the GPT on your own dataset of choice! What other data could be fun to blabber on about? (A fun advanced suggestion if you like: train a GPT to do addition of two numbers, i.e. a+b=c. You may find it helpful to predict the digits of c in reverse order, as the typical addition algorithm (that you’re hoping it learns) would proceed right to left too. You may want to modify the data loader to simply serve random problems and skip the generation of train.bin, val.bin. You may want to mask out the loss at the input positions of a+b that just specify the problem using y=-1 in the targets (see CrossEntropyLoss ignore_index). Does your Transformer learn to add? Once you have this, swole doge project: build a calculator clone in GPT, for all of +-*/. Not an easy problem. You may need Chain of Thought traces.)
– EX3: Find a dataset that is very large, so large that you can’t see a gap between train and val loss. Pretrain the transformer on this data, then initialize with that model and finetune it on tiny shakespeare with a smaller number of steps and lower learning rate. Can you obtain a lower validation loss by the use of pretraining?
– EX4: Read some transformer papers and implement one additional feature or change that people seem to use. Does it improve the performance of your GPT?

Chapters:
00:00:00 intro: ChatGPT, Transformers, nanoGPT, Shakespeare
baseline language modeling, code setup
00:07:52 reading and exploring the data
00:09:28 tokenization, train/val split
00:14:27 data loader: batches of chunks of data
00:22:11 simplest baseline: bigram language model, loss, generation
00:34:53 training the bigram model
00:38:00 port our code to a script
Building the “self-attention”
00:42:13 version 1: averaging past context with for loops, the weakest form of aggregation
00:47:11 the trick in self-attention: matrix multiply as weighted aggregation
00:51:54 version 2: using matrix multiply
00:54:42 version 3: adding softmax
00:58:26 minor code cleanup
01:00:18 positional encoding
01:02:00 THE CRUX OF THE VIDEO: version 4: self-attention
01:11:38 note 1: attention as communication
01:12:46 note 2: attention has no notion of space, operates over sets
01:13:40 note 3: there is no communication across batch dimension
01:14:14 note 4: encoder blocks vs. decoder blocks
01:15:39 note 5: attention vs. self-attention vs. cross-attention
01:16:56 note 6: “scaled” self-attention. why divide by sqrt(head_size)
Building the Transformer
01:19:11 inserting a single self-attention block to our network
01:21:59 multi-headed self-attention
01:24:25 feedforward layers of transformer block
01:26:48 residual connections
01:32:51 layernorm (and its relationship to our previous batchnorm)
01:37:49 scaling up the model! creating a few variables. adding dropout
Notes on Transformer
01:42:39 encoder vs. decoder vs. both (?) Transformers
01:46:22 super quick walkthrough of nanoGPT, batched multi-headed self-attention
01:48:53 back to ChatGPT, GPT-3, pretraining vs. finetuning, RLHF
01:54:32 conclusions

Corrections:
00:57:00 Oops “tokens from the _future_ cannot communicate”, not “past”. Sorry! 🙂
01:20:05 Oops I should be using the head_size for the normalization, not C

<iframe></p> <p><a href="https://www.youtube.com/watch?v=kCc8FmEb1nY">Source</a></p> <div class="be1e40beae42d993bafb8643f4ddde8b" data-index="3" style="float: none; margin:10px 0 10px 0; text-align:center;"> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block; text-align:center;" data-ad-layout="in-article" data-ad-format="fluid" data-ad-client="ca-pub-9244112244416304" data-ad-slot="4549240677"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> <div style="font-size: 0px; height: 0px; line-height: 0px; margin: 0; padding: 0; clear: both;"></div> </div> </article> <div class="clearfix"></div> <ul class="default-theme-post-navigation"> <li class="theme-nav-previous"><a href="https://theengineeringofconsciousexperience.com/the-hessdalen-light-phenomena-a-scientist-reveals-startling-information-its-not-from-this-planet/" rel="prev"><span class="meta-nav">←</span> The Hessdalen Light Phenomena: a Scientist Reveals Startling Information… Its Not from this Planet!</a></li> <li class="theme-nav-next"><a href="https://theengineeringofconsciousexperience.com/i-tried-starting-a-business-with-chat-gpt/" rel="next">I Tried Starting A Business With Chat GPT <span class="meta-nav">→</span></a></li> </ul> <div class="clearfix"></div> <h3 class='comment-reply-title'>Similar Posts</h3> <div class="mb-related-posts mb-simple-featured-posts mb-simple-featured-posts-wrap row"> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/%e9%ab%98%e5%ba%a6%e3%81%aa%e6%96%87%e7%ab%a0%e3%82%92%e7%94%9f%e6%88%90%e3%81%99%e3%82%8bai%e3%83%84%e3%83%bc%e3%83%ab%e3%80%8cgpt-3%e3%80%8d%e3%81%8c%e7%a4%ba%e3%81%97%e3%81%9f%e5%8f%af%e8%83%bd/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1603487737_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1603487737_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1603487737_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1603487737_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1603487737_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/10/1603487737_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/%e9%ab%98%e5%ba%a6%e3%81%aa%e6%96%87%e7%ab%a0%e3%82%92%e7%94%9f%e6%88%90%e3%81%99%e3%82%8bai%e3%83%84%e3%83%bc%e3%83%ab%e3%80%8cgpt-3%e3%80%8d%e3%81%8c%e7%a4%ba%e3%81%97%e3%81%9f%e5%8f%af%e8%83%bd/" rel="bookmark">高度な文章を生成するAIツール「GPT-3」が示した可能性 #HOTTAKE #004 落合渉悟 × 設楽悠介 #あたらしい経済 #幻冬舎</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/%e9%ab%98%e5%ba%a6%e3%81%aa%e6%96%87%e7%ab%a0%e3%82%92%e7%94%9f%e6%88%90%e3%81%99%e3%82%8bai%e3%83%84%e3%83%bc%e3%83%ab%e3%80%8cgpt-3%e3%80%8d%e3%81%8c%e7%a4%ba%e3%81%97%e3%81%9f%e5%8f%af%e8%83%bd/" rel="bookmark"><time class="entry-date published updated" datetime="2020-10-08T01:03:45-07:00">October 8, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/b281ca51c0771476389d0ccc2690819a/">あたらしい経済 NewEconomy /幻冬舎のブロックチェーン専門メディア</a></span></span> </div> </header> </article> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/%e3%80%90gpt-3-explained%e3%80%91language-model-openai-%e6%9c%80%e5%bc%b7%e4%ba%ba%e5%b7%a5%e6%99%ba%e8%83%bd%e5%b0%8e%e8%87%b4%e5%85%a8%e6%b0%91%e5%a4%b1%e6%a5%ad%ef%bc%9f%e7%a8%8b%e5%ba%8f/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599317788_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599317788_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599317788_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599317788_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599317788_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1599317788_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/%e3%80%90gpt-3-explained%e3%80%91language-model-openai-%e6%9c%80%e5%bc%b7%e4%ba%ba%e5%b7%a5%e6%99%ba%e8%83%bd%e5%b0%8e%e8%87%b4%e5%85%a8%e6%b0%91%e5%a4%b1%e6%a5%ad%ef%bc%9f%e7%a8%8b%e5%ba%8f/" rel="bookmark">【GPT-3 EXPLAINED】Language Model OpenAI | 最強人工智能導致全民失業？程序員小姐姐帶你揭秘GPT3 | 苏苏思量</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/%e3%80%90gpt-3-explained%e3%80%91language-model-openai-%e6%9c%80%e5%bc%b7%e4%ba%ba%e5%b7%a5%e6%99%ba%e8%83%bd%e5%b0%8e%e8%87%b4%e5%85%a8%e6%b0%91%e5%a4%b1%e6%a5%ad%ef%bc%9f%e7%a8%8b%e5%ba%8f/" rel="bookmark"><time class="entry-date published updated" datetime="2020-08-12T02:08:36-07:00">August 12, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/678599aade6bbe90a3e7da5fe797e36c/">苏苏思量</a></span></span> </div> </header> </article> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/trailer-1-draft-litrpg-mmorpg-based-on-gpt-3-like-ai-dungeon/" aria-hidden="true" tabindex="-1"> <img width="400" height="300" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1601321615_hqdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1601321615_hqdefault.jpg 480w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1601321615_hqdefault-300x225.jpg 300w" sizes="auto, (max-width: 400px) 100vw, 400px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/trailer-1-draft-litrpg-mmorpg-based-on-gpt-3-like-ai-dungeon/" rel="bookmark">Trailer 1 Draft – LitRPG MMORPG Based on GPT-3 (like AI Dungeon)!</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/trailer-1-draft-litrpg-mmorpg-based-on-gpt-3-like-ai-dungeon/" rel="bookmark"><time class="entry-date published updated" datetime="2020-08-28T22:09:34-07:00">August 28, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/16b3f190ded68092b5ae6be109781459/">LitRPG Reads (Paul Bellow)</a></span></span> </div> </header> </article> </div> <div id="comments" class="comments-area"> <h5 class="comments-title"> 45 thoughts on “<span>Let’s build GPT: from scratch, in code, spelled out.</span>” </h5> <ol class="comment-list"> <li id="comment-302286" class="comment even thread-even depth-1"> <article id="div-comment-302286" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCVmyoDbwlZ4db9E5K1dak_w" class="url" rel="ugc external nofollow">Amparo Consuelo</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302286"><time datetime="2023-08-28T15:18:54-07:00">August 28, 2023 at 3:18 pm</time></a> </div> </footer> <div class="comment-content"> <p>If I will order from Amazon a GPT assembly kit, what would it deliver me? How much would the kit cost?</p> </div> </article> </li> <li id="comment-302285" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302285" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCJxUNUE8KiovOshL_uPzjeQ" class="url" rel="ugc external nofollow">JR1215</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302285"><time datetime="2023-08-30T06:07:01-07:00">August 30, 2023 at 6:07 am</time></a> </div> </footer> <div class="comment-content"> <p>this is fantastic – thanks for putting this together.</p> </div> </article> </li> <li id="comment-302284" class="comment even thread-even depth-1"> <article id="div-comment-302284" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC6X0DIM7j8fKLO9ICCXkwTg" class="url" rel="ugc external nofollow">Ryan</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302284"><time datetime="2023-08-31T17:08:28-07:00">August 31, 2023 at 5:08 pm</time></a> </div> </footer> <div class="comment-content"> <p>For anyone getting an error after adding multihead attention block at <a href="https://www.youtube.com/watch?v=kCc8FmEb1nY&t=1h23m46s">1:23:46</a><br />I think current pytorch is looking for explicit integers for the head_size of MultiHeadAttention()<br />this fixed my error:<br /> self.self_attention_heads = MultiHeadAttention(4, int(n_embd/4))</p> <p>Had to wrap the n_emb/4 with int() to ensure it was an integer</p> </div> </article> </li> <li id="comment-302283" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302283" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCli70AlDyKy106FyhW8VlfQ" class="url" rel="ugc external nofollow">TheSackblabbath</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302283"><time datetime="2023-09-01T03:19:54-07:00">September 1, 2023 at 3:19 am</time></a> </div> </footer> <div class="comment-content"> <p>This is a public service. So glad I just came upon this.</p> </div> </article> </li> <li id="comment-302282" class="comment even thread-even depth-1"> <article id="div-comment-302282" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCfpaVbiTtLX3EBIyg3-FmRw" class="url" rel="ugc external nofollow">qcc</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302282"><time datetime="2023-09-01T09:45:40-07:00">September 1, 2023 at 9:45 am</time></a> </div> </footer> <div class="comment-content"> <p>REVENGE OF THE WORMS</p> </div> </article> </li> <li id="comment-302281" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302281" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCvxKxZJcjp-mSrZUdetO-ww" class="url" rel="ugc external nofollow">Alexey Poltoradnev</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302281"><time datetime="2023-09-02T06:35:01-07:00">September 2, 2023 at 6:35 am</time></a> </div> </footer> <div class="comment-content"> <p>Thank you!</p> </div> </article> </li> <li id="comment-302280" class="comment even thread-even depth-1"> <article id="div-comment-302280" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCYajHS-5010vl-vbvZE83aQ" class="url" rel="ugc external nofollow">Fahad Nadeem</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302280"><time datetime="2023-09-02T10:23:24-07:00">September 2, 2023 at 10:23 am</time></a> </div> </footer> <div class="comment-content"> <p>Thankyou so much for all that you do Sir.</p> </div> </article> </li> <li id="comment-302279" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302279" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCbJJRSDybyCq5b-Ghou3evQ" class="url" rel="ugc external nofollow">Patrick</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302279"><time datetime="2023-09-03T13:13:52-07:00">September 3, 2023 at 1:13 pm</time></a> </div> </footer> <div class="comment-content"> <p>I'm really grateful for this detailed guide; you make navigating through the process a breeze! Now, I've been wondering, how does this stack up against fine-tuning something like gpt-3.5-turbo-0613? Especially when dealing with extensive code bases or large repositories? 3.5 would be a different ballpark, wouldn't it, as far as abilities?</p> </div> </article> </li> <li id="comment-302278" class="comment even thread-even depth-1"> <article id="div-comment-302278" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCBOw-khvkPFEwMNgr2ZMhLg" class="url" rel="ugc external nofollow">Artur Magalhães 亚瑟</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302278"><time datetime="2023-09-03T14:41:21-07:00">September 3, 2023 at 2:41 pm</time></a> </div> </footer> <div class="comment-content"> <p>Awesome video, really loved it</p> </div> </article> </li> <li id="comment-302277" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302277" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCG3qZ1YgD1GOCKBBc87W1og" class="url" rel="ugc external nofollow">mohad reza</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302277"><time datetime="2023-09-04T04:17:28-07:00">September 4, 2023 at 4:17 am</time></a> </div> </footer> <div class="comment-content"> <p>This Video worth a lot for me. <br />Thank you Sir</p> </div> </article> </li> <li id="comment-302276" class="comment even thread-even depth-1"> <article id="div-comment-302276" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCOdHydvoXKJvPToZ6-ULBBQ" class="url" rel="ugc external nofollow">Happyduderawr</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302276"><time datetime="2023-09-06T00:57:23-07:00">September 6, 2023 at 12:57 am</time></a> </div> </footer> <div class="comment-content"> <p>Why are so many people in the comments saying they have been given an understanding of transformers after watching a 2hr video? It takes 10,000 hours. A 2 hour video is a helpful start, especially if it ignites a lifelong interest. It is not going to make you understand almost anything though.</p> </div> </article> </li> <li id="comment-302275" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302275" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCsRQd87mDtPLX-TJg_N5VbQ" class="url" rel="ugc external nofollow">Nelson Sharma</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302275"><time datetime="2023-09-06T06:58:37-07:00">September 6, 2023 at 6:58 am</time></a> </div> </footer> <div class="comment-content"> <p>how does "dot-product" relates to "affinity" @ <a href="https://www.youtube.com/watch?v=kCc8FmEb1nY&t=1h08m41s">1:08:41</a></p> </div> </article> </li> <li id="comment-302274" class="comment even thread-even depth-1"> <article id="div-comment-302274" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCB_f2PxwBnlNKJA4xX6bHng" class="url" rel="ugc external nofollow">Maggie_Playing_BGMI</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302274"><time datetime="2023-09-07T04:23:51-07:00">September 7, 2023 at 4:23 am</time></a> </div> </footer> <div class="comment-content"> <p>@Andrej Kaparthy please start a tutorial to be done from scratch in vs code</p> </div> </article> </li> <li id="comment-302273" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302273" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCWIoCzWKueAcvvd7pX5PxCQ" class="url" rel="ugc external nofollow">Jonas Jakob Öberg</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302273"><time datetime="2023-09-07T13:04:55-07:00">September 7, 2023 at 1:04 pm</time></a> </div> </footer> <div class="comment-content"> <p>Awesome</p> </div> </article> </li> <li id="comment-302272" class="comment even thread-even depth-1"> <article id="div-comment-302272" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCYHwssaW3VXvZ2AZt_MCq-w" class="url" rel="ugc external nofollow">Johnny Evans</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302272"><time datetime="2023-09-08T14:30:49-07:00">September 8, 2023 at 2:30 pm</time></a> </div> </footer> <div class="comment-content"> <p>For those who replicated this video, what's the approximate training cost using Lambda?</p> </div> </article> </li> <li id="comment-302271" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302271" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCP8wl4qaD6kV3HtdXaX87Iw" class="url" rel="ugc external nofollow">Pranay Sawant</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302271"><time datetime="2023-09-09T00:05:52-07:00">September 9, 2023 at 12:05 am</time></a> </div> </footer> <div class="comment-content"> <p>Andrej, love your videos. keep doing this.</p> </div> </article> </li> <li id="comment-302270" class="comment even thread-even depth-1"> <article id="div-comment-302270" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC-EBFQJgD3CxRFLqRhIuCUQ" class="url" rel="ugc external nofollow">FishRaider</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302270"><time datetime="2023-09-09T11:30:07-07:00">September 9, 2023 at 11:30 am</time></a> </div> </footer> <div class="comment-content"> <p>Definitely not for the casual watcher. Need to do some homework to fully understand this.</p> </div> </article> </li> <li id="comment-302269" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302269" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCDQu2JF7-vGqHg3P5sYkt2A" class="url" rel="ugc external nofollow">TropicalCoder</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302269"><time datetime="2023-09-09T19:19:16-07:00">September 9, 2023 at 7:19 pm</time></a> </div> </footer> <div class="comment-content"> <p>Just finished watching this and there is a lot to digest, but will note: The output of your Shakespeare model reminded me of the Markov chains that I experimented with many years ago.</p> </div> </article> </li> <li id="comment-302268" class="comment even thread-even depth-1"> <article id="div-comment-302268" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCmyC6a6-KHTd_pPBl4J9hBw" class="url" rel="ugc external nofollow">Bisnu Sarkar</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302268"><time datetime="2023-09-11T02:59:00-07:00">September 11, 2023 at 2:59 am</time></a> </div> </footer> <div class="comment-content"> <p>Such an awesome content for learning tranaformer. Many many thanks…</p> </div> </article> </li> <li id="comment-302267" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302267" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC5iDlODh4_0dFxLPruxArbg" class="url" rel="ugc external nofollow">Koyaanisqatsi2000</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302267"><time datetime="2023-09-11T09:21:37-07:00">September 11, 2023 at 9:21 am</time></a> </div> </footer> <div class="comment-content"> <p>Thank you very much! This was quite a course. Epic!</p> </div> </article> </li> <li id="comment-302266" class="comment even thread-even depth-1"> <article id="div-comment-302266" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCmwnRJIoRiF_sikShSWMY5w" class="url" rel="ugc external nofollow">Jason Rothfuss</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302266"><time datetime="2023-09-11T12:23:33-07:00">September 11, 2023 at 12:23 pm</time></a> </div> </footer> <div class="comment-content"> <p>This video deserves two thumbs up (or more)! I spent a lot of time watching and rewatching parts of this, coding the model "the hard way", and it was totally worth it. Thank you!</p> </div> </article> </li> <li id="comment-302265" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302265" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCZUk6PY-xC8kVlhL7Xzsb7w" class="url" rel="ugc external nofollow">Nick Fruneaux</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302265"><time datetime="2023-09-11T19:35:07-07:00">September 11, 2023 at 7:35 pm</time></a> </div> </footer> <div class="comment-content"> <p>amazing thank you, this is really a wonderful lecture</p> </div> </article> </li> <li id="comment-302264" class="comment even thread-even depth-1"> <article id="div-comment-302264" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCiwuVrCDEqbfB73Q1ISt-0g" class="url" rel="ugc external nofollow">J Williams</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302264"><time datetime="2023-09-11T20:33:55-07:00">September 11, 2023 at 8:33 pm</time></a> </div> </footer> <div class="comment-content"> <p>Andrej, you are an EXTREMELY intelligent individual! Excellent presentation…</p> </div> </article> </li> <li id="comment-302263" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302263" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCuG4K9JPJw7QP6bbyYbtYLw" class="url" rel="ugc external nofollow">Gianina Salomó</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302263"><time datetime="2023-09-12T00:39:26-07:00">September 12, 2023 at 12:39 am</time></a> </div> </footer> <div class="comment-content"> <p>Love your way of teaching, thank you!!</p> </div> </article> </li> <li id="comment-302262" class="comment even thread-even depth-1"> <article id="div-comment-302262" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCYqnqRt-MkKh5PYnk5_hoUg" class="url" rel="ugc external nofollow">Rui Li</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302262"><time datetime="2023-09-12T20:53:06-07:00">September 12, 2023 at 8:53 pm</time></a> </div> </footer> <div class="comment-content"> <p>When he prints out the first 1000 elements of the PyTorch tensor, I was like this is Matrix.</p> </div> </article> </li> <li id="comment-302261" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302261" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC-QmN-jxbOl0vkIyxmZ_VoQ" class="url" rel="ugc external nofollow">王擒</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302261"><time datetime="2023-09-13T05:21:23-07:00">September 13, 2023 at 5:21 am</time></a> </div> </footer> <div class="comment-content"> <p>ty for making such a logically smooth tutorial! it helps to see why we use such structure. it's also cool that you explian almost everything that appears in the model even tho they might have been classic in the field. very nice job bravo</p> </div> </article> </li> <li id="comment-302260" class="comment even thread-even depth-1"> <article id="div-comment-302260" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC6C7eqw8_HOJ58ImMZawX8g" class="url" rel="ugc external nofollow">Mateusz Nowakowski</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302260"><time datetime="2023-09-13T19:31:34-07:00">September 13, 2023 at 7:31 pm</time></a> </div> </footer> <div class="comment-content"> <p>Amazing video. Thank you.</p> </div> </article> </li> <li id="comment-302259" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302259" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC7LE_hzN8SdyQ8levp5j-kw" class="url" rel="ugc external nofollow">Data Mystery</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302259"><time datetime="2023-09-14T23:24:13-07:00">September 14, 2023 at 11:24 pm</time></a> </div> </footer> <div class="comment-content"> <p>It is truely amaying to see such a content. Andrej thanks a ton for the effort. Your expertise of the topic is clear from how you are able to simplify your lecture.</p> </div> </article> </li> <li id="comment-302258" class="comment even thread-even depth-1"> <article id="div-comment-302258" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCIXLrnCWrRaFxU1P3nWfx6A" class="url" rel="ugc external nofollow">Maz Jackson</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302258"><time datetime="2023-09-17T08:03:48-07:00">September 17, 2023 at 8:03 am</time></a> </div> </footer> <div class="comment-content"> <p>He loved the infinity soft max trick lol</p> </div> </article> </li> <li id="comment-302257" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302257" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCcgpLRm2YCYIVJQh_u0MByg" class="url" rel="ugc external nofollow">Piyush Sahu</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302257"><time datetime="2023-09-18T11:16:15-07:00">September 18, 2023 at 11:16 am</time></a> </div> </footer> <div class="comment-content"> <p>Fun fact – Only 0.1 % people actually did it practically</p> </div> </article> </li> <li id="comment-302256" class="comment even thread-even depth-1"> <article id="div-comment-302256" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCLo-l2hIn1gme0PntqAYO8A" class="url" rel="ugc external nofollow">Francis Duncan</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302256"><time datetime="2023-09-18T13:13:21-07:00">September 18, 2023 at 1:13 pm</time></a> </div> </footer> <div class="comment-content"> <p>Thank u very much for such a lesson . I guess that there is point in the lecture u enlightened me about vision transformers .Very Well . May u come up with a nice one again in Reinforcement learning .</p> </div> </article> </li> <li id="comment-302255" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302255" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCHOy8McXLr7NjjwgJI6UEfA" class="url" rel="ugc external nofollow">piku</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302255"><time datetime="2023-09-18T14:22:01-07:00">September 18, 2023 at 2:22 pm</time></a> </div> </footer> <div class="comment-content"> <p>Thank you Andrej. The best transformer tutorial on the internet 🔥❤</p> </div> </article> </li> <li id="comment-302254" class="comment even thread-even depth-1"> <article id="div-comment-302254" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC5w-KSttCZ-xBecW5REeCSQ" class="url" rel="ugc external nofollow">Gladi A Thor</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302254"><time datetime="2023-09-19T23:21:44-07:00">September 19, 2023 at 11:21 pm</time></a> </div> </footer> <div class="comment-content"> <p>I had to install sound booster extension to listen to the video😅</p> </div> </article> </li> <li id="comment-302253" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302253" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC-fh1B_SpyMHQyQctC2JY2A" class="url" rel="ugc external nofollow">BigAl</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302253"><time datetime="2023-09-20T12:24:37-07:00">September 20, 2023 at 12:24 pm</time></a> </div> </footer> <div class="comment-content"> <p>A huge thank you from me!</p> </div> </article> </li> <li id="comment-302252" class="comment even thread-even depth-1"> <article id="div-comment-302252" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCV50tUQzsvzSMbL6VY4ZLzw" class="url" rel="ugc external nofollow">kamado nezuko</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302252"><time datetime="2023-09-22T02:25:18-07:00">September 22, 2023 at 2:25 am</time></a> </div> </footer> <div class="comment-content"> <p>I would like to know how the "auto-complete" feature in VScode is implemented in the video.</p> </div> </article> </li> <li id="comment-302251" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302251" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCmVw1tn3aTCZgp79tmoSHjQ" class="url" rel="ugc external nofollow">Дмитрий Киперов</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302251"><time datetime="2023-09-22T10:00:01-07:00">September 22, 2023 at 10:00 am</time></a> </div> </footer> <div class="comment-content"> <p><a href="https://youtu.be/JYqXD_n0P64">https://youtu.be/JYqXD_n0P64</a></p> </div> </article> </li> <li id="comment-302250" class="comment even thread-even depth-1"> <article id="div-comment-302250" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC_13g7fxugNcIRNsR_hi9XA" class="url" rel="ugc external nofollow">Vedant Gannu</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302250"><time datetime="2023-09-23T11:34:32-07:00">September 23, 2023 at 11:34 am</time></a> </div> </footer> <div class="comment-content"> <p>How exactly does backpropagation work for transformers? Is it similar to backpropagation through time for RNNs?</p> </div> </article> </li> <li id="comment-302249" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302249" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCBzUYCj7QGIm6OzWHZquTow" class="url" rel="ugc external nofollow">Tokyo</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302249"><time datetime="2023-09-24T13:17:33-07:00">September 24, 2023 at 1:17 pm</time></a> </div> </footer> <div class="comment-content"> <p>Do you need to know calculus to understand and follow this video?</p> </div> </article> </li> <li id="comment-302248" class="comment even thread-even depth-1"> <article id="div-comment-302248" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCo7v9Y5zNd-TS9DJcVOPM5Q" class="url" rel="ugc external nofollow">Samuel Rodriguez</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302248"><time datetime="2023-09-25T00:08:47-07:00">September 25, 2023 at 12:08 am</time></a> </div> </footer> <div class="comment-content"> <p>Long live Andrej.</p> </div> </article> </li> <li id="comment-302247" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302247" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCUQj6xxhb429N1zGG1E24EQ" class="url" rel="ugc external nofollow">ayush thakur</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302247"><time datetime="2023-09-25T03:47:46-07:00">September 25, 2023 at 3:47 am</time></a> </div> </footer> <div class="comment-content"> <p>Thank you 😀</p> </div> </article> </li> <li id="comment-302246" class="comment even thread-even depth-1"> <article id="div-comment-302246" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCTM9hjnDrjgNyAvpkqk3LBQ" class="url" rel="ugc external nofollow">Mohamad Serhan</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302246"><time datetime="2023-09-25T13:06:43-07:00">September 25, 2023 at 1:06 pm</time></a> </div> </footer> <div class="comment-content"> <p>First I want to express my gratitude for this amazing content.<br />Second, is there a way to have like 15-30 min talk with you regarding to ask you some stuff?</p> </div> </article> </li> <li id="comment-302245" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302245" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCsrCHygeuvgSWTZoqy5q9dA" class="url" rel="ugc external nofollow">Ernie Tam</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302245"><time datetime="2023-09-25T16:43:07-07:00">September 25, 2023 at 4:43 pm</time></a> </div> </footer> <div class="comment-content"> <p>Thanks. This class is wonderful because now I have basic understanding of how Chat GPT was pre-trained. I can freely move onto my fine-tuning. Thanks a millions</p> </div> </article> </li> <li id="comment-302244" class="comment even thread-even depth-1"> <article id="div-comment-302244" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCi3CoT9Tlris0iJrvNqgFPg" class="url" rel="ugc external nofollow">M. Otto</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302244"><time datetime="2023-09-25T17:15:56-07:00">September 25, 2023 at 5:15 pm</time></a> </div> </footer> <div class="comment-content"> <p>thanks a bundle man</p> </div> </article> </li> <li id="comment-302243" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-302243" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCWt_faIxTomAqkIsVfBU1Mg" class="url" rel="ugc external nofollow">qwertyuiop</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302243"><time datetime="2023-09-26T08:17:21-07:00">September 26, 2023 at 8:17 am</time></a> </div> </footer> <div class="comment-content"> <p>he is not karpathy, he is empathy, the great man – andrej empathy.</p> </div> </article> </li> <li id="comment-302242" class="comment even thread-even depth-1"> <article id="div-comment-302242" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCPbvlG5lQk-H-H7bWH1FJuQ" class="url" rel="ugc external nofollow">Pallavi Jain</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/lets-build-gpt-from-scratch-in-code-spelled-out/#comment-302242"><time datetime="2023-09-26T20:25:46-07:00">September 26, 2023 at 8:25 pm</time></a> </div> </footer> <div class="comment-content"> <p>Your material, way of explaining, everything you said in the lecture is super awesome. I saw this video twice, once just watching and understanding and next time following and writing code with you. It has been very very helpful. I feel like I knew nothing and now I am well-informed and motivated to learn and implement it. All thanks to you. Thank you very much for sharing such an amazing lecture with us for free. Much appreciated. 🌟<a href="about:invalid#zCSafez"></a></p> </div> </article> </li> </ol> <p class="no-comments">Comments are closed.</p> </div> </main> </div> <div class="col-md-3 px-lg-3 "> </div> </div> </div> </div> <footer id="colophon" class="site-footer"> <div class="container"> <div class="row"> <div class="col-md-12 text-center"> <div class="site-info"> <span> Powered By: <a href="https://wordpress.org/" target="_blank">WordPress</a> </span> <span class="sep"> | </span> <span> Theme: <a href="https://odiethemes.com/themes/magazinebook/" target="_blank">MagazineBook</a> By OdieThemes </span> </div> </div> </div> </div> </footer> </div> <script>(function(){var advanced_ads_ga_UID="UA-88163215-1",advanced_ads_ga_anonymIP=!!1;window.advanced_ads_check_adblocker=function(t){var n=[],e=null;function a(t){var n=window.requestAnimationFrame||window.mozRequestAnimationFrame||window.webkitRequestAnimationFrame||function(t){return setTimeout(t,16)};n.call(window,t)}return a((function(){var t=document.createElement("div");t.innerHTML=" ",t.setAttribute("class","ad_unit ad-unit text-ad text_ad pub_300x250"),t.setAttribute("style","width: 1px !important; height: 1px !important; position: absolute !important; left: 0px !important; top: 0px !important; overflow: hidden !important;"),document.body.appendChild(t),a((function(){var a,o,i=null===(a=(o=window).getComputedStyle)||void 0===a?void 0:a.call(o,t),d=null==i?void 0:i.getPropertyValue("-moz-binding");e=i&&"none"===i.getPropertyValue("display")||"string"==typeof d&&-1!==d.indexOf("about:");for(var c=0,r=n.length;c<r;c++)n[c](e);n=[]}))})),function(t){"undefined"==typeof advanced_ads_adblocker_test&&(e=!0),null!==e?t(e):n.push(t)}}(),(()=>{function t(t){this.UID=t,this.analyticsObject="function"==typeof gtag;var n=this;return this.count=function(){gtag("event","AdBlock",{event_category:"Advanced Ads",event_label:"Yes",non_interaction:!0,send_to:n.UID})},function(){if(!n.analyticsObject){var e=document.createElement("script");e.src="https://www.googletagmanager.com/gtag/js?id="+t,e.async=!0,document.body.appendChild(e),window.dataLayer=window.dataLayer||[],window.gtag=function(){dataLayer.push(arguments)},n.analyticsObject=!0,gtag("js",new Date)}var a={send_page_view:!1,transport_type:"beacon"};window.advanced_ads_ga_anonymIP&&(a.anonymize_ip=!0),gtag("config",t,a)}(),this}advanced_ads_check_adblocker((function(n){n&&new t(advanced_ads_ga_UID).count()}))})();})();</script><div style="clear:both;width:100%;text-align:center; font-size:11px; "><a target="_blank" title="WP2Social Auto Publish" href="https://xyzscripts.com/wordpress-plugins/facebook-auto-publish/compare" >WP2Social Auto Publish</a> Powered By : <a target="_blank" title="PHP Scripts & Programs" href="http://www.xyzscripts.com" >XYZScripts.com</a></div><script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/navigation.js?ver=1.0.9" id="magazinebook-navigation-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/skip-link-focus-fix.js?ver=1.0.9" id="magazinebook-skip-link-focus-fix-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/jquery.easy-ticker.js?ver=3.1.0" id="magazinebook-news-ticker-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/splide.min.js?ver=2.3.1" id="splide-js-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/theme.js?ver=1.0.9" id="magazinebook-theme-js-js"></script> <script>!function(){window.advanced_ads_ready_queue=window.advanced_ads_ready_queue||[],advanced_ads_ready_queue.push=window.advanced_ads_ready;for(var d=0,a=advanced_ads_ready_queue.length;d<a;d++)advanced_ads_ready(advanced_ads_ready_queue[d])}();</script> </body> </html>