How AI Image Generators Work (Stable Diffusion / Dall-E) – Computerphile

Computerphile

AI image generators are massive, but how are they creating such interesting images? Dr Mike Pound explains what’s going on.

Thumbnail image partly created by DALL-E with the prompt: “Computerphile YouTube Video presenter Mike Pound Explains Diffusion AI methods thumbnail with green computer style title text on a black background with grey binary”

https://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: https://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran’s Numberphile. More at http://www.bradyharan.com

<iframe></p> <p><a href="https://www.youtube.com/watch?v=1CIpzeNxIhU">Source</a></p> <div class="be1e40beae42d993bafb8643f4ddde8b" data-index="3" style="float: none; margin:10px 0 10px 0; text-align:center;"> <script async src="https://pagead2.googlesyndication.com/pagead/js/adsbygoogle.js"></script> <ins class="adsbygoogle" style="display:block; text-align:center;" data-ad-layout="in-article" data-ad-format="fluid" data-ad-client="ca-pub-9244112244416304" data-ad-slot="4549240677"></ins> <script> (adsbygoogle = window.adsbygoogle || []).push({}); </script> </div> <div style="font-size: 0px; height: 0px; line-height: 0px; margin: 0; padding: 0; clear: both;"></div> </div> </article> <div class="clearfix"></div> <ul class="default-theme-post-navigation"> <li class="theme-nav-previous"><a href="https://theengineeringofconsciousexperience.com/hive1-alive-live-stream-gpt3-joins-we-discuss-many-things/" rel="prev"><span class="meta-nav">←</span> Hive1 Alive Live Stream – GPT3 joins. We discuss many things.</a></li> <li class="theme-nav-next"><a href="https://theengineeringofconsciousexperience.com/the-truth-behind-the-da-vinci-code/" rel="next">The Truth Behind the Da Vinci Code <span class="meta-nav">→</span></a></li> </ul> <div class="clearfix"></div> <h3 class='comment-reply-title'>Similar Posts</h3> <div class="mb-related-posts mb-simple-featured-posts mb-simple-featured-posts-wrap row"> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/ai-in-the-news-6-ai-nano-bots-gansky-gpt-3-short-film-tesla-auto-pilot-ai-e-commerce/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1606260338_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1606260338_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1606260338_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1606260338_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1606260338_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/11/1606260338_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/ai-in-the-news-6-ai-nano-bots-gansky-gpt-3-short-film-tesla-auto-pilot-ai-e-commerce/" rel="bookmark">AI in the News #6: AI Nano Bots, GANsky, GPT -3 Short Film, Tesla Auto Pilot, AI E-Commerce …</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/ai-in-the-news-6-ai-nano-bots-gansky-gpt-3-short-film-tesla-auto-pilot-ai-e-commerce/" rel="bookmark"><time class="entry-date published updated" datetime="2020-10-25T11:33:05-07:00">October 25, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/1662251d172a899bf607dcf90292d132/">Trust This Robot</a></span></span> </div> </header> </article> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/crazy-gpt-3-use-cases/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/08/1598500949_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/08/1598500949_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/08/1598500949_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/08/1598500949_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/08/1598500949_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/08/1598500949_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/crazy-gpt-3-use-cases/" rel="bookmark">Crazy GPT-3 Use Cases</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/crazy-gpt-3-use-cases/" rel="bookmark"><time class="entry-date published updated" datetime="2020-07-27T09:48:55-07:00">July 27, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/b320f88597221238af305d62577c9ead/">Przemek Chojecki</a></span></span> </div> </header> </article> <article class="mb-featured-article col-md-4 px-lg-3 post"> <a class="post-thumbnail" href="https://theengineeringofconsciousexperience.com/natural-language-processing-gpt-3-language-model/" aria-hidden="true" tabindex="-1"> <img width="501" height="282" src="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1600317502_maxresdefault.jpg" class="attachment-magazinebook-featured-image-medium size-magazinebook-featured-image-medium wp-post-image" alt="" decoding="async" loading="lazy" srcset="https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1600317502_maxresdefault.jpg 1280w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1600317502_maxresdefault-300x169.jpg 300w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1600317502_maxresdefault-1024x576.jpg 1024w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1600317502_maxresdefault-768x432.jpg 768w, https://theengineeringofconsciousexperience.com/wp-content/uploads/2020/09/1600317502_maxresdefault-520x293.jpg 520w" sizes="auto, (max-width: 501px) 100vw, 501px" /> </a> <span class="cat-links"><a href="https://theengineeringofconsciousexperience.com/category/gpt-3/" rel="category tag">GPT 3</a></span> <header class="entry-header"> <h3 class="entry-title"><a href="https://theengineeringofconsciousexperience.com/natural-language-processing-gpt-3-language-model/" rel="bookmark">Natural Language Processing : GPT-3 Language Model</a></h3> <div class="entry-meta"> <span class="posted-on"><i class="far fa-calendar-alt"></i><a href="https://theengineeringofconsciousexperience.com/natural-language-processing-gpt-3-language-model/" rel="bookmark"><time class="entry-date published updated" datetime="2020-06-30T20:25:27-07:00">June 30, 2020</time></a></span><span class="byline"><i class="far fa-user-circle"></i><span class="author vcard"><a class="url fn n" href="https://theengineeringofconsciousexperience.com/author/6122fe1ae098dfc0904fc747baa68d10/">SolFinder Research</a></span></span> </div> </header> </article> </div> <div id="comments" class="comments-area"> <h5 class="comments-title"> 24 thoughts on “<span>How AI Image Generators Work (Stable Diffusion / Dall-E) – Computerphile</span>” </h5> <ol class="comment-list"> <li id="comment-311459" class="comment even thread-even depth-1"> <article id="div-comment-311459" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCPSr7tdwFjWQYW79fctjy-w" class="url" rel="ugc external nofollow">Alex</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311459"><time datetime="2023-06-28T11:33:03-07:00">June 28, 2023 at 11:33 am</time></a> </div> </footer> <div class="comment-content"> <p>I find the name of this channel quite interesting</p> </div> </article> </li> <li id="comment-311458" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311458" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC7KRrupgWyhTT_TvmxptELA" class="url" rel="ugc external nofollow">Jamie Watts</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311458"><time datetime="2023-07-02T05:12:56-07:00">July 2, 2023 at 5:12 am</time></a> </div> </footer> <div class="comment-content"> <p>Is this simplified explanation of the noise diffusion process true?</p> <p>Theoretically, it’s like inserting an ‘ice cream’ mosaic with hundreds of other tesserae (rectangular slabs used to create a mosaic) and then asking a highly intelligent artist to watch them being removed to restore the original image. During this process, the artist learns how to understand and reinterpret the ‘ice cream’ image in other mosaics. The artist is trained to do this with millions of other images in mosaics so that they can create entirely new ones determined by the requests (or text prompts) of the person commissioning them.</p> </div> </article> </li> <li id="comment-311457" class="comment even thread-even depth-1"> <article id="div-comment-311457" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCNrTZu1uzxyyg51S0ugRY1Q" class="url" rel="ugc external nofollow">Vincent Oostelbos</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311457"><time datetime="2023-07-04T06:43:19-07:00">July 4, 2023 at 6:43 am</time></a> </div> </footer> <div class="comment-content"> <p><a href="https://www.youtube.com/watch?v=1CIpzeNxIhU&t=10m24s">10:24</a> This is the part that boggles my mind intuitively. Because wouldn't the second step just try to remove the noise and get back to that imperfect result produced by the first step? Presumably when the first time around you imperfectly remove the noise and get back some vague shape kind of resembling the true image, you also lost some of the actual information of that image because you were removing the noise perfectly. So wouldn't the second step just remove noise to try to get back to that imperfect image, instead (and doing that imperfectly, so that it loses even more actual information)? I guess I just don't see how that would make it better and better each iteration, rather than worse and worse.</p> <p>EDIT: Oh, but I guess the point is that you're using this specifically only when generating a new image from scratch, so there wasn't really any "true" image to begin with, and all that matters is making a crisp image that fits the prompt, so it doesn't matter if you're "losing information" about some true image, because there was none to begin with? If that is the case, would my above intuition still be true if this were the approach for removing noise from an actual existing image, rather than for generating something new?</p> </div> </article> </li> <li id="comment-311456" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311456" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCTkRYgNhllLgZsQH6G7GIkw" class="url" rel="ugc external nofollow">David Lay</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311456"><time datetime="2023-07-04T20:51:23-07:00">July 4, 2023 at 8:51 pm</time></a> </div> </footer> <div class="comment-content"> <p>You cannot subtract out random noise. Random noise minus random noise is just different random noise.</p> </div> </article> </li> <li id="comment-311455" class="comment even thread-even depth-1"> <article id="div-comment-311455" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCmddjYsKh-owdVqhRkILnjw" class="url" rel="ugc external nofollow">lovedancer</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311455"><time datetime="2023-07-17T07:28:40-07:00">July 17, 2023 at 7:28 am</time></a> </div> </footer> <div class="comment-content"> <p>谁能给翻译一下</p> </div> </article> </li> <li id="comment-311454" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311454" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCF1-KGyHho21YW2hn2dnGzQ" class="url" rel="ugc external nofollow">chris watts</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311454"><time datetime="2023-07-21T01:36:04-07:00">July 21, 2023 at 1:36 am</time></a> </div> </footer> <div class="comment-content"> <p>Why Noise? Because it is something like a pixelated disintegration of the image that can be stored mathematically? So it's easier to compare its structure with other images?</p> </div> </article> </li> <li id="comment-311453" class="comment even thread-even depth-1"> <article id="div-comment-311453" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCBW9tqDE3B7iUXJdl-5dZBw" class="url" rel="ugc external nofollow">Emirhan Bilgiç</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311453"><time datetime="2023-08-03T11:21:45-07:00">August 3, 2023 at 11:21 am</time></a> </div> </footer> <div class="comment-content"> <p>bok hibi</p> </div> </article> </li> <li id="comment-311452" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311452" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCE1TWRCO7nU1WZJYfX9aBMg" class="url" rel="ugc external nofollow">Wollie</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311452"><time datetime="2023-08-03T13:57:34-07:00">August 3, 2023 at 1:57 pm</time></a> </div> </footer> <div class="comment-content"> <p>How does it "know" what a frog looks like when I give it the text "frog"?</p> </div> </article> </li> <li id="comment-311451" class="comment even thread-even depth-1"> <article id="div-comment-311451" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCjJXiIX-wl0Ue-5a2baM8GQ" class="url" rel="ugc external nofollow">less</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311451"><time datetime="2023-08-09T02:34:19-07:00">August 9, 2023 at 2:34 am</time></a> </div> </footer> <div class="comment-content"> <p>gpt style transformer embedding</p> </div> </article> </li> <li id="comment-311450" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311450" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCeS2OuY88JrrT-n0_pya6AA" class="url" rel="ugc external nofollow">Michael G</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311450"><time datetime="2023-08-16T05:39:20-07:00">August 16, 2023 at 5:39 am</time></a> </div> </footer> <div class="comment-content"> <p>Can any of you ai losers tell me how the ai taking the jobs of actors and script writers is any different than that of ai art. Why is it you hold the labor and monetary gains of one group of artists above that of another, often poorer, group of artists? Many commission artists currently at this moment rely on the money they make doing their work to survive, shame many of them will be replaced by ai, equally shameful the fact you don't seem to care. The theft of labor for the gains of those who hold the ai capable of creating the greatest soulless piece of garbage, to me, in both cases of ai art as well as the acting and script writing ai, seem like a negative to me. Sorry to break the day dream but we still live in a capitalist society, meaning people still need money to live, and even if we didn't live in a capitalist society do you not believe the work of the laborers whose work is used within the ai should be met with compensation? Why is it that within a socialist community I have found those who are willing to rob others of their livelihood and labor all for ones own greed. Doesn't seem very socialist to me…</p> <p>My final opinion on ai art is that if you want to use it, it should have to be licensed with a legal mandatory price charge, a payment for each artist's work that is put into it each and every individual time it is run through an ai. Too much for anyone to afford, let alone afford and be worth using or too difficult to trace as a result of taking too many people's work for scanning? That's just too bad, guess you'll have to either learn art yourself or buy from a person who actually creates through the labor that is used to function said ai. If we're to replace the creative for machines then the creative should be paid at full for their work so they can go on creating. And if such means cannot be met then the use of ai art within the media and the sale of ai art should be avoided through legal means as well as through TOS. As for the use of ai art as a means for ones own individual enjoyment, such as the creation of personal room posters, a personal phone screen, or anything kept to oneself or a small in group such as friends or family is essentially unavoidable and while still damaging to the art community, the actual art community, is a damage that cannot really be stopped in any reasonable fashion. I also believe it should be legally mandatory for ai art to be labeled as ai art, and should not be allowed in areas such as journalism and other forms of media that are meant to report on reality, I have already found highly misleading ai clickbait photos in news articles (and we wonder why we've been seeing a massive spike in right leaning conspiracy theorists and people detached from reality).</p> </div> </article> </li> <li id="comment-311449" class="comment even thread-even depth-1"> <article id="div-comment-311449" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCJnY1eApHavWex-DXjlRqMA" class="url" rel="ugc external nofollow">Random Schmid</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311449"><time datetime="2023-08-18T02:52:05-07:00">August 18, 2023 at 2:52 am</time></a> </div> </footer> <div class="comment-content"> <p>the whole "it injects an embedding from the input string" is a bit glossed over. so its just back to using a GAN, or what? the whole point is that it's generating images based on this input string, and it feels like you didn't talk about it at all how it does that</p> </div> </article> </li> <li id="comment-311448" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311448" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCBQdohFqfaXDvUUfhFgsF5g" class="url" rel="ugc external nofollow">Amy T</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311448"><time datetime="2023-08-19T23:14:15-07:00">August 19, 2023 at 11:14 pm</time></a> </div> </footer> <div class="comment-content"> <p>So, to sum it up in one word: PHOTOBASHING<br />(It would have been ethically better to have paid the artists instead of taking their work off the internet without permission to train these models)</p> </div> </article> </li> <li id="comment-311447" class="comment even thread-even depth-1"> <article id="div-comment-311447" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCmUi2gnk9EbJQjvWo2VdZCA" class="url" rel="ugc external nofollow">Hedu AI by Batool Haider</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311447"><time datetime="2023-08-22T01:04:42-07:00">August 22, 2023 at 1:04 am</time></a> </div> </footer> <div class="comment-content"> <p>This was a pretty awesome explanation! Thank you!</p> </div> </article> </li> <li id="comment-311446" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311446" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCZaYy_r-rcQ5Fu_ZU2FS7Sg" class="url" rel="ugc external nofollow">Adegboyega Samuel</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311446"><time datetime="2023-08-23T04:08:06-07:00">August 23, 2023 at 4:08 am</time></a> </div> </footer> <div class="comment-content"> <p>Computer Phile, this is very Good and intuitive 😊</p> </div> </article> </li> <li id="comment-311445" class="comment even thread-even depth-1"> <article id="div-comment-311445" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCrf7FoSvRD-8rHulDqe3ohQ" class="url" rel="ugc external nofollow">Shabazza84</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311445"><time datetime="2023-09-06T02:22:08-07:00">September 6, 2023 at 2:22 am</time></a> </div> </footer> <div class="comment-content"> <p>This is how DALL-E works in a nutshell:<br />"Read user prompt. Decide it's against their arbitrary moral codex. Emit error."</p> <p>Excellent vid btw. Explained something complex in a very easy way.</p> </div> </article> </li> <li id="comment-311444" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311444" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCvxKxZJcjp-mSrZUdetO-ww" class="url" rel="ugc external nofollow">Alexey Poltoradnev</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311444"><time datetime="2023-09-09T10:38:13-07:00">September 9, 2023 at 10:38 am</time></a> </div> </footer> <div class="comment-content"> <p>Amazing explanation. Thank you!</p> </div> </article> </li> <li id="comment-311443" class="comment even thread-even depth-1"> <article id="div-comment-311443" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCYxZpeQnfc1TO471tBjgGKg" class="url" rel="ugc external nofollow">Silvio Lasmar</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311443"><time datetime="2023-09-11T19:36:49-07:00">September 11, 2023 at 7:36 pm</time></a> </div> </footer> <div class="comment-content"> <p>Hello, Dall-E and MidJourney can be considered a DCGANs?</p> </div> </article> </li> <li id="comment-311442" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311442" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCMURRGmJNnI72oEmAemVfJw" class="url" rel="ugc external nofollow">Heejune Ahn</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311442"><time datetime="2023-09-15T20:52:40-07:00">September 15, 2023 at 8:52 pm</time></a> </div> </footer> <div class="comment-content"> <p>Here one question, Let's assume we have a nice diffusion model trained. Then we have an image I and then we added noise to I so we got the I_noised. Then we apply the diffusion network to I_noised, can we really get the original image I? or just a realistic image simiar to the set of trained dataset images? I guess we cannot get the same original image, becuase whenever you train the images you add a "RANDOM" Gaussian noise, so if you train the image 100 epoches you will train with 100 different I_noised. And I_noised is in fact almost pure Multivariate Gaussian and does not have any information of the original image. The intermediate process could remember the statistics from t to t-1, but you cannot get the original image I_0 from the Image I_noised = I_T. So In short, the reseverse processing network learn how to convert the probability distribution of I_t to I_t-1 …. so that finally I_1 is similar to the distribtion of the original dataset. but cannot reconstruct the specific original image.</p> </div> </article> </li> <li id="comment-311441" class="comment even thread-even depth-1"> <article id="div-comment-311441" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCMo9z4AWPXCLaq9aiMSwgDQ" class="url" rel="ugc external nofollow">Marko Milenkovic</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311441"><time datetime="2023-09-17T03:25:03-07:00">September 17, 2023 at 3:25 am</time></a> </div> </footer> <div class="comment-content"> <p>All i got from this video is…NOISE</p> </div> </article> </li> <li id="comment-311440" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311440" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCQ2oAqz0Z9TMOUpoJqO0pWw" class="url" rel="ugc external nofollow">Ryan</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311440"><time datetime="2023-09-27T21:52:09-07:00">September 27, 2023 at 9:52 pm</time></a> </div> </footer> <div class="comment-content"> <p>I think a lot of fear artists have comes from the failure of sites like this to explain what algorithms are doing in the initial stages, how they are trained, and what is actually happening in each step to alter the image file. So far I haven't seen a single site explain that stuff as more than "ai is trained on many images" so then artist take that as "its stealing pieces of my art" when it isn't doing that at all. I'm honestly tired of explaining this stuff to angry idiots on social media.</p> <p>Please explain that part better. Even lawyers and media companies don't seem to understand this aspect of these image generators, further feeding the fear of artists.</p> </div> </article> </li> <li id="comment-311439" class="comment even thread-even depth-1"> <article id="div-comment-311439" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC3WfOMJYVVu92QnbLlLrFbw" class="url" rel="ugc external nofollow">Mims Talsi</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311439"><time datetime="2023-09-30T15:08:02-07:00">September 30, 2023 at 3:08 pm</time></a> </div> </footer> <div class="comment-content"> <p>I love that he's doing all of this on 1980s printer paper. Proper geek</p> </div> </article> </li> <li id="comment-311438" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311438" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCnlPnDazU_s3tHFuLN3vYQw" class="url" rel="ugc external nofollow">Jonathan Grey</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311438"><time datetime="2023-10-10T19:34:55-07:00">October 10, 2023 at 7:34 pm</time></a> </div> </footer> <div class="comment-content"> <p>You did a great job explaining how the process works and provided visual examples. Nice work with this video.</p> </div> </article> </li> <li id="comment-311437" class="comment even thread-even depth-1"> <article id="div-comment-311437" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UC0WzZN-IZu2PA-SfyLDyqzw" class="url" rel="ugc external nofollow">Charlotte Hancocks</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311437"><time datetime="2023-10-13T04:22:57-07:00">October 13, 2023 at 4:22 am</time></a> </div> </footer> <div class="comment-content"> <p>I wish I could understand this. You must be a genius!</p> </div> </article> </li> <li id="comment-311436" class="comment odd alt thread-odd thread-alt depth-1"> <article id="div-comment-311436" class="comment-body"> <footer class="comment-meta"> <div class="comment-author vcard"> <b class="fn"><a href="https://www.youtube.com/channel/UCqLwa3ih9ixuP-A0pdyaSKA" class="url" rel="ugc external nofollow">Hodor Hodor</a></b> <span class="says">says:</span> </div> <div class="comment-metadata"> <a href="https://theengineeringofconsciousexperience.com/how-ai-image-generators-work-stable-diffusion-dall-e-computerphile/#comment-311436"><time datetime="2023-10-22T10:36:15-07:00">October 22, 2023 at 10:36 am</time></a> </div> </footer> <div class="comment-content"> <p>imagine if you had used dall-e to create visuals for this instead of just drawing the same "useful" box on paper 30 times?</p> </div> </article> </li> </ol> <p class="no-comments">Comments are closed.</p> </div> </main> </div> <div class="col-md-3 px-lg-3 "> </div> </div> </div> </div> <footer id="colophon" class="site-footer"> <div class="container"> <div class="row"> <div class="col-md-12 text-center"> <div class="site-info"> <span> Powered By: <a href="https://wordpress.org/" target="_blank">WordPress</a> </span> <span class="sep"> | </span> <span> Theme: <a href="https://odiethemes.com/themes/magazinebook/" target="_blank">MagazineBook</a> By OdieThemes </span> </div> </div> </div> </div> </footer> </div> <script>(function(){var advanced_ads_ga_UID="UA-88163215-1",advanced_ads_ga_anonymIP=!!1;window.advanced_ads_check_adblocker=function(t){var n=[],e=null;function a(t){var n=window.requestAnimationFrame||window.mozRequestAnimationFrame||window.webkitRequestAnimationFrame||function(t){return setTimeout(t,16)};n.call(window,t)}return a((function(){var t=document.createElement("div");t.innerHTML=" ",t.setAttribute("class","ad_unit ad-unit text-ad text_ad pub_300x250"),t.setAttribute("style","width: 1px !important; height: 1px !important; position: absolute !important; left: 0px !important; top: 0px !important; overflow: hidden !important;"),document.body.appendChild(t),a((function(){var a,o,i=null===(a=(o=window).getComputedStyle)||void 0===a?void 0:a.call(o,t),d=null==i?void 0:i.getPropertyValue("-moz-binding");e=i&&"none"===i.getPropertyValue("display")||"string"==typeof d&&-1!==d.indexOf("about:");for(var c=0,r=n.length;c<r;c++)n[c](e);n=[]}))})),function(t){"undefined"==typeof advanced_ads_adblocker_test&&(e=!0),null!==e?t(e):n.push(t)}}(),(()=>{function t(t){this.UID=t,this.analyticsObject="function"==typeof gtag;var n=this;return this.count=function(){gtag("event","AdBlock",{event_category:"Advanced Ads",event_label:"Yes",non_interaction:!0,send_to:n.UID})},function(){if(!n.analyticsObject){var e=document.createElement("script");e.src="https://www.googletagmanager.com/gtag/js?id="+t,e.async=!0,document.body.appendChild(e),window.dataLayer=window.dataLayer||[],window.gtag=function(){dataLayer.push(arguments)},n.analyticsObject=!0,gtag("js",new Date)}var a={send_page_view:!1,transport_type:"beacon"};window.advanced_ads_ga_anonymIP&&(a.anonymize_ip=!0),gtag("config",t,a)}(),this}advanced_ads_check_adblocker((function(n){n&&new t(advanced_ads_ga_UID).count()}))})();})();</script><div style="clear:both;width:100%;text-align:center; font-size:11px; "><a target="_blank" title="WP2Social Auto Publish" href="https://xyzscripts.com/wordpress-plugins/facebook-auto-publish/compare" >WP2Social Auto Publish</a> Powered By : <a target="_blank" title="PHP Scripts & Programs" href="http://www.xyzscripts.com" >XYZScripts.com</a></div><script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/navigation.js?ver=1.0.9" id="magazinebook-navigation-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/skip-link-focus-fix.js?ver=1.0.9" id="magazinebook-skip-link-focus-fix-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/jquery.easy-ticker.js?ver=3.1.0" id="magazinebook-news-ticker-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/splide.min.js?ver=2.3.1" id="splide-js-js"></script> <script type="text/javascript" src="https://theengineeringofconsciousexperience.com/wp-content/themes/magazinebook/js/theme.js?ver=1.0.9" id="magazinebook-theme-js-js"></script> <script>!function(){window.advanced_ads_ready_queue=window.advanced_ads_ready_queue||[],advanced_ads_ready_queue.push=window.advanced_ads_ready;for(var d=0,a=advanced_ads_ready_queue.length;d<a;d++)advanced_ads_ready(advanced_ads_ready_queue[d])}();</script> </body> </html>