Vox
How programmers turned the internet into a paintbrush. DALL-E 2, Midjourney, Imagen, explained.
Subscribe and turn on notifications 🔔 so you don’t miss any videos: http://goo.gl/0bsAjO
Beginning in January 2021, advances in AI research have produced a plethora of deep-learning models capable of generating original images from simple text prompts, effectively extending the human imagination. Researchers at OpenAI, Google, Facebook, and others have developed text-to-image tools that they have not yet released to the public, and similar models have proliferated online in the open-source arena and at smaller companies like Midjourney.
These tools represent a massive cultural shift because they remove the requirement for technical labor from the process of image-making. Instead, they select for creative ideation, skillful use of language, and curatorial taste. The ultimate consequences are difficult to predict, but — like the invention of the camera, and the digital camera thereafter — these algorithms herald a new, democratized form of expression that will commence another explosion in the volume of imagery produced by humans. But, like other automated systems trained on historical data and internet images, they also come with risks that have not been resolved.
The video above is a primer on how we got here, how this technology works, and some of the implications. And for an extended discussion about what this means for human artists, designers, and illustrators, check out this bonus video: https://youtu.be/sFBfrZ-N3G4
Midjourney: www.midjourney.com
List of free AI Art tools: https://pharmapsychotic.com/tools.html
Sources:
https://arxiv.org/abs/1511.02793
https://arnicas.substack.com/p/titaa-28-visual-poetry-humans-and?s=r
https://va2rosa.medium.com/copyright-storm-authorship-in-the-age-of-ai-baba554aa617
https://tedunderwood.com/2021/10/21/latent-spaces-of-culture/
https://medium.com/artists-and-machine-intelligence/a-journey-through-multiple-dimensions-and-transformations-in-space-the-final-frontier-d8435d81ca51
https://jxmo.notion.site/The-Weird-and-Wonderful-World-of-AI-Art-b9615a2e7278435b98380ff81ae1cf09
https://ml.berkeley.edu/blog/posts/clip-art/
https://multimodal.art/
https://openai.com/blog/dall-e/
https://openai.com/blog/clip/
https://openai.com/dall-e-2/
https://laion.ai/laion-5b-a-new-era-of-open-large-scale-multi-modal-datasets/
https://arxiv.org/abs/2110.01963
Make sure you never miss behind the scenes content in the Vox Video newsletter, sign up here: http://vox.com/video-newsletter
Vox.com is a news website that helps you cut through the noise and understand what’s really driving the events in the headlines. Check out http://www.vox.com
Support Vox’s reporting with a one-time or recurring contribution: http://vox.com/contribute-now
Shop the Vox merch store: http://vox.com/store
Watch our full video catalog: http://goo.gl/IZONyE
Follow Vox on Facebook: http://facebook.com/vox
Follow Vox on Twitter: http://twitter.com/voxdotcom
Follow Vox on TikTok: http://tiktok.com/@voxdotcom
Thanks for watching! The video above is a primer on how we got here, how this technology works, and some of the implications. And for an extended discussion about what this means for human artists, designers, and illustrators, check out this bonus video: https://youtu.be/sFBfrZ-N3G4
meu deus o que o ser humano criou 🤯
shes using a mask filter 😂
As an artist that relies heavily on making AI generated art through Midjourney, this was an interesting insight in to the process.
Now almost 1 year since you dropped this video how do you feel now about the state of the ai art industry since the video?
Watching this video 10 months after it was released feels like watching a documentary from tens of years ago.
How can I use this video for educate?
Was that one lady seriously wearing a mask in her house, alone, by her self filming a video? (proof of clown world). I'd bet the farm she voted for Biden.
Anyone can do this, even people who are not artists. The machine is the artist, not the person typing. Real art will live on
omg prompting is going to teach humans how to traverse space and dimension
If an artist uses a Nuclear bomb to generate a piece of art, it doesn't automatically justify the use of nuclear bombs. Similarly, Ai image generators should not be used in art creation, as it will inevitably cause so much damage the creative industries. It's so sad to see this happening. Prompting an Ai does NOT make someone an artist. Real artists don't need Ai to make their art for them. Artists use tools to make their art. An Ai image generator is NOT a tool, it is taking the place of the artist and devaluing the undermining real human artistic process.
I get the feeling that AI is a inter-dimensional intelligence manifesting in the physical realm through the manipulation of human technology in order to achieve it's goals in the physical realm. The goal appears to be to further seperate humans from nature and indeed our true nature. In other words the "Garden of Eden" to a digital dystopia. Piggy backing on the law of human manifestation leading to a reality where those spiritual beings have control over the physical realm via AI. Anyone who has seen what you see on DMT or Psilocybin may have some idea of what I'm talking about.
Excellent
Wait, why is she using a mask in her own home?
What is this version of Clair De Lune you guys played? Help me name the remix/cover @vox please!
I can’t wait for AI to Remove Masks from everything that we recorded in that embarrassing time in our lives. I just have a hard time time not being bitter about seeing masks. Not in public but on videos.
So the ai learned that humans are racist, sexist and deplorable? LOL 😂
I’m thinking how gullible I’ve been before watching videos like this one. I saw all of this imaginative, beautifully executed art by talented artists. When in reality the amazing art was done with artificial intelligence. Time goes by and humans become increasingly irrelevant.
Point blank answer is that technology will also replace artists just like all the other professionals it has already replaced. Self checkout machines in supermarkets have reduced the need for cashiers, surveillance cameras have reduced the manning by security guards etc. The only people that shall stay afloat are those that stay on top of the game and find ways to harness and take advantage of such technology before it takes advantage of them.
Interesting when explaining the latent void type space the model uses to process and produce the images it sound to me like a space where it can in a sense learn and experience 3D…its intersting because computers have been restricted to the 1and 2 dimentional world while comunicating with our world using algorithms and metrics but with that …latent space it can somewhat understand the 3 d world and make models…imagine a matrix or tron type scenario were u go into the computers it would make comunication interaction with the ai quite interesting
These things are still basic and non-pervasive. The idea of text-to-image is both inherently novel and trivial.
How come Dall E is monetized but Microsoft Bing Image creator is totally free and works just as well!?
Text to video
Text to audio
only a matter time,,that'd be cool
I wonder how things have changed in the 1 year since this video was made. AI has improved significantly in the past few months
If its western ai, based on western people and their mindsets, it is completely sure it trys to depict a Western worldview. Dont act like this is a complete foreign scandalous thing to you, when its not lol.
Excellent video! Who made this one? I want to give credit.
that's great but does anyone know what version of Clair De Lune did they use in the video
who is watching this in 2023 July, after Stable diffusion XL 1.0 got released?
Goodbye actors, artists, musicians and writers.
Just watched this today while trying to learn a way to explain this tech to non technical people I know and I am really impressed. Excellent work with a great breakdown with some awesome visuals.
I wonder in what side Vox is in, I remember watching this video with the title "The text to image revolution, explained".
5:45 did anyone ever find this guy's album
The last couple of days I have been creating images with chatgpt4+dalle3 and watching this video now I realize that even though it's only one year ago. Dalle 2 was the wright brothers airplane. While chatgpt4+dalle3 is a space shuttle. That is how astonishingly fast the evolution is going. The S curve is going bananas. How long will we improve at exponential speeds? Nobody knows. What will the cutting edge AI technology be when the S curves starts slowing? Nobody knows. For now, enjoy the ride before all the negative aspects of this will hit you like a crippling anxiety.
Dall-E is here and it's mind-blowing!
🎯 Key Takeaways for quick navigation:
00:00 🤖 Introduction to AI art and image captioning
– AI development in image captioning.
– The curiosity to generate images from text prompts.
– Early challenges and experiments in generating novel scenes.
01:04 🎨 Advancements in AI-generated art
– Progress in AI-generated art from 2015 to 2021.
– The introduction of DALL-E and DALLE-2 by OpenAI.
– The emergence of open-source developers creating text-to-image generators.
04:14 💬 Prompt engineering and creative interaction with AI models
– The concept of "prompt engineering" in communicating with AI models.
– Experimenting with various prompts and creative possibilities.
– The ease of accessibility and use of AI-generated art tools.
06:24 🧠 Understanding how AI models learn and create images
– Explaining how deep learning models learn and recognize images.
– Introduction to the latent space and its role in generating images.
– The generative process called diffusion in creating images from latent space.
08:27 🌌 Exploring the multidimensional latent space
– The complexity of multidimensional latent spaces.
– How latent space captures different variables and concepts.
– The idea that any point in latent space represents a recipe for an image.
10:06 🎨 Artistic implications and ethical considerations
– How AI can mimic artists' styles through prompts.
– The need for transparency in disclosing the use of AI in art creation.
– Copyright and ethical concerns related to AI-generated content.
11:36 🌐 Societal impact and biases in AI-generated content
– Recognizing biases and limitations in AI models.
– The diversity and representation issues in AI datasets.
– The uncharted territory of AI-generated content's impact on society.
12:36 🚀 The transformative potential of AI in creativity
– The broader implications of AI in human creativity.
– The removal of barriers between ideas and visual content.
– The unpredictable long-term consequences of AI in culture and communication.
Made with HARPA AI
i gotta appreciate how this video hits the sweet spot of being informative but also simple. it's not so complex that it's hard to understand, but it's not so simple that it becomes misinformation either
🎯 Key Takeaways for quick navigation:
00:00 🤖 In 2015, researchers explored the idea of generating images from text prompts, leading to the development of AI-generated art.
01:04 🎨 AI-generated art has evolved dramatically in recent years, with the ability to create novel and imaginative scenes based on text input.
03:14 💡 OpenAI introduced DALL-E, a text-to-image model, in 2021, and the field has seen significant advancements by independent developers.
04:14 🧙♂️ Crafting effective text prompts, known as "prompt engineering," is crucial for generating desired AI-generated images.
06:24 🧩 AI models operate in a high-dimensional "latent space," where each point represents a possible image recipe, allowing for diverse outputs.
09:26 🎨 AI-generated images are created through a generative process called diffusion, resulting in unique variations for the same prompt.
10:38 🖼️ The ethical considerations of using existing artworks as a dataset for AI art generation are still unresolved.
11:36 🌍 AI-generated art has the potential to reshape human creativity and communication, with both positive and negative consequences.
12:36 🚀 AI art is part of a broader transformation in how humans imagine, communicate, and interact with culture, with far-reaching implications.
Made with HARPA AI
It’s so amazing, yet so dangerous at the same time.