Language

Semantic Pyramid for Image Generation



Henry AI Labs

This video explores a new GAN model for generating images by conditioning them on features from pre-trained image classifiers! This is really interesting for visualizing what is contained in pre-trained image classifiers as well as controllable image editing. The authors also show that this can be used for semantic image composition such as copying a tree and pasting it into a snow landscape or image relabeling by changing the embedded logit from the pre-trained classifier to produce an image of a new class while retaining as much of the original image as possible.
Thanks for watching! Please Subscribe!

Paper Links:
Semantic Pyramid for Image Generation: https://arxiv.org/abs/2003.06221
Corresponding Github Page: https://semantic-pyramid.github.io/supmat.html
Neural Style Transfer: https://arxiv.org/pdf/1508.06576.pdf
Zoom In: An Introduction to Circuits: https://distill.pub/2020/circuits/zoom-in/
EfficientDet: https://arxiv.org/pdf/1911.09070.pdf
Generative Teaching Networks: https://arxiv.org/pdf/1912.07768.pdf
DermGAN: https://arxiv.org/pdf/1911.08716.pdf
Classification Accuracy Score for Conditional Generative Models: https://arxiv.org/pdf/1905.10887.pdf
GauGAN: https://arxiv.org/pdf/1903.07291.pdf
SinGAN: https://arxiv.org/pdf/1905.01164.pdf
StyleGAN2 Distillation: https://arxiv.org/pdf/2003.03581.pdf
Semi-Supervised StyleGAN for Disentanglement Learning: https://arxiv.org/pdf/2003.03461.pdf

Thanks for watching! Please Subscribe!