You Won't Believe But These Photos Were Generated By AI
We officially can no longer trust anything we see on the internet. AI is starting to distort reality â all with the help of machine learning. Researchers at NVIDIA have harnessed the power of a GAN â a class of neural network â to generate some extremely realistic faces. The results are more impressive than anything weâve seen before. These AI generated images are viral now.
Generative Adversarial Network (GAN)
A GAN can iteratively generate images based on genuine photos it learns from. Then it evaluates the new images against the original. In this instance, the researchers taught a GAN a number of âstylesâ. For example, faces modeled after subjects who were old, young, wearing glasses, or had different hairstyles. The results are spectacular. Even small seemingly random details like freckles, skin pores or stubble have convincing distribution in the images the project generated. Behind the new feature is a technique NVIDIA calls âstyle-mixing.â  âTo further encourage the styles to localize, we employ mixing regularization, where a given percentage of images are generated using two random latent codes instead of one during training. When generating such an image, we simply switch from one latent code to another â an operation we refer to as style mixing â at a randomly selected point in the synthesis network.â - From the paper. Â
Stochastic Variation
Stochastic variation is another key property allowing GANs to realize the randomization of detailed facial features, such as the placement of facial hair, stubble density, freckles, pores, etc. The paper proposes adding per-pixel noise after each convolution layer. The added noise does not affect the overall composition or the high-level attributes of images. And the changing noise in different layers produces matching stochastic variation results. To quantify interpolation quality and disentanglement, the paper proposes two new, automated methods â perceptual path length and linear separability â that are applicable to any generator architecture. Researchers saw impressive results using the new generator to forge images of bedrooms, cars, and cats. It was developed with the Large-scale Scene Understanding (LSUN) dataset. Alongside todayâs paper, NVIDIA has also released a huge new dataset of human faces. Flickr- Faces-HQ (FFHQ) contains 70,000 high-quality images at 1024 resolution. The dataset will soon be available to the public. Although Nvidiaâs results may be the best so far, there are still tiny discrepancies if you stare long enough.
Recognizing The Images
Kyle McDonald, an artist working with code. He published a list of things to look out for when trying to recognize an image generated by an AI system. âAt low resolutions, almost all the images in the paper are indistinguishable from photographs. There are only a few artifacts that stand out to me that I will try to address,â he said in a blog post. McDonald points out what he calls âthe missing earringâ problem. Often in the images, there is small circular glitch below the ears. This comes from the GAN attempting to add in earrings previously seen in photos. There are other slight oddities too. For example, like the asymmetry of facial features, lack of detailing in teeth, strange hair strands, and patterns in clothing.
The Future
Itâs not clear how easy it is to replicate Nvidiaâs results, however. A spokesperson said that the paper was currently under peer review. And the submission rules donât allow any preliminary discussions with the press until the paper has been published. According to the paper, however, there are 26.2 million parameters that can be tweaked during the training process, so itâs probably not a project to take on if you havenât got the money for the hardware and compute. Read the full article










