By Mathew in Deep Learning — 18 Apr 2019

The Evolution of Deep Learning: Ian Goodfellow on GANs, Neural Networks, and AI Security

The Evolution of Deep Learning: Insights from Ian Goodfellow, Creator of GANs

In this illuminating conversation with Lex Fridman, Ian Goodfellow—the mind behind Generative Adversarial Networks (GANs) and co-author of the definitive textbook on deep learning—shares his perspectives on the current state and future direction of artificial intelligence. From the foundational concepts of deep learning to the revolutionary impact of adversarial networks, Goodfellow offers both technical insights and philosophical reflections on what it means to build truly intelligent systems.

Understanding Deep Learning's Place in AI

Goodfellow begins by addressing how deep learning fits within the broader landscape of artificial intelligence. Referencing the "Russian doll" diagram from his textbook that positions deep learning as a subset of representation learning, which itself is a subset of machine learning and ultimately AI, he acknowledges the current limitations of deep learning while remaining optimistic about its potential.

"One of the biggest limitations of deep learning is that right now it requires really a lot of data, especially labeled data," Goodfellow explains. While unsupervised and semi-supervised learning algorithms can reduce the amount of labeled data needed, they still require substantial unlabeled data. Similarly, reinforcement learning requires millions of experiences—far more than a human would need to learn comparable skills.

Goodfellow challenges the traditional view of deep learning as merely learning representations. Instead, he conceptualizes neural networks as multi-step programs that refine their understanding through sequential processing.

"I think of deep learning as basically learning programs that have more than one step," he says. "What we got with deep learning was really the ability to have steps of a program that run in sequence...I think of it as a program that makes several updates and arrives at better and better understandings."

This perspective represents a significant shift from earlier thinking about deep learning. When deep learning first gained academic attention in 2006, researchers thought of each layer as learning different levels of abstraction—edges in the first layer, corners in the second, and eventually object recognition in higher layers. Today's understanding is more nuanced, viewing neural networks as refining representations over multiple iterations.

Beyond Gradient Descent: The Future of Learning Algorithms

While back-propagation and gradient-based methods have been central to deep learning's success, Goodfellow believes we may discover more efficient learning algorithms in the future. He envisions systems that could update themselves with new information much more rapidly than current approaches.

"If I tell you the meeting today is at 3 p.m., I don't need to say over and over again 'it's at 3 p.m., it's at 3 p.m., it's at 3 p.m.' for you to do a gradient step on each one. You just hear it once and you remember it," Goodfellow points out.

This limitation of current deep learning systems might be overcome through new optimization approaches or different ways of applying existing optimization algorithms. Goodfellow suggests that techniques like self-attention and memory-based mechanisms offer promising directions, though he believes we haven't yet "nailed it" in terms of replicating human-like short-term memory.

When asked about Geoff Hinton's suggestion to "throw away back-propagation and start all over," Goodfellow takes a balanced view. He believes back-propagation will remain useful for specific applications, similar to how algorithms like logistic regression continue to serve important functions in certain domains despite no longer being at the cutting edge of AI research.

The Birth and Evolution of Generative Adversarial Networks

One of the most fascinating segments of the conversation revolves around Goodfellow's creation of Generative Adversarial Networks (GANs), which he developed after a debate with friends at a bar. While his colleagues were skeptical that the approach would work, Goodfellow went home and implemented it that same night, proving its viability.

Goodfellow explains GANs as a two-player game where a generator creates data (like images) and a discriminator tries to determine whether that data is real or fake. As the two networks compete, the generator becomes increasingly adept at creating realistic outputs that the discriminator cannot distinguish from real data.

"What's specific about GANs is that we have a two-player game in the game theoretic sense, and as the players in this game compete, one of them becomes able to generate realistic data," Goodfellow explains. "The discriminator takes images as input and guesses whether they're real or fake... As the two players compete in this game, the discriminator tries to become better at recognizing whether images are real or fake, and the generator becomes better at fooling the discriminator."

Since their introduction in 2014, GANs have evolved dramatically. Goodfellow highlights several milestones:

The original GAN paper demonstrated the concept worked, though the results were primitive
LapGAN by Facebook AI Research achieved high-resolution photo generation for the first time
DCGAN (Deep Convolutional GAN) provided a recipe for more realistic image generation, sparking what Goodfellow calls "the Cambrian explosion of GANs"

Beyond just generating increasingly realistic images, GANs have found applications in semi-supervised learning, fairness in machine learning, and even creating differentially private synthetic data for sensitive domains like healthcare.

The Paradox of Generative Models

Goodfellow expresses genuine amazement at how well generative models work, despite theoretical reasons suggesting they shouldn't be as effective as they are.

"If they really did what we asked them to do, they would do nothing but memorize the training data," he points out. "The way that you obtain the maximum likelihood for a specific training set is you assign all of your probability mass to the training examples and nowhere else."

Yet generative models produce novel, compelling images rather than just memorizing or producing garbage. Goodfellow suggests part of the explanation might lie in the architecture of models like convolutional networks, which inherently capture something fundamental about the structure of images even before learning begins.

Security, Fairness, and the Future of AI

Looking toward the future, Goodfellow emphasizes several critical areas for development:

On the issue of "deep fakes" and synthetic media, Goodfellow is less concerned about the long-term impacts than the immediate future: "I think there will be a kind of bumpy cultural transition as people encounter this idea that there can be very realistic videos and audio that aren't real. I think 20 years from now, people will mostly understand that you shouldn't believe something is real just because you saw a video of it."

He believes authentication systems, rather than fake detection tools, will ultimately provide the solution. This might involve cryptographic signing of media at the source, such as within smartphone cameras.

Regarding fairness in AI, Goodfellow describes how adversarial approaches can help create models that don't inadvertently discriminate based on sensitive variables. He explains a technique similar to domain adversarial learning where one model component extracts features while another ensures these features cannot be used to infer protected attributes like gender.

When asked about the path to human-level intelligence, Goodfellow emphasizes the importance of diverse environments for training agents: "I think that the agent really needs to interact and have a variety of experiences within the same lifespan... We don't really have an agent that goes seamlessly from one type of experience to another and really integrates all the different things that it does over the course of its life."

The Challenge of AI Security

Goodfellow concludes by identifying adversarial robustness as the problem he's most excited to solve: making machine learning systems secure against adversaries who want to interfere with or control them.

"Almost anything you can do with machine learning, you would like to make sure that people can't get it to do what they want rather than what you want just by showing it a funny QR code or a funny input pattern," he says.

He proposes that dynamic models—those that change with each prediction rather than remaining static after training—might offer better security by making it harder for adversaries to exploit consistent vulnerabilities.

Key Points

Deep learning is best understood as learning multi-step programs rather than just representations, with each layer refining understanding rather than simply building higher levels of abstraction.
Current deep learning systems require too much data compared to humans; developing algorithms that can learn from fewer examples remains a critical research direction.
GANs represent a game-theoretic approach to generative modeling, where a generator network competes with a discriminator network to create increasingly realistic outputs.
The surprising effectiveness of generative models challenges our theoretical understanding, as they produce novel outputs rather than simply memorizing training data.
Adversarial approaches offer promising directions for improving fairness in AI by preventing models from using sensitive attributes even indirectly.
Human-level AI will likely require agents that can seamlessly integrate diverse experiences across multiple domains within the same "lifespan."
Making AI systems resistant to adversarial attacks—inputs specifically designed to fool models—represents one of the most important security challenges for the field.

Ian Goodfellow: Generative Adversarial Networks (GANs) | Lex Fridman Podcast #19
https://www.youtube.com/watch?v=Z6rxFNMGdn0