SC 395: Image Generative Models in Computer Vision

Overview

Course Description

This short course provides a rigorous overview of the current state-of-the-art in generative modeling, transitioning from foundational adversarial techniques to modern diffusion and flow-based paradigms.

Designed for senior undergraduate and graduate students, the curriculum balances theoretical derivation (SDEs, ODEs, Flow Matching) with practical architectural implementation (Diffusion Transformers, LoRA, ControlNet). The course concludes with an exploration of frontier applications in engineering sciences and the ethical implications of synthetic media.

Prerequisites

Probability Theory (Probability Distributions, Conditional Probability, Gaussian Distribution, Divergence)
Linear Algebra (Matrix decompositions, Vector spaces)
Deep Learning Fundamentals (CNNs, Transformers, Backpropagation) & Python / PyTorch

📚 Refresher Materials

🎲

Probability Theory CS229 Stanford (PDF)

📐

Linear Algebra CS229 Stanford (PDF)

🧠

Deep Learning Fundamentals (Slides)

🔥

Intro to PyTorch Framework Basics (Slides)

Curriculum & Schedule

Syllabus

01

Introduction to Generative Modeling
Monday • 5:00 PM - 6:20 PM • Venue: AB 7/103

Introduction and Motivation: why studying generative models is important?, Taxonomy of Generative Models (Implicit vs. Explicit), Likelihood Maximization.

Slides

📖 Reading List

📄
GAN Tutorial Goodfellow (2016)
📄
GAN Paper Goodfellow et al. (2014)
📄
GAN Inversion Xia et al. (2021)
📄
StyleGAN2 Karras et al. (2020)
📄
Clean-FID Parmar et al. (2022)
📄
GigaGAN Kang et al. (2023)
02

Generative Advrsarial Networks
Wednesday • 5:00 PM - 6:20 PM • Venue: AB 7/102

Adversarial Learning Dynamics (GAN Min-Max objective), WGAN, and the StyleGAN Paradigm (Disentanglement, AdaIN), GAN Applications.

Slides

📖 Reading List

📄
GAN Tutorial Goodfellow (2016)
📄
GAN Paper Goodfellow et al. (2014)
📄
GAN Inversion Xia et al. (2021)
📄
StyleGAN2 Karras et al. (2020)
📄
Clean-FID Parmar et al. (2022)
📄
GigaGAN Kang et al. (2023)
03

Diffusion Models
Friday • 5:00 PM - 6:20 PM • Venue: AB 7/102

GANs Summary, Auto-encoders, Diffusion Models, Mathematical derivation of DDPM objective.

Slides

📖 Reading List

📄
Unified Perspective Luo (2022)
📄
DDPM Paper Ho et al. (2020)
📄
DDIM Paper Song et al. (2020)
📺
How I understand DM? Concept Video
📺
DDPM vs. DDIM Comparison Video
📺
Math of Diffusion Full Course Playlist
04

Advances in Diffusion Models
Monday • 5:00 PM - 6:20 PM • Venue: AB 7/101

DDPM Implementation, Conditional Generation with Diffusion Models, Latent Diffusion Models, Faster Inference and Distillation.

Slides

📖 Reading List

📄
Latent Diffusion Models Rombach et al. (2021)
📄
Classifier-free Guidance Ho & Salimans (2022)
📝
Diffusion Distillation Sander Dieleman (2024)
📝
LoRA Implementation Hugging Face Blog
🌐
Diffusion into a GAN Kang et al. (Project Page)
05

Flow Matching & Modern Architectures (DiT)
Friday • 5:00 PM - 6:20 PM • Venue: AB 7/102

Math Preliminaries of Flow Matching (ODEs, Vector Fields, and Probability Paths). Diffusion Transformers (DiT)

Slides

📖 Reading List

📺
Flow Matching Intro Concept Video
🌐
Flow Matching Notes MIT Course
📺
Advances in Flow Research Talk
🌐
Diffusion vs. Flow Project Page
Supplementary Lectures Monday

S1

Applications of Diffusion Models
5:00 PM - 6:00 PM

Supplementary Lecture covering the applications of Diffusion Models.

Slides

S2

Theory of Diffusion Models
6:00 PM - 7:00 PM

Supplementary Lecture covering the theory of Diffusion Models.

Slides
06

Frontiers in Science, Engineering & Ethics
Friday, 5 pm to 6:20 pm • Venue: AB 7/102

Part 1: Cultural Inconsistencies and Ethical Issues in Image Generative Models
Discussion on cross-cultural performance gaps, fairness metrics, bias mitigation, and safety in generative AI.

Slides (Part 1)

Guest Lecture (Part 2)
Need for Sovereign AI Foundation Models for India: BharatGen Story

Dr. Maneesh Singh VP of Machine Learning, BharatGen

Slides (Part 2): Available on request (via email)

Image Generative Models in Computer Vision

Course Description

Syllabus

Introduction to Generative Modeling

Generative Advrsarial Networks

Diffusion Models

Advances in Diffusion Models

Flow Matching & Modern Architectures (DiT)

Applications of Diffusion Models

Theory of Diffusion Models

Frontiers in Science, Engineering & Ethics

Laboratory Sessions

Lab A: Foundations of GANs and Diffusion Models

Lab B: Flow Matching