Stanford CS348K, Spring 2026

VISUAL COMPUTING SYSTEMS

Visual and spatial computing tasks such as computational photography, 3D graphics, generative-AI based world creation (images, videos, 3D scenes, interactive world models), and embodied agentic AI are key responsibilities of modern computer systems ranging from sensor-rich smart phones, autonomous robots, and large datacenters. These workloads demand exceptional system efficiency. This course examines the key ideas, techniques, and challenges associated with the design of parallel, heterogeneous systems that accelerate visual and spatial computing applications. This course is intended for systems students interested in architecting efficient graphics and visual AI platforms (both new hardware architectures and domain-specific programming frameworks for these platforms) and for graphics, vision, robotics, and AI students that wish to understand throughput computing principles to design new algorithms that map efficiently to these platforms.

Basic Info

Tues/Thurs 10:30-11:50pm

Location: Lathrop 016

Instructor: Kayvon Fatahalian

Welcome to CS348K Spring 2026. Please see the course info page for more info on policies and logistics, and well as answers to common questions like "Am I prepared to take this class?" This course is a paper-reading and in-class discussion-based course. sLive attendence is expected of all participants.

Spring 2026 Schedule

Mar 31		Introduction: Importance of Goals and Constraints in System Design Broad survey of course topics, why establishing goals and constraints is so important to clear systems design and thinking. A few in-class design exercises,
Apr 02		Digital Camera Pipeline (Part I) - Algorithms Multi-shot alignment/merging, multi-scale processing with Gaussian and Laplacian pyramids, HDR (local tone mapping), advanced image understanding responsibilities of digital cameras (portrait mode, autofocus, etc)
Apr 07		Digital Camera Pipeline (Part II) - Programming Abstractions The Frankencamera, modern camera APIs, intro to Halide programming language
Apr 09		Scheduling Image Processing Algorithms in Halide Key optimization ideas, a detailed look at Halide's scheduling algebra
Apr 14		Generating ML Kernels using LLMs + Domain Specific Languages for Targeting Tensor Cores Trends in using LLMs to write efficient ML kernel code, motivation for DSLs like CuTile, Thunkerkittens, and Triton.
Apr 16		The Design of Automatic Differentiation Primitives in Slang Guest lecture by Yong He (NVIDIA). How the Slang language provides flexible support for auto-differentiation. Understanding the difference between mechanism and policy in system design.
Apr 21		Designing 2D/3D Procedural Models for AutoDiff Recovering 3D content with differentiable rendering, differentiable procedural modeling
Apr 23		Making Useful Controls for Visual Generative AI The importance of predictable control in content creation. Techniques for inserting new forms of control into generative image synthesis, role of human-interpretable abstractions, neurosymbolic representations, The role of code as a representation for content.
Apr 28		Controlling Generative AI (Part II) Continuing our discussion of control for visual and spatial generative AI
Apr 30		Neural Postprocessing: Making Renderer's Output Look Photoreal using AI Guest lecture from Miloš Hašan (NVIDIA). Basics of techniques like DLSS. How the role of a modern and future renderer is to produce inputs for a final AI post-processing pass.
May 05		Generating 3D Objects and 3D Scenes Techniques for generating complex 3D scenes. Generating code vs generating content. Direct 3D generation architectures, recovering 3D training data from image and video models.
May 07		Introduction to World Models Action-conditioned video models, non-pixel-based models, latent world representations, the challenge of maintaining long-term coherence, what applications is a world model good for anyway? (Entertainment? Training AI agents?)
May 12		Autoregressive, Action-Conditioned Video Models (Part II) Guest lecture by Ryan Po (Stanford)
May 14		Automating Design Feedback Using Agents The role of AI agents in dealing with the increasing velocity of 3d content creation.
May 19		Building Blocks for 3D World Generation Guest lecture by Ben Mildenhall and Justin Johnson (WorldLabs)
May 21		How Generative AI is Being Used to Create Interactive Worlds at Roblox Guest lecture by Kiran Bhat (Roblox)
May 26		High-Performance Simulators for Training AI Agents Fast simulation platforms for training agents in virtual worlds, batch simulation concepts and challenges.
May 28		Performance Optimizations for Autoregressive Video Models How a combination of model distillation, KV-Caching, and token/feature caching are leading to real-time interactive video models.
Jun 02		Final Project Presentation Day Have a great summer!

Assignments

Jun 2

Term Project Information