Picture for Maneesh Agrawala

Maneesh Agrawala

Pretraining Frame Preservation in Autoregressive Video Memory Compression

Add code
Dec 29, 2025
Viaarxiv icon

LouvreSAE: Sparse Autoencoders for Interpretable and Controllable Style Transfer

Add code
Dec 22, 2025
Viaarxiv icon

StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation

Add code
Nov 10, 2025
Figure 1 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 2 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 3 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Figure 4 for StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
Viaarxiv icon

Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration

Add code
Oct 25, 2025
Figure 1 for Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
Figure 2 for Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
Figure 3 for Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
Figure 4 for Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
Viaarxiv icon

Taming Flow-based I2V Models for Creative Video Editing

Add code
Sep 26, 2025
Viaarxiv icon

Mixture of Contexts for Long Video Generation

Add code
Aug 28, 2025
Viaarxiv icon

Captain Cinema: Towards Short Movie Generation

Add code
Jul 24, 2025
Viaarxiv icon

Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation

Add code
Jun 24, 2025
Viaarxiv icon

Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders

Add code
Jun 24, 2025
Viaarxiv icon

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Add code
Apr 21, 2025
Figure 1 for Packing Input Frame Context in Next-Frame Prediction Models for Video Generation
Figure 2 for Packing Input Frame Context in Next-Frame Prediction Models for Video Generation
Figure 3 for Packing Input Frame Context in Next-Frame Prediction Models for Video Generation
Figure 4 for Packing Input Frame Context in Next-Frame Prediction Models for Video Generation
Viaarxiv icon