Picture for Agrim Gupta

Agrim Gupta

MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation

Add code
Feb 18, 2025
Viaarxiv icon

PhaseMO: Future-Proof, Energy-efficient, Adaptive Massive MIMO

Add code
Jan 08, 2025
Figure 1 for PhaseMO: Future-Proof, Energy-efficient, Adaptive Massive MIMO
Figure 2 for PhaseMO: Future-Proof, Energy-efficient, Adaptive Massive MIMO
Figure 3 for PhaseMO: Future-Proof, Energy-efficient, Adaptive Massive MIMO
Figure 4 for PhaseMO: Future-Proof, Energy-efficient, Adaptive Massive MIMO
Viaarxiv icon

HourVideo: 1-Hour Video-Language Understanding

Add code
Nov 07, 2024
Figure 1 for HourVideo: 1-Hour Video-Language Understanding
Figure 2 for HourVideo: 1-Hour Video-Language Understanding
Figure 3 for HourVideo: 1-Hour Video-Language Understanding
Figure 4 for HourVideo: 1-Hour Video-Language Understanding
Viaarxiv icon

A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

Add code
May 22, 2024
Figure 1 for A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Figure 2 for A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Figure 3 for A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Figure 4 for A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Viaarxiv icon

Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless

Add code
Mar 20, 2024
Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Dec 21, 2023
Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Photorealistic Video Generation with Diffusion Models

Add code
Dec 11, 2023
Viaarxiv icon

Holistic Evaluation of Text-To-Image Models

Add code
Nov 07, 2023
Figure 1 for Holistic Evaluation of Text-To-Image Models
Figure 2 for Holistic Evaluation of Text-To-Image Models
Figure 3 for Holistic Evaluation of Text-To-Image Models
Figure 4 for Holistic Evaluation of Text-To-Image Models
Viaarxiv icon

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Add code
Oct 09, 2023
Viaarxiv icon

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

Add code
Jun 20, 2023
Figure 1 for RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation
Figure 2 for RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation
Figure 3 for RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation
Figure 4 for RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation
Viaarxiv icon