Picture for Aditya Grover

Aditya Grover

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

Add code
Mar 15, 2025
Viaarxiv icon

VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation

Add code
Mar 09, 2025
Viaarxiv icon

Enabling Autoregressive Models to Fill In Masked Tokens

Add code
Feb 09, 2025
Viaarxiv icon

MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants

Add code
Dec 17, 2024
Viaarxiv icon

OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

Add code
Dec 02, 2024
Figure 1 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 2 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 3 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Figure 4 for OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
Viaarxiv icon

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback

Add code
Oct 30, 2024
Viaarxiv icon

PopAlign: Population-Level Alignment for Fair Text-to-Image Generation

Add code
Jun 28, 2024
Viaarxiv icon

LICO: Large Language Models for In-Context Molecular Optimization

Add code
Jun 27, 2024
Viaarxiv icon

Probing the Decision Boundaries of In-context Learning in Large Language Models

Add code
Jun 17, 2024
Viaarxiv icon

VideoPhy: Evaluating Physical Commonsense for Video Generation

Add code
Jun 05, 2024
Figure 1 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Figure 2 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Figure 3 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Figure 4 for VideoPhy: Evaluating Physical Commonsense for Video Generation
Viaarxiv icon