Picture for Mohit Bansal

Mohit Bansal

Shammie

Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level

Add code
Nov 15, 2024
Viaarxiv icon

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

Add code
Nov 07, 2024
Figure 1 for M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding
Figure 2 for M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding
Figure 3 for M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding
Figure 4 for M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding
Viaarxiv icon

Self-Consistency Preference Optimization

Add code
Nov 06, 2024
Figure 1 for Self-Consistency Preference Optimization
Figure 2 for Self-Consistency Preference Optimization
Figure 3 for Self-Consistency Preference Optimization
Figure 4 for Self-Consistency Preference Optimization
Viaarxiv icon

Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM

Add code
Nov 03, 2024
Viaarxiv icon

On Positional Bias of Faithfulness for Long-form Summarization

Add code
Oct 31, 2024
Viaarxiv icon

Unbounded: A Generative Infinite Game of Character Life Simulation

Add code
Oct 24, 2024
Figure 1 for Unbounded: A Generative Infinite Game of Character Life Simulation
Figure 2 for Unbounded: A Generative Infinite Game of Character Life Simulation
Figure 3 for Unbounded: A Generative Infinite Game of Character Life Simulation
Figure 4 for Unbounded: A Generative Infinite Game of Character Life Simulation
Viaarxiv icon

Teaching Models to Balance Resisting and Accepting Persuasion

Add code
Oct 18, 2024
Figure 1 for Teaching Models to Balance Resisting and Accepting Persuasion
Figure 2 for Teaching Models to Balance Resisting and Accepting Persuasion
Figure 3 for Teaching Models to Balance Resisting and Accepting Persuasion
Figure 4 for Teaching Models to Balance Resisting and Accepting Persuasion
Viaarxiv icon

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

Add code
Oct 16, 2024
Figure 1 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Figure 2 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Figure 3 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Figure 4 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Viaarxiv icon

Adapt-$\infty$: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection

Add code
Oct 14, 2024
Viaarxiv icon

LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints

Add code
Oct 09, 2024
Viaarxiv icon