Picture for Ishita Dasgupta

Ishita Dasgupta

SKALD: Learning-Based Shot Assembly for Coherent Multi-Shot Video Creation

Add code
Mar 11, 2025
Viaarxiv icon

Decoupling the components of geometric understanding in Vision Language Models

Add code
Mar 05, 2025
Viaarxiv icon

The in-context inductive biases of vision-language models differ across modalities

Add code
Feb 03, 2025
Viaarxiv icon

ReMI: A Dataset for Reasoning with Multiple Images

Add code
Jun 13, 2024
Figure 1 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 2 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 3 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 4 for ReMI: A Dataset for Reasoning with Multiple Images
Viaarxiv icon

HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances

Add code
Mar 04, 2024
Figure 1 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Figure 2 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Figure 3 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Figure 4 for HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Viaarxiv icon

How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

Add code
Feb 13, 2024
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models

Add code
Nov 01, 2023
Figure 1 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Figure 2 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Figure 3 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Figure 4 for A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
Viaarxiv icon

The Impact of Depth and Width on Transformer Language Model Generalization

Add code
Oct 30, 2023
Figure 1 for The Impact of Depth and Width on Transformer Language Model Generalization
Figure 2 for The Impact of Depth and Width on Transformer Language Model Generalization
Figure 3 for The Impact of Depth and Width on Transformer Language Model Generalization
Figure 4 for The Impact of Depth and Width on Transformer Language Model Generalization
Viaarxiv icon