Picture for Chitta Baral

Chitta Baral

Shammie

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

Add code
Feb 22, 2025
Viaarxiv icon

Dual Caption Preference Optimization for Diffusion Models

Add code
Feb 09, 2025
Viaarxiv icon

Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning

Add code
Feb 08, 2025
Figure 1 for Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
Figure 2 for Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
Figure 3 for Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
Figure 4 for Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
Viaarxiv icon

TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives

Add code
Nov 04, 2024
Figure 1 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 2 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 3 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 4 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Viaarxiv icon

ToW: Thoughts of Words Improve Reasoning in Large Language Models

Add code
Oct 21, 2024
Figure 1 for ToW: Thoughts of Words Improve Reasoning in Large Language Models
Figure 2 for ToW: Thoughts of Words Improve Reasoning in Large Language Models
Figure 3 for ToW: Thoughts of Words Improve Reasoning in Large Language Models
Figure 4 for ToW: Thoughts of Words Improve Reasoning in Large Language Models
Viaarxiv icon

VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks

Add code
Oct 17, 2024
Figure 1 for VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Figure 2 for VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Figure 3 for VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Figure 4 for VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Viaarxiv icon

ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions

Add code
Oct 17, 2024
Figure 1 for ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Figure 2 for ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Figure 3 for ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Figure 4 for ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions
Viaarxiv icon

Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?

Add code
Oct 17, 2024
Figure 1 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Figure 2 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Figure 3 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Figure 4 for Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?
Viaarxiv icon

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Add code
Aug 05, 2024
Viaarxiv icon

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?

Add code
Jul 20, 2024
Viaarxiv icon