Picture for Arash Eshghi

Arash Eshghi

Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs

Add code
Oct 26, 2024
Viaarxiv icon

Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models

Add code
Sep 21, 2024
Viaarxiv icon

Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

Add code
Sep 09, 2024
Figure 1 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 2 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 3 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 4 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Viaarxiv icon

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Add code
Jun 19, 2024
Viaarxiv icon

Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers

Add code
Apr 21, 2024
Viaarxiv icon

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

Add code
Nov 07, 2023
Viaarxiv icon

Learning to generate and corr- uh I mean repair language in real-time

Add code
Aug 22, 2023
Viaarxiv icon

No that's not what I meant: Handling Third Position Repair in Conversational Question Answering

Add code
Jul 31, 2023
Viaarxiv icon

'What are you referring to?' Evaluating the Ability of Multi-Modal Dialogue Models to Process Clarificational Exchanges

Add code
Jul 28, 2023
Viaarxiv icon

The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering

Add code
May 25, 2023
Viaarxiv icon