Picture for Babak Damavandi

Babak Damavandi

Doppelgänger's Watch: A Split Objective Approach to Large Language Models

Add code
Sep 09, 2024
Viaarxiv icon

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Mar 07, 2024
Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Add code
Sep 27, 2023
Viaarxiv icon

Navigating Connected Memories with a Task-oriented Dialog System

Add code
Nov 15, 2022
Viaarxiv icon

Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation

Add code
Nov 08, 2022
Viaarxiv icon

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text

Add code
Oct 26, 2022
Viaarxiv icon

Connecting What to Say With Where to Look by Modeling Human Attention Traces

Add code
May 12, 2021
Figure 1 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 2 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 3 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 4 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Viaarxiv icon

SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Add code
Apr 18, 2021
Figure 1 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Figure 2 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Figure 3 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Figure 4 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Viaarxiv icon

NN-grams: Unifying neural network and n-gram language models for Speech Recognition

Add code
Jun 23, 2016
Figure 1 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 2 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 3 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 4 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Viaarxiv icon