Picture for Arushi Goel

Arushi Goel

OMCAT: Omni Context Aware Transformer

Add code
Oct 15, 2024
Viaarxiv icon

Audio Dialogues: Dialogues dataset for audio and music understanding

Add code
Apr 11, 2024
Viaarxiv icon

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Add code
Feb 02, 2024
Viaarxiv icon

Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in Clutter

Add code
Nov 09, 2023
Viaarxiv icon

Semi-supervised multimodal coreference resolution in image narrations

Add code
Oct 20, 2023
Viaarxiv icon

Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories

Add code
Jun 15, 2023
Viaarxiv icon

Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE

Add code
Mar 09, 2023
Figure 1 for Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Figure 2 for Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Figure 3 for Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Figure 4 for Controllable Video Generation by Learning the Underlying Dynamical System with Neural ODE
Viaarxiv icon

Who are you referring to? Weakly supervised coreference resolution with multimodal grounding

Add code
Nov 26, 2022
Viaarxiv icon

WiCV 2022: The Tenth Women In Computer Vision Workshop

Add code
Aug 24, 2022
Figure 1 for WiCV 2022: The Tenth Women In Computer Vision Workshop
Figure 2 for WiCV 2022: The Tenth Women In Computer Vision Workshop
Viaarxiv icon

WiCV 2021: The Eighth Women In Computer Vision Workshop

Add code
Mar 11, 2022
Figure 1 for WiCV 2021: The Eighth Women In Computer Vision Workshop
Figure 2 for WiCV 2021: The Eighth Women In Computer Vision Workshop
Viaarxiv icon