Picture for David M. Chan

David M. Chan

Analyzing The Language of Visual Tokens

Add code
Nov 07, 2024
Figure 1 for Analyzing The Language of Visual Tokens
Figure 2 for Analyzing The Language of Visual Tokens
Figure 3 for Analyzing The Language of Visual Tokens
Figure 4 for Analyzing The Language of Visual Tokens
Viaarxiv icon

Rediscovering the Latent Dimensions of Personality with Large Language Models as Trait Descriptors

Add code
Sep 16, 2024
Viaarxiv icon

An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems

Add code
Sep 16, 2024
Viaarxiv icon

Visual Haystacks: Answering Harder Questions About Sets of Images

Add code
Jul 18, 2024
Viaarxiv icon

Virtual Personas for Language Models via an Anthology of Backstories

Add code
Jul 09, 2024
Figure 1 for Virtual Personas for Language Models via an Anthology of Backstories
Figure 2 for Virtual Personas for Language Models via an Anthology of Backstories
Figure 3 for Virtual Personas for Language Models via an Anthology of Backstories
Figure 4 for Virtual Personas for Language Models via an Anthology of Backstories
Viaarxiv icon

ALOHa: A New Measure for Hallucination in Captioning Models

Add code
Apr 03, 2024
Viaarxiv icon

ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video

Add code
Jan 10, 2024
Viaarxiv icon

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Add code
Jan 04, 2024
Viaarxiv icon

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Add code
Dec 22, 2023
Viaarxiv icon

$IC^3$: Image Captioning by Committee Consensus

Add code
Feb 16, 2023
Viaarxiv icon