Picture for Zhaojiang Lin

Zhaojiang Lin

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Mar 07, 2024
Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Add code
Feb 16, 2024
Viaarxiv icon

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Add code
Sep 27, 2023
Viaarxiv icon

Continual Dialogue State Tracking via Example-Guided Question Answering

Add code
May 23, 2023
Viaarxiv icon

Introducing Semantics into Speech Encoders

Add code
Nov 15, 2022
Viaarxiv icon

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text

Add code
Oct 26, 2022
Viaarxiv icon

Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

Add code
Oct 14, 2022
Figure 1 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Figure 2 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Figure 3 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Figure 4 for Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values
Viaarxiv icon

FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Add code
Dec 28, 2021
Figure 1 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 2 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 3 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 4 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Viaarxiv icon

Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation

Add code
Dec 07, 2021
Figure 1 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Figure 2 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Figure 3 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Figure 4 for Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation
Viaarxiv icon

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

Add code
Oct 15, 2021
Figure 1 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Figure 2 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Figure 3 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Figure 4 for Few-Shot Bot: Prompt-Based Learning for Dialogue Systems
Viaarxiv icon