Picture for Tian Tan

Tian Tan

LikeBench: Evaluating Subjective Likability in LLMs for Personalization

Add code
Dec 15, 2025
Viaarxiv icon

Not All Documents Are What You Need for Extracting Instruction Tuning Data

Add code
May 18, 2025
Viaarxiv icon

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

Add code
Jul 05, 2024
Figure 1 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 2 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 3 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 4 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Viaarxiv icon

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

Add code
Jun 22, 2024
Figure 1 for video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
Figure 2 for video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
Figure 3 for video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
Figure 4 for video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
Viaarxiv icon

Text-aware Speech Separation for Multi-talker Keyword Spotting

Add code
Jun 18, 2024
Figure 1 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Figure 2 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Figure 3 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Figure 4 for Text-aware Speech Separation for Multi-talker Keyword Spotting
Viaarxiv icon

Can Large Language Models Understand Spatial Audio?

Add code
Jun 12, 2024
Viaarxiv icon

SALMONN: Towards Generic Hearing Abilities for Large Language Models

Add code
Oct 20, 2023
Figure 1 for SALMONN: Towards Generic Hearing Abilities for Large Language Models
Figure 2 for SALMONN: Towards Generic Hearing Abilities for Large Language Models
Figure 3 for SALMONN: Towards Generic Hearing Abilities for Large Language Models
Figure 4 for SALMONN: Towards Generic Hearing Abilities for Large Language Models
Viaarxiv icon

Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models

Add code
Oct 10, 2023
Viaarxiv icon

Connecting Speech Encoder and Large Language Model for ASR

Add code
Sep 26, 2023
Figure 1 for Connecting Speech Encoder and Large Language Model for ASR
Figure 2 for Connecting Speech Encoder and Large Language Model for ASR
Figure 3 for Connecting Speech Encoder and Large Language Model for ASR
Figure 4 for Connecting Speech Encoder and Large Language Model for ASR
Viaarxiv icon

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

Add code
Sep 14, 2023
Viaarxiv icon