Picture for Shuai Fan

Shuai Fan

Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning

Add code
Mar 19, 2025
Viaarxiv icon

NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms

Add code
Feb 26, 2025
Viaarxiv icon

Audio-FLAN: A Preliminary Release

Add code
Feb 23, 2025
Viaarxiv icon

VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization

Add code
Dec 13, 2024
Figure 1 for VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
Figure 2 for VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
Figure 3 for VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
Figure 4 for VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
Viaarxiv icon

Reducing Tool Hallucination via Reliability Alignment

Add code
Dec 05, 2024
Figure 1 for Reducing Tool Hallucination via Reliability Alignment
Figure 2 for Reducing Tool Hallucination via Reliability Alignment
Figure 3 for Reducing Tool Hallucination via Reliability Alignment
Figure 4 for Reducing Tool Hallucination via Reliability Alignment
Viaarxiv icon

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Viaarxiv icon

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

Add code
Jun 17, 2024
Viaarxiv icon

Sparsity-Accelerated Training for Large Language Models

Add code
Jun 03, 2024
Viaarxiv icon

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

Add code
May 06, 2024
Viaarxiv icon

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Apr 10, 2024
Viaarxiv icon