Picture for Kai Yu

Kai Yu

Sherman

Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency

Add code
Dec 17, 2024
Viaarxiv icon

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting

Add code
Dec 17, 2024
Viaarxiv icon

VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization

Add code
Dec 13, 2024
Viaarxiv icon

Reducing Tool Hallucination via Reliability Alignment

Add code
Dec 05, 2024
Figure 1 for Reducing Tool Hallucination via Reliability Alignment
Figure 2 for Reducing Tool Hallucination via Reliability Alignment
Figure 3 for Reducing Tool Hallucination via Reliability Alignment
Figure 4 for Reducing Tool Hallucination via Reliability Alignment
Viaarxiv icon

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Viaarxiv icon

Unified Pathological Speech Analysis with Prompt Tuning

Add code
Nov 05, 2024
Viaarxiv icon

Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding

Add code
Oct 29, 2024
Figure 1 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 2 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 3 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Figure 4 for Fast and High-Quality Auto-Regressive Speech Synthesis via Speculative Decoding
Viaarxiv icon

A Survey on Speech Large Language Models

Add code
Oct 24, 2024
Figure 1 for A Survey on Speech Large Language Models
Figure 2 for A Survey on Speech Large Language Models
Figure 3 for A Survey on Speech Large Language Models
Figure 4 for A Survey on Speech Large Language Models
Viaarxiv icon

LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec

Add code
Oct 21, 2024
Viaarxiv icon

MobA: A Two-Level Agent System for Efficient Mobile Task Automation

Add code
Oct 17, 2024
Figure 1 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 2 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 3 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Figure 4 for MobA: A Two-Level Agent System for Efficient Mobile Task Automation
Viaarxiv icon