Picture for Yaqian Zhou

Yaqian Zhou

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

Add code
Oct 18, 2024
Viaarxiv icon

IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities

Add code
Oct 09, 2024
Figure 1 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 2 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 3 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 4 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Viaarxiv icon

SpeechAlign: Aligning Speech Generation to Human Preferences

Add code
Apr 08, 2024
Viaarxiv icon

Calibrating the Confidence of Large Language Models by Eliciting Fidelity

Add code
Apr 03, 2024
Viaarxiv icon

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

Add code
Jan 25, 2024
Viaarxiv icon

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems

Add code
Jan 08, 2024
Viaarxiv icon

SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models

Add code
Aug 31, 2023
Viaarxiv icon

PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search

Add code
May 20, 2023
Viaarxiv icon

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities

Add code
May 19, 2023
Viaarxiv icon

DUB: Discrete Unit Back-translation for Speech Translation

Add code
May 19, 2023
Viaarxiv icon