Picture for Yaqian Zhou

Yaqian Zhou

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Figure 1 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 2 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 3 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 4 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Viaarxiv icon

Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Alignment in MLLM

Add code
Sep 18, 2025
Viaarxiv icon

Thus Spake Long-Context Large Language Model

Add code
Feb 24, 2025
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Figure 1 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 2 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 3 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 4 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Viaarxiv icon

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

Add code
Oct 18, 2024
Figure 1 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 2 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 3 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 4 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Viaarxiv icon

IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities

Add code
Oct 09, 2024
Figure 1 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 2 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 3 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 4 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Viaarxiv icon

SpeechAlign: Aligning Speech Generation to Human Preferences

Add code
Apr 08, 2024
Figure 1 for SpeechAlign: Aligning Speech Generation to Human Preferences
Figure 2 for SpeechAlign: Aligning Speech Generation to Human Preferences
Figure 3 for SpeechAlign: Aligning Speech Generation to Human Preferences
Figure 4 for SpeechAlign: Aligning Speech Generation to Human Preferences
Viaarxiv icon

Calibrating the Confidence of Large Language Models by Eliciting Fidelity

Add code
Apr 03, 2024
Figure 1 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Figure 2 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Figure 3 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Figure 4 for Calibrating the Confidence of Large Language Models by Eliciting Fidelity
Viaarxiv icon

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

Add code
Jan 25, 2024
Viaarxiv icon

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems

Add code
Jan 08, 2024
Figure 1 for SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
Figure 2 for SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
Figure 3 for SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
Figure 4 for SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
Viaarxiv icon