Picture for Weizhen Bian

Weizhen Bian

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Viaarxiv icon

CogSimulator: A Model for Simulating User Cognition & Behavior with Minimal Data for Tailored Cognitive Enhancement

Add code
Dec 10, 2024
Viaarxiv icon

IntellectSeeker: A Personalized Literature Management System with the Probabilistic Model and Large Language Model

Add code
Dec 10, 2024
Viaarxiv icon

EmoSpeech: A Corpus of Emotionally Rich and Contextually Detailed Speech Annotations

Add code
Dec 09, 2024
Viaarxiv icon

Advancing Music Therapy: Integrating Eastern Five-Element Music Theory and Western Techniques with AI in the Novel Five-Element Harmony System

Add code
Dec 09, 2024
Viaarxiv icon

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Apr 25, 2024
Figure 1 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 2 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 3 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Figure 4 for FlashSpeech: Efficient Zero-Shot Speech Synthesis
Viaarxiv icon