Picture for Yu Pan

Yu Pan

Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech

Add code
Feb 05, 2025
Viaarxiv icon

StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching

Add code
Dec 10, 2024
Viaarxiv icon

A Unified Platform for At-Home Post-Stroke Rehabilitation Enabled by Wearable Technologies and Artificial Intelligence

Add code
Nov 28, 2024
Figure 1 for A Unified Platform for At-Home Post-Stroke Rehabilitation Enabled by Wearable Technologies and Artificial Intelligence
Figure 2 for A Unified Platform for At-Home Post-Stroke Rehabilitation Enabled by Wearable Technologies and Artificial Intelligence
Figure 3 for A Unified Platform for At-Home Post-Stroke Rehabilitation Enabled by Wearable Technologies and Artificial Intelligence
Figure 4 for A Unified Platform for At-Home Post-Stroke Rehabilitation Enabled by Wearable Technologies and Artificial Intelligence
Viaarxiv icon

Wearable intelligent throat enables natural speech in stroke patients with dysarthria

Add code
Nov 28, 2024
Figure 1 for Wearable intelligent throat enables natural speech in stroke patients with dysarthria
Figure 2 for Wearable intelligent throat enables natural speech in stroke patients with dysarthria
Figure 3 for Wearable intelligent throat enables natural speech in stroke patients with dysarthria
Figure 4 for Wearable intelligent throat enables natural speech in stroke patients with dysarthria
Viaarxiv icon

Reward Modeling with Ordinal Feedback: Wisdom of the Crowd

Add code
Nov 19, 2024
Viaarxiv icon

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

Add code
Nov 04, 2024
Viaarxiv icon

Can Language Models Enable In-Context Database?

Add code
Nov 04, 2024
Viaarxiv icon

Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis

Add code
Oct 31, 2024
Figure 1 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Figure 2 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Figure 3 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Figure 4 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Viaarxiv icon

Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching

Add code
Oct 08, 2024
Figure 1 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 2 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 3 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 4 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Viaarxiv icon

Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling

Add code
Oct 02, 2024
Figure 1 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 2 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 3 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 4 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Viaarxiv icon