Picture for Yu Pan

Yu Pan

StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching

Add code
Dec 10, 2024
Viaarxiv icon

Wearable intelligent throat enables natural speech in stroke patients with dysarthria

Add code
Nov 28, 2024
Viaarxiv icon

A Unified Platform for At-Home Post-Stroke Rehabilitation Enabled by Wearable Technologies and Artificial Intelligence

Add code
Nov 28, 2024
Viaarxiv icon

Reward Modeling with Ordinal Feedback: Wisdom of the Crowd

Add code
Nov 19, 2024
Viaarxiv icon

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

Add code
Nov 04, 2024
Viaarxiv icon

Can Language Models Enable In-Context Database?

Add code
Nov 04, 2024
Viaarxiv icon

Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis

Add code
Oct 31, 2024
Figure 1 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Figure 2 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Figure 3 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Figure 4 for Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis
Viaarxiv icon

Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching

Add code
Oct 08, 2024
Figure 1 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 2 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 3 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 4 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Viaarxiv icon

Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling

Add code
Oct 02, 2024
Figure 1 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 2 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 3 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 4 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Viaarxiv icon

MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval

Add code
Aug 05, 2024
Viaarxiv icon