Picture for Yu Pan

Yu Pan

Can Language Models Enable In-Context Database?

Add code
Nov 04, 2024
Viaarxiv icon

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

Add code
Nov 04, 2024
Viaarxiv icon

Building Multi-Agent Copilot towards Autonomous Agricultural Data Management and Analysis

Add code
Oct 31, 2024
Viaarxiv icon

Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching

Add code
Oct 08, 2024
Figure 1 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 2 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 3 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Figure 4 for Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching
Viaarxiv icon

Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling

Add code
Oct 02, 2024
Figure 1 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 2 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 3 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Figure 4 for Takin-VC: Zero-shot Voice Conversion via Jointly Hybrid Content and Memory-Augmented Context-Aware Timbre Modeling
Viaarxiv icon

CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration

Add code
Aug 05, 2024
Viaarxiv icon

MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval

Add code
Aug 05, 2024
Viaarxiv icon

A Vectorization Method Induced By Maximal Margin Classification For Persistent Diagrams

Add code
Jul 31, 2024
Viaarxiv icon

MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement

Add code
Jul 16, 2024
Viaarxiv icon

FAGhead: Fully Animate Gaussian Head from Monocular Videos

Add code
Jun 27, 2024
Viaarxiv icon