Picture for Zhaoxin Fan

Zhaoxin Fan

AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline

Add code
Apr 01, 2025
Viaarxiv icon

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Add code
Mar 28, 2025
Viaarxiv icon

STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM

Add code
Mar 27, 2025
Viaarxiv icon

MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System

Add code
Mar 12, 2025
Viaarxiv icon

ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis

Add code
Mar 09, 2025
Viaarxiv icon

DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue

Add code
Feb 19, 2025
Viaarxiv icon

TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding

Add code
Jan 26, 2025
Viaarxiv icon

EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers

Add code
Dec 29, 2024
Viaarxiv icon

MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing

Add code
Dec 28, 2024
Viaarxiv icon

Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images

Add code
Dec 27, 2024
Figure 1 for Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images
Figure 2 for Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images
Figure 3 for Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images
Figure 4 for Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images
Viaarxiv icon