Picture for Zhaoxin Fan

Zhaoxin Fan

DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue

Add code
Feb 19, 2025
Viaarxiv icon

TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding

Add code
Jan 26, 2025
Viaarxiv icon

EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers

Add code
Dec 29, 2024
Viaarxiv icon

MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing

Add code
Dec 28, 2024
Viaarxiv icon

Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images

Add code
Dec 27, 2024
Viaarxiv icon

CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition

Add code
Dec 26, 2024
Figure 1 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Figure 2 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Figure 3 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Figure 4 for CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition
Viaarxiv icon

Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation

Add code
Dec 13, 2024
Viaarxiv icon

Moderating the Generalization of Score-based Generative Model

Add code
Dec 10, 2024
Figure 1 for Moderating the Generalization of Score-based Generative Model
Figure 2 for Moderating the Generalization of Score-based Generative Model
Figure 3 for Moderating the Generalization of Score-based Generative Model
Figure 4 for Moderating the Generalization of Score-based Generative Model
Viaarxiv icon

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Add code
Dec 09, 2024
Figure 1 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Figure 2 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Figure 3 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Figure 4 for CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction
Viaarxiv icon

VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction

Add code
Sep 17, 2024
Figure 1 for VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction
Figure 2 for VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction
Figure 3 for VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction
Figure 4 for VGG-Tex: A Vivid Geometry-Guided Facial Texture Estimation Model for High Fidelity Monocular 3D Face Reconstruction
Viaarxiv icon