Picture for Xiaofan Li

Xiaofan Li

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception

Add code
Mar 17, 2025
Viaarxiv icon

AttFC: Attention Fully-Connected Layer for Large-Scale Face Recognition with One GPU

Add code
Mar 10, 2025
Viaarxiv icon

MSConv: Multiplicative and Subtractive Convolution for Face Recognition

Add code
Mar 08, 2025
Viaarxiv icon

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance

Add code
Mar 05, 2025
Figure 1 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 2 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 3 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 4 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Viaarxiv icon

AdaSin: Enhancing Hard Sample Metrics with Dual Adaptive Penalty for Face Recognition

Add code
Mar 05, 2025
Viaarxiv icon

RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition

Add code
Mar 05, 2025
Figure 1 for RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition
Figure 2 for RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition
Figure 3 for RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition
Figure 4 for RVAFM: Re-parameterizing Vertical Attention Fusion Module for Handwritten Paragraph Text Recognition
Viaarxiv icon

One-for-More: Continual Diffusion Model for Anomaly Detection

Add code
Feb 27, 2025
Viaarxiv icon

Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting

Add code
Jan 30, 2025
Figure 1 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 2 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 3 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 4 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Viaarxiv icon

Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities

Add code
Dec 21, 2024
Figure 1 for Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities
Figure 2 for Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities
Figure 3 for Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities
Figure 4 for Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities
Viaarxiv icon

Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception

Add code
Dec 18, 2024
Figure 1 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 2 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 3 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Figure 4 for Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
Viaarxiv icon