Picture for Xin Geng

Xin Geng

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Add code
Mar 11, 2025
Viaarxiv icon

Label Distribution Learning with Biased Annotations by Learning Multi-Label Representation

Add code
Feb 03, 2025
Viaarxiv icon

CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification

Add code
Jan 27, 2025
Figure 1 for CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification
Figure 2 for CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification
Figure 3 for CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification
Figure 4 for CILP-FGDI: Exploiting Vision-Language Model for Generalizable Person Re-Identification
Viaarxiv icon

STHFL: Spatio-Temporal Heterogeneous Federated Learning

Add code
Jan 10, 2025
Viaarxiv icon

BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization

Add code
Jan 02, 2025
Figure 1 for BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization
Figure 2 for BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization
Figure 3 for BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization
Figure 4 for BatStyler: Advancing Multi-category Style Generation for Source-free Domain Generalization
Viaarxiv icon

SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization

Add code
Dec 06, 2024
Figure 1 for SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
Figure 2 for SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
Figure 3 for SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
Figure 4 for SoPo: Text-to-Motion Generation Using Semi-Online Preference Optimization
Viaarxiv icon

MageBench: Bridging Large Multimodal Models to Agents

Add code
Dec 05, 2024
Viaarxiv icon

Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection

Add code
Dec 04, 2024
Viaarxiv icon

Redefining <Creative> in Dictionary: Towards a Enhanced Semantic Understanding of Creative Generation

Add code
Oct 31, 2024
Viaarxiv icon

Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning

Add code
Oct 28, 2024
Figure 1 for Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning
Figure 2 for Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning
Figure 3 for Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning
Figure 4 for Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning
Viaarxiv icon