Picture for Fei Yu

Fei Yu

M3SR: Multi-Scale Multi-Perceptual Mamba for Efficient Spectral Reconstruction

Add code
Jan 13, 2026
Viaarxiv icon

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Add code
Jan 10, 2026
Viaarxiv icon

ThinkGen: Generalized Thinking for Visual Generation

Add code
Dec 29, 2025
Viaarxiv icon

From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images

Add code
Dec 09, 2025
Viaarxiv icon

Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning

Add code
Nov 08, 2025
Figure 1 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Figure 2 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Figure 3 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Figure 4 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Viaarxiv icon

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following

Add code
Oct 16, 2025
Viaarxiv icon

HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness

Add code
Oct 10, 2025
Viaarxiv icon

TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding

Add code
Aug 11, 2025
Viaarxiv icon

AD-AVSR: Asymmetric Dual-stream Enhancement for Robust Audio-Visual Speech Recognition

Add code
Aug 11, 2025
Viaarxiv icon

Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks for Enhanced Action Understanding

Add code
Aug 10, 2025
Viaarxiv icon