Picture for Fei Yu

Fei Yu

ThinkGen: Generalized Thinking for Visual Generation

Add code
Dec 29, 2025
Viaarxiv icon

From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images

Add code
Dec 09, 2025
Viaarxiv icon

Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning

Add code
Nov 08, 2025
Viaarxiv icon

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following

Add code
Oct 16, 2025
Viaarxiv icon

HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness

Add code
Oct 10, 2025
Viaarxiv icon

AD-AVSR: Asymmetric Dual-stream Enhancement for Robust Audio-Visual Speech Recognition

Add code
Aug 11, 2025
Viaarxiv icon

TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding

Add code
Aug 11, 2025
Viaarxiv icon

Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks for Enhanced Action Understanding

Add code
Aug 10, 2025
Viaarxiv icon

A Two-Stage Lightweight Framework for Efficient Land-Air Bimodal Robot Autonomous Navigation

Add code
Jul 30, 2025
Viaarxiv icon

JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction

Add code
Jul 23, 2025
Viaarxiv icon