Picture for Feng Lu

Feng Lu

Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences

The Quality-Utility Paradox: Why High-Reward Data Impairs Small Model Mathematical Reasoning

Add code
Jun 15, 2026
Viaarxiv icon

Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback

Add code
Jun 04, 2026
Viaarxiv icon

GazeOnce360: Fisheye-Based 360° Multi-Person Gaze Estimation with Global-Local Feature Fusion

Add code
Mar 17, 2026
Viaarxiv icon

Requesting Expert Reasoning: Augmenting LLM Agents with Learned Collaborative Intervention

Add code
Feb 26, 2026
Viaarxiv icon

Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era

Add code
Nov 08, 2025
Viaarxiv icon

Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track

Add code
Sep 19, 2025
Figure 1 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Figure 2 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Figure 3 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Figure 4 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Viaarxiv icon

Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track

Add code
Sep 18, 2025
Figure 1 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Figure 2 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Figure 3 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Figure 4 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Viaarxiv icon

In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding with Gaze-Guided Prompting

Add code
Sep 09, 2025
Viaarxiv icon

SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition

Add code
Feb 23, 2025
Viaarxiv icon

3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation

Add code
Feb 06, 2025
Viaarxiv icon