Picture for Feng Lu

Feng Lu

Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences

GazeOnce360: Fisheye-Based 360° Multi-Person Gaze Estimation with Global-Local Feature Fusion

Add code
Mar 17, 2026
Viaarxiv icon

Requesting Expert Reasoning: Augmenting LLM Agents with Learned Collaborative Intervention

Add code
Feb 26, 2026
Viaarxiv icon

Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era

Add code
Nov 08, 2025
Viaarxiv icon

Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track

Add code
Sep 19, 2025
Figure 1 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Figure 2 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Figure 3 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Figure 4 for Enhancing Sa2VA for Referent Video Object Segmentation: 2nd Solution for 7th LSVOS RVOS Track
Viaarxiv icon

Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track

Add code
Sep 18, 2025
Figure 1 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Figure 2 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Figure 3 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Figure 4 for Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Viaarxiv icon

In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding with Gaze-Guided Prompting

Add code
Sep 09, 2025
Viaarxiv icon

SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition

Add code
Feb 23, 2025
Viaarxiv icon

3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation

Add code
Feb 06, 2025
Viaarxiv icon

TdAttenMix: Top-Down Attention Guided Mixup

Add code
Jan 26, 2025
Figure 1 for TdAttenMix: Top-Down Attention Guided Mixup
Figure 2 for TdAttenMix: Top-Down Attention Guided Mixup
Figure 3 for TdAttenMix: Top-Down Attention Guided Mixup
Figure 4 for TdAttenMix: Top-Down Attention Guided Mixup
Viaarxiv icon

EDTformer: An Efficient Decoder Transformer for Visual Place Recognition

Add code
Dec 01, 2024
Figure 1 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Figure 2 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Figure 3 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Figure 4 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Viaarxiv icon