Picture for Lingyi Hong

Lingyi Hong

Fudan university

RSAgent: Learning to Reason and Act for Text-Guided Segmentation via Multi-Turn Tool Invocations

Add code
Dec 30, 2025
Viaarxiv icon

Seeing is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding

Add code
Nov 15, 2025
Viaarxiv icon

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Add code
Jun 17, 2025
Viaarxiv icon

CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms

Add code
May 22, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

Add code
Apr 14, 2025
Viaarxiv icon

MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection

Add code
Feb 19, 2025
Figure 1 for MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection
Figure 2 for MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection
Figure 3 for MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection
Figure 4 for MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection
Viaarxiv icon

VideoPure: Diffusion-based Adversarial Purification for Video Recognition

Add code
Jan 25, 2025
Figure 1 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 2 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 3 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Figure 4 for VideoPure: Diffusion-based Adversarial Purification for Video Recognition
Viaarxiv icon

DeTrack: In-model Latent Denoising Learning for Visual Object Tracking

Add code
Jan 05, 2025
Figure 1 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 2 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 3 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Figure 4 for DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Viaarxiv icon

P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision

Add code
Dec 27, 2024
Figure 1 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Figure 2 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Figure 3 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Figure 4 for P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision
Viaarxiv icon

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation

Add code
Sep 28, 2024
Figure 1 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 2 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 3 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Figure 4 for X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation
Viaarxiv icon