Picture for Zhangquan Chen

Zhangquan Chen

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Add code
Feb 05, 2026
Viaarxiv icon

Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation

Add code
Feb 05, 2026
Viaarxiv icon

Dual Latent Memory for Visual Multi-agent System

Add code
Jan 31, 2026
Viaarxiv icon

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Add code
Jan 14, 2026
Viaarxiv icon

Topology-Agnostic Animal Motion Generation from Text Prompt

Add code
Dec 11, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon

SIFThinker: Spatially-Aware Image Focus for Visual Reasoning

Add code
Aug 08, 2025
Viaarxiv icon

NFR: Neural Feature-Guided Non-Rigid Shape Registration

Add code
May 28, 2025
Viaarxiv icon

VisRL: Intention-Driven Visual Perception via Reinforced Reasoning

Add code
Mar 10, 2025
Figure 1 for VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Figure 2 for VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Figure 3 for VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Figure 4 for VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Viaarxiv icon

Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models

Add code
Aug 16, 2024
Figure 1 for Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models
Figure 2 for Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models
Figure 3 for Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models
Figure 4 for Unsupervised Non-Rigid Point Cloud Matching through Large Vision Models
Viaarxiv icon