Picture for Chen Li

Chen Li

Beihang University

StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Add code
Jan 21, 2026
Viaarxiv icon

FaithSCAN: Model-Driven Single-Pass Hallucination Detection for Faithful Visual Question Answering

Add code
Jan 01, 2026
Viaarxiv icon

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

Add code
Nov 18, 2025
Figure 1 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Figure 2 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Figure 3 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Figure 4 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Viaarxiv icon

Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation

Add code
Nov 08, 2025
Figure 1 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Figure 2 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Figure 3 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Figure 4 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Viaarxiv icon

V-Thinker: Interactive Thinking with Images

Add code
Nov 06, 2025
Viaarxiv icon

DAMap: Distance-aware MapNet for High Quality HD Map Construction

Add code
Oct 26, 2025
Viaarxiv icon

Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs

Add code
Oct 02, 2025
Figure 1 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 2 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 3 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 4 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Viaarxiv icon

MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation

Add code
Oct 02, 2025
Figure 1 for MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
Figure 2 for MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
Figure 3 for MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
Figure 4 for MATCH: Multi-faceted Adaptive Topo-Consistency for Semi-Supervised Histopathology Segmentation
Viaarxiv icon

Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation

Add code
Sep 26, 2025
Figure 1 for Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation
Figure 2 for Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation
Figure 3 for Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation
Figure 4 for Bézier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation
Viaarxiv icon

Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning

Add code
Sep 19, 2025
Figure 1 for Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning
Figure 2 for Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning
Figure 3 for Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning
Figure 4 for Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning
Viaarxiv icon