Picture for Chen Li

Chen Li

Beihang University

Stabilizing Decentralized Federated Fine-Tuning via Topology-Aware Alternating LoRA

Add code
Jan 31, 2026
Viaarxiv icon

HAAF: Hierarchical Adaptation and Alignment of Foundation Models for Few-Shot Pathology Anomaly Detection

Add code
Jan 24, 2026
Viaarxiv icon

Creating a biologically more accurate spider robot to study active vibration sensing

Add code
Jan 23, 2026
Viaarxiv icon

StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Add code
Jan 21, 2026
Viaarxiv icon

FaithSCAN: Model-Driven Single-Pass Hallucination Detection for Faithful Visual Question Answering

Add code
Jan 01, 2026
Viaarxiv icon

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

Add code
Nov 18, 2025
Figure 1 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Figure 2 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Figure 3 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Figure 4 for ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
Viaarxiv icon

Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation

Add code
Nov 08, 2025
Figure 1 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Figure 2 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Figure 3 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Figure 4 for Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Viaarxiv icon

V-Thinker: Interactive Thinking with Images

Add code
Nov 06, 2025
Viaarxiv icon

DAMap: Distance-aware MapNet for High Quality HD Map Construction

Add code
Oct 26, 2025
Viaarxiv icon

Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs

Add code
Oct 02, 2025
Figure 1 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 2 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 3 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Figure 4 for Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs
Viaarxiv icon