Picture for Jiaqi Li

Jiaqi Li

LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization

Add code
Feb 02, 2026
Viaarxiv icon

PandaPose: 3D Human Pose Lifting from a Single Image via Propagating 2D Pose Prior to 3D Anchor Space

Add code
Feb 01, 2026
Viaarxiv icon

CE-GOCD: Central Entity-Guided Graph Optimization for Community Detection to Augment LLM Scientific Question Answering

Add code
Jan 29, 2026
Viaarxiv icon

Towards Mitigating Modality Bias in Vision-Language Models for Temporal Action Localization

Add code
Jan 28, 2026
Viaarxiv icon

Graph Reasoning Paradigm: Structured and Symbolic Reasoning with Topology-Aware Reinforcement Learning for Large Language Models

Add code
Jan 19, 2026
Viaarxiv icon

Joint DOA and Non-circular Phase Estimation of Non-circular Signals for Antenna Arrays: Block Sparse Bayesian Learning Method

Add code
Jan 14, 2026
Viaarxiv icon

The AI Hippocampus: How Far are We From Human Memory?

Add code
Jan 14, 2026
Viaarxiv icon

Collaborative Reconstruction and Repair for Multi-class Industrial Anomaly Detection

Add code
Dec 12, 2025
Viaarxiv icon

BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching

Add code
Nov 19, 2025
Figure 1 for BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching
Figure 2 for BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching
Figure 3 for BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching
Figure 4 for BokehFlow: Depth-Free Controllable Bokeh Rendering via Flow Matching
Viaarxiv icon

Scaling Spatial Intelligence with Multimodal Foundation Models

Add code
Nov 17, 2025
Figure 1 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 2 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 3 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 4 for Scaling Spatial Intelligence with Multimodal Foundation Models
Viaarxiv icon