Picture for Qi Zheng

Qi Zheng

4KAgent: Agentic Any Image to 4K Super-Resolution

Add code
Jul 09, 2025
Viaarxiv icon

Node Splitting SVMs for Survival Trees Based on an L2-Regularized Dipole Splitting Criteria

Add code
Jun 13, 2025
Viaarxiv icon

End-to-End HOI Reconstruction Transformer with Graph-based Encoding

Add code
Mar 08, 2025
Viaarxiv icon

An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation

Add code
Jan 25, 2025
Viaarxiv icon

ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting

Add code
Dec 19, 2024
Viaarxiv icon

Unicorn: Unified Neural Image Compression with One Number Reconstruction

Add code
Dec 11, 2024
Viaarxiv icon

Video Quality Assessment: A Comprehensive Survey

Add code
Dec 04, 2024
Viaarxiv icon

M3-CVC: Controllable Video Compression with Multimodal Generative Models

Add code
Nov 24, 2024
Figure 1 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Figure 2 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Figure 3 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Figure 4 for M3-CVC: Controllable Video Compression with Multimodal Generative Models
Viaarxiv icon

Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding

Add code
Nov 12, 2024
Figure 1 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 2 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 3 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Figure 4 for Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding
Viaarxiv icon

A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint

Add code
Oct 25, 2024
Figure 1 for A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint
Figure 2 for A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint
Figure 3 for A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint
Figure 4 for A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint
Viaarxiv icon