Picture for Qi Zhang

Qi Zhang

School of Information, North China University of Technology

DocFusion: A Unified Framework for Document Parsing Tasks

Add code
Dec 17, 2024
Viaarxiv icon

CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models

Add code
Dec 17, 2024
Viaarxiv icon

Point Cloud-Assisted Neural Image Compression

Add code
Dec 16, 2024
Viaarxiv icon

View Transformation Robustness for Multi-View 3D Object Reconstruction with Reconstruction Error-Guided View Selection

Add code
Dec 16, 2024
Viaarxiv icon

Large Action Models: From Inception to Implementation

Add code
Dec 13, 2024
Figure 1 for Large Action Models: From Inception to Implementation
Figure 2 for Large Action Models: From Inception to Implementation
Figure 3 for Large Action Models: From Inception to Implementation
Figure 4 for Large Action Models: From Inception to Implementation
Viaarxiv icon

Position-aware Guided Point Cloud Completion with CLIP Model

Add code
Dec 11, 2024
Viaarxiv icon

Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors

Add code
Dec 10, 2024
Viaarxiv icon

LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model

Add code
Dec 05, 2024
Figure 1 for LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model
Figure 2 for LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model
Figure 3 for LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model
Figure 4 for LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model
Viaarxiv icon

Large Language Model-Brained GUI Agents: A Survey

Add code
Dec 03, 2024
Figure 1 for Large Language Model-Brained GUI Agents: A Survey
Figure 2 for Large Language Model-Brained GUI Agents: A Survey
Figure 3 for Large Language Model-Brained GUI Agents: A Survey
Figure 4 for Large Language Model-Brained GUI Agents: A Survey
Viaarxiv icon

A Semi-Supervised Approach with Error Reflection for Echocardiography Segmentation

Add code
Dec 01, 2024
Viaarxiv icon