Picture for Guangzhi Wang

Guangzhi Wang

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Add code
Mar 17, 2025
Viaarxiv icon

S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing

Add code
Apr 11, 2024
Viaarxiv icon

Navigate Biopsy with Ultrasound under Augmented Reality Device: Towards Higher System Performance

Add code
Feb 04, 2024
Viaarxiv icon

The Efficiency Spectrum of Large Language Models: An Algorithmic Survey

Add code
Dec 01, 2023
Viaarxiv icon

SEED-Bench-2: Benchmarking Multimodal Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Add code
Oct 16, 2023
Viaarxiv icon

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension

Add code
Aug 02, 2023
Viaarxiv icon

Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection

Add code
Jul 19, 2023
Figure 1 for Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Figure 2 for Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Figure 3 for Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Figure 4 for Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection
Viaarxiv icon

EVD Surgical Guidance with Retro-Reflective Tool Tracking and Spatial Reconstruction using Head-Mounted Augmented Reality Device

Add code
Jul 03, 2023
Viaarxiv icon

What Makes for Good Visual Tokenizers for Large Language Models?

Add code
May 23, 2023
Viaarxiv icon