Picture for Hang Xu

Hang Xu

A Study of In-Context-Learning-Based Text-to-SQL Errors

Add code
Jan 16, 2025
Viaarxiv icon

FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors

Add code
Jan 14, 2025
Viaarxiv icon

Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising

Add code
Jan 06, 2025
Figure 1 for Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising
Figure 2 for Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising
Figure 3 for Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising
Figure 4 for Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising
Viaarxiv icon

ACE: Anti-Editing Concept Erasure in Text-to-Image Models

Add code
Jan 03, 2025
Viaarxiv icon

UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity

Add code
Dec 28, 2024
Viaarxiv icon

End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework

Add code
Dec 23, 2024
Figure 1 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Figure 2 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Figure 3 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Figure 4 for End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework
Viaarxiv icon

ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance

Add code
Dec 09, 2024
Viaarxiv icon

Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning

Add code
Dec 04, 2024
Figure 1 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 2 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 3 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Figure 4 for Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning
Viaarxiv icon

SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images

Add code
Dec 03, 2024
Viaarxiv icon

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

Add code
Nov 18, 2024
Viaarxiv icon