Picture for Hao Li

Hao Li

Jack

Object Style Diffusion for Generalized Object Detection in Urban Scene

Add code
Dec 18, 2024
Viaarxiv icon

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting

Add code
Dec 17, 2024
Viaarxiv icon

Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency

Add code
Dec 17, 2024
Viaarxiv icon

Efficient Scaling of Diffusion Transformers for Text-to-Image Generation

Add code
Dec 16, 2024
Viaarxiv icon

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Add code
Dec 12, 2024
Viaarxiv icon

Unified Vertex Motion Estimation for Integrated Video Stabilization and Stitching in Tractor-Trailer Wheeled Robots

Add code
Dec 10, 2024
Viaarxiv icon

Political Actor Agent: Simulating Legislative System for Roll Call Votes Prediction with Large Language Models

Add code
Dec 10, 2024
Viaarxiv icon

Radiant: Large-scale 3D Gaussian Rendering based on Hierarchical Framework

Add code
Dec 07, 2024
Viaarxiv icon

LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment

Add code
Dec 06, 2024
Viaarxiv icon

LiDAR SLAMMOT based on Confidence-guided Data Association

Add code
Dec 02, 2024
Figure 1 for LiDAR SLAMMOT based on Confidence-guided Data Association
Figure 2 for LiDAR SLAMMOT based on Confidence-guided Data Association
Figure 3 for LiDAR SLAMMOT based on Confidence-guided Data Association
Figure 4 for LiDAR SLAMMOT based on Confidence-guided Data Association
Viaarxiv icon