Picture for Hongxiang Li

Hongxiang Li

VideoGen-Eval: Agent-based System for Video Generation Evaluation

Add code
Mar 30, 2025
Viaarxiv icon

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Add code
Mar 17, 2025
Viaarxiv icon

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Add code
Dec 13, 2024
Viaarxiv icon

Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine

Add code
Dec 12, 2024
Viaarxiv icon

PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Add code
Aug 26, 2024
Figure 1 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Figure 2 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Figure 3 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Figure 4 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems
Viaarxiv icon

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning

Add code
May 31, 2024
Figure 1 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 2 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 3 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Figure 4 for Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning
Viaarxiv icon

Textual Inversion and Self-supervised Refinement for Radiology Report Generation

Add code
May 31, 2024
Viaarxiv icon

Uncertainty-aware sign language video retrieval with probability distribution modeling

Add code
May 30, 2024
Viaarxiv icon

Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation

Add code
Apr 03, 2024
Viaarxiv icon

Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction

Add code
Jan 25, 2024
Viaarxiv icon