Picture for Xu Li

Xu Li

Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, China

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Viaarxiv icon

Baichuan Alignment Technical Report

Add code
Oct 19, 2024
Figure 1 for Baichuan Alignment Technical Report
Figure 2 for Baichuan Alignment Technical Report
Figure 3 for Baichuan Alignment Technical Report
Figure 4 for Baichuan Alignment Technical Report
Viaarxiv icon

Baichuan-Omni Technical Report

Add code
Oct 11, 2024
Figure 1 for Baichuan-Omni Technical Report
Figure 2 for Baichuan-Omni Technical Report
Figure 3 for Baichuan-Omni Technical Report
Figure 4 for Baichuan-Omni Technical Report
Viaarxiv icon

Language-Queried Target Sound Extraction Without Parallel Training Data

Add code
Sep 14, 2024
Figure 1 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 2 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 3 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 4 for Language-Queried Target Sound Extraction Without Parallel Training Data
Viaarxiv icon

Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility

Add code
Sep 14, 2024
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Viaarxiv icon

MaskSR: Masked Language Model for Full-band Speech Restoration

Add code
Jun 04, 2024
Viaarxiv icon

Dynamic Resolution Guidance for Facial Expression Recognition

Add code
Apr 09, 2024
Viaarxiv icon

CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

Add code
Apr 02, 2024
Viaarxiv icon

CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction

Add code
Feb 27, 2024
Figure 1 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Figure 2 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Figure 3 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Figure 4 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Viaarxiv icon