Picture for Xu Li

Xu Li

Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, China

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification

Add code
Jan 25, 2025
Figure 1 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 2 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 3 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Figure 4 for Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
Viaarxiv icon

Global Semantic-Guided Sub-image Feature Weight Allocation in High-Resolution Large Vision-Language Models

Add code
Jan 24, 2025
Viaarxiv icon

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting

Add code
Dec 17, 2024
Viaarxiv icon

CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs

Add code
Nov 19, 2024
Figure 1 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 2 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 3 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Figure 4 for CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs
Viaarxiv icon

Baichuan Alignment Technical Report

Add code
Oct 19, 2024
Figure 1 for Baichuan Alignment Technical Report
Figure 2 for Baichuan Alignment Technical Report
Figure 3 for Baichuan Alignment Technical Report
Figure 4 for Baichuan Alignment Technical Report
Viaarxiv icon

Baichuan-Omni Technical Report

Add code
Oct 11, 2024
Figure 1 for Baichuan-Omni Technical Report
Figure 2 for Baichuan-Omni Technical Report
Figure 3 for Baichuan-Omni Technical Report
Figure 4 for Baichuan-Omni Technical Report
Viaarxiv icon

Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility

Add code
Sep 14, 2024
Viaarxiv icon

Language-Queried Target Sound Extraction Without Parallel Training Data

Add code
Sep 14, 2024
Figure 1 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 2 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 3 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 4 for Language-Queried Target Sound Extraction Without Parallel Training Data
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Figure 1 for EA-VTR: Event-Aware Video-Text Retrieval
Figure 2 for EA-VTR: Event-Aware Video-Text Retrieval
Figure 3 for EA-VTR: Event-Aware Video-Text Retrieval
Figure 4 for EA-VTR: Event-Aware Video-Text Retrieval
Viaarxiv icon