Picture for Fei Li

Fei Li

Baichuan-M1: Pushing the Medical Capability of Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation

Add code
Jan 29, 2025
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction

Add code
Dec 05, 2024
Figure 1 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Figure 2 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Figure 3 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Figure 4 for M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Viaarxiv icon

Continuous Speculative Decoding for Autoregressive Image Generation

Add code
Nov 18, 2024
Viaarxiv icon

Multiple kernel concept factorization algorithm based on global fusion

Add code
Oct 27, 2024
Viaarxiv icon

Baichuan Alignment Technical Report

Add code
Oct 19, 2024
Figure 1 for Baichuan Alignment Technical Report
Figure 2 for Baichuan Alignment Technical Report
Figure 3 for Baichuan Alignment Technical Report
Figure 4 for Baichuan Alignment Technical Report
Viaarxiv icon

Closed-loop Long-horizon Robotic Planning via Equilibrium Sequence Modeling

Add code
Oct 02, 2024
Viaarxiv icon

VisionUnite: A Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge

Add code
Aug 05, 2024
Viaarxiv icon

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Add code
Aug 03, 2024
Figure 1 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 2 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 3 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 4 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Viaarxiv icon