Picture for Weipeng Chen

Weipeng Chen

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Add code
Mar 19, 2025
Viaarxiv icon

Efficient Motion-Aware Video MLLM

Add code
Mar 17, 2025
Viaarxiv icon

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Add code
Feb 24, 2025
Viaarxiv icon

Baichuan-M1: Pushing the Medical Capability of Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation

Add code
Feb 11, 2025
Viaarxiv icon

Ocean-OCR: Towards General OCR Application via a Vision-Language Model

Add code
Jan 26, 2025
Figure 1 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 2 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 3 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Figure 4 for Ocean-OCR: Towards General OCR Application via a Vision-Language Model
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Med-R$^2$: Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine

Add code
Jan 21, 2025
Viaarxiv icon

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Add code
Jan 03, 2025
Viaarxiv icon

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback

Add code
Dec 20, 2024
Figure 1 for Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
Figure 2 for Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
Figure 3 for Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
Figure 4 for Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
Viaarxiv icon