Picture for Ao Zhang

Ao Zhang

Configurable Foundation Models: Building LLMs from a Modular Perspective

Add code
Sep 04, 2024
Viaarxiv icon

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Add code
Aug 03, 2024
Figure 1 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 2 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 3 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 4 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Viaarxiv icon

Physical formula enhanced multi-task learning for pharmacokinetics prediction

Add code
Apr 16, 2024
Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Jan 07, 2024
Figure 1 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Figure 2 for ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
Viaarxiv icon

Knowledge Enhanced Conditional Imputation for Healthcare Time-series

Add code
Jan 04, 2024
Viaarxiv icon

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

Add code
Dec 15, 2023
Viaarxiv icon

NExT-Chat: An LMM for Chat, Detection and Segmentation

Add code
Nov 13, 2023
Viaarxiv icon

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

Add code
Oct 07, 2023
Viaarxiv icon

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

Add code
Jun 01, 2023
Viaarxiv icon

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

Add code
May 21, 2023
Viaarxiv icon