Picture for Heyang Liu

Heyang Liu

Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm

Add code
Aug 16, 2024
Viaarxiv icon

Decoding Linguistic Representations of Human Brain

Add code
Jul 30, 2024
Viaarxiv icon

Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

Add code
Jun 17, 2024
Viaarxiv icon

M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset

Add code
Mar 21, 2024
Viaarxiv icon

Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview

Add code
Mar 01, 2024
Viaarxiv icon

MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception

Add code
Jan 15, 2024
Viaarxiv icon

LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework

Add code
Aug 30, 2023
Figure 1 for LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework
Figure 2 for LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework
Figure 3 for LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework
Figure 4 for LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework
Viaarxiv icon