Picture for Sizhou Chen

Sizhou Chen

Bridging the Gap between Text, Audio, Image, and Any Sequence: A Novel Approach using Gloss-based Annotation

Add code
Oct 04, 2024
Viaarxiv icon

Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks

Add code
Sep 14, 2023
Viaarxiv icon