Picture for Ruiming Xie

Ruiming Xie

Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition

Add code
Dec 19, 2024
Viaarxiv icon

Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens

Add code
Oct 04, 2024
Viaarxiv icon

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition

Add code
Jan 18, 2024
Figure 1 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 2 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 3 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 4 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Apr 03, 2023
Viaarxiv icon