Picture for Chulayuth Asawaroengchai

Chulayuth Asawaroengchai

STAB: Speech Tokenizer Assessment Benchmark

Add code
Sep 04, 2024
Figure 1 for STAB: Speech Tokenizer Assessment Benchmark
Figure 2 for STAB: Speech Tokenizer Assessment Benchmark
Figure 3 for STAB: Speech Tokenizer Assessment Benchmark
Figure 4 for STAB: Speech Tokenizer Assessment Benchmark
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

Translatotron 3: Speech to Speech Translation with Monolingual Data

Add code
Jun 01, 2023
Viaarxiv icon