Picture for Zhengrui Ma

Zhengrui Ma

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Add code
Sep 10, 2024
Figure 1 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Figure 2 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Figure 3 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Figure 4 for LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Viaarxiv icon

Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models

Add code
Jun 12, 2024
Viaarxiv icon

Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?

Add code
Jun 11, 2024
Viaarxiv icon

CTC-based Non-autoregressive Textless Speech-to-Speech Translation

Add code
Jun 11, 2024
Viaarxiv icon

A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation

Add code
Jun 11, 2024
Viaarxiv icon

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning

Add code
Jun 05, 2024
Viaarxiv icon

SiLLM: Large Language Models for Simultaneous Machine Translation

Add code
Feb 20, 2024
Viaarxiv icon

Non-autoregressive Machine Translation with Probabilistic Context-free Grammar

Add code
Nov 14, 2023
Viaarxiv icon

Beyond MLE: Convex Learning for Text Generation

Add code
Oct 26, 2023
Viaarxiv icon

Non-autoregressive Streaming Transformer for Simultaneous Translation

Add code
Oct 23, 2023
Viaarxiv icon