Picture for Lei Ma

Lei Ma

Kyushu University

Motus: A Unified Latent Action World Model

Add code
Dec 15, 2025
Viaarxiv icon

On-the-fly Large-scale 3D Reconstruction from Multi-Camera Rigs

Add code
Dec 09, 2025
Viaarxiv icon

TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning

Add code
Aug 06, 2025
Viaarxiv icon

FaRMamba: Frequency-based learning and Reconstruction aided Mamba for Medical Segmentation

Add code
Jul 26, 2025
Viaarxiv icon

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

Add code
Jun 16, 2025
Figure 1 for S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder
Figure 2 for S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder
Viaarxiv icon

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

Add code
Jun 12, 2025
Viaarxiv icon

ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech

Add code
May 20, 2025
Figure 1 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Figure 2 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Figure 3 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Figure 4 for ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech
Viaarxiv icon

The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models

Add code
May 18, 2025
Figure 1 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Figure 2 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Figure 3 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Figure 4 for The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Viaarxiv icon

Risk Assessment Framework for Code LLMs via Leveraging Internal States

Add code
Apr 20, 2025
Viaarxiv icon

PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks

Add code
Mar 31, 2025
Viaarxiv icon