Picture for Lei Ma

Lei Ma

Kyushu University

S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder

Add code
Jun 16, 2025
Viaarxiv icon

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

Add code
Jun 12, 2025
Viaarxiv icon

ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech

Add code
May 20, 2025
Viaarxiv icon

The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models

Add code
May 18, 2025
Viaarxiv icon

Risk Assessment Framework for Code LLMs via Leveraging Internal States

Add code
Apr 20, 2025
Viaarxiv icon

PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks

Add code
Mar 31, 2025
Viaarxiv icon

Pre-trained Models Succeed in Medical Imaging with Representation Similarity Degradation

Add code
Mar 11, 2025
Viaarxiv icon

Keeping Representation Similarity in Finetuning for Medical Image Analysis

Add code
Mar 10, 2025
Viaarxiv icon

From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning

Add code
Mar 08, 2025
Viaarxiv icon

Aligning Instruction Tuning with Pre-training

Add code
Jan 16, 2025
Figure 1 for Aligning Instruction Tuning with Pre-training
Figure 2 for Aligning Instruction Tuning with Pre-training
Figure 3 for Aligning Instruction Tuning with Pre-training
Figure 4 for Aligning Instruction Tuning with Pre-training
Viaarxiv icon