Picture for Zhibin Lan

Zhibin Lan

Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training

Add code
Oct 06, 2024
Viaarxiv icon

Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation

Add code
Jul 03, 2024
Viaarxiv icon

A Survey on Multi-modal Machine Translation: Tasks, Methods and Challenges

Add code
May 23, 2024
Viaarxiv icon

Exploring Better Text Image Translation with Multimodal Codebook

Add code
Jun 02, 2023
Viaarxiv icon