Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:One-Shot Multilingual Font Generation Via ViT

Dec 15, 2024

Zhiheng Wang, Jiarui Liu

Figure 1 for One-Shot Multilingual Font Generation Via ViT

Figure 2 for One-Shot Multilingual Font Generation Via ViT

Figure 3 for One-Shot Multilingual Font Generation Via ViT

Figure 4 for One-Shot Multilingual Font Generation Via ViT

Share this with someone who'll enjoy it:

Abstract:Font design poses unique challenges for logographic languages like Chinese, Japanese, and Korean (CJK), where thousands of unique characters must be individually crafted. This paper introduces a novel Vision Transformer (ViT)-based model for multi-language font generation, effectively addressing the complexities of both logographic and alphabetic scripts. By leveraging ViT and pretraining with a strong visual pretext task (Masked Autoencoding, MAE), our model eliminates the need for complex design components in prior frameworks while achieving comprehensive results with enhanced generalizability. Remarkably, it can generate high-quality fonts across multiple languages for unseen, unknown, and even user-crafted characters. Additionally, we integrate a Retrieval-Augmented Guidance (RAG) module to dynamically retrieve and adapt style references, improving scalability and real-world applicability. We evaluated our approach in various font generation tasks, demonstrating its effectiveness, adaptability, and scalability.

View paper on

Share this with someone who'll enjoy it:

Title:One-Shot Multilingual Font Generation Via ViT

Paper and Code