Picture for Jacob Zhiyuan Fang

Jacob Zhiyuan Fang

FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation

Add code
May 08, 2024
Viaarxiv icon

E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer

Add code
Nov 28, 2023
Viaarxiv icon

Text-to-image Editing by Image Information Removal

Add code
May 27, 2023
Viaarxiv icon