Picture for Fei Yu

Fei Yu

MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach

Add code
Mar 31, 2025
Viaarxiv icon

Object Isolated Attention for Consistent Story Visualization

Add code
Mar 30, 2025
Viaarxiv icon

UniSync: A Unified Framework for Audio-Visual Synchronization

Add code
Mar 20, 2025
Viaarxiv icon

Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation

Add code
Feb 27, 2025
Viaarxiv icon

Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following

Add code
Feb 24, 2025
Viaarxiv icon

Scene Understanding Enabled Semantic Communication with Open Channel Coding

Add code
Jan 24, 2025
Figure 1 for Scene Understanding Enabled Semantic Communication with Open Channel Coding
Figure 2 for Scene Understanding Enabled Semantic Communication with Open Channel Coding
Figure 3 for Scene Understanding Enabled Semantic Communication with Open Channel Coding
Figure 4 for Scene Understanding Enabled Semantic Communication with Open Channel Coding
Viaarxiv icon

Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

Add code
Jan 09, 2025
Viaarxiv icon

Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering

Add code
Dec 30, 2024
Figure 1 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Figure 2 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Figure 3 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Figure 4 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Viaarxiv icon

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

Add code
Dec 16, 2024
Figure 1 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 2 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 3 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 4 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Viaarxiv icon

Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization

Add code
Dec 09, 2024
Viaarxiv icon