Picture for Jiaqi Wang

Jiaqi Wang

Michael Pokorny

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies

Add code
Mar 19, 2025
Viaarxiv icon

Unified Reward Model for Multimodal Understanding and Generation

Add code
Mar 07, 2025
Viaarxiv icon

Visual-RFT: Visual Reinforcement Fine-Tuning

Add code
Mar 03, 2025
Viaarxiv icon

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Add code
Feb 25, 2025
Viaarxiv icon

DivIL: Unveiling and Addressing Over-Invariance for Out-of- Distribution Generalization

Add code
Feb 18, 2025
Viaarxiv icon

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Add code
Feb 18, 2025
Viaarxiv icon

Oversmoothing as Loss of Sign: Towards Structural Balance in Graph Neural Networks

Add code
Feb 17, 2025
Viaarxiv icon

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Add code
Feb 12, 2025
Viaarxiv icon

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Add code
Feb 07, 2025
Viaarxiv icon

ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model

Add code
Feb 05, 2025
Figure 1 for ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model
Figure 2 for ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model
Figure 3 for ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model
Figure 4 for ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model
Viaarxiv icon