Picture for Xiaochuang Han

Xiaochuang Han

Inference-time Physics Alignment of Video Generative Models with Latent World Models

Add code
Jan 15, 2026
Viaarxiv icon

Unified Text-Image Generation with Weakness-Targeted Post-Training

Add code
Jan 07, 2026
Viaarxiv icon

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

Add code
Jun 09, 2025
Figure 1 for MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation
Figure 2 for MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation
Figure 3 for MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation
Figure 4 for MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation
Viaarxiv icon

FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text

Add code
Mar 19, 2025
Figure 1 for FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text
Figure 2 for FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text
Figure 3 for FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text
Figure 4 for FACTS&EVIDENCE: An Interactive Tool for Transparent Fine-Grained Factual Verification of Machine-Generated Text
Viaarxiv icon

When One LLM Drools, Multi-LLM Collaboration Rules

Add code
Feb 06, 2025
Viaarxiv icon

LMFusion: Adapting Pretrained Language Models for Multimodal Generation

Add code
Dec 26, 2024
Figure 1 for LMFusion: Adapting Pretrained Language Models for Multimodal Generation
Figure 2 for LMFusion: Adapting Pretrained Language Models for Multimodal Generation
Figure 3 for LMFusion: Adapting Pretrained Language Models for Multimodal Generation
Figure 4 for LMFusion: Adapting Pretrained Language Models for Multimodal Generation
Viaarxiv icon

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Add code
Dec 19, 2024
Figure 1 for LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation
Figure 2 for LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation
Figure 3 for LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation
Figure 4 for LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation
Viaarxiv icon

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Add code
Aug 15, 2024
Viaarxiv icon

Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

Add code
Jun 23, 2024
Figure 1 for Can LLM Graph Reasoning Generalize beyond Pattern Memorization?
Figure 2 for Can LLM Graph Reasoning Generalize beyond Pattern Memorization?
Figure 3 for Can LLM Graph Reasoning Generalize beyond Pattern Memorization?
Figure 4 for Can LLM Graph Reasoning Generalize beyond Pattern Memorization?
Viaarxiv icon

Tuning Language Models by Proxy

Add code
Jan 16, 2024
Figure 1 for Tuning Language Models by Proxy
Figure 2 for Tuning Language Models by Proxy
Figure 3 for Tuning Language Models by Proxy
Figure 4 for Tuning Language Models by Proxy
Viaarxiv icon