Picture for Jian Zhu

Jian Zhu

University of British Columbia

Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos

Add code
Nov 19, 2025
Figure 1 for Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Figure 2 for Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Figure 3 for Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Figure 4 for Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Viaarxiv icon

Boomda: Balanced Multi-objective Optimization for Multimodal Domain Adaptation

Add code
Nov 11, 2025
Viaarxiv icon

MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering

Add code
Nov 08, 2025
Figure 1 for MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering
Figure 2 for MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering
Figure 3 for MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering
Figure 4 for MoEGCL: Mixture of Ego-Graphs Contrastive Representation Learning for Multi-View Clustering
Viaarxiv icon

Understanding In-Context Learning Beyond Transformers: An Investigation of State Space and Hybrid Architectures

Add code
Oct 27, 2025
Viaarxiv icon

ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning

Add code
Oct 01, 2025
Figure 1 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 2 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 3 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Figure 4 for ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
Viaarxiv icon

Generative Diffusion Contrastive Network for Multi-View Clustering

Add code
Sep 11, 2025
Viaarxiv icon

CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities

Add code
Aug 20, 2025
Viaarxiv icon

NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer

Add code
Aug 14, 2025
Viaarxiv icon

FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

ZIPA: A family of efficient models for multilingual phone recognition

Add code
May 29, 2025
Viaarxiv icon