Picture for Junjie Zhou

Junjie Zhou

GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Add code
Dec 15, 2025
Viaarxiv icon

OmniGen2: Exploration to Advanced Multimodal Generation

Add code
Jun 23, 2025
Viaarxiv icon

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Add code
Jun 12, 2025
Viaarxiv icon

Feature Fusion Revisited: Multimodal CTR Prediction for MMCTR Challenge

Add code
Apr 26, 2025
Viaarxiv icon

DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction

Add code
Mar 12, 2025
Figure 1 for DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction
Figure 2 for DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction
Figure 3 for DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction
Figure 4 for DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction
Viaarxiv icon

Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder

Add code
Mar 12, 2025
Figure 1 for Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder
Figure 2 for Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder
Figure 3 for Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder
Figure 4 for Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder
Viaarxiv icon

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Add code
Feb 18, 2025
Viaarxiv icon

Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval

Add code
Feb 17, 2025
Viaarxiv icon

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Add code
Dec 19, 2024
Viaarxiv icon