Picture for Wenjun Zeng

Wenjun Zeng

Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction

Add code
Dec 11, 2024
Viaarxiv icon

Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty

Add code
Dec 09, 2024
Viaarxiv icon

UniScene: Unified Occupancy-centric Driving Scene Generation

Add code
Dec 06, 2024
Viaarxiv icon

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations

Add code
Oct 17, 2024
Viaarxiv icon

Open-World Reinforcement Learning over Long Short-Term Imagination

Add code
Oct 04, 2024
Figure 1 for Open-World Reinforcement Learning over Long Short-Term Imagination
Figure 2 for Open-World Reinforcement Learning over Long Short-Term Imagination
Figure 3 for Open-World Reinforcement Learning over Long Short-Term Imagination
Figure 4 for Open-World Reinforcement Learning over Long Short-Term Imagination
Viaarxiv icon

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation

Add code
Oct 01, 2024
Viaarxiv icon

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs

Add code
Aug 16, 2024
Viaarxiv icon

ShieldGemma: Generative AI Content Moderation Based on Gemma

Add code
Jul 31, 2024
Figure 1 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 2 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 3 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Figure 4 for ShieldGemma: Generative AI Content Moderation Based on Gemma
Viaarxiv icon

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models

Add code
Jul 26, 2024
Viaarxiv icon

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Add code
Jul 17, 2024
Figure 1 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Figure 2 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Figure 3 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Figure 4 for HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects
Viaarxiv icon