Picture for Hongyang Wei

Hongyang Wei

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Add code
Jan 29, 2026
Viaarxiv icon

Skywork UniPic 3.0: Unified Multi-Image Composition via Sequence Modeling

Add code
Jan 22, 2026
Viaarxiv icon

MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition

Add code
Dec 08, 2025
Figure 1 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Figure 2 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Figure 3 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Figure 4 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Viaarxiv icon

Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model

Add code
Sep 04, 2025
Figure 1 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Figure 2 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Figure 3 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Figure 4 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Viaarxiv icon

Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models

Add code
Mar 14, 2025
Figure 1 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Figure 2 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Figure 3 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Figure 4 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Viaarxiv icon

EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene

Add code
Dec 20, 2024
Figure 1 for EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene
Figure 2 for EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene
Figure 3 for EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene
Figure 4 for EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene
Viaarxiv icon

Sparse Laneformer

Add code
Apr 11, 2024
Figure 1 for Sparse Laneformer
Figure 2 for Sparse Laneformer
Figure 3 for Sparse Laneformer
Figure 4 for Sparse Laneformer
Viaarxiv icon

ADD: An Automatic Desensitization Fisheye Dataset for Autonomous Driving

Add code
Aug 15, 2023
Viaarxiv icon