Picture for Minsu Cho

Minsu Cho

Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards

Add code
Mar 05, 2026
Viaarxiv icon

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Add code
Mar 05, 2026
Viaarxiv icon

Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping

Add code
Feb 25, 2026
Viaarxiv icon

Vision-aligned Latent Reasoning for Multi-modal Large Language Model

Add code
Feb 04, 2026
Viaarxiv icon

MV-SAM: Multi-view Promptable Segmentation using Pointmap Guidance

Add code
Jan 25, 2026
Viaarxiv icon

DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning

Add code
Jan 22, 2026
Viaarxiv icon

Affostruction: 3D Affordance Grounding with Generative Reconstruction

Add code
Jan 14, 2026
Viaarxiv icon

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Add code
Dec 24, 2025
Figure 1 for Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting
Figure 2 for Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting
Figure 3 for Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting
Figure 4 for Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting
Viaarxiv icon

PanoGrounder: Bridging 2D and 3D with Panoramic Scene Representations for VLM-based 3D Visual Grounding

Add code
Dec 24, 2025
Viaarxiv icon

Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection

Add code
Nov 05, 2025
Figure 1 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 2 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 3 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 4 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Viaarxiv icon