Picture for Mingming Gong

Mingming Gong

Admitting Ignorance Helps the Video Question Answering Models to Answer

Add code
Jan 15, 2025
Viaarxiv icon

A Two-Stage Pretraining-Finetuning Framework for Treatment Effect Estimation with Unmeasured Confounding

Add code
Jan 15, 2025
Viaarxiv icon

OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies

Add code
Dec 31, 2024
Viaarxiv icon

PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM

Add code
Dec 31, 2024
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Viaarxiv icon

Uncertainty Quantification in Stereo Matching

Add code
Dec 24, 2024
Viaarxiv icon

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon

Urban4D: Semantic-Guided 4D Gaussian Splatting for Urban Scene Reconstruction

Add code
Dec 04, 2024
Viaarxiv icon

Revisit Non-parametric Two-sample Testing as a Semi-supervised Learning Problem

Add code
Nov 30, 2024
Viaarxiv icon

TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On

Add code
Nov 26, 2024
Viaarxiv icon