Picture for Mingming Gong

Mingming Gong

Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders

Add code
Mar 13, 2025
Viaarxiv icon

I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

Add code
Mar 12, 2025
Viaarxiv icon

MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input

Add code
Mar 11, 2025
Viaarxiv icon

A Two-Stage Pretraining-Finetuning Framework for Treatment Effect Estimation with Unmeasured Confounding

Add code
Jan 15, 2025
Viaarxiv icon

Admitting Ignorance Helps the Video Question Answering Models to Answer

Add code
Jan 15, 2025
Viaarxiv icon

OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies

Add code
Dec 31, 2024
Viaarxiv icon

PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM

Add code
Dec 31, 2024
Figure 1 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Figure 2 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Figure 3 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Figure 4 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Figure 1 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 2 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 3 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 4 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Viaarxiv icon

Uncertainty Quantification in Stereo Matching

Add code
Dec 24, 2024
Figure 1 for Uncertainty Quantification in Stereo Matching
Figure 2 for Uncertainty Quantification in Stereo Matching
Figure 3 for Uncertainty Quantification in Stereo Matching
Figure 4 for Uncertainty Quantification in Stereo Matching
Viaarxiv icon

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon