Picture for Mingming Gong

Mingming Gong

Agent-Centric Personalized Multiple Clustering with Multi-Modal LLMs

Add code
Mar 31, 2025
Viaarxiv icon

Analytic DAG Constraints for Differentiable DAG Learning

Add code
Mar 24, 2025
Viaarxiv icon

Hyper3D: Efficient 3D Representation via Hybrid Triplane and Octree Feature for Enhanced 3D Shape Variational Auto-Encoders

Add code
Mar 13, 2025
Viaarxiv icon

I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?

Add code
Mar 12, 2025
Viaarxiv icon

MF-VITON: High-Fidelity Mask-Free Virtual Try-On with Minimal Input

Add code
Mar 11, 2025
Viaarxiv icon

Admitting Ignorance Helps the Video Question Answering Models to Answer

Add code
Jan 15, 2025
Viaarxiv icon

A Two-Stage Pretraining-Finetuning Framework for Treatment Effect Estimation with Unmeasured Confounding

Add code
Jan 15, 2025
Viaarxiv icon

PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM

Add code
Dec 31, 2024
Figure 1 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Figure 2 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Figure 3 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Figure 4 for PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM
Viaarxiv icon

OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies

Add code
Dec 31, 2024
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Figure 1 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 2 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 3 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 4 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Viaarxiv icon