Picture for Ming-Ming Cheng

Ming-Ming Cheng

Nankai University

Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis

Add code
Nov 11, 2024
Viaarxiv icon

Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation

Add code
Oct 29, 2024
Viaarxiv icon

ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer

Add code
Oct 18, 2024
Viaarxiv icon

OPUS: Occupancy Prediction Using a Sparse Set

Add code
Sep 14, 2024
Figure 1 for OPUS: Occupancy Prediction Using a Sparse Set
Figure 2 for OPUS: Occupancy Prediction Using a Sparse Set
Figure 3 for OPUS: Occupancy Prediction Using a Sparse Set
Figure 4 for OPUS: Occupancy Prediction Using a Sparse Set
Viaarxiv icon

From Words to Worth: Newborn Article Impact Prediction with LLM

Add code
Aug 07, 2024
Viaarxiv icon

Towards Stable 3D Object Detection

Add code
Jul 05, 2024
Viaarxiv icon

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Add code
Jun 02, 2024
Figure 1 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Figure 2 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Figure 3 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Figure 4 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Viaarxiv icon

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Add code
May 02, 2024
Viaarxiv icon

Generative Multi-modal Models are Good Class-Incremental Learners

Add code
Mar 27, 2024
Viaarxiv icon

LSKNet: A Foundation Lightweight Backbone for Remote Sensing

Add code
Mar 22, 2024
Viaarxiv icon