Picture for Shifeng Zhang

Shifeng Zhang

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Add code
Nov 22, 2024
Viaarxiv icon

Generating Compositional Scenes via Text-to-image RGBA Instance Generation

Add code
Nov 16, 2024
Viaarxiv icon

Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look

Add code
Oct 16, 2024
Figure 1 for Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look
Figure 2 for Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look
Figure 3 for Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look
Figure 4 for Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look
Viaarxiv icon

Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model

Add code
Jun 25, 2024
Figure 1 for Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model
Figure 2 for Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model
Figure 3 for Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model
Figure 4 for Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model
Viaarxiv icon

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Add code
Apr 03, 2024
Viaarxiv icon

V2X-PC: Vehicle-to-everything Collaborative Perception via Point Cluster

Add code
Mar 25, 2024
Viaarxiv icon

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

Add code
Mar 07, 2024
Viaarxiv icon

Accelerating Diffusion Sampling with Optimized Time Steps

Add code
Feb 27, 2024
Viaarxiv icon

Optimisation-Based Multi-Modal Semantic Image Editing

Add code
Nov 28, 2023
Viaarxiv icon

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models

Add code
Sep 10, 2023
Viaarxiv icon