Picture for Boqing Gong

Boqing Gong

Neptune: The Long Orbit to Benchmarking Long Video Understanding

Add code
Dec 12, 2024
Viaarxiv icon

Diffusion Autoencoders for Few-shot Image Generation in Hyperbolic Space

Add code
Nov 27, 2024
Viaarxiv icon

Extending Video Masked Autoencoders to 128 frames

Add code
Nov 20, 2024
Viaarxiv icon

OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities

Add code
Oct 16, 2024
Viaarxiv icon

$ε$-VAE: Denoising as Visual Decoding

Add code
Oct 05, 2024
Viaarxiv icon

SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining

Add code
Sep 26, 2024
Figure 1 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 2 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 3 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 4 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Viaarxiv icon

On Discrete Prompt Optimization for Diffusion Models

Add code
Jun 27, 2024
Viaarxiv icon

Understanding the Impact of Negative Prompts: When and How Do They Take Effect?

Add code
Jun 05, 2024
Viaarxiv icon

The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise

Add code
Jun 04, 2024
Viaarxiv icon

Automatic Jailbreaking of the Text-to-Image Generative AI Systems

Add code
May 28, 2024
Viaarxiv icon