Picture for Shaofeng Zhang

Shaofeng Zhang

Motion Control for Enhanced Complex Action Video Generation

Add code
Nov 13, 2024
Viaarxiv icon

Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation

Add code
Nov 04, 2024
Figure 1 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Figure 2 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Figure 3 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Figure 4 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Viaarxiv icon

PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

Add code
Aug 16, 2024
Viaarxiv icon

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Add code
Jun 26, 2024
Figure 1 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Figure 2 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Figure 3 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Figure 4 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Viaarxiv icon

Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach

Add code
Jan 28, 2024
Viaarxiv icon

GMTR: Graph Matching Transformers

Add code
Nov 14, 2023
Viaarxiv icon

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Add code
Oct 31, 2023
Viaarxiv icon

REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets

Add code
Oct 10, 2023
Viaarxiv icon

RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension

Add code
Aug 03, 2023
Viaarxiv icon

Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning

Add code
Jun 23, 2023
Viaarxiv icon