Picture for Zhiyu Tan

Zhiyu Tan

ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack

Add code
Aug 10, 2024
Viaarxiv icon

VidGen-1M: A Large-Scale Dataset for Text-to-video Generation

Add code
Aug 05, 2024
Viaarxiv icon

EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models

Add code
Jun 27, 2024
Viaarxiv icon

EvalAlign: Evaluating Text-to-Image Models through Precision Alignment of Multimodal Large Models with Supervised Fine-Tuning to Human Annotations

Add code
Jun 24, 2024
Viaarxiv icon

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

Add code
Mar 08, 2024
Viaarxiv icon

OVO: Open-Vocabulary Occupancy

Add code
May 25, 2023
Figure 1 for OVO: Open-Vocabulary Occupancy
Figure 2 for OVO: Open-Vocabulary Occupancy
Figure 3 for OVO: Open-Vocabulary Occupancy
Figure 4 for OVO: Open-Vocabulary Occupancy
Viaarxiv icon

Entroformer: A Transformer-based Entropy Model for Learned Image Compression

Add code
Feb 11, 2022
Figure 1 for Entroformer: A Transformer-based Entropy Model for Learned Image Compression
Figure 2 for Entroformer: A Transformer-based Entropy Model for Learned Image Compression
Figure 3 for Entroformer: A Transformer-based Entropy Model for Learned Image Compression
Figure 4 for Entroformer: A Transformer-based Entropy Model for Learned Image Compression
Viaarxiv icon

GiraffeDet: A Heavy-Neck Paradigm for Object Detection

Add code
Feb 09, 2022
Figure 1 for GiraffeDet: A Heavy-Neck Paradigm for Object Detection
Figure 2 for GiraffeDet: A Heavy-Neck Paradigm for Object Detection
Figure 3 for GiraffeDet: A Heavy-Neck Paradigm for Object Detection
Figure 4 for GiraffeDet: A Heavy-Neck Paradigm for Object Detection
Viaarxiv icon

Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search

Add code
Nov 26, 2021
Figure 1 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Figure 2 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Figure 3 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Figure 4 for Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search
Viaarxiv icon

Interpolation variable rate image compression

Add code
Sep 20, 2021
Figure 1 for Interpolation variable rate image compression
Figure 2 for Interpolation variable rate image compression
Figure 3 for Interpolation variable rate image compression
Figure 4 for Interpolation variable rate image compression
Viaarxiv icon