Picture for Yuxuan Cai

Yuxuan Cai

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Add code
Oct 21, 2024
Viaarxiv icon

Allegro: Open the Black Box of Commercial-Level Video Generation Model

Add code
Oct 20, 2024
Figure 1 for Allegro: Open the Black Box of Commercial-Level Video Generation Model
Figure 2 for Allegro: Open the Black Box of Commercial-Level Video Generation Model
Figure 3 for Allegro: Open the Black Box of Commercial-Level Video Generation Model
Figure 4 for Allegro: Open the Black Box of Commercial-Level Video Generation Model
Viaarxiv icon

Attention-Guided Perturbation for Unsupervised Image Anomaly Detection

Add code
Aug 14, 2024
Viaarxiv icon

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Add code
Jun 06, 2024
Viaarxiv icon

High-Performance Temporal Reversible Spiking Neural Networks with $O(L)$ Training Memory and $O(1)$ Inference Cost

Add code
May 26, 2024
Viaarxiv icon

Anomaly Detection by Adapting a pre-trained Vision Language Model

Add code
Mar 14, 2024
Viaarxiv icon

Yi: Open Foundation Models by 01.AI

Add code
Mar 07, 2024
Figure 1 for Yi: Open Foundation Models by 01.AI
Figure 2 for Yi: Open Foundation Models by 01.AI
Figure 3 for Yi: Open Foundation Models by 01.AI
Figure 4 for Yi: Open Foundation Models by 01.AI
Viaarxiv icon

A Discrepancy Aware Framework for Robust Anomaly Detection

Add code
Oct 11, 2023
Viaarxiv icon

RevColV2: Exploring Disentangled Representations in Masked Image Modeling

Add code
Sep 02, 2023
Viaarxiv icon

Reversible Column Networks

Add code
Dec 22, 2022
Viaarxiv icon