Picture for Yutong Lin

Yutong Lin

Efficient and Economic Large Language Model Inference with Attention Offloading

Add code
May 03, 2024
Viaarxiv icon

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Add code
Aug 08, 2023
Viaarxiv icon

DETR Doesn't Need Multi-Scale or Locality Design

Add code
Aug 03, 2023
Viaarxiv icon

Could Giant Pretrained Image Models Extract Universal Representations?

Add code
Nov 03, 2022
Figure 1 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 2 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 3 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 4 for Could Giant Pretrained Image Models Extract Universal Representations?
Viaarxiv icon

On Data Scaling in Masked Image Modeling

Add code
Jun 09, 2022
Figure 1 for On Data Scaling in Masked Image Modeling
Figure 2 for On Data Scaling in Masked Image Modeling
Figure 3 for On Data Scaling in Masked Image Modeling
Figure 4 for On Data Scaling in Masked Image Modeling
Viaarxiv icon

A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model

Add code
Dec 29, 2021
Figure 1 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Figure 2 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Figure 3 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Figure 4 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Viaarxiv icon

SimMIM: A Simple Framework for Masked Image Modeling

Add code
Nov 18, 2021
Figure 1 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 2 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 3 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 4 for SimMIM: A Simple Framework for Masked Image Modeling
Viaarxiv icon

Swin Transformer V2: Scaling Up Capacity and Resolution

Add code
Nov 18, 2021
Figure 1 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 2 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 3 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 4 for Swin Transformer V2: Scaling Up Capacity and Resolution
Viaarxiv icon

Bootstrap Your Object Detector via Mixed Training

Add code
Nov 04, 2021
Figure 1 for Bootstrap Your Object Detector via Mixed Training
Figure 2 for Bootstrap Your Object Detector via Mixed Training
Figure 3 for Bootstrap Your Object Detector via Mixed Training
Figure 4 for Bootstrap Your Object Detector via Mixed Training
Viaarxiv icon

Self-Supervised Learning with Swin Transformers

Add code
May 11, 2021
Figure 1 for Self-Supervised Learning with Swin Transformers
Figure 2 for Self-Supervised Learning with Swin Transformers
Figure 3 for Self-Supervised Learning with Swin Transformers
Figure 4 for Self-Supervised Learning with Swin Transformers
Viaarxiv icon