Picture for Stephen Lin

Stephen Lin

You Only Need Less Attention at Each Stage in Vision Transformers

Add code
Jun 01, 2024
Viaarxiv icon

Image to Pseudo-Episode: Boosting Few-Shot Segmentation by Unlabeled Data

Add code
May 14, 2024
Viaarxiv icon

Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence

Add code
Mar 17, 2024
Figure 1 for Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence
Figure 2 for Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence
Figure 3 for Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence
Figure 4 for Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence
Viaarxiv icon

Collaboratively Self-supervised Video Representation Learning for Action Recognition

Add code
Jan 15, 2024
Viaarxiv icon

Exploring Transferability for Randomized Smoothing

Add code
Dec 14, 2023
Viaarxiv icon

NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

Add code
Oct 12, 2023
Viaarxiv icon

Associative Transformer Is A Sparse Representation Learner

Add code
Sep 22, 2023
Viaarxiv icon

Randomized Quantization for Data Agnostic Representation Learning

Add code
Dec 19, 2022
Viaarxiv icon

ClipCrop: Conditioned Cropping Driven by Vision-Language Model

Add code
Nov 21, 2022
Figure 1 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Figure 2 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Figure 3 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Figure 4 for ClipCrop: Conditioned Cropping Driven by Vision-Language Model
Viaarxiv icon

Local Magnification for Data and Feature Augmentation

Add code
Nov 15, 2022
Viaarxiv icon