Picture for Yitian Zhang

Yitian Zhang

Stephen

Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces

Add code
Jan 09, 2025
Viaarxiv icon

Slicing Vision Transformer for Flexible Inference

Add code
Dec 06, 2024
Viaarxiv icon

SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion

Add code
Oct 30, 2024
Figure 1 for SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
Figure 2 for SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
Figure 3 for SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
Figure 4 for SFDFusion: An Efficient Spatial-Frequency Domain Fusion Network for Infrared and Visible Image Fusion
Viaarxiv icon

Accessing Vision Foundation Models at ImageNet-level Costs

Add code
Jul 15, 2024
Viaarxiv icon

CKGConv: General Graph Convolution with Continuous Kernels

Add code
Apr 21, 2024
Viaarxiv icon

Don't Judge by the Look: Towards Motion Coherent Video Representation

Add code
Mar 25, 2024
Viaarxiv icon

Multi-resolution Time-Series Transformer for Long-term Forecasting

Add code
Nov 07, 2023
Viaarxiv icon

Frame Flexible Network

Add code
Mar 26, 2023
Viaarxiv icon

Look More but Care Less in Video Recognition

Add code
Nov 18, 2022
Viaarxiv icon

Parameter-Efficient Masking Networks

Add code
Oct 13, 2022
Figure 1 for Parameter-Efficient Masking Networks
Figure 2 for Parameter-Efficient Masking Networks
Figure 3 for Parameter-Efficient Masking Networks
Figure 4 for Parameter-Efficient Masking Networks
Viaarxiv icon