Picture for Fanyi Xiao

Fanyi Xiao

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Viaarxiv icon

Gen2Det: Generate to Detect

Add code
Dec 07, 2023
Viaarxiv icon

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

Add code
Dec 04, 2023
Figure 1 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 2 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 3 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 4 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Viaarxiv icon

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Add code
Dec 01, 2023
Viaarxiv icon

EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding

Add code
Sep 15, 2023
Viaarxiv icon

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

Add code
Jun 01, 2023
Viaarxiv icon

Going Denser with Open-Vocabulary Part Segmentation

Add code
May 18, 2023
Viaarxiv icon

3rd Continual Learning Workshop Challenge on Egocentric Category and Instance Level Object Understanding

Add code
Dec 13, 2022
Viaarxiv icon

SCVRL: Shuffled Contrastive Video Representation Learning

Add code
May 24, 2022
Figure 1 for SCVRL: Shuffled Contrastive Video Representation Learning
Figure 2 for SCVRL: Shuffled Contrastive Video Representation Learning
Figure 3 for SCVRL: Shuffled Contrastive Video Representation Learning
Figure 4 for SCVRL: Shuffled Contrastive Video Representation Learning
Viaarxiv icon

Hierarchical Self-supervised Representation Learning for Movie Understanding

Add code
Apr 06, 2022
Figure 1 for Hierarchical Self-supervised Representation Learning for Movie Understanding
Figure 2 for Hierarchical Self-supervised Representation Learning for Movie Understanding
Figure 3 for Hierarchical Self-supervised Representation Learning for Movie Understanding
Figure 4 for Hierarchical Self-supervised Representation Learning for Movie Understanding
Viaarxiv icon