Picture for Lingtong Min

Lingtong Min

Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP

Add code
Dec 13, 2024
Viaarxiv icon

Task-Adapter: Task-specific Adaptation of Image Models for Few-shot Action Recognition

Add code
Aug 01, 2024
Viaarxiv icon

VS-TransGRU: A Novel Transformer-GRU-based Framework Enhanced by Visual-Semantic Fusion for Egocentric Action Anticipation

Add code
Jul 08, 2023
Viaarxiv icon

Cross-Spatial Pixel Integration and Cross-Stage Feature Fusion Based Transformer Network for Remote Sensing Image Super-Resolution

Add code
Jul 06, 2023
Viaarxiv icon