Picture for Bozheng Li

Bozheng Li

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Add code
Dec 12, 2024
Viaarxiv icon

Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition

Add code
Aug 22, 2024
Viaarxiv icon

Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning

Add code
Aug 22, 2024
Figure 1 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Figure 2 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Figure 3 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Figure 4 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Viaarxiv icon

OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning

Add code
Aug 12, 2024
Viaarxiv icon

Fully Fine-tuned CLIP Models are Efficient Few-Shot Learners

Add code
Jul 04, 2024
Viaarxiv icon

Zero-Shot Long-Form Video Understanding through Screenplay

Add code
Jun 25, 2024
Viaarxiv icon