Picture for Shuailei Ma

Shuailei Ma

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

ControlMTR: Control-Guided Motion Transformer with Scene-Compliant Intention Points for Feasible Motion Prediction

Add code
Apr 17, 2024
Viaarxiv icon

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Add code
Apr 08, 2024
Viaarxiv icon

DreamLIP: Language-Image Pre-training with Long Captions

Add code
Mar 25, 2024
Viaarxiv icon

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model

Add code
Dec 18, 2023
Viaarxiv icon

A Simple Knowledge Distillation Framework for Open-world Object Detection

Add code
Dec 14, 2023
Viaarxiv icon

Detecting the open-world objects with the help of the Brain

Add code
Mar 21, 2023
Viaarxiv icon

FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection

Add code
Jan 08, 2023
Viaarxiv icon

CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection

Add code
Jan 05, 2023
Viaarxiv icon