Picture for Boyang Li

Boyang Li

SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models

Add code
Dec 17, 2024
Viaarxiv icon

Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses

Add code
Dec 11, 2024
Viaarxiv icon

Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events

Add code
Dec 07, 2024
Viaarxiv icon

Paint Outside the Box: Synthesizing and Selecting Training Data for Visual Grounding

Add code
Dec 01, 2024
Viaarxiv icon

KaLDeX: Kalman Filter based Linear Deformable Cross Attention for Retina Vessel Segmentation

Add code
Oct 28, 2024
Viaarxiv icon

Generating Synthetic Datasets for Few-shot Prompt Tuning

Add code
Oct 08, 2024
Viaarxiv icon

The First Competition on Resource-Limited Infrared Small Target Detection Challenge: Methods and Results

Add code
Aug 18, 2024
Viaarxiv icon

A Training Data Recipe to Accelerate A* Search with Language Models

Add code
Jul 13, 2024
Figure 1 for A Training Data Recipe to Accelerate A* Search with Language Models
Figure 2 for A Training Data Recipe to Accelerate A* Search with Language Models
Figure 3 for A Training Data Recipe to Accelerate A* Search with Language Models
Figure 4 for A Training Data Recipe to Accelerate A* Search with Language Models
Viaarxiv icon

Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines

Add code
Jun 20, 2024
Viaarxiv icon

Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding

Add code
Jun 18, 2024
Figure 1 for Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding
Figure 2 for Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding
Figure 3 for Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding
Figure 4 for Multilingual Synopses of Movie Narratives: A Dataset for Story Understanding
Viaarxiv icon