Picture for Bin Li

Bin Li

Member, IEEE

Event Signal Filtering via Probability Flux Estimation

Add code
Apr 10, 2025
Viaarxiv icon

Hierarchical Modeling for Medical Visual Question Answering with Cross-Attention Fusion

Add code
Apr 10, 2025
Viaarxiv icon

EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively

Add code
Apr 09, 2025
Viaarxiv icon

MSA-UNet3+: Multi-Scale Attention UNet3+ with New Supervised Prototypical Contrastive Loss for Coronary DSA Image Segmentation

Add code
Apr 07, 2025
Viaarxiv icon

Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models

Add code
Mar 24, 2025
Viaarxiv icon

Multi-modal Multi-platform Person Re-Identification: Benchmark and Method

Add code
Mar 21, 2025
Viaarxiv icon

UMIT: Unifying Medical Imaging Tasks via Vision-Language Models

Add code
Mar 20, 2025
Viaarxiv icon

DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling

Add code
Mar 19, 2025
Viaarxiv icon

Robot Skin with Touch and Bend Sensing using Electrical Impedance Tomography

Add code
Mar 17, 2025
Viaarxiv icon

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation

Add code
Mar 13, 2025
Viaarxiv icon