Picture for Leonid Sigal

Leonid Sigal

The Power of One: A Single Example is All it Takes for Segmentation in VLMs

Add code
Mar 13, 2025
Viaarxiv icon

BiasConnect: Investigating Bias Interactions in Text-to-Image Models

Add code
Mar 12, 2025
Viaarxiv icon

Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation

Add code
Jan 24, 2025
Viaarxiv icon

MMFactory: A Universal Solution Search Engine for Vision-Language Tasks

Add code
Dec 24, 2024
Viaarxiv icon

What Has Been Overlooked in Contrastive Source-Free Domain Adaptation: Leveraging Source-Informed Latent Augmentation within Neighborhood Context

Add code
Dec 18, 2024
Viaarxiv icon

Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images

Add code
Dec 13, 2024
Viaarxiv icon

Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses

Add code
Dec 11, 2024
Viaarxiv icon

Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events

Add code
Dec 07, 2024
Viaarxiv icon

Four-Plane Factorized Video Autoencoders

Add code
Dec 05, 2024
Viaarxiv icon

Extending Video Masked Autoencoders to 128 frames

Add code
Nov 20, 2024
Figure 1 for Extending Video Masked Autoencoders to 128 frames
Figure 2 for Extending Video Masked Autoencoders to 128 frames
Figure 3 for Extending Video Masked Autoencoders to 128 frames
Figure 4 for Extending Video Masked Autoencoders to 128 frames
Viaarxiv icon