Picture for Serge Belongie

Serge Belongie

Cornell Tech

Taxonomy-Aware Evaluation of Vision-Language Models

Add code
Apr 07, 2025
Viaarxiv icon

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

Add code
Apr 03, 2025
Viaarxiv icon

Multi-Modal Framing Analysis of News

Add code
Mar 26, 2025
Viaarxiv icon

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

Add code
Mar 20, 2025
Viaarxiv icon

Gradient Imbalance in Direct Preference Optimization

Add code
Feb 28, 2025
Viaarxiv icon

Bayesian Optimization for Controlled Image Editing via LLMs

Add code
Feb 26, 2025
Figure 1 for Bayesian Optimization for Controlled Image Editing via LLMs
Figure 2 for Bayesian Optimization for Controlled Image Editing via LLMs
Figure 3 for Bayesian Optimization for Controlled Image Editing via LLMs
Figure 4 for Bayesian Optimization for Controlled Image Editing via LLMs
Viaarxiv icon

Learning to Learn Weight Generation via Trajectory Diffusion

Add code
Feb 03, 2025
Figure 1 for Learning to Learn Weight Generation via Trajectory Diffusion
Figure 2 for Learning to Learn Weight Generation via Trajectory Diffusion
Figure 3 for Learning to Learn Weight Generation via Trajectory Diffusion
Figure 4 for Learning to Learn Weight Generation via Trajectory Diffusion
Viaarxiv icon

Explaining Context Length Scaling and Bounds for Language Models

Add code
Feb 03, 2025
Viaarxiv icon

Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes

Add code
Jan 23, 2025
Viaarxiv icon

Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation

Add code
Oct 29, 2024
Figure 1 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 2 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 3 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 4 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Viaarxiv icon