Picture for Hilde Kuehne

Hilde Kuehne

VideoGEM: Training-free Action Grounding in Videos

Add code
Mar 26, 2025
Viaarxiv icon

Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks

Add code
Mar 24, 2025
Viaarxiv icon

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

Add code
Feb 03, 2025
Viaarxiv icon

TimeLogic: A Temporal Logic Benchmark for Video QA

Add code
Jan 13, 2025
Viaarxiv icon

State-Space Large Audio Language Models

Add code
Nov 24, 2024
Viaarxiv icon

Teaching VLMs to Localize Specific Objects from In-context Examples

Add code
Nov 20, 2024
Figure 1 for Teaching VLMs to Localize Specific Objects from In-context Examples
Figure 2 for Teaching VLMs to Localize Specific Objects from In-context Examples
Figure 3 for Teaching VLMs to Localize Specific Objects from In-context Examples
Figure 4 for Teaching VLMs to Localize Specific Objects from In-context Examples
Viaarxiv icon

Convolutional Differentiable Logic Gate Networks

Add code
Nov 07, 2024
Figure 1 for Convolutional Differentiable Logic Gate Networks
Figure 2 for Convolutional Differentiable Logic Gate Networks
Figure 3 for Convolutional Differentiable Logic Gate Networks
Figure 4 for Convolutional Differentiable Logic Gate Networks
Viaarxiv icon

Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms

Add code
Oct 24, 2024
Viaarxiv icon

MaskInversion: Localized Embeddings via Optimization of Explainability Maps

Add code
Jul 29, 2024
Viaarxiv icon

DASS: Distilled Audio State Space Models Are Stronger and More Duration-Scalable Learners

Add code
Jul 04, 2024
Viaarxiv icon