Picture for Nima Mesgarani

Nima Mesgarani

MeanFlow-TSE: One-Step Generative Target Speaker Extraction with Mean Flow

Add code
Dec 21, 2025
Viaarxiv icon

Far from the Shallow: Brain-Predictive Reasoning Embedding through Residual Disentanglement

Add code
Oct 26, 2025
Viaarxiv icon

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Add code
Sep 19, 2025
Viaarxiv icon

Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations

Add code
Sep 19, 2025
Viaarxiv icon

ZeroSep: Separate Anything in Audio with Zero Training

Add code
May 29, 2025
Viaarxiv icon

Bridging Ears and Eyes: Analyzing Audio and Visual Large Language Models to Humans in Visible Sound Recognition and Reducing Their Sensory Gap via Cross-Modal Distillation

Add code
May 11, 2025
Viaarxiv icon

XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs

Add code
Feb 27, 2025
Figure 1 for XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs
Figure 2 for XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs
Figure 3 for XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs
Figure 4 for XCOMPS: A Multilingual Benchmark of Conceptual Minimal Pairs
Viaarxiv icon

AAD-LLM: Neural Attention-Driven Auditory Scene Understanding

Add code
Feb 24, 2025
Figure 1 for AAD-LLM: Neural Attention-Driven Auditory Scene Understanding
Figure 2 for AAD-LLM: Neural Attention-Driven Auditory Scene Understanding
Figure 3 for AAD-LLM: Neural Attention-Driven Auditory Scene Understanding
Figure 4 for AAD-LLM: Neural Attention-Driven Auditory Scene Understanding
Viaarxiv icon

Exploring Finetuned Audio-LLM on Heart Murmur Features

Add code
Jan 23, 2025
Figure 1 for Exploring Finetuned Audio-LLM on Heart Murmur Features
Figure 2 for Exploring Finetuned Audio-LLM on Heart Murmur Features
Figure 3 for Exploring Finetuned Audio-LLM on Heart Murmur Features
Figure 4 for Exploring Finetuned Audio-LLM on Heart Murmur Features
Viaarxiv icon

Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning

Add code
Nov 12, 2024
Figure 1 for Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning
Figure 2 for Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning
Figure 3 for Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning
Figure 4 for Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning
Viaarxiv icon