Picture for Gopala Anumanchipalli

Gopala Anumanchipalli

Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech

Add code
Dec 25, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

How Do LLMs Use Their Depth?

Add code
Oct 21, 2025
Viaarxiv icon

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Add code
Aug 25, 2025
Viaarxiv icon

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon

MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real

Add code
Jul 03, 2025
Viaarxiv icon

RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding

Add code
Jun 12, 2025
Viaarxiv icon

Efficient Knowledge Editing via Minimal Precomputation

Add code
Jun 04, 2025
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection

Add code
May 28, 2025
Viaarxiv icon