Picture for Gopala Anumanchipalli

Gopala Anumanchipalli

Asymmetric Hierarchical Anchoring for Audio-Visual Joint Representation: Resolving Information Allocation Ambiguity for Robust Cross-Modal Generalization

Add code
Feb 03, 2026
Viaarxiv icon

HuPER: A Human-Inspired Framework for Phonetic Perception

Add code
Feb 02, 2026
Viaarxiv icon

Evolutionary Strategies lead to Catastrophic Forgetting in LLMs

Add code
Jan 28, 2026
Viaarxiv icon

Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech

Add code
Dec 25, 2025
Figure 1 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Figure 2 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Figure 3 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Figure 4 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

How Do LLMs Use Their Depth?

Add code
Oct 21, 2025
Viaarxiv icon

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Add code
Aug 25, 2025
Viaarxiv icon

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon

MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real

Add code
Jul 03, 2025
Viaarxiv icon

RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding

Add code
Jun 12, 2025
Viaarxiv icon