Picture for Chunhui Zhang

Chunhui Zhang

SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs

Add code
May 27, 2025
Viaarxiv icon

Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks

Add code
Apr 28, 2025
Viaarxiv icon

ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs

Add code
Apr 02, 2025
Viaarxiv icon

COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking

Add code
Apr 02, 2025
Viaarxiv icon

Scaled Supervision is an Implicit Lipschitz Regularizer

Add code
Mar 19, 2025
Viaarxiv icon

Superficial Self-Improved Reasoners Benefit from Model Merging

Add code
Mar 03, 2025
Viaarxiv icon

Modality-Aware Neuron Pruning for Unlearning in Multimodal Large Language Models

Add code
Feb 21, 2025
Viaarxiv icon

Pretrained Image-Text Models are Secretly Video Captioners

Add code
Feb 19, 2025
Viaarxiv icon

Learning Musical Representations for Music Performance Question Answering

Add code
Feb 10, 2025
Viaarxiv icon