Picture for Hsin-Min Wang

Hsin-Min Wang

DeRA-MOS: Optimizing Text-to-Music Evaluation via Decoupled Listwise Ranking and Modality Alignment

Add code
Jun 08, 2026
Viaarxiv icon

Few-Shot and Pseudo-Label Guided Speech Quality Evaluation with Large Language Models

Add code
Apr 15, 2026
Viaarxiv icon

SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment

Add code
Mar 26, 2026
Viaarxiv icon

LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement

Add code
Mar 17, 2026
Viaarxiv icon

Robust Generative Audio Quality Assessment: Disentangling Quality from Spurious Correlations

Add code
Mar 17, 2026
Viaarxiv icon

MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment

Add code
Mar 11, 2026
Viaarxiv icon

Efficient Dialect-Aware Modeling and Conditioning for Low-Resource Taiwanese Hakka Speech Processing

Add code
Feb 26, 2026
Viaarxiv icon

TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

Add code
Feb 25, 2026
Viaarxiv icon

Universal Robust Speech Adaptation for Cross-Domain Speech Recognition and Enhancement

Add code
Feb 04, 2026
Viaarxiv icon

Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings

Add code
Sep 03, 2025
Viaarxiv icon