Picture for Hsin-Min Wang

Hsin-Min Wang

Few-Shot and Pseudo-Label Guided Speech Quality Evaluation with Large Language Models

Add code
Apr 15, 2026
Viaarxiv icon

SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment

Add code
Mar 26, 2026
Viaarxiv icon

LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement

Add code
Mar 17, 2026
Viaarxiv icon

Robust Generative Audio Quality Assessment: Disentangling Quality from Spurious Correlations

Add code
Mar 17, 2026
Viaarxiv icon

MOS-Bias: From Hidden Gender Bias to Gender-Aware Speech Quality Assessment

Add code
Mar 11, 2026
Viaarxiv icon

Efficient Dialect-Aware Modeling and Conditioning for Low-Resource Taiwanese Hakka Speech Processing

Add code
Feb 26, 2026
Viaarxiv icon

TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

Add code
Feb 25, 2026
Viaarxiv icon

Universal Robust Speech Adaptation for Cross-Domain Speech Recognition and Enhancement

Add code
Feb 04, 2026
Viaarxiv icon

Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings

Add code
Sep 03, 2025
Viaarxiv icon

Revealing the Role of Audio Channels in ASR Performance Degradation

Add code
Aug 12, 2025
Figure 1 for Revealing the Role of Audio Channels in ASR Performance Degradation
Figure 2 for Revealing the Role of Audio Channels in ASR Performance Degradation
Figure 3 for Revealing the Role of Audio Channels in ASR Performance Degradation
Figure 4 for Revealing the Role of Audio Channels in ASR Performance Degradation
Viaarxiv icon