Picture for Yu Tsao

Yu Tsao

Graduate Program of Data Science, National Taiwan University and Academia Sinica, Taipei, Taiwan, Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan

How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception

Add code
Nov 14, 2024
Viaarxiv icon

Understanding Audiovisual Deepfake Detection: Techniques, Challenges, Human Factors and Perceptual Insights

Add code
Nov 12, 2024
Viaarxiv icon

RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier

Add code
Oct 29, 2024
Viaarxiv icon

TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement

Add code
Oct 04, 2024
Figure 1 for TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
Figure 2 for TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
Figure 3 for TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
Figure 4 for TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement
Viaarxiv icon

MECG-E: Mamba-based ECG Enhancer for Baseline Wander Removal

Add code
Sep 27, 2024
Viaarxiv icon

MC-SEMamba: A Simple Multi-channel Extension of SEMamba

Add code
Sep 26, 2024
Viaarxiv icon

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

Add code
Sep 17, 2024
Figure 1 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 2 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 3 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 4 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Viaarxiv icon

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement

Add code
Sep 16, 2024
Viaarxiv icon

A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models

Add code
Sep 16, 2024
Viaarxiv icon

DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset

Add code
Sep 13, 2024
Figure 1 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 2 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 3 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Figure 4 for DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
Viaarxiv icon