Picture for Jiangyan Yi

Jiangyan Yi

Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio

Add code
Dec 02, 2024
Figure 1 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Figure 2 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Figure 3 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Figure 4 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Viaarxiv icon

From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language

Add code
Nov 20, 2024
Figure 1 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Figure 2 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Figure 3 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Figure 4 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Viaarxiv icon

Unification of Balti and trans-border sister dialects in the essence of LLMs and AI Technology

Add code
Nov 20, 2024
Viaarxiv icon

WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification

Add code
Sep 18, 2024
Figure 1 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 2 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 3 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 4 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Viaarxiv icon

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild

Add code
Aug 09, 2024
Viaarxiv icon

Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism

Add code
Jul 31, 2024
Viaarxiv icon

An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio

Add code
Jul 11, 2024
Viaarxiv icon

Frequency-mix Knowledge Distillation for Fake Speech Detection

Add code
Jun 14, 2024
Viaarxiv icon

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

Add code
Jun 10, 2024
Viaarxiv icon