Picture for Jiangyan Yi

Jiangyan Yi

WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification

Add code
Sep 18, 2024
Viaarxiv icon

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild

Add code
Aug 09, 2024
Viaarxiv icon

Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism

Add code
Jul 31, 2024
Viaarxiv icon

An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio

Add code
Jul 11, 2024
Viaarxiv icon

Frequency-mix Knowledge Distillation for Fake Speech Detection

Add code
Jun 14, 2024
Viaarxiv icon

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

Add code
Jun 10, 2024
Viaarxiv icon

TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Add code
Jun 07, 2024
Viaarxiv icon

EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark

Add code
May 15, 2024
Viaarxiv icon

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

Add code
Apr 29, 2024
Viaarxiv icon