Picture for Shutong Niu

Shutong Niu

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Add code
Nov 23, 2024
Figure 1 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 2 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 3 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 4 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Viaarxiv icon

Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention

Add code
Oct 19, 2024
Figure 1 for Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Figure 2 for Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Figure 3 for Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Figure 4 for Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention
Viaarxiv icon

Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings

Add code
Sep 25, 2024
Figure 1 for Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings
Figure 2 for Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings
Figure 3 for Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings
Figure 4 for Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings
Viaarxiv icon

The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge

Add code
Sep 03, 2024
Figure 1 for The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge
Figure 2 for The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge
Figure 3 for The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge
Figure 4 for The USTC-NERCSLIP Systems for the CHiME-8 NOTSOFAR-1 Challenge
Viaarxiv icon

Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

Add code
Sep 17, 2023
Viaarxiv icon

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge

Add code
Aug 28, 2023
Figure 1 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Figure 2 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Figure 3 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Figure 4 for The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Viaarxiv icon

Semi-supervised multi-channel speaker diarization with cross-channel attention

Add code
Jul 17, 2023
Figure 1 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Figure 2 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Figure 3 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Figure 4 for Semi-supervised multi-channel speaker diarization with cross-channel attention
Viaarxiv icon

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

Add code
Feb 10, 2022
Figure 1 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 2 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 3 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Figure 4 for The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription  challenge
Viaarxiv icon

USTC-NELSLIP System Description for DIHARD-III Challenge

Add code
Mar 19, 2021
Figure 1 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 2 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 3 for USTC-NELSLIP System Description for DIHARD-III Challenge
Figure 4 for USTC-NELSLIP System Description for DIHARD-III Challenge
Viaarxiv icon

A Two-Stage Approach to Device-Robust Acoustic Scene Classification

Add code
Nov 03, 2020
Figure 1 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 2 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 3 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Figure 4 for A Two-Stage Approach to Device-Robust Acoustic Scene Classification
Viaarxiv icon