Picture for Bing Yang

Bing Yang

STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking

Add code
Oct 08, 2024
Viaarxiv icon

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Add code
Jun 28, 2024
Figure 1 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 2 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 3 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 4 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Viaarxiv icon

IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization

Add code
May 11, 2024
Viaarxiv icon

Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer

Add code
Dec 01, 2023
Viaarxiv icon

Aeroacoustic Source Localization

Add code
Jul 05, 2023
Figure 1 for Aeroacoustic Source Localization
Figure 2 for Aeroacoustic Source Localization
Figure 3 for Aeroacoustic Source Localization
Figure 4 for Aeroacoustic Source Localization
Viaarxiv icon

FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization

Add code
May 31, 2023
Viaarxiv icon

Attention Mechanisms in Medical Image Segmentation: A Survey

Add code
May 29, 2023
Viaarxiv icon

xTrimoABFold: De novo Antibody Structure Prediction without MSA

Add code
Dec 11, 2022
Viaarxiv icon

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Add code
Jun 27, 2022
Figure 1 for Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Figure 2 for Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Figure 3 for Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Figure 4 for Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Viaarxiv icon

AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation

Add code
Jun 01, 2022
Figure 1 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Figure 2 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Figure 3 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Figure 4 for AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation
Viaarxiv icon