Picture for Yiwen Wang

Yiwen Wang

SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation

Add code
Oct 21, 2024
Viaarxiv icon

Cross-attention Inspired Selective State Space Models for Target Sound Extraction

Add code
Sep 10, 2024
Figure 1 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Figure 2 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Figure 3 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Figure 4 for Cross-attention Inspired Selective State Space Models for Target Sound Extraction
Viaarxiv icon

DENSE: Dynamic Embedding Causal Target Speech Extraction

Add code
Sep 10, 2024
Viaarxiv icon

PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation

Add code
Sep 04, 2024
Figure 1 for PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation
Figure 2 for PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation
Figure 3 for PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation
Figure 4 for PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation
Viaarxiv icon

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Add code
Aug 13, 2024
Figure 1 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 2 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 3 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Figure 4 for OpenResearcher: Unleashing AI for Accelerated Scientific Research
Viaarxiv icon

RS-BNN: A Deep Learning Framework for the Optimal Beamforming Design of Rate-Splitting Multiple Access

Add code
Jul 09, 2024
Viaarxiv icon

TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information

Add code
Jun 13, 2024
Viaarxiv icon

Beware of Overestimated Decoding Performance Arising from Temporal Autocorrelations in Electroencephalogram Signals

Add code
May 27, 2024
Figure 1 for Beware of Overestimated Decoding Performance Arising from Temporal Autocorrelations in Electroencephalogram Signals
Figure 2 for Beware of Overestimated Decoding Performance Arising from Temporal Autocorrelations in Electroencephalogram Signals
Figure 3 for Beware of Overestimated Decoding Performance Arising from Temporal Autocorrelations in Electroencephalogram Signals
Figure 4 for Beware of Overestimated Decoding Performance Arising from Temporal Autocorrelations in Electroencephalogram Signals
Viaarxiv icon

Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior

Add code
Apr 25, 2024
Figure 1 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Figure 2 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Figure 3 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Figure 4 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Viaarxiv icon

Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings

Add code
May 09, 2023
Figure 1 for Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Figure 2 for Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Figure 3 for Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Figure 4 for Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Viaarxiv icon