Picture for Siyang Wang

Siyang Wang

Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation

Add code
Jul 19, 2024
Viaarxiv icon

Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model

Add code
May 16, 2024
Viaarxiv icon

On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis

Add code
Jul 11, 2023
Viaarxiv icon

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

Add code
Jun 15, 2023
Figure 1 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Figure 2 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Figure 3 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Viaarxiv icon

Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis

Add code
May 29, 2023
Viaarxiv icon

A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS

Add code
Mar 05, 2023
Viaarxiv icon

Integrated Speech and Gesture Synthesis

Add code
Aug 25, 2021
Figure 1 for Integrated Speech and Gesture Synthesis
Figure 2 for Integrated Speech and Gesture Synthesis
Figure 3 for Integrated Speech and Gesture Synthesis
Figure 4 for Integrated Speech and Gesture Synthesis
Viaarxiv icon

Unaligned Image-to-Sequence Transformation with Loop Consistency

Add code
Oct 09, 2019
Figure 1 for Unaligned Image-to-Sequence Transformation with Loop Consistency
Figure 2 for Unaligned Image-to-Sequence Transformation with Loop Consistency
Figure 3 for Unaligned Image-to-Sequence Transformation with Loop Consistency
Figure 4 for Unaligned Image-to-Sequence Transformation with Loop Consistency
Viaarxiv icon

Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers

Add code
Jun 06, 2019
Figure 1 for Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers
Figure 2 for Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers
Figure 3 for Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers
Figure 4 for Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers
Viaarxiv icon

Controllable Top-down Feature Transformer

Add code
Nov 04, 2018
Figure 1 for Controllable Top-down Feature Transformer
Figure 2 for Controllable Top-down Feature Transformer
Figure 3 for Controllable Top-down Feature Transformer
Figure 4 for Controllable Top-down Feature Transformer
Viaarxiv icon