Picture for Li-Wei Chen

Li-Wei Chen

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Add code
Sep 16, 2024
Viaarxiv icon

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Add code
Sep 16, 2024
Viaarxiv icon

Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages

Add code
Sep 13, 2024
Viaarxiv icon

Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation

Add code
Sep 03, 2024
Viaarxiv icon

VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka

Add code
Sep 03, 2024
Viaarxiv icon

BackFlip: The Impact of Local and Global Data Augmentations on Artistic Image Aesthetic Assessment

Add code
Aug 26, 2024
Viaarxiv icon

How Temporal Unrolling Supports Neural Physics Simulators

Add code
Feb 20, 2024
Viaarxiv icon

The North System for Formosa Speech Recognition Challenge 2023

Add code
Oct 06, 2023
Viaarxiv icon

Turbulent Flow Simulation using Autoregressive Conditional Diffusion Models

Add code
Sep 04, 2023
Viaarxiv icon

Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings

Add code
May 23, 2023
Figure 1 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Figure 2 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Figure 3 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Figure 4 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Viaarxiv icon