Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thor Højhus Avenstrup

SepMamba: State-space models for speaker separation using Mamba

Oct 28, 2024

Thor Højhus Avenstrup, Boldizsár Elek, István László Mádi, András Bence Schin, Morten Mørup, Bjørn Sand Jensen, Kenny Falkær Olsen

Figure 1 for SepMamba: State-space models for speaker separation using Mamba

Figure 2 for SepMamba: State-space models for speaker separation using Mamba

Figure 3 for SepMamba: State-space models for speaker separation using Mamba

Figure 4 for SepMamba: State-space models for speaker separation using Mamba

Abstract:Deep learning-based single-channel speaker separation has improved significantly in recent years largely due to the introduction of the transformer-based attention mechanism. However, these improvements come at the expense of intense computational demands, precluding their use in many practical applications. As a computationally efficient alternative with similar modeling capabilities, Mamba was recently introduced. We propose SepMamba, a U-Net-based architecture composed primarily of bidirectional Mamba layers. We find that our approach outperforms similarly-sized prominent models - including transformer-based models - on the WSJ0 2-speaker dataset while enjoying a significant reduction in computational cost, memory usage, and forward pass time. We additionally report strong results for causal variants of SepMamba. Our approach provides a computationally favorable alternative to transformer-based architectures for deep speech separation.

Via

Access Paper or Ask Questions

Self-Supervised Learning for Time Series: A Review & Critique of FITS

Oct 23, 2024

Andreas Løvendahl Eefsen, Nicholas Erup Larsen, Oliver Glozmann Bork Hansen, Thor Højhus Avenstrup

Figure 1 for Self-Supervised Learning for Time Series: A Review & Critique of FITS

Figure 2 for Self-Supervised Learning for Time Series: A Review & Critique of FITS

Figure 3 for Self-Supervised Learning for Time Series: A Review & Critique of FITS

Figure 4 for Self-Supervised Learning for Time Series: A Review & Critique of FITS

Abstract:Accurate time series forecasting is a highly valuable endeavour with applications across many industries. Despite recent deep learning advancements, increased model complexity, and larger model sizes, many state-of-the-art models often perform worse or on par with simpler models. One of those cases is a recently proposed model, FITS, claiming competitive performance with significantly reduced parameter counts. By training a one-layer neural network in the complex frequency domain, we are able to replicate these results. Our experiments on a wide range of real-world datasets further reveal that FITS especially excels at capturing periodic and seasonal patterns, but struggles with trending, non-periodic, or random-resembling behavior. With our two novel hybrid approaches, where we attempt to remedy the weaknesses of FITS by combining it with DLinear, we achieve the best results of any known open-source model on multivariate regression and promising results in multiple/linear regression on price datasets, on top of vastly improving upon what FITS achieves as a standalone model.

* arXiv:2307.03756v3 45 pages, 36 figures

Via

Access Paper or Ask Questions