Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Oct 21, 2024

Peiji Yang, Fengping Wang, Yicheng Zhong, Huawei Wei, Zhisheng Wang

Figure 1 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Figure 2 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Figure 3 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Figure 4 for Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Share this with someone who'll enjoy it:

Abstract:Neural speech codecs have demonstrated their ability to compress high-quality speech and audio by converting them into discrete token representations. Most existing methods utilize Residual Vector Quantization (RVQ) to encode speech into multiple layers of discrete codes with uniform time scales. However, this strategy overlooks the differences in information density across various speech features, leading to redundant encoding of sparse information, which limits the performance of these methods at low bitrate. This paper proposes MsCodec, a novel multi-scale neural speech codec that encodes speech into multiple layers of discrete codes, each corresponding to a different time scale. This encourages the model to decouple speech features according to their diverse information densities, consequently enhancing the performance of speech compression. Furthermore, we incorporate mutual information loss to augment the diversity among speech codes across different layers. Experimental results indicate that our proposed method significantly improves codec performance at low bitrate.

View paper on

Share this with someone who'll enjoy it:

Title:Optimizing Neural Speech Codec for Low-Bitrate Compression via Multi-Scale Encoding

Paper and Code