Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Apr 10, 2024

Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou(+18 more)

Figure 1 for Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Figure 2 for Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Figure 3 for Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Figure 4 for Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Share this with someone who'll enjoy it:

Abstract:We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture. Our architectural design advancements include multi-headed matrix-valued states and a dynamic recurrence mechanism that improve expressivity while maintaining the inference efficiency characteristics of RNNs. We introduce a new multilingual corpus with 1.12 trillion tokens and a fast tokenizer based on greedy matching for enhanced multilinguality. We trained four Eagle models, ranging from 0.46 to 7.5 billion parameters, and two Finch models with 1.6 and 3.1 billion parameters and find that they achieve competitive performance across a wide variety of benchmarks. We release all our models on HuggingFace under the Apache 2.0 license. Models at: https://huggingface.co/RWKV Training code at: https://github.com/RWKV/RWKV-LM Inference code at: https://github.com/RWKV/ChatRWKV Time-parallel training code at: https://github.com/RWKV/RWKV-infctx-trainer

View paper on

Share this with someone who'll enjoy it:

Title:Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper and Code