O Open Source Frameworks medium

RWKV: Reinventing RNNs for the Transformer Era

by Community

Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence le

Visit Community View repo Submit your build →

OSS

Added 1 June 2026

Overview

RWKV combines the efficient parallelizable training of transformers with the efficient inference of RNNs. It uses a linear-scaling architecture that avoids the quadratic complexity of traditional transformers, making it suitable for long sequences.

Best for

Best for
Developers building language models for long sequences or resource-constrained inference scenarios

Use cases

Training language models on long documents or sequences where memory is constrained
Running inference on edge devices or low-resource environments
Building efficient NLP models that require fast generation with limited computational budget

Notes

Use cases

Training language models on long documents or sequences where memory is constrained
Running inference on edge devices or low-resource environments
Building efficient NLP models that require fast generation with limited computational budget

Pros

Linear memory and computational scaling with sequence length
Efficient inference compared to Transformer-based models
Open-source community project with published research

Cons

May not match state-of-the-art transformer performance on all tasks
Relatively new architecture with smaller ecosystem and fewer pre-trained models
Implementation and optimization tools are less mature than for transformers

Indexed from awesome-llm and enriched against its public facts.

Pros

Linear memory and computational scaling with sequence length
Efficient inference compared to Transformer-based models
Open-source community project with published research

Cons

May not match state-of-the-art transformer performance on all tasks
Relatively new architecture with smaller ecosystem and fewer pre-trained models
Implementation and optimization tools are less mature than for transformers

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Uses1entry

O OSS Obs medium

PyTorch

Community

Tensors and Dynamic neural networks in Python with strong GPU acceleration

★ 100,318 updated 23d ago

Pairs with1entry

O OSS Framework medium

llama.cpp

Community

LLM inference in C/C++

★ 114,160 updated 23d ago

← Back to Open Source Submit your own entry →