Resurrecting Recurrent Neural Networks for Long Sequences
by Community
Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to per
OSS
Resurrecting Recurrent Neural Networks for Long Sequences
Added 1 June 2026
Overview
A research paper presenting a methodology for designing deep recurrent neural networks that recover the performance of state-space models on long sequence tasks. It uses standard signal propagation arguments to guide architecture choices, achieving fast inference while addressing training difficulties.
Best for
Best for
Researchers and engineers building efficient long-sequence models requiring fast inference
Use cases
- Designing RNN architectures that match state-space model performance
- Enabling fast inference for long sequence applications
- Applying signal propagation principles to optimize RNN depth
Notes
A research paper presenting a methodology for designing deep recurrent neural networks that recover the performance of state-space models on long sequence tasks. It uses standard signal propagation arguments to guide architecture choices, achieving fast inference while addressing training difficulties.
Use cases
- Designing RNN architectures that match state-space model performance
- Enabling fast inference for long sequence applications
- Applying signal propagation principles to optimize RNN depth
Pros
- Fast inference on long sequences
- Competitive accuracy with state-space models
- Leverages well-understood recurrent network structure
Cons
- Slower to train than parallelizable alternatives
- Requires careful optimization and hyperparameter tuning
- Limited to researchers due to theoretical depth
Indexed from awesome-llm and enriched against its public facts.
Pros
- Fast inference on long sequences
- Competitive accuracy with state-space models
- Leverages well-understood recurrent network structure
Cons
- Slower to train than parallelizable alternatives
- Requires careful optimization and hyperparameter tuning
- Limited to researchers due to theoretical depth
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.