TinyZero
by Community
Minimal reproduction of DeepSeek R1-Zero
OSS
TinyZero
Added 1 June 2026
Overview
TinyZero is a minimal Python implementation that reproduces the core mechanics of DeepSeek R1-Zero, a reasoning model that learns to think through problems without supervised reasoning data. It provides a stripped-down codebase for understanding and experimenting with zero-shot chain-of-thought reasoning in language models.
Best for
Best for
Researchers and builders who want to understand and experiment with zero-shot reasoning without black-box dependencies.
Use cases
- Study how reasoning models learn without labeled reasoning traces
- Prototype and test reasoning-based model architectures
- Reproduce DeepSeek R1-Zero results on smaller datasets or models
Notes
TinyZero is a minimal Python implementation that reproduces the core mechanics of DeepSeek R1-Zero, a reasoning model that learns to think through problems without supervised reasoning data. It provides a stripped-down codebase for understanding and experimenting with zero-shot chain-of-thought reasoning in language models.
13,125 stars on GitHub. Last updated 2026-02-27. Licensed Apache-2.0.
Use cases
- Study how reasoning models learn without labeled reasoning traces
- Prototype and test reasoning-based model architectures
- Reproduce DeepSeek R1-Zero results on smaller datasets or models
Pros
- Minimal codebase makes the reasoning mechanism transparent and hackable
- Community-maintained with 13k+ stars, indicating active interest and validation
- Direct path to understanding DeepSeek R1-Zero without proprietary complexity
Cons
- Minimal scope means you handle infrastructure, scaling, and production concerns yourself
- Limited documentation typical of research reproductions, requires reading source code
- No guarantee of feature parity with the original DeepSeek implementation
Indexed from awesome-llm and enriched against its public facts.
Pros
- Minimal codebase makes the reasoning mechanism transparent and hackable
- Community-maintained with 13k+ stars, indicating active interest and validation
- Direct path to understanding DeepSeek R1-Zero without proprietary complexity
Cons
- Minimal scope means you handle infrastructure, scaling, and production concerns yourself
- Limited documentation typical of research reproductions, requires reading source code
- No guarantee of feature parity with the original DeepSeek implementation
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.