PyTorch
by Community
Tensors and Dynamic neural networks in Python with strong GPU acceleration
OSS
PyTorch
Added 1 June 2026
Overview
PyTorch is a Python library for building neural networks using tensors and dynamic computation graphs. It provides strong GPU acceleration through CUDA integration, allowing researchers and engineers to define and train models with flexible, define-by-run semantics.
Best for
Best for
ML researchers and engineers building custom neural network architectures with GPU training needs
Use cases
- Training deep learning models with GPU acceleration
- Research prototyping with dynamic neural network architectures
- Building production inference pipelines for computer vision and NLP
Notes
PyTorch is a Python library for building neural networks using tensors and dynamic computation graphs. It provides strong GPU acceleration through CUDA integration, allowing researchers and engineers to define and train models with flexible, define-by-run semantics.
100,318 stars on GitHub. Last updated 2026-06-01.
Use cases
- Training deep learning models with GPU acceleration
- Research prototyping with dynamic neural network architectures
- Building production inference pipelines for computer vision and NLP
Pros
- Dynamic computation graphs enable intuitive debugging and flexible model design
- Mature ecosystem with extensive pre-trained models and community libraries
- Strong GPU support with optimized kernels for training at scale
Cons
- Steeper learning curve than some alternatives for beginners
- Production deployment requires additional tooling beyond the core library
- Memory usage can be high for large models without careful optimization
Indexed from awesome-llmops and enriched against its public facts.
Pros
- Dynamic computation graphs enable intuitive debugging and flexible model design
- Mature ecosystem with extensive pre-trained models and community libraries
- Strong GPU support with optimized kernels for training at scale
Cons
- Steeper learning curve than some alternatives for beginners
- Production deployment requires additional tooling beyond the core library
- Memory usage can be high for large models without careful optimization
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
PyTorch Lightning
Community
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Jax
Community
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
TensorFlow
Community
An Open Source Machine Learning Framework for Everyone
Caffe
Community
Caffe: a fast open framework for deep learning.
Apache MXNet
Community
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
mindsdb/mindsdb
Various
Platform dedicated to building an open foundation for applied Artificial Intelligence, designed for people seeking production-ready AI systems they can truly control, extend and de
Alpaca-LoRA-Serve
Community
LLM as a Chatbot Service
Archai
Community
Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.
AudioGPT
Community
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Axolotl
Community
Go ahead and axolotl questions
BentoML
Community
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Community
2018-10
bitsandbytes
Community
Accessible large language models via k-bit quantization for PyTorch.
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Community
BigScience
Build a Large Language Model (From Scratch)
Community
How to implement LLM attention mechanisms and GPT-style transformers.
CodeGeeX
Community
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Comet
Community
Examples of Machine Learning code using Comet.ml
CTranslate2
Community
Fast inference engine for Transformer models
DeepSeek-VL-1.3|7B
Community
DeepSeek-VL model series
DeepSpeed
Community
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Determined
Community
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch
EasyEdit
Community
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
FATE
Community
An Industrial Grade Federated Learning Framework
FasterTransformer
Community
Transformer related optimization, including BERT, GPT
Fiddler AI
Community
Fiddler Auditor is a tool to evaluate language models.
finetuning-scheduler
Community
A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.
FLAML
Community
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
FlexGen
Community
Running large language models on a single GPU for throughput-oriented scenarios.
Flyflow
Community
Open source, high performance fine tuning as a service for GPT4 quality models with 5x lower latency and 3x lower cost
HELM
Community
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducib
Holistic Evaluation of Language Models
Community
Stanford
HPOlib2
Community
Collection of hyperparameter optimization benchmark problems
InternLM-XComposer2-1.8|7B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Kubeflow
Community
Machine Learning Toolkit for Kubernetes
Litgpt
Community
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Llama 3-8|70B
Community
[Llama 2-7 13 70B](https://llama.meta.com/llama2/)
lm-evaluation-harness
Community
A framework for few-shot evaluation of language models.
LMDeploy
Community
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Lora
Community
Using Low-rank adaptation to quickly fine-tune diffusion models.
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Community
CMU&Princeton
Megatron-DeepSpeed
Community
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Meta Lingua
Community
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
MiniCPM-2B
Community
The MiniCPM family of LLMs and VLLMs.
MLflow
Community
MLflow - Open Source AI Platform for Agents, LLMs & Models
Model Search
Community

Moonlight-A3B
Community
Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer
MPT-7B
Community
Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available fo
Multitask Prompted Training Enables Zero-Shot Task Generalization
Community
Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is
NASGym
Community
A simple OpenAI Gym environment for Neural Architecture Search (NAS)
NNI
Community
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
octoml-profile
Community
Home for OctoML PyTorch Profiler
OLMoE: Open Mixture-of-Experts Language Models
Community
We introduce OLMoE, a fully open, state-of-the-art language model leveraging sparse Mixture-of-Experts (MoE). OLMoE-1B-7B has 7 billion (B) parameters but uses only 1B per input
OneComp
Community
Python package for LLM compression
OPT: Open Pre-trained Transformer Language Models
Community
2022-05
Phi1-1.3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
PocketFlow
Community
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
PyTorch Lightning
Community
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Qwen2.5-1M-7|14B
Community
Tech Report HuggingFace ModelScope Qwen Chat HuggingFace Demo ModelScope Demo DISCORD Introduction Two months after upgrading Qwen2.5-Turbo to support context length up to one mi
Resurrecting Recurrent Neural Networks for Long Sequences
Community
Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to per
ROLL
Community
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
RWKV: Reinventing RNNs for the Transformer Era
Community
Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence le
RWKV-v4|5|6
Community
Org profile for RWKV on Hugging Face, the AI community building the future.
SGLang
Community
SGLang is a high-performance serving framework for large language models and multimodal models.
Text-Embeddings-Inference
Community
A blazing fast inference solution for text embeddings models
text-generation-inference
Community
Large Language Model Text Generation Inference
TinyZero
Community
Minimal reproduction of DeepSeek R1-Zero
Torchserve
Community
Serve, optimize and scale PyTorch models in production
Triton Server (TRTIS)
Community
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Tune Studio
Community
Playground for devs to finetune & deploy LLMs
unslothai
Community
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Vegas
Community
AutoML tools chain
veRL
Community
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
Voyager
Community
An Open-Ended Embodied Agent with Large Language Models
WebGPT: Browser-assisted question-answering with human feedback
Community
2021-12
whisper
Community
Robust Speech Recognition via Large-Scale Weak Supervision
x-stable-diffusion
Community
Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6
ZenML
Community
ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.
Yi-34B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
Community
Microsoft
Zetane Viewer
Community
ML models and internal tensors 3D visualizer
Artbreeder
Various
A collaborative tool for creating images with AI.
ComfyUI
Various
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Flux
Various
Official inference repo for FLUX.1 models
Forefront
Various
Forefront is a platform to fine-tune and inference open-source-language-models.
Hugging Face Diffusion Models Course
Various
Materials for the Hugging Face Diffusion Models Course
KREA
Various
Generate, edit, and enhance images, videos, and 3D assets with Krea
Mistral
Various
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
Unsloth
Various
Unsloth is an open-source, no-code web UI for training, running and exporting open models in one unified local interface.
Whisper
Various
Robust speech recognition via large-scale weak supervision. [#opensource](https://github.com/openai/whisper)
HumanSignal/label-studio-mcp-server
Various
🎖️ 🐍 ☁️ 🪟 🐧 🍎 - Create, manage, and automate Label Studio projects, tasks, and predictions for data labeling workflows.
mindsdb/mindsdb
Various
Platform dedicated to building an open foundation for applied Artificial Intelligence, designed for people seeking production-ready AI systems they can truly control, extend and de
shuji-bonji/xcomet-mcp-server
Various
Translation quality evaluation MCP Server powered by xCOMET (eXplainable COMET).
topoteretes/cognee
Various
Memory platform for AI Agents in 6 lines of code
vectorize-io/hindsight
Various
Hindsight: Agent Memory That Learns
Accelerate
Community
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP a
Alexander Rush Series
Community
Projects
Auto-PyTorch
Community
Automatic architecture search and hyperparameter optimization for PyTorch
AutoGL
Community
An autoML framework & toolkit for machine learning on graphs.
Axolotl
Community
Go ahead and axolotl questions
bark
Community
🔊 Text-Prompted Generative Audio Model
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Community
2018-10
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Community
BigScience
BLOOMZ&mT0
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
BMTrain
Community
Efficient Training (including pre-training and fine-tuning) for Big Models
ChatGLM2-6B
Community
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Clip-as-a-service
Community
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
CodeGeeX
Community
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeQwen1.5-7B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction The advent of advanced programming tools, which harnesses the power of large language models (LLMs), has significantly en
Codestral-7|22B
Community
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
CodeT5
Community
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
Colossal-AI
Community
Making large AI models cheaper, faster and more accessible
DeepSeek-Math-7B
Community
DeepSeek Math series
DeepSeek-R1
Community
First-generation reasoning models from DeepSeek.
DeepSeek-v2-236B-MoE
Community
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of whic
DeepSeek-V2.5
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
DeepSeek-VL-1.3|7B
Community
DeepSeek-VL model series
DeepSpeed
Community
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
disco-diffusion
Community
A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations.
distilabel
Community
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
DiVLA
Community
A continuous diffusion-based Vision-Language-Action model that integrates diffusion policies into autoregressive VLMs for robust and precise continuous robotic control.
dolly
Community
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
EasyEdit
Community
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
EasyFL
Community
An easy-to-use federated learning platform
exllama
Community
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Falcon 40B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
FastChat
Community
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FastEdit
Community
🩹Editing large language models within 10 seconds⚡
FastDatasets
Community
A powerful tool for creating high-quality training datasets for Large Language Models (LLMs)(一个快速生成高质量LLM微调训练数据集的工具)
FedML
Community
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables runn
FlagAI
Community
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
Flyflow
Community
Open source, high performance fine tuning as a service for GPT4 quality models with 5x lower latency and 3x lower cost
Galactica: A Large Language Model for Science
Community
Galactica
GLM-130B: An Open Bilingual Pre-trained Model
Community
GLM-130B
GLM-6B (ChatGLM)
Community
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Gorilla
Community
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
GPT-NeoX
Community
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Grok-1-314B-MoE
Community
Grok-1-314B-MoE — indexed from awesome-llm
Hands-On Large Language Models: Language Understanding and Generation
Community
Hands-On Large Language Models
Hypernets
Community
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Improving language models by retrieving from trillions of tokens
Community
Publications — Google DeepMind
Infinity
Community
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
InternLM-XComposer2-1.8|7B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
InternLM2-1.8|7|20B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Language Is Not All You Need: Aligning Perception with Language Models
Community
A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence. In this work, we introduce Kosmos-1, a Mult
learn2learn
Community
A PyTorch Library for Meta-learning Research
Liger-Kernel
Community
Efficient Triton Kernels for LLM Training
LeRobot
Community
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Litgpt
Community
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Llama 1-7|13|33|65B
Community
[OPT-1.3 6.7 13 30 66B](https://arxiv.org/abs/2205.01068)
Llama 2: Open Foundation and Fine-Tuned Chat Models
Community
2023-07
Llama 3.2-1|3|11|90B
Community
[Llama 3.1-8 70 405B](https://llama.meta.com/)
Llama 3-8|70B
Community
[Llama 2-7 13 70B](https://llama.meta.com/llama2/)
LLaMA: Open and Efficient Foundation Language Models
Community
2023-02
LMDeploy
Community
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Ludwig
Community
Low-code framework for building custom LLMs, neural networks, and other AI models
Lora
Community
Using Low-rank adaptation to quickly fine-tune diffusion models.
Luotuo
Community
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Community
CMU&Princeton
Megatron-LM
Community
Ongoing research training transformer models at scale
Meta Lingua
Community
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
MInference
Community
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to
Mixtral-8x7B
Community
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
MM ReAct
Community
Official repo for MM-REACT
Moonlight-A3B
Community
Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer
MPT-7B
Community
Introducing MPT-7B, the first entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available fo
Multitask Prompted Training Enables Zero-Shot Task Generalization
Community
Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is
nanotron
Community
Minimalistic large language model 3D-parallelism training
NeMo Framework
Community
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech
NNI
Community
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Octo
Community
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
OLMo-7B
Community
Artifacts for the first set of OLMo models.
OLMo: Accelerating the Science of Language Models
Community
Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have be
OLMoE: Open Mixture-of-Experts Language Models
Community
We introduce OLMoE, a fully open, state-of-the-art language model leveraging sparse Mixture-of-Experts (MoE). OLMoE-1B-7B has 7 billion (B) parameters but uses only 1B per input
OLMO-eval
Community
Evaluation suite for LLMs
OpenELM-1.1|3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
OpenRLHF
Community
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
OpenPI
Community
Open-source VLA models from Physical Intelligence, including π₀ and π₀.5 — flow-based vision-language-action models pretrained on large-scale robot data with fine-tuning support.
OpenVLA
Community
OpenVLA: An open-source vision-language-action model for robotic manipulation.
OPT: Open Pre-trained Transformer Language Models
Community
2022-05
p-tuning-v2
Community
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
peft
Community
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Phi1-1.3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
PocketFlow
Community
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
Pythia-1|1.4|2.8|6.9|12B
Community
The hub for EleutherAI's work on interpretability and learning dynamics
PyTorch Lightning
Community
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Qwen-VL-7B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Qwen-1.8B|7B|14B|72B
Community
Qwen - a Qwen Collection
Qwen2-0.5B|1.5B|7B|57B-A14B-MoE|72B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2. This time, we bring to you: Pret
Qwen2.5-1M-7|14B
Community
Tech Report HuggingFace ModelScope Qwen Chat HuggingFace Demo ModelScope Demo DISCORD Introduction Two months after upgrading Qwen2.5-Turbo to support context length up to one mi
Qwen2-Audio-7B
Community
DEMO PAPER GITHUB HUGGING FACE MODELSCOPE DISCORD To achieve the objective of building an AGI system, the model should be capable of understanding information from different moda
Qwen2-Math-1.5B|7B|72B
Community
GITHUB HUGGING FACE MODELSCOPE DISCORD 🚨 This model mainly supports English. We will release bilingual (English and Chinese) math models soon. Introduction Over the past year, w
Ragas
Community
Supercharge Your LLM Application Evaluations 🚀
Resurrecting Recurrent Neural Networks for Long Sequences
Community
Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to per
RoboMamba
Community
An efficient VLA model leveraging State Space Models (Mamba) instead of standard self-attention, offering linear inference complexity for efficient, recurrent robotic reasoning.
Scaling Instruction-Finetuned Language Models
Community
Flan-T5/PaLM
segment-anything (SAM)
Community
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to
SGLang
Community
SGLang is a high-performance serving framework for large language models and multimodal models.
SmolVLA
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
stable-diffusion
Community
A latent text-to-image diffusion model
StableLM-3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
StableLM-v2-12B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
StarCoder-1|3|7B
Community
All models, datasets, and demos related to StarCoder!
TensorRT-LLM
Community
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NV
text-generation-inference
Community
Large Language Model Text Generation Inference
TGI
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
The Llama 3 Herd of Models
Community
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models
Torchmeta
Community
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
torchtune
Community
PyTorch native post-training library
torchtitan
Community
A PyTorch native platform for training generative AI models
TRL
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Tune Studio
Community
Playground for devs to finetune & deploy LLMs
UQLM
Community
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
VisCPM-10B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
veRL
Community
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Visual Instruction Tuning
Community
Instruction tuning large language models (LLMs) using machine-generated instruction-following data has improved zero-shot capabilities on new tasks, but the idea is less explored
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
Voyager
Community
An Open-Ended Embodied Agent with Large Language Models
WebGPT: Browser-assisted question-answering with human feedback
Community
2021-12
whisper
Community
Robust Speech Recognition via Large-Scale Weak Supervision
xTuring
Community
Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: htt
Yi-34B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Yi-VL-6B|34B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Community
Stanford
DreamBench++
Community
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Community
Megatron-LM
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Community
2022-12
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Community
How do large language models (LLMs) develop and evolve over the course of training? How do these patterns change as models scale? To answer these questions, we introduce \textit{
Bloom
Various
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Build a DeepSeek Model (From Scratch)
Various
Learn how to build the features that set DeepSeek apart from other top LLMs! When DeepSeek started making waves in January 2025, it sounded too good to be true. How could a gener
ChatGPT
OpenAI
General-purpose AI assistant for writing, coding, analysis, and conversation. The most widely deployed consumer AI product.
Colossyan
Various
Colossyan Creator makes video creation simple and stress-free. Discover our AI video generator with real actors. Create AI videos in less than 5 minutes.
ComfyUI
Various
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
DALL·E 2
Various
Announcement of the release of DALL·E 2, an advanced image generation system with improved resolution, expanded image creation capabilities, and various safety mitigations. OpenAI
DeepSeek
Various
Org profile for DeepSeek on Hugging Face, the AI community building the future.
DragGAN
Various
Official Code for DragGAN (SIGGRAPH 2023)
Flux
Various
Official inference repo for FLUX.1 models
Forefront
Various
Forefront is a platform to fine-tune and inference open-source-language-models.
Harmonai
Various
Harmonai.org
KREA
Various
Generate, edit, and enhance images, videos, and 3D assets with Krea
LLaMA
Various
Llama LLM, a foundational, 65-billion-parameter large language model by Meta. Meta, February 23rd, 2023. #opensource
Ludwig
Various
Low-code framework for building custom LLMs, neural networks, and other AI models
Make-A-Scene
Various
Make-A-Scene by Meta is a multimodal generative AI method puts creative control in the hands of people who use it by allowing them to describe and illustrate their vision through b
Meta AI
Various
Meta AI assistant to get things done, create AI-generated images, get answers. Built on Llama LLM.
MiniMax
Various
Building AGI with our mission Intelligence with Everyone. Global leader in multi-modal models and AI-native products with over 200 million users.
Nightcafe
Various
NightCafe Creator is an AI Art Generator app with multiple methods of AI art generation.
OPT
Various
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Resemble AI
Various
Resemble AI helps enterprises generate secure voice AI, verify proper usage, and detect deepfakes instantly. Available on-prem or via cloud. Built for enterprise scale with gover
Stable Diffusion Public Release
Various
We are delighted to announce the public release of Stable Diffusion and the launch of DreamStudio Lite.
Stable Diffusion
Various
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
SymbolicAI
Various
A neurosymbolic perspective on LLMs
TorToiSe
Various
A multi-voice TTS system trained with an emphasis on quality
Unsloth
Various
Unsloth is an open-source, no-code web UI for training, running and exporting open models in one unified local interface.
Vicuna-13B
Various
We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge s
Whisper
Various
Robust speech recognition via large-scale weak supervision. [#opensource](https://github.com/openai/whisper)
optuna/optuna-mcp
Various
The Optuna MCP Server is a Model Context Protocol (MCP) server to interact with Optuna APIs.
Aim
Community
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Airflow
Community
Platform created by the community to programmatically author, schedule and monitor workflows.
Attention Is All You Need
Community
Transformers
Awesome Federated Learning Systems
Community
Federated Learning Systems Paper List
Awesome Tensor Compilers
Community
A list of awesome compiler projects and papers for tensor computation and deep learning.
BudgetML
Community
Deploy a ML inference service on a budget in less than 10 lines of code.
ClearML
Community
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
conda
Community
A system-level, binary package and environment manager running on all major operating systems and platforms.
Deepchecks
Community
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test
Delta-Lake
Community
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
distilabel
Community
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Feast
Community
The Open Source Feature Store for AI/ML
Featureform
Community
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Flower
Community
Flower: A Friendly Federated AI Framework
Forward
Community
A library for high performance deep learning inference on NVIDIA GPUs.
Goptuna
Community
A hyperparameter optimization framework, inspired by Optuna.
Guild AI
Community
Experiment tracking, ML developer tools
Harmonia
Community
Federated Learning Made Easy
Hopsworks
Community
Hopsworks - Data-Intensive AI platform with a Feature Store
Horovod
Community
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
HpBandSter
Community
a distributed Hyperband implementation on Steroids
Hyperband
Community
Tuning hyperparams fast with Hyperband
Hyperopt
Community
Distributed Asynchronous Hyperparameter Optimization in Python
hyperunity
Community
A toolset for black-box hyperparameter optimisation.
Jupyter Notebooks
Community
Jupyter Interactive Notebook
Kubeflow Pipelines
Community
Machine Learning Pipelines for Kubeflow
LabNotebook
Community
LabNotebook is a tool that allows you to flexibly monitor, record, save, and query all your machine learning experiments.
Maniford
Community
A model-agnostic visual debugging tool for machine learning
MLRun
Community
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environ
Model Search
Community

ModelDB
Community
Open Source ML Model Versioning, Metadata, and Experiment Management
ModelFox
Community
ModelFox makes it easy to train, deploy, and monitor machine learning models.
MOE
Community
A global, black box optimization engine for real world metric optimization.
Mosec
Community
A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
netron
Community
Visualizer for neural network, deep learning and machine learning models
ONNX-MLIR
Community
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
ormb
Community
Docker for Your ML/DL Models Based on OCI Artifacts
PAI
Community
Resource scheduling and cluster management for AI
Polyaxon
Community
Open Source AI Infra & Engineering Control Plane
Sacred
Community
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
scalene
Community
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
scikit-learn
Community
scikit-learn: machine learning in Python
Spearmint
Community
Spearmint Bayesian optimization codebase
tokenizers
Community
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
TVM
Community
Open Machine Learning Compiler Framework
Upgini
Community
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, in
Weights & Biases
Community
W&B Weave helps developers evaluate, monitor, and iterate continuously to deliver generative AI applications with confidence.
whylogs
Community
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-pre
Molecular design
Various
List of Molecular and Material design using Generative AI and Deep Learning
Apache MXNet
Community
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Caffe
Community
Caffe: a fast open framework for deep learning.
Candle
Community
Minimalist ML framework for Rust
Jax
Community
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
MegEngine
Community
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
MindSpore
Community
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
NCNN
Community
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Oneflow
Community
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
PaddlePaddle
Community
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
TensorFlow
Community
An Open Source Machine Learning Framework for Everyone
TNN
Community
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding featur
VectorFlow
Community
A minimalist neural network library optimized for sparse data and single machine environments.