ollama
by Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
OSS
ollama
Added 1 June 2026
Overview
Ollama is a Go-based framework for running large language models locally on your machine. It downloads and executes open-source models like Llama, Mistral, and others without requiring cloud infrastructure or API keys.
Best for
Best for
Developers building local-first applications or prototyping with open-source LLMs without cloud costs
Use cases
- Running LLMs offline for privacy-sensitive applications
- Local development and testing before deploying to production
- Building chatbots and agents that don't depend on external APIs
Notes
Ollama is a Go-based framework for running large language models locally on your machine. It downloads and executes open-source models like Llama, Mistral, and others without requiring cloud infrastructure or API keys.
172,846 stars on GitHub. Last updated 2026-06-01. Licensed MIT.
Use cases
- Running LLMs offline for privacy-sensitive applications
- Local development and testing before deploying to production
- Building chatbots and agents that don’t depend on external APIs
Pros
- No cloud dependency or API costs once models are downloaded
- Simple CLI interface with minimal setup overhead
- Supports a wide range of open-source models with one-command installation
Cons
- Requires significant local compute and storage for larger models
- Performance depends entirely on your hardware, not optimized cloud infrastructure
- Limited to open-source models, no access to proprietary models like GPT-4
Indexed from awesome-llm and enriched against its public facts.
Pros
- No cloud dependency or API costs once models are downloaded
- Simple CLI interface with minimal setup overhead
- Supports a wide range of open-source models with one-command installation
Cons
- Requires significant local compute and storage for larger models
- Performance depends entirely on your hardware, not optimized cloud infrastructure
- Limited to open-source models, no access to proprietary models like GPT-4
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
Cline
Cline
Open-source autonomous coding agent that lives inside VS Code. BYO model key, watch it work.
Continue
Continue.dev
Open-source AI code assistant for VS Code and JetBrains. Customisable, BYO model, built for enterprise.
dcostenco/prism-mcp
Various
The Mind Palace for AI Agents - HIPAA-hardened Cognitive Architecture with on-device LLM (prism-coder:7b), Hebbian learning, ACT-R spreading activation, adversarial evaluation, per
elvismdev/mem0-mcp-selfhosted
Various
Self-hosted mem0 MCP server for Claude Code. Run a complete memory server against self-hosted Qdrant + Neo4j + Ollama while using Claude as the main LLM.
escapeboy/agent-fleet-o
Various
Open-source AI agent orchestration platform — self-hosted mission control for autonomous multi-agent systems. Visual DAG workflows, 450+ MCP tools, human-in-the-loop approvals. Wor
jaspertvdm/mcp-server-ollama-bridge
Various
MCP Server - Bridge to local Ollama LLM. Part of HumoticaOS/SymbAIon ecosystem.
spranab/brainstorm-mcp
Various
MCP server for multi-round AI brainstorming debates between multiple models (GPT, DeepSeek, Groq, Ollama, etc.)
TKMD/ReftrixMCP
Various
MCP server with 39 tools for web design analysis — layout extraction, motion detection, quality scoring, accessibility audit, Core Web Vitals, design comparison, and semantic searc
VrtxOmega/Ollama-Omega
Various
Sovereign Ollama Bridge — MCP server for local and cloud Ollama models. Generated by Qwen 3.5 397B.
Anything LLM
Community
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
Continue
Community
⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI
create-t3-turbo-ai
Community
Build full-stack, type-safe, LLM-powered apps with the T3 Stack, Turborepo, OpenAI, and Langchain
deploy-llms-with-ansible
Community
Easily deploy LLMs with Ansible. Uses Docker with llama.cpp or ollama. Secured with whitelisted IPs.
distilabel
Community
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
GPT Researcher
Community
An autonomous agent that conducts deep research on any data using any LLM providers
Knowledge GPT
Community
Accurate answers and instant citations for your documents.
LLocalSearch
Community
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user ca
Local GPT
Community
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
OpenDAN
Community
OpenDAN is an open source Personal AI OS , which consolidates various AI modules in one place for your personal use.
Phidata
Community
Build, run, and manage agent platforms.
Private GPT
Community
Interact with your documents using the power of GPT, 100% privately, no data leaks
Qwen2-Math-1.5B|7B|72B
Community
GITHUB HUGGING FACE MODELSCOPE DISCORD 🚨 This model mainly supports English. We will release bilingual (English and Chinese) math models soon. Introduction Over the past year, w
Shell-Pilot
Community
A simple, lightweight shell script to interact with OpenAI or Ollama or Mistral AI or LocalAI or ZhipuAI from the terminal, and enhancing intelligent system management without any
SwarmClaw
Community
Open-source self-hosted AI agent runtime and multi-agent framework for autonomous agent swarms. Agent memory, MCP tools, schedules, delegation, and 23+ LLM providers (Claude, GPT,
ai-i18n
Various
ai-i18n is a GitHub Action that translates your app's i18n files using LLMs. It extracts strings, translates only what's changed, and commits the results back to your repo. Works w
Claude Code
Various
Anthropic's agentic coding tool for developers. Claude Code understands your codebase, edits files, runs commands, and helps you ship faster.
LibreChat
Various
LibreChat brings together all your AI conversations in one unified, customizable interface.
LLM
Various
LLM: A CLI utility and Python library for interacting with Large Language Models
Local Deep Research
Various
~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Every
Open Interpreter
Various
A natural language interface for computers
Open WebUI
Various
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
PR-Agent
Various
🚀 PR Agent: The Original Open-Source PR Reviewer. This project It is not the Qodo free tier.
BondAI Homepage/Documentation
Community
BondAI
Jwrede/llmprobe
Various
Synthetic monitoring and CI smoke tests for LLM inference endpoints.
ShipItAndPray/mcp-turboquant
Various
MCP server for LLM quantization. Compress any model to GGUF/GPTQ/AWQ in one tool call. First MCP server for model compression.
AI Getting Started
Community
A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs
AI
Community
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
awesome-japanese-llm
Community
日本語LLMまとめ - Overview of Japanese LLMs
awesome-llm-webapps
Community
A collection of open source, actively maintained web apps for LLM applications
Bifrost
Community
Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.
Casibase
Community
⚡️next-generation personal AI assistant powered by LLM, RAG and agent loops, supporting computer-use, browser-use and coding agent, demo: https://demo.openagentai.org
Chainlit
Community
A Python library for making chatbot interfaces.
Cheshire Cat
Community
AI agent microservice
CodeGeeX
Community
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Codestral-7|22B
Community
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
CrewAI
CrewAI
Role-based multi-agent framework. Define crews of agents with roles, goals, and tasks, run them as a team.
DeepSeek-Math-7B
Community
DeepSeek Math series
DeepSeek-R1
Community
First-generation reasoning models from DeepSeek.
DeepSeek-v2-236B-MoE
Community
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of whic
DeepSeek-V2.5
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
DeepSeek-VL-1.3|7B
Community
DeepSeek-VL model series
Dify
Community
Production-ready platform for agentic workflow development.
Embedchain
Community
Universal memory layer for AI Agents
Flock
Community
A multi agent desktop application built with Rust and Tauri.
Flowise
Community
Build AI Agents, Visually
Gemma
Community
Checking your browser - reCAPTCHA
GLM-2|6|10|13|70B
Community
Org profile for THUDM on Hugging Face, the AI community building the future.
Grok-1-314B-MoE
Community
Grok-1-314B-MoE — indexed from awesome-llm
Haystack
Community
Create agentic, context engineered AI systems using Haystack’s modular and customizable building blocks, built for real-world, production-ready applications.
InternLM2-1.8|7|20B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Lagent
Community
A lightweight framework for building LLM-based agents
Langchain-Chatchat
Community
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like
Langflow
Community
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Llama 1-7|13|33|65B
Community
[OPT-1.3 6.7 13 30 66B](https://arxiv.org/abs/2205.01068)
Llama 2: Open Foundation and Fine-Tuned Chat Models
Community
2023-07
Llama 3.2-1|3|11|90B
Community
[Llama 3.1-8 70 405B](https://llama.meta.com/)
llama.cpp
Community
LLM inference in C/C++
LLaMA Cult and More
Community
Large Language Models for All, 🦙 Cult and More, Stay in touch !
LLaMA: Open and Efficient Foundation Language Models
Community
2023-02
llm-ui
Community
The React library for LLMs
MemGPT
Community
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
MiniCPM-2B
Community
The MiniCPM family of LLMs and VLLMs.
Mistral 7B
Community
Mistral 7B
Mixtral-8x7B
Community
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
Moonlight-A3B
Community
Moonshot's Compute-efficient MoE LLM, first Scaling Up of Muon Optimizer
OLMoE: Open Mixture-of-Experts Language Models
Community
We introduce OLMoE, a fully open, state-of-the-art language model leveraging sparse Mixture-of-Experts (MoE). OLMoE-1B-7B has 7 billion (B) parameters but uses only 1B per input
OpenELM-1.1|3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Phi1-1.3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Qwen-1.8B|7B|14B|72B
Community
Qwen - a Qwen Collection
Qwen2-0.5B|1.5B|7B|57B-A14B-MoE|72B
Community
GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction After months of efforts, we are pleased to announce the evolution from Qwen1.5 to Qwen2. This time, we bring to you: Pret
Qwen2.5-1M-7|14B
Community
Tech Report HuggingFace ModelScope Qwen Chat HuggingFace Demo ModelScope Demo DISCORD Introduction Two months after upgrading Qwen2.5-Turbo to support context length up to one mi
Qwen2.5 Technical Report
Community
In this report, we introduce Qwen2.5, a comprehensive series of large language models (LLMs) designed to meet diverse needs. Compared to previous iterations, Qwen 2.5 has been si
Qwen2.5-Max
Community
QWEN CHAT API DEMO DISCORD It is widely recognized that continuously scaling both data size and model size can lead to significant improvements in model intelligence. However, th
ray-llm
Community
RayLLM - LLMs on Ray (Archived). Read README for more info.
Rigging
Community
Lightweight LLM Interaction Framework
Rivet
Community
The open-source visual AI programming environment and TypeScript library
Semantic Kernel
Microsoft
Microsoft's enterprise-flavoured framework for AI agents. .NET-first, with Python and Java siblings.
Shell-Pilot
Community
A simple, lightweight shell script to interact with OpenAI or Ollama or Mistral AI or LocalAI or ZhipuAI from the terminal, and enhancing intelligent system management without any
SimpleAIChat
Community
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
StableLM-3B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
StableLM-v2-12B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
talkd.ai dialog
Community
RAG LLM Ops App for easy deployment and testing
TermGPT
Community
Giving LLMs like GPT-4 the ability to plan and execute terminal commands
The Llama 3 Herd of Models
Community
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models
Vercel AI SDK
Vercel
The de facto TypeScript SDK for AI apps. Streaming, tools, multi-model, and now an agent loop.
Yi-34B
Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Chatbot UI
Various
Chatbot UI
DeepSeek
Various
Org profile for DeepSeek on Hugging Face, the AI community building the future.
Forefront
Various
Forefront is a platform to fine-tune and inference open-source-language-models.
LangChain
Various
LangChain provides the engineering platform and open source frameworks developers use to build, test, and deploy reliable AI agents.
RunThisLLM
Various
Find out exactly what hardware you need to run any local LLM, image, video, or audio AI model. 275+ models with full build specs and performance estimates.
OpenLLM
Community
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Rapid-MLX
Community
The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Dr