Wllama
by Community
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
OSS
Wllama
Added 1 June 2026
Overview
Wllama is a TypeScript library that provides WebAssembly bindings for llama.cpp, enabling large language model inference directly in the browser. It loads quantized GGUF models and runs them client-side without server dependencies.
Best for
Best for
Developers building privacy-focused or offline web apps that need on-device LLM inference
Use cases
- Running private, offline LLM inference in web applications
- Prototyping browser-based chatbots or text assistants
- Evaluating small to medium models without cloud costs
Notes
Wllama is a TypeScript library that provides WebAssembly bindings for llama.cpp, enabling large language model inference directly in the browser. It loads quantized GGUF models and runs them client-side without server dependencies.
1,095 stars on GitHub. Last updated 2026-06-01. Licensed MIT.
Use cases
- Running private, offline LLM inference in web applications
- Prototyping browser-based chatbots or text assistants
- Evaluating small to medium models without cloud costs
Pros
- No server or API key needed; fully client-side
- Leverages llama.cpp’s efficient inference in WebAssembly
- Active community with over 1,000 GitHub stars
Cons
- Limited to models that fit in browser memory (typically small quantized models)
- Performance constrained by client hardware and WebAssembly overhead
- No built-in support for GPU acceleration in most browsers
Indexed from awesome-llm and enriched against its public facts.
Pros
- No server or API key needed; fully client-side
- Leverages llama.cpp's efficient inference in WebAssembly
- Active community with over 1,000 GitHub stars
Cons
- Limited to models that fit in browser memory (typically small quantized models)
- Performance constrained by client hardware and WebAssembly overhead
- No built-in support for GPU acceleration in most browsers
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.