LLaMA: Open and Efficient Foundation Language Models
by Community
2023-02
OSS
LLaMA: Open and Efficient Foundation Language Models
Added 1 June 2026
Overview
LLaMA (Large Language Model Meta AI) is a collection of foundation language models released by Meta in February 2023, ranging from 7B to 65B parameters. It provides open weights for research, emphasizing efficient inference with smaller model footprints. The models are designed to enable reproducible research and serve as a base for fine-tuning.
Best for
Best for
Researchers and developers needing an open, efficient base model for fine-tuning and experimentation
Use cases
- Fine-tuning on domain-specific datasets for custom tasks
- Running large-scale text generation experiments locally
- Benchmarking model architectures and comparing efficiency
Notes
LLaMA (Large Language Model Meta AI) is a collection of foundation language models released by Meta in February 2023, ranging from 7B to 65B parameters. It provides open weights for research, emphasizing efficient inference with smaller model footprints. The models are designed to enable reproducible research and serve as a base for fine-tuning.
Use cases
- Fine-tuning on domain-specific datasets for custom tasks
- Running large-scale text generation experiments locally
- Benchmarking model architectures and comparing efficiency
Pros
- Competitive performance relative to model size, reducing compute requirements
- Open weights allow full transparency and reproducibility
- Efficient inference enables deployment on fewer GPUs
Cons
- Original release limited to non-commercial research use
- Requires substantial GPU memory and infrastructure for larger variants
- No built-in API or model serving infrastructure
Indexed from awesome-llm and enriched against its public facts.
Pros
- Competitive performance relative to model size, reducing compute requirements
- Open weights allow full transparency and reproducibility
- Efficient inference enables deployment on fewer GPUs
Cons
- Original release limited to non-commercial research use
- Requires substantial GPU memory and infrastructure for larger variants
- No built-in API or model serving infrastructure
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.
llama.cpp
Community
LLM inference in C/C++
vLLM
Community
A high-throughput and memory-efficient inference and serving engine for LLMs
LangChain
Community
The agent engineering platform.
ollama
Community
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.