Galactica: A Large Language Model for Science
by Community
Galactica
OSS
Galactica: A Large Language Model for Science
Added 1 June 2026
Overview
Galactica is a large language model trained on a corpus of over 48 million scientific papers, textbooks, and knowledge bases. It is designed to summarize, answer questions, and assist with reasoning across scientific domains. Originally developed by Meta AI, the model is now available as an open-weight community resource.
Best for
Best for
Researchers and students who need a quick, science-focused text assistant for literature exploration
Use cases
- Generate concise summaries of scientific papers
- Answer factual questions about research topics
- Assist with literature review and hypothesis generation
Notes
Galactica is a large language model trained on a corpus of over 48 million scientific papers, textbooks, and knowledge bases. It is designed to summarize, answer questions, and assist with reasoning across scientific domains. Originally developed by Meta AI, the model is now available as an open-weight community resource.
Use cases
- Generate concise summaries of scientific papers
- Answer factual questions about research topics
- Assist with literature review and hypothesis generation
Pros
- Trained on an extensive corpus of peer-reviewed scientific literature
- Specialized for scientific terminology and reasoning tasks
- Free and open source model weights available for community use
Cons
- Prone to generating plausible but incorrect citations and fabricated facts
- Limited to text only and does not handle multi-modal scientific data
- No longer actively maintained or updated by the original developers
Indexed from awesome-llm and enriched against its public facts.
Pros
- Trained on an extensive corpus of peer-reviewed scientific literature
- Specialized for scientific terminology and reasoning tasks
- Free and open source model weights available for community use
Cons
- Prone to generating plausible but incorrect citations and fabricated facts
- Limited to text only and does not handle multi-modal scientific data
- No longer actively maintained or updated by the original developers
Pairs with
Other entries in the index that connect to this one. Click through to see the chain.