Enterprise DNA
O Open Source Frameworks medium

Galactica: A Large Language Model for Science

by Community

Galactica

GA

OSS

Galactica: A Large Language Model for Science

Added 1 June 2026

Overview

Galactica is a large language model trained on a corpus of over 48 million scientific papers, textbooks, and knowledge bases. It is designed to summarize, answer questions, and assist with reasoning across scientific domains. Originally developed by Meta AI, the model is now available as an open-weight community resource.

Best for

Best for
Researchers and students who need a quick, science-focused text assistant for literature exploration

Use cases

  • Generate concise summaries of scientific papers
  • Answer factual questions about research topics
  • Assist with literature review and hypothesis generation

Notes

Galactica is a large language model trained on a corpus of over 48 million scientific papers, textbooks, and knowledge bases. It is designed to summarize, answer questions, and assist with reasoning across scientific domains. Originally developed by Meta AI, the model is now available as an open-weight community resource.

Use cases

  • Generate concise summaries of scientific papers
  • Answer factual questions about research topics
  • Assist with literature review and hypothesis generation

Pros

  • Trained on an extensive corpus of peer-reviewed scientific literature
  • Specialized for scientific terminology and reasoning tasks
  • Free and open source model weights available for community use

Cons

  • Prone to generating plausible but incorrect citations and fabricated facts
  • Limited to text only and does not handle multi-modal scientific data
  • No longer actively maintained or updated by the original developers

Indexed from awesome-llm and enriched against its public facts.

Pros

  • Trained on an extensive corpus of peer-reviewed scientific literature
  • Specialized for scientific terminology and reasoning tasks
  • Free and open source model weights available for community use

Cons

  • Prone to generating plausible but incorrect citations and fabricated facts
  • Limited to text only and does not handle multi-modal scientific data
  • No longer actively maintained or updated by the original developers

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.