KEMBAR78
Ai2 · GitHub
Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.1k 663

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 151

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.5k 258

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 14.5k 1.1k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 888 81

Repositories

Showing 10 of 528 repositories
  • olmes Public

    Reproducible, flexible LLM evaluations

    allenai/olmes’s past year of commit activity
    Python 257 Apache-2.0 50 8 1 Updated Oct 24, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,263 Apache-2.0 452 14 (1 issue needs help) 34 Updated Oct 24, 2025
  • FlexOlmo Public

    Code and training scripts for FlexOlmo

    allenai/FlexOlmo’s past year of commit activity
    Python 108 Apache-2.0 13 3 10 Updated Oct 23, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 309 Apache-2.0 57 2 38 Updated Oct 23, 2025
  • safety-eval Public

    A simple evaluation of generative language models and safety classifiers.

    allenai/safety-eval’s past year of commit activity
    Python 69 18 0 1 Updated Oct 23, 2025
  • beaker-gantry Public

    Gantry is a CLI that streamlines running experiments in Beaker

    allenai/beaker-gantry’s past year of commit activity
    Python 27 Apache-2.0 7 2 3 Updated Oct 23, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 51 Apache-2.0 9 1 32 Updated Oct 23, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 14,463 Apache-2.0 1,086 28 9 Updated Oct 23, 2025
  • allenai/rslearn_projects’s past year of commit activity
    Python 14 Apache-2.0 3 15 13 Updated Oct 23, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    allenai/OLMo’s past year of commit activity
    Python 6,052 Apache-2.0 663 14 61 Updated Oct 23, 2025