Releases — EleutherAI

Library

Dec 9, 2023

trlX

Library

Dec 9, 2023

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Library

Dec 9, 2023

Dataset

Oct 16, 2023

Proof-Pile-2

Dataset

Oct 16, 2023

A 55 billion token dataset of mathematical and scientific documents, created for training the LLeMA models.

Dataset

Oct 16, 2023

Model

Oct 16, 2023

LLeMA

Model

Oct 16, 2023

Language models for mathematical applications

Model

Oct 16, 2023

Dataset

Oct 10, 2023

OpenWebMath

Dataset

Oct 10, 2023

A 14.7B token dataset of high quality English mathematical text.

Dataset

Oct 10, 2023

Model

Feb 13, 2023

Pythia

Model

Feb 13, 2023

A suite of models designed to enable controlled scientific research on transparently trained LLMs

Model

Feb 13, 2023

Library

Feb 5, 2023

tuned-lens

Library

Feb 5, 2023

A library implementing the Tuned Lens, along with other tools for extracting, manipulating, and studying the learned representations of transformers across layers.

Library

Feb 5, 2023

Model

Dec 15, 2022

SD Upscaler

Model

Dec 15, 2022

A diffusion-based model for upscaling images to higher resolution, trained by Katherine Crowson in collaboration with Stability AI.

Model

Dec 15, 2022

Model

Dec 15, 2022

Polyglot-Ko

Model

Dec 15, 2022

A series of Korean autoregressive language models made by the EleutherAI polyglot team. We currently have trained and released 1.3B, 3.8B, and 5.8B parameter models.

Model

Dec 15, 2022

Find all our models, codebases, and datasets