Stella Biderman 09/12/2023 Stella Biderman 09/12/2023

trlX

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Stella Biderman 05/02/2023 Stella Biderman 05/02/2023

tuned-lens

A library implementing the Tuned Lens, along with other tools for extracting, manipulating, and studying the learned representations of transformers across layers.

https://github.com/norabelrose/tuned-lens

Stella Biderman 05/12/2022 Stella Biderman 05/12/2022

RWKV

RWKV is an RNN with transformer-level performance at some language modeling tasks. Unlike other RNNs, it can be scaled to tens of billions of parameters efficiently.

RWKV is an RNN with transformer-level performance at some language modeling tasks. Unlike other RNNs, it can be scaled to tens of billions of parameters quite efficiently.

Stella Biderman 23/11/2022 Stella Biderman 23/11/2022

OpenFold

A trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold2

Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold2

Stella Biderman 18/01/2022 Stella Biderman 18/01/2022

GPT-NeoX

A library for efficiently training large language models with tens of billions of parameters in a multimachine distributed context. This library is currently maintained by EleutherAI.

A library for efficiently training large language models with tens of billions of parameters in a multimachine distributed context.

Stella Biderman 13/05/2021 Stella Biderman 13/05/2021

LM Eval Harness

Our library for reproducible and transparent evaluation of LLMs.

Curtis Huebner 01/05/2021 Curtis Huebner 01/05/2021

Mesh Transformer Jax

A JAX and TPU-based library developed by Ben Wang. The library has been used to train GPT-J.

https://github.com/kingoflolz/mesh-transformer-jax

Curtis Huebner 21/03/2021 Curtis Huebner 21/03/2021

GPT-Neo Library

A library for training language models written in Mesh TensorFlow. This library was used to train the GPT-Neo models, but has since been retired and is no longer maintained. We currently recommend the GPT-NeoX library for LLM training.

https://github.com/EleutherAI/gpt-neo