Skip to content
View StellaAthena's full-sized avatar

Organizations

@EleutherAI

Block or report StellaAthena

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
StellaAthena/README.md

Hi there 👋, my name is Stella Biderman

I'm an AI researcher seeking to understand how large language models work better.

  • 🔭 I’m currently working on language model interpretability with Pythia
  • 🤔 I’m looking for help with statistical models of learning dynamics and designing custom datasets to test theories about language models.
  • 💬 Ask me about training large language models
  • 😄 Pronouns: she/her

Catch me on:

Google Scholar Twitter Stack Exchange

Some stats:

GitHub stats

Pinned Loading

  1. EleutherAI/gpt-neox EleutherAI/gpt-neox Public

    An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

    Python 7.3k 1.1k

  2. EleutherAI/gpt-neo EleutherAI/gpt-neo Public archive

    An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

    Python 8.3k 963

  3. EleutherAI/the-pile EleutherAI/the-pile Public

    Python 1.6k 144

  4. EleutherAI/pythia EleutherAI/pythia Public

    The hub for EleutherAI's work on interpretability and learning dynamics

    Jupyter Notebook 2.7k 199