Skip to content
View jkeegan's full-sized avatar

Block or report jkeegan

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. distributed-llama distributed-llama Public

    Forked from b4rtaz/distributed-llama

    Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

    C++