Skip to content
View withlin's full-sized avatar
🧸
🧸
  • nil
  • GuangZhou,China

Block or report withlin

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The Cursor for Designers • An Open-Source Visual Vibecoding Editor • Visually build, style, and edit your React App with AI

TypeScript 14,706 850 Updated Jun 4, 2025

Efficient Triton Kernels for LLM Training

Python 5,143 345 Updated Jun 2, 2025

Fast and memory-efficient exact attention

Python 17,664 1,720 Updated Jun 4, 2025

We propose a pioneering benchmark to evaluate LLM agents' ability to improve over time in streaming scenarios

Python 45 6 Updated Oct 28, 2024

Grafana Dash-n-Grab

Go 382 34 Updated Jun 1, 2025

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 13,748 3,443 Updated Jun 4, 2025

Telemetry and logs generator for benchmarks

C# 21 13 Updated Aug 23, 2022

Logs performance benchmark repo: Comparing Elastic, Loki and SigNoz

Shell 82 3 Updated Jan 23, 2023

Expert Kit is an efficient foundation of Expert Parallelism (EP) for MoE model Inference on heterogenous hardware

Rust 29 7 Updated Jun 4, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,868 1,909 Updated Jun 4, 2025

AOT binary translator from Linux/ELF to WebAssembly

C++ 237 12 Updated Jun 4, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 28,912 5,801 Updated Jun 2, 2025

An AI Hedge Fund Team

Python 34,568 5,998 Updated Jun 2, 2025

所有小初高、大学PDF教材。

Roff 35,325 7,876 Updated May 18, 2025

LogAI - An open-source library for log analytics and intelligence

Python 590 86 Updated Nov 14, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 90,531 24,339 Updated Jun 4, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,740 784 Updated May 28, 2025

AI powered Kubernetes Assistant

Go 5,956 497 Updated Jun 3, 2025

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 460 80 Updated Jun 3, 2025

⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.

Shell 12 9 Updated May 30, 2025

ring-attention experiments

Python 145 12 Updated Oct 17, 2024

An awesome list of Tech Lead.

32 3 Updated Jan 18, 2025

A multi-tenancy focused solution, that facilitates collection of telemetry data from Kubernetes workloads transparently.

Go 44 1 Updated Jun 3, 2025

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 84,710 44,141 Updated Jun 2, 2025

real time face swap and one-click video deepfake with only a single image

Python 70,483 9,981 Updated Jun 1, 2025

AI/GPU flame graph

C 150 1 Updated Jun 3, 2025

a unified scheduler for online and offline tasks

Go 607 81 Updated Mar 26, 2025

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 6,999 2,946 Updated Apr 29, 2025
Next