Scalable data pre processing and curation toolkit for LLMs
-
Updated
Dec 31, 2025 - Python
Scalable data pre processing and curation toolkit for LLMs
GWAS summary statistics files QC tool
Predict next number in a sequence using a simple ANN. Modularized code with classes for data preparation, neural network architecture, and training.
얼굴 정렬·파싱 후 주름/모공/홍조 3���널 조건지도(heatmaps + .npy)를 만들어 cGAN 학습에 쓰는 파이프라인. (Face parsing + skin-condition maps (redness/wrinkle/pore) pipeline for conditional GAN training.)
Add a description, image, and links to the data-prep topic page so that developers can more easily learn about it.
To associate your repository with the data-prep topic, visit your repo's landing page and select "manage topics."