This repository is for testing, experimentation, and learning by getting hands dirty with core machine learning and deep learning workflows. It contains a collection of Jupyter notebooks and scripts covering NumPy, pandas, scikit-learn, PyTorch, computer vision, and more.
-
Deep Learning (PyTorch)
-
Machine Learning Libraries
-
The Annotated Transformer
-
Modular Scripts
- Overview: dl/going_modular/README.md
- Entry point: dl/going_modular/going_modular/train.py
dl/: Deep learning notebooks, small datasets, and modular scripts.going_modular/: Reusable Python scripts for training.data/: Datasets used in notebooks (e.g.,pizza_steak_sushi,FashionMNIST).models/: Exported PyTorch checkpoints.
library_learning/: Machine learning notebooks focusing on libraries like NumPy, pandas, and scikit-learn.annotated-transformer/: Implementation of "The Annotated Transformer" paper, including notebook and Python script versions.helper_functions.py: General utility functions used by multiple notebooks.
- Create a Python environment (recommended Python 3.10+).
- Install core packages used across notebooks:
pip install numpy pandas matplotlib scikit-learn jupyterlab notebook torch torchvision
- Open the desired notebook in VS Code or Jupyter and run cells.
- Some notebooks/scripts assume recent PyTorch + torchvision versions.
- For modular training scripts, read dl/going_modular/README.md first.
- Prepare data (see
going_modular.data_setup.create_dataloaders). - Run the training script:
python dl/going_modular/going_modular/train.py
- Make small, focused changes.
- Update relevant notebook outputs if you change code behavior.
This workspace is intended for learning and personal use. No license file is included.