Mila

Mila Deep Neural Network Library

Prerelease Notice

Mila, version 0.9.914-alpha

Roadmap

Mila is targeting a v0.1.0-beta release in 10-12 weeks featuring a stable high-level Model API, GPU-accelerated loss functions, comprehensive checkpointing, and production-quality MNIST examples with 98%+ accuracy. See the complete ROADMAP.md for detailed weekly milestones, success criteria, and the path to beta.

Description

Achilles Mila Deep Neural Network library provides a comprehensive API to model, train and evaluate Deep Neural Networks for both research and production environments. The library implements state-of-the-art architectures including transformers, convolutional networks, and recurrent models. Mila utilizes the NVIDIA CUDA runtime for high-performance GPU acceleration, enabling efficient training and inference on large-scale datasets. The library also supports distributed training across multiple GPUs and compute nodes, with automatic optimization for various hardware configurations.

Usage

Mnist Sample

A complete Mnist training example is included in the samples/mnist directory. This example demonstrates how to set up a simple feedforward neural network using Mila, load the MNIST dataset, and train the model to achieve high accuracy on handwritten digit recognition.

Documentation

Comprehensive Online documentation is available:

Online Documentation: The complete API reference is hosted on GitHub Pages at https://toddthomson.github.io/Mila

The documentation includes class references, usage examples, and architecture guides. It is automatically updated through our GitHub Actions workflow whenever changes are pushed to the master branch.

Top Features

Deep Neural Nets
- GPT2, Recurrent Neural Networks
- GPU acceleration using CUDA runtime
Datasets
- Batch sequence loader
- Optimized data processing pipelines

What's New

Recent Updates (v0.9.9XX-alpha)

Training Infrastructure Complete

Successfully trained MNIST classifier achieving 97.5% test accuracy with 3-layer MLP
Implemented complete forward and backward pass for Linear layers using cuBLASLt
AdamW optimizer fully operational with momentum, weight decay, and bias correction
Achieved ~136,000 samples/second training throughput on CUDA

Critical Fixes

Race Condition Resolution: Fixed GPU-CPU memory transfer synchronization issues that caused intermittent forward pass failures
cuBLASLt Integration: Resolved stream synchronization in tensor copy operations for reliable GPU computation
Gradient Management: Fixed gradient accumulation vs. overwrite semantics in backward pass (beta parameter handling)
Optimizer Registration: Verified correct parameter and gradient tensor registration for multi-layer networks

Performance Optimizations

Optimized cuBLASLt matrix multiplication with proper layout configurations for forward and backward passes
Implemented efficient bias gradient reduction kernels using CUDA warp-level operations
Stream-ordered execution eliminates unnecessary synchronization overhead
Zero-copy operations where possible using pinned memory for host-device transfers

Validated Components

Linear layer forward pass (784?128?64?10 architecture tested)
Linear layer backward pass (input gradients, weight gradients, bias gradients)
AdamW optimizer step with all hyperparameters
Gradient zeroing and accumulation
Multi-layer network training convergence
Test set evaluation with proper inference mode

Next Steps

Additional activation functions (GELU, SiLU) and their backward passes
Layer normalization and batch normalization modules
Attention mechanism implementation for transformer models
Gradient clipping and learning rate scheduling
Model checkpointing and weight serialization
Distributed training support for multi-GPU environments
Mixed precision training (FP16/BF16) support

Mila Build Instructions

Mila uses CMake build. To build Mila, follow the steps below:

Clone the Mila repository

git clone https://github.com/toddthomson/mila.git cd mila

Using Visual Studio

Prerequisites
- Visual Studio 2022 or newer with "Desktop development with C++" workload
- CUDA Toolkit 13.0 latest
- CMake 3.31 or newer (included with Visual Studio)
Open the Project
- Launch Visual Studio
- Select "Open a local folder" and navigate to your cloned Mila repository
- Visual Studio will automatically detect the CMakeLists.txt file
Configure Project
- Visual Studio will automatically generate CMake cache
- To customize build settings, right-click on CMakeLists.txt and select "CMake Settings for MilaProject"
- Under "Configuration type", select "Release" for optimal performance
Build the Project
- Right-click on CMakeLists.txt and select "Build All"
- Alternatively, use the Build menu or press F7
Run Tests
- In the Solution Explorer, expand the "Tests" folder
- Right-click on a test project and select "Run Tests"

Using Visual Studio Code

Prerequisites
- Visual Studio Code
- C/C++ extension
- CMake Tools extension
- CUDA Toolkit 13.0
- CMake 3.31 or newer
Open the Project
- Launch VS Code
- Open the folder containing your cloned Mila repository
- VS Code should detect the CMake project automatically
Configure Project
- Press Ctrl+Shift+P to open the command palette
- Type "CMake: Configure" and select it
- Choose your preferred generator (Ninja is recommended for faster builds)
- Select the build variant (Debug/Release)
Build the Project
- Press Ctrl+Shift+P to open the command palette
- Type "CMake: Build" and select it, or use the build button in the status bar
Run Tests
- Press Ctrl+Shift+P to open the command palette
- Type "CMake: Run Tests" and select it
- Alternatively, use the Test Explorer extension to browse and run tests

Using Docker on Linux

Prerequisites
- Docker installed on your system
- NVIDIA Docker runtime (for GPU support)
Pull the Docker Image
Run the Container

For CPU-only usage:

docker run -it --rm toddthomson/mila:latest

For GPU support:

docker run -it --rm --gpus all toddthomson/mila:latest

Build from Dockerfile

Clone the repository and build locally:

git clone https://github.com/toddthomson/mila.git
cd mila
docker build -t mila:local .

Development Workflow

Mount your local source directory for development:

docker run -it --rm -v $(pwd):/mila/src toddthomson/mila:latest

Build inside the container:

mkdir -p build && cd build
cmake .. -G Ninja -DCMAKE_BUILD_TYPE=Release
ninja

Required Components

C++23 support
NVIDIA CUDA Runtime, 13.0 latest
CMake 3.31 or later
GTest framework for unit testing, 1.17.0

License

Mila is licensed under the Apache License 2.0. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Contributing

We welcome contributions from the community. If you are interested in contributing to Mila, please follow these steps:

Fork the repository on GitHub.
Create a new branch from the master branch.
Make your changes and commit them with clear and concise messages.
Push your changes to your forked repository.
Create a pull request to the master branch of the original repository.

Please ensure that your code adheres to the project's coding standards and includes appropriate tests. For more detailed guidelines, refer to the contribution guidelines for details on our code of conduct and the process for submitting pull requests.

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
.devcontainer		.devcontainer
.github		.github
Cmake		Cmake
Data/DataSets/TinyShakespeare		Data/DataSets/TinyShakespeare
Docker		Docker
Mila		Mila
MilaStudio		MilaStudio
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
CMakePresets.json		CMakePresets.json
CONTRIBUTING.md		CONTRIBUTING.md
License.md		License.md
README.md		README.md
ROADMAP.md		ROADMAP.md
Version.txt		Version.txt
icon.png		icon.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mila

Prerelease Notice

Roadmap

Description

Usage

Mnist Sample

Documentation

Top Features

What's New

Recent Updates (v0.9.9XX-alpha)

Next Steps

Mila Build Instructions

Using Visual Studio

Using Visual Studio Code

Using Docker on Linux

Required Components

License

Contributing

About

Uh oh!

Releases

Uh oh!

Languages

License

ToddThomson/Mila

Folders and files

Latest commit

History

Repository files navigation

Mila

Prerelease Notice

Roadmap

Description

Usage

Mnist Sample

Documentation

Top Features

What's New

Recent Updates (v0.9.9XX-alpha)

Next Steps

Mila Build Instructions

Using Visual Studio

Using Visual Studio Code

Using Docker on Linux

Required Components

License

Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages