MEMLP Library

Introduction

The MEMLP (Microcontroller Embedded Multi-Layer Perceptron) library is a lightweight and efficient library designed for implementing machine learning models on resource-constrained devices such as microcontrollers. It supports the Arduino framework and microcontrollers like the Raspberry Pi Pico.

Key Features:

Platform Compatibility: Works seamlessly with C++11 and is optimized for microcontrollers.
Reinforcement Learning: Includes support for reinforcement learning algorithms.
Customizability: Allows users to define custom architectures, activation functions, and loss functions.
Replay Memory: Implements replay memory for reinforcement learning tasks.

Supported Platforms:

Arduino IDE
Raspberry Pi Pico
Other microcontrollers with C++11 support

Installation Instructions

Prerequisites:

Arduino IDE: Ensure you have the latest version installed.
Earle Philhower's Arduino-Pico Library: Follow the installation instructions on the project's GitHub page.

Steps:

Clone the repository:

git clone https://github.com/your-repo/memlp.git

Create a folder named src in your project directory.

Add the MEMLP library as a submodule:

git submodule add https://github.com/your-repo/memlp.git src/memlp

Include the library in your project:
```
#include "src/memlp/MLP.h"
```

Usage Examples

Initializing the Library

#include "src/memlp/MLP.h"

MLP<float> my_mlp({6, 16, 8, 8, 12}, 
                  {ACTIVATION_FUNCTIONS::RELU, ACTIVATION_FUNCTIONS::LINEAR, ACTIVATION_FUNCTIONS::RELU, ACTIVATION_FUNCTIONS::SIGMOID});

Training a Model

std::vector<std::vector<float>> features = {{0.1, 0.2}, {0.3, 0.4}};
std::vector<std::vector<float>> labels = {{1.0}, {0.0}};
my_mlp.Train({features, labels}, 0.01, 1000, 0.001);

Running Inference

std::vector<float> input = {0.1, 0.2};
std::vector<float> output;
my_mlp.GetOutput(input, &output);

Implementing Reinforcement Learning

The MEMLP library provides built-in support for reinforcement learning through its ReplayMemory class and special network update methods. Here's a complete example of implementing a Q-learning agent:

#include "src/memlp/MLP.h"
#include "src/memlp/ReplayMemory.hpp"

// Create Q-network and target network with identical architecture
MLP<float> q_network({4, 16, 16, 2}, // State size: 4, Action size: 2
                     {ACTIVATION_FUNCTIONS::RELU,
                      ACTIVATION_FUNCTIONS::RELU,
                      ACTIVATION_FUNCTIONS::LINEAR});

auto target_network = std::make_shared<MLP<float>>(q_network);

// Initialize replay memory
ReplayMemory<trainXYItem<float>> memory;
memory.setMemoryLimit(10000);  // Store up to 10000 experiences
memory.forgettingMode = ReplayMemory<trainXYItem<float>>::FORGETMODES::RANDOM_OLDER;

// Training loop
float epsilon = 1.0f;  // Exploration rate
float gamma = 0.99f;   // Discount factor
float alpha = 0.001f;  // Soft update rate

for(int episode = 0; episode < 1000; episode++) {
    // Get current state from environment
    std::vector<float> state = get_state();
    
    // Epsilon-greedy action selection
    std::vector<float> q_values;
    q_network.GetOutput(state, &q_values);
    int action = (rand() < epsilon * RAND_MAX) ? 
                 rand() % 2 : 
                 std::max_element(q_values.begin(), q_values.end()) - q_values.begin();
    
    // Execute action and get reward
    float reward = execute_action(action);
    std::vector<float> next_state = get_state();
    
    // Store experience in replay memory
    trainXYItem<float> experience;
    experience.X = state;
    experience.Y = {reward};  // Store reward as target
    memory.add(experience, episode);
    
    // Sample batch from replay memory and train
    if(memory.size() >= 32) {
        auto batch = memory.sample(32);
        for(auto &sample : batch) {
            // Get next state Q-values from target network
            std::vector<float> next_q_values;
            target_network->GetOutput(sample.X, &next_q_values);
            float max_next_q = *std::max_element(next_q_values.begin(), next_q_values.end());
            
            // Calculate target Q-value with temporal difference
            std::vector<float> target = q_values;
            target[action] = reward + gamma * max_next_q;
            
            // Train Q-network on this sample
            q_network.Train({{sample.X}, {target}}, 0.001f, 1, 0.0001f, false);
        }
        
        // Soft update target network
        target_network->SmoothUpdateWeights(q_network, alpha);
    }
    
    // Decay exploration rate
    epsilon *= 0.995f;
}

Key Features for Reinforcement Learning:

Experience Replay: The ReplayMemory class supports various forgetting modes:
- FIFO: First-in-first-out memory
- RANDOM_EQUAL: Random removal with equal probability
- RANDOM_OLDER: Biased removal of older memories
Target Networks: Use SmoothUpdateWeights() for stable Q-learning
Gradient Calculation: CalcGradients() supports policy gradient methods
Flexible Architecture: Easily create actor-critic networks

API Documentation

Classes and Methods

`MLP`

Constructor:
```
MLP(const std::vector<size_t> &layers_nodes, const std::vector<ACTIVATION_FUNCTIONS> &layers_activfuncs, loss::LOSS_FUNCTIONS loss_function = loss::LOSS_FUNCTIONS::LOSS_MSE);
```
- Parameters:
  - layers_nodes: Number of nodes in each layer.
  - layers_activfuncs: Activation functions for each layer.
  - loss_function: Loss function to use (default: Mean Squared Error).
Methods:
- void Train(const training_pair_t& training_sample_set_with_bias, float learning_rate, int max_iterations, float min_error_cost, bool output_log);
  - Trains the model using the provided dataset.
- void GetOutput(const std::vector<T> &input, std::vector<T> *output);
  - Runs inference on the input data.
- void SmoothUpdateWeights(std::shared_ptr<MLP<T>> anotherMLP, const float alpha);
  - Updates weights for reinforcement learning.

`Dataset`

Methods:
- bool Add(const std::vector<float> &feature, const std::vector<float> &label);
  - Adds a new data point to the dataset.
- std::pair<DatasetVector, DatasetVector> Sample(bool with_bias = true);
  - Samples data from the dataset.

`ReplayMemory`

Methods:
- void add(trainingItem &tp, size_t timestamp);
  - Adds a training item to the replay memory.
- std::vector<trainingItem> sample(size_t nMemories);
  - Samples a batch of training items.

Supported Algorithms

Activation Functions:
- Sigmoid
- Tanh
- ReLU
- Linear
Loss Functions:
- Mean Squared Error (MSE)

Contributing Guidelines

We welcome contributions to the MEMLP library! To contribute:

Fork the repository and create a new branch.
Follow the coding standards outlined in the CONTRIBUTING.md file.
Submit a pull request with a detailed description of your changes.

Setting Up a Development Environment

Install the required tools (e.g., C++ compiler, Arduino IDE).
Clone the repository and ensure all dependencies are installed.

License

This library is distributed under the Mozilla Public License (MPL). See the LICENSE file for more details.

Contact and Support

For support or to report issues, please use:

GitHub Issues: https://github.com/your-repo/memlp/issues
Email: support@yourdomain.com

Name		Name	Last commit message	Last commit date
Latest commit History 181 Commits
.settings		.settings
.vscode		.vscode
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
Data.h		Data.h
Dataset.cpp		Dataset.cpp
Dataset.hpp		Dataset.hpp
LICENSE		LICENSE
Layer.h		Layer.h
Loss.h		Loss.h
MLP.cpp		MLP.cpp
MLP.h		MLP.h
Node.h		Node.h
OrnsteinUhlenbeckNoise.h		OrnsteinUhlenbeckNoise.h
ReplayMemory.hpp		ReplayMemory.hpp
Sample.h		Sample.h
Utils.cpp		Utils.cpp
Utils.h		Utils.h
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MEMLP Library

Introduction

Key Features:

Supported Platforms:

Installation Instructions

Prerequisites:

Steps:

Usage Examples

Initializing the Library

Training a Model

Running Inference

Implementing Reinforcement Learning

API Documentation

Classes and Methods

`MLP`

`Dataset`

`ReplayMemory`

Supported Algorithms

Contributing Guidelines

Setting Up a Development Environment

License

Contact and Support

About

Uh oh!

Releases

Packages

Languages

License

MusicallyEmbodiedML/memlp

Folders and files

Latest commit

History

Repository files navigation

MEMLP Library

Introduction

Key Features:

Supported Platforms:

Installation Instructions

Prerequisites:

Steps:

Usage Examples

Initializing the Library

Training a Model

Running Inference

Implementing Reinforcement Learning

API Documentation

Classes and Methods

MLP

Dataset

ReplayMemory

Supported Algorithms

Contributing Guidelines

Setting Up a Development Environment

License

Contact and Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`MLP`

`Dataset`

`ReplayMemory`

Packages