Flit ML Platform

Production-grade machine learning infrastructure for eCommerce intelligence, built with modern MLOps practices and designed for operational resilience.

Platform Overview

Flit ML is a scalable ML platform that addresses critical business decisions across the eCommerce lifecycle. The platform architecture separates model development, experiment management, and production deployment to enable rapid iteration while maintaining production stability.

Current Capabilities

Project	Status	Business Impact	Technical Stack
BNPL Risk Assessment	Production (Railway)	Real-time credit decisioning	Shadow Mode A/B Testing, MLflow, Redis
Demand Forecasting	Planned	Inventory optimization	-
Marketing Optimization	Planned	Customer acquisition efficiency	-

Architecture Philosophy

The platform implements three-layer separation of concerns that enables independent optimization of ML inference, deployment strategies, and business integration:

Data Layer: Integrates with flit-data-engineering for feature engineering pipelines and simtom synthetic data generation for production-like testing environments.

ML Layer: Model training, experiment tracking (MLflow), and multi-model inference with sub-millisecond latency requirements.

Deployment Layer: Shadow mode controller orchestrates A/B testing, experiment management, and business rule integration without impacting API response times.

System Integration

graph TB
    subgraph "Production System"
        SIMTOM[Simtom<br/>Synthetic eCommerce]
    end

    subgraph "Flit Data Platform"
        BQ[(BigQuery<br/>Training Data<br/>Performance Tracking)]
    end

    subgraph "Flit ML Platform"
        API[ML API Endpoint]
        SC[Shadow Controller<br/>Orchestration]
        MODELS[Multi-Model Predictor<br/>4 Models in Parallel]
        REDIS[(Redis Cache<br/>Temporary Storage)]
        MLF[MLflow<br/>Experiment Tracking]
    end

    %% Inference Flow
    SIMTOM -->|1. Transaction JSON<br/>Real-time Request| API
    API -->|2. Route Request| SC
    SC -->|3. Engineer 36 Features| MODELS
    MODELS -->|4. ALL 4 Predictions| SC
    SC -->|5. Business Decision<br/>ONLY Selected Model| SIMTOM

    %% Data Persistence Flow
    SC -.->|6. Async: Cache ALL 4 Predictions<br/>+ Input Data| REDIS
    SC -.->|7. Async: Experiment Logging| MLF
    REDIS -->|8. Daily Batch Upload| BQ

    %% Training & Monitoring Flow
    BQ -.->|9. Historical Data| MODELS
    BQ -.->|10. Performance Analysis<br/>Drift Detection| SC

    style API fill:#e1f5fe
    style SC fill:#fff3e0
    style MODELS fill:#f3e5f5
    style REDIS fill:#ffe0b2
    style MLF fill:#e8f5e8
    style BQ fill:#e8eaf6

Production Infrastructure

Deployment: Railway platform with containerized microservices Storage: Redis caching (real-time) + BigQuery (analytics) Monitoring: MLflow experiment tracking with async logging Testing: Shadow mode deployment enables safe production testing

Projects

BNPL Risk Assessment

Production ML system for real-time Buy Now Pay Later credit decisions. Implements sophisticated A/B testing infrastructure with 4 production models (Ridge, Logistic, Elastic Net, Voting Ensemble) achieving 0.616 AUC while maintaining <20ms API response times.

Key Technical Achievements:

Shadow mode controller with storage abstraction for operational flexibility
Deterministic traffic assignment using customer ID hashing for consistent experiments
Async operation design maintains sub-20ms response times while logging comprehensive metrics
Railway deployment with Redis co-location for sub-millisecond caching operations

Documentation:

Project Overview - Complete BNPL implementation journey
Shadow Controller Design - A/B testing architecture
ML Data Infrastructure - Production data flow
Known Issues - Model limitations and future improvements

Performance Characteristics:

API Response: <20ms (target: <100ms)
Model Inference: ~2ms (4 models)
Feature Engineering: ~15ms
Redis Operations: <1ms

Current Status: Deployed on Railway with ephemeral MLflow tracking. Phase 3 roadmap includes persistent MLflow server, BigQuery analytics integration, and advanced monitoring dashboards.

Future Projects

Demand Forecasting: ML-driven inventory optimization and delivery resource planning. Transferable to supply chain management and logistics optimization across industries.

Marketing Optimization: Customer acquisition cost reduction through predictive LTV modeling and channel attribution. Applicable to any digital marketing operation with multi-channel customer acquisition.

Technology Stack

ML Framework: scikit-learn (production models), MLflow (experiment tracking) API Framework: FastAPI with async operation support Data Processing: pandas, NumPy with BigQuery integration Deployment: Docker containers on Railway platform Storage: Redis (caching), BigQuery (analytics), MLflow (artifacts - SQLite ephemeral, PostgreSQL planned) Testing: pytest with comprehensive unit/integration coverage

Development

Quick Start

# Install dependencies
poetry install

# Setup development environment
cp .env.redis.template .env.redis
# Edit .env.redis with local Redis credentials

# Run test suite
poetry run pytest -v

# Start API locally
poetry run uvicorn flit_ml.api.main:app --reload

Testing Strategy

# Run all tests (organized by category)
python run_tests.py

# Specific test suites
poetry run pytest tests/unit/ -v          # Unit tests
poetry run pytest tests/integration/ -v   # Integration tests

# Individual modules
python tests/unit/features/test_feature_engineering.py
python tests/unit/models/test_multi_model_predictor.py

Code Quality

# Linting and type checking
poetry run ruff check .
poetry run mypy .

Repository Structure

flit-ml/
├── flit_ml/                    # Core ML platform code
│   ├── api/                    # FastAPI service endpoints
│   ├── config/                 # Configuration management (BigQuery, etc.)
│   ├── core/                   # Shadow controller, Redis storage, registry
│   ├── data/                   # Data access layer
│   ├── evaluation/             # Model evaluation frameworks
│   ├── features/               # Feature engineering pipelines
│   ├── models/                 # Model implementations by project
│   │   ├── bnpl/              # BNPL-specific models
│   │   └── shared/            # Shared model utilities
│   └── monitoring/             # Observability and monitoring
├── docs/
│   ├── architecture/           # Platform architecture decisions
│   ├── data/                   # Data schema and lineage
│   ├── deployment/             # Deployment guides
│   ├── models/                 # Model artifacts and known issues
│   └── projects/               # Project-specific documentation
│       └── bnpl/              # BNPL technical deep dive
├── tests/
│   ├── unit/                   # Component-level tests
│   │   ├── api/
│   │   ├── core/
│   │   ├── features/
│   │   └── models/
│   └── integration/            # End-to-end workflow tests
├── models/                     # Trained model artifacts
│   └── production/            # Production-ready models (.joblib files)
├── research/                   # ML research and experimentation
│   ├── experiments/            # MLflow experiments
│   ├── notebooks/              # Jupyter notebooks
│   └── reports/                # Research findings
├── scripts/                    # Deployment and utility scripts
├── Dockerfile                  # Railway deployment configuration
├── railway.json               # Railway platform config
└── run_tests.py               # Organized test runner

Data Integration

Training Data Pipeline

Source: BigQuery (flit-data-platform)

Models trained on historical data in BigQuery tables
flit_intermediate.int_bnpl_customer_tenure_adjusted: Primary training dataset (1.9M records)
Model retraining uses accumulated prediction data for drift correction and performance optimization

Feature Engineering: Batch processing in BigQuery → 36 engineered features → Model training

Live Inference Pipeline

Source: Simtom (production synthetic eCommerce system)

Real-time API calls with transaction JSON
No database access during inference - <2ms latency requirement
Feature engineering happens in-memory from API request data

Flow:

Simtom sends transaction → ML API receives JSON
Shadow Controller engineers 36 features in-memory
ALL 4 models generate predictions (~2ms)
Selected model's decision returned to Simtom
Post-inference: ALL 4 predictions + input data cached to Redis (async, non-blocking)

Data Storage for Monitoring

Redis: Temporary cache for inference results (30-day TTL)

Stores ALL 4 model predictions for every transaction
Input features and metadata
Business decisions and experiment assignments

BigQuery: Long-term analytics and model monitoring

Daily batch upload from Redis
Model performance tracking
Data drift detection
Ground truth joining for retraining

Deployment

Production Environment

Current deployment on Railway platform with managed Redis for caching:

# Deploy to Railway (automatic from git push)
git push origin main

# Manual deployment validation
curl https://flit-ml-api.railway.app/health

Detailed deployment instructions: DEPLOYMENT.md

Environment Configuration

Production environments require:

REDIS_URL: Managed Redis connection string
BIGQUERY_PROJECT: GCP project ID for data access
MLFLOW_TRACKING_URI: MLflow server endpoint (coming in Phase 3)

See .env.production.template for complete configuration.

Roadmap

Phase 3: MLOps Maturity (Next)

Persistent MLflow server deployment (Railway + PostgreSQL)
BigQuery integration for long-term analytics
Comprehensive production monitoring dashboards
Automated model performance analysis

Phase 4: Advanced Capabilities

Model registry with automated champion selection
Real-time drift detection and alerting
Load testing and performance optimization
Multi-armed bandit experiment optimization

Future Projects

Demand forecasting for inventory and delivery optimization
Marketing optimization with LTV prediction
Customer churn prediction
Price optimization models

Documentation

Project Documentation: docs/projects/ - Complete technical details per project Architecture Decisions: docs/architecture/ - Platform design and trade-offs Model Artifacts: docs/models/ - Production model documentation Deployment Guide: DEPLOYMENT.md - Production deployment procedures

Contributing

Development follows trunk-based workflow with feature branches:

Create feature branch from main
Implement changes with comprehensive testing
Create PR with detailed technical documentation
Deploy to production after review and merge

License

MIT License

Platform Maintainer: Kevin | GitHub Documentation Standards: See CLAUDE.md for technical documentation requirements

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.claude		.claude
docs		docs
flit_ml		flit_ml
models/production		models/production
research/notebooks		research/notebooks
tests		tests
.env.production.template		.env.production.template
.env.redis.template		.env.redis.template
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
DEPLOYMENT.md		DEPLOYMENT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
PROJECT_ROADMAP.md		PROJECT_ROADMAP.md
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
railway.json		railway.json
requirements.txt		requirements.txt
run_tests.py		run_tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Flit ML Platform

Platform Overview

Current Capabilities

Architecture Philosophy

System Integration

Production Infrastructure

Projects

BNPL Risk Assessment

Future Projects

Technology Stack

Development

Quick Start

Testing Strategy

Code Quality

Repository Structure

Data Integration

Training Data Pipeline

Live Inference Pipeline

Data Storage for Monitoring

Deployment

Production Environment

Environment Configuration

Roadmap

Phase 3: MLOps Maturity (Next)

Phase 4: Advanced Capabilities

Future Projects

Documentation

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

whitehackr/flit-ml

Folders and files

Latest commit

History

Repository files navigation

Flit ML Platform

Platform Overview

Current Capabilities

Architecture Philosophy

System Integration

Production Infrastructure

Projects

BNPL Risk Assessment

Future Projects

Technology Stack

Development

Quick Start

Testing Strategy

Code Quality

Repository Structure

Data Integration

Training Data Pipeline

Live Inference Pipeline

Data Storage for Monitoring

Deployment

Production Environment

Environment Configuration

Roadmap

Phase 3: MLOps Maturity (Next)

Phase 4: Advanced Capabilities

Future Projects

Documentation

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages