Skip to content
View kossisoroyce's full-sized avatar
:octocat:
:octocat:

Block or report kossisoroyce

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. train_grpo.py train_grpo.py Public

    GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Optimization) method on the GSM8K (Generalized Math 8K) datase…

    Python 22 3

  2. Gemma-3n-local-training Gemma-3n-local-training Public

    A lightweight, GPU-focused framework to run inference and LoRA fine-tuning on Google’s Gemma 3n family (`1.1B`, `2B`). Designed for small-scale deployments such as chatbots, assistants, or domain-s…

    Python 6 1

  3. vxdf vxdf Public

    VXDF (Vector eXchange Data Format) is an AI-native container for text, metadata and vector embeddings—portable, indexable and compressed.

    Python 2

  4. maxa-ai maxa-ai Public

    Maxa is an AI assistant that maintains persistent memory and theory of mind capabilities, enabling more natural and context-aware interactions over time.

    Python 1

  5. economic-policy-simulator economic-policy-simulator Public

    This project combines a large-language-model (LLM) for natural language policy input, an agent-based computational economics (ACE) core for simulation, and real-time fiscal feedback loops.

    Python 1

  6. vxdf-platform vxdf-platform Public

    TypeScript 1