Skip to content
View Pawansingh3889's full-sized avatar

Highlights

  • Pro

Block or report Pawansingh3889

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Pawansingh3889/README.md

Pawan Singh Kapkoti

Data engineer based in the UK. I build small, free, on-prem tools for data and SQL work, and send fixes upstream to the projects I use. MSc Data Analytics, Aston.

Portfolio · Governed Agent Stack · LinkedIn · PyPI

On-prem AI / agents

  • governed-agent-stack: free, on-prem building blocks for an AI agent you can point at a real database and audit. schema-scout maps the data, sql-explorer-mcp and sql-sop fence in read-only access, FloorMind reasons, and agent-blackbox keeps a tamper-evident record.
  • FloorMind: on-prem natural-language-to-SQL over manufacturing data. A LangGraph agent, a local LLM via Ollama, sql-sop validation, and a tamper-evident audit log. Eval-measured.
  • agent-blackbox: an append-only, hash-chained ledger that gives agent actions a tamper-evident audit trail. On-prem, zero dependencies.
  • sql-explorer-mcp: read-only MCP server for SQL Server, Postgres, and SQLite, with a layered safety stack. pip install sql-explorer-mcp
  • query-warden: role-based access control for SQL. Decides whether a role may touch the tables and columns a query references, before it runs. On-prem.

Tools

  • schema-scout: reverse-engineers a SQL Server database into an AI-ready catalog. Infers the undeclared foreign keys, flags PII, scores how AI-ready the schema is, and renders an offline dashboard. Runs entirely on-prem.
  • sql-sop: a SQL linter that catches dangerous and non-conforming SQL in CI and pre-commit. pip install sql-sop · browser playground
  • sql-sop-mcp: an MCP server wrapping the linter, so Claude / Cursor / any MCP client can call it. pip install sql-sop-mcp
  • pr-sop: small PR-governance checks: changelog drift, version mismatch, stale pre-commit pins. pip install pr-sop
  • morning-brief: rule-based Gmail triage, read-only, no LLM. pip install morning-brief

Data pipelines

  • production-analytics-pipeline: incremental ETL from a manufacturing ERP. FastAPI + Next.js + Power BI, Prefect orchestration, 53 tests.
  • uk-crime-pipeline: Police UK API into PostgreSQL and BigQuery. dbt marts, ~100k records, Polars ingestion.

Tech

Python · SQL / T-SQL · dbt · SQL Server / PostgreSQL / BigQuery · pandas / Polars · FastAPI · Ollama / LangGraph (on-prem AI) · MCP · Docker · GitHub Actions

Learning (2026): Microsoft Fabric · KQL · Azure · Snowflake

Open source

My pull requests across OSS projects.

Pinned Loading

  1. sql-guard sql-guard Public

    Fast rule-based SQL linter on PyPI (sql-sop). 38 rules, 149 tests, libCST injection scanner, SARIF output, browser playground. Pre-commit hook + GitHub Action. 500+ monthly downloads.

    Python 1 7

  2. uk-crime-pipeline uk-crime-pipeline Public

    End-to-end pipeline: Police UK API to PostgreSQL + BigQuery. dbt staging/marts, 65 tests, 3 CI/CD workflows, Looker Studio + Streamlit dashboards.

    Python

  3. Hackathon-mediask Hackathon-mediask Public

    MediAsk — health Q&A platform for factory workers. Flask, PostgreSQL, Gemini AI, Docker. Live on Render.

    Python

  4. forthepeople-uk forthepeople-uk Public

    UK citizen transparency platform. Free council-level dashboards: weather, population, housing, crime, health, schools, elections, benefits.

    Python

  5. manufacturing-compliance-dashboard manufacturing-compliance-dashboard Public

    BRC/HACCP food safety dashboard. Batch traceability, temperature monitoring, allergen matrix, weight variance. Streamlit + Sentry.

    Python