Governed Agent Stack

Free, on-prem building blocks for an AI agent you can point at a real database and actually audit.

Every 2026 agentic-AI report lands on the same two blockers, and neither of them is the model. The first is the data underneath it: nobody mapped the database, so the agent is working blind. The second is governance around it: nothing constrains what the agent can touch, and there is no trustworthy record of what it did. Pilots stall there, not on model quality.

This is a reference stack of small tools that each solve one of those problems, run entirely on your own hardware, and cost nothing. Each one stands on its own. Put together, they make up an agent you can place in front of a regulated database without losing sleep.

Nobody packages this free and on-prem. That is the whole point.

The layers

Layer	Tool	Job
Foundations	schema-scout	Map the database, recover undeclared relationships, flag PII, and score how ready the schema actually is for an agent.
Scoped access	sql-explorer-mcp + sql-sop + query-warden	Give the agent read-only SQL access: every query is parsed, linted, and checked against role-based access rules before it runs.
Reasoning	FloorMind	Turn a plain-English question into a checked query and a plain-English answer.
Result masking	pii-veil	Mask any PII that survives into result rows before they reach the model.
Accountability	agent-blackbox	Record every step in a tamper-evident, hash-chained log you can verify later.

Flagship: sql-steward

sql-steward bundles the scoped-access, masking, and accountability layers into one Model Context Protocol server, behind a stronger guarantee: the agent never writes SQL at all. Instead of validating SQL the model wrote, sql-steward compiles every query from a semantic layer you control (entities, joins, metrics, PII tags), so there is no run_sql tool to misuse. Blocked PII is refused before the query runs, every call can land in the agent-blackbox ledger, and the same tools work across SQL Server, Postgres, and SQLite.

Use it as the all-in-one entry point, or compose the individual pieces below yourself. They are the same building blocks either way.

How it fits together

flowchart TB
    Q["Question in plain English"] --> AG

    subgraph Foundations
        SS["schema-scout<br/>map, relationships, PII, readiness"]
    end

    subgraph Reasoning
        AG["FloorMind<br/>question to SQL to answer"]
    end

    subgraph Scoped_access["Scoped access"]
        SOP["sql-sop<br/>SQL safety lint"]
        WARD["query-warden<br/>role-based access"]
        EX["sql-explorer-mcp<br/>read-only execution"]
    end

    DB[("Your database<br/>stays on-prem")]
    VEIL["pii-veil<br/>mask PII in results"]
    A["Answer + chart"]
    BB["agent-blackbox<br/>tamper-evident log"]
    ST["sql-steward<br/>all-in-one gateway:<br/>agent never writes SQL"]

    SS -- schema context --> AG
    AG -- generated SQL --> SOP --> WARD --> EX --> DB
    DB -- rows --> VEIL --> AG --> A

    Q -. or, one governed gateway .-> ST
    ST -- compiled SQL --> DB

    AG -. every step .-> BB
    SOP -. logged .-> BB
    WARD -. logged .-> BB
    EX -. logged .-> BB
    ST -. logged .-> BB

How a question flows through it

Once, up front: point schema-scout at the database. It produces a catalog, an agent-ready context file, and a readiness score. If the score is low, you fix the foundations before going further. Re-run it on a schedule and use diff to catch drift.
A user asks a question in plain English. FloorMind uses the schema context to route the question to the right domain and tables, then drafts SQL.
Before anything touches the database, sql-sop lints the draft, query-warden checks it against the asker's role (which tables and columns they may see), and sql-explorer-mcp enforces read-only execution. Writes never run, and out-of-role access is blocked before it reaches the database.
Results come back and FloorMind explains them in plain English, with context.
agent-blackbox records the whole chain (question, SQL, result, outcome) in a hash-chained ledger. Anyone can verify later that the record was not altered after the fact.

Why on-prem, why free

Nothing leaves the building. The database, the local LLM (Ollama), and the logs all stay on your hardware. That is the whole reason this exists for regulated or privacy-sensitive data.
Read-only by enforcement, not by trust. Three layers have to agree before a query runs, so a misconfigured login is not your only protection.
Auditable by design. The log is tamper-evident, so "what did the agent do" has a real, checkable answer.
No licence cost, no per-seat fee, no vendor lock-in. Clone the pieces you need and run them.

The pieces

Each tool is its own repo with its own docs. Start with whichever problem is most urgent. Usually that is schema-scout, because everything downstream depends on knowing the data first.

sql-steward (flagship): one governed MCP server where the agent never writes SQL. Queries are compiled from a semantic layer you control, multi-dialect (SQL Server, Postgres, SQLite), with optional role checks, masking, and audit wired in.
schema-scout: maps a SQL Server database, recovers hidden foreign keys, flags PII, scores agent-readiness, and serves the catalog to an agent over MCP.
sql-explorer-mcp: read-only Model Context Protocol server for SQL Server, Postgres, and SQLite, with three layers of safety.
sql-sop: a fast rule-based SQL linter (available on PyPI) that catches dangerous and slow patterns before a query runs.
query-warden: role-based access control for SQL. Decides whether the asker's role may touch the tables and columns a query references, before it runs.
pii-veil: masks PII in query results (Microsoft Presidio when installed, regex fallback otherwise) before they reach the model.
FloorMind: an on-prem natural-language query tool for manufacturing data, eval-measured rather than vibes-based.
agent-blackbox: an append-only, hash-chained ledger that gives agent actions a tamper-evident audit trail.

Status

All eight components are public and usable today, including the sql-steward flagship that bundles them. This repo is the map that ties them together, not a separate install. Pick the layers you need, or start with sql-steward.

Governance

The stack holds itself to the same bar it helps you apply to an agent: on-prem, open, single-purpose, auditable. Those rules aren't just prose — the components are declared in stack.yaml and enforced as policy-as-code in policies/, so a new component has to pass the same check. See GOVERNANCE.md for the principles and ROADMAP.md for where it's heading.

License

MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github/workflows		.github/workflows
docs		docs
policies		policies
scripts		scripts
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
stack.yaml		stack.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Governed Agent Stack

The layers

Flagship: sql-steward

How it fits together

How a question flows through it

Why on-prem, why free

The pieces

Status

Governance

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Governed Agent Stack

The layers

Flagship: sql-steward

How it fits together

How a question flows through it

Why on-prem, why free

The pieces

Status

Governance

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages