SQLite — One Store for Local, Large, and Private Data

FlowMCP Team

June 1, 2026

#data-formats#sqlite#resources

2026-06-01 · FlowMCP Team · #data-formats #sqlite #resources

A tool that calls a live API is the obvious building block — but it is not always the right one. Some data rarely changes. Some data is too large to fetch on every call. Some data is private and should never leave the machine. For these, FlowMCP offers three local mechanisms. This post explains what each one is for, and then goes deep on the most powerful of them: SQLite resources.

Three ways to anchor data

Mechanism	What it is	Reach for it when…
Preload	An optional `preload` block on a route (`enabled`, `ttl`). The runtime caches the API response locally in memory for the time-to-live.	The data comes from an API but changes slowly and is expensive to refetch — a few hundred KB to a couple of MB that you don’t want to pull on every call.
Shared Lists	Versioned, centrally maintained value sets, referenced by name and filtered per schema.	You need small, canonical value sets shared across many schemas — EVM chain IDs, fiat codes, country codes. One source of truth instead of duplicated inline lists.
SQLite resources	A local SQLite database exposed as an MCP resource (`main.resources`).	The data is large, or static reference data, or private, or generated by the agent itself.

The first two are about light data. SQLite is about heavy and local data.

SQLite: turning an awkward format into queryable tables

The strongest argument for SQLite is conversion. Many datasets ship in formats that are painful to query directly — a ZIP of dozens of CSVs, a sprawling XML dump, a custom binary. Converting such a format once into SQLite turns it into ordinary tables you can query with SQL, with indexes, without re-parsing the source on every access. GTFS transit feeds are exactly this story: the geo-gtfs-toolkit converts a feed into a sealed SQLite database that a schema simply references.

Two modes

SQLite resources come in two modes, and the difference is the whole point:

Mode	Opened as	Allowed	Use case
in-memory	`better-sqlite3` with `readonly: true`	SELECT only — the file on disk is never modified	Reference data, lookups, open data
file-based	WAL mode, writable	All SQL statements; changes persist; a `.bak` is taken before the first write	Agent-generated data: analysis results, collected metrics

Read vs. write: safe by design

Here is the part that matters for safety. In in-memory mode an AI can write SQL queries itself and run them — but only reads get through, and that guarantee holds on two independent layers:

Driver level. The database is opened with better-sqlite3 and the readonly: true flag. The driver itself refuses any write.
Module level. Before a query runs, the runtime checks that the statement begins with SELECT or WITH. Write keywords — INSERT, UPDATE, DELETE, CREATE, ALTER, DROP, REPLACE, TRUNCATE — are rejected with “Only SELECT statements are allowed in in-memory mode.”

So the AI gets full freedom to explore and query a dataset, and zero ability to mutate it. Writing is reserved for the explicit file-based mode, which only schemas marked origin: 'project' may use. The separation is deliberate: completely read-only and safe, or completely on disk and free — never an ambiguous middle.

The default methods

Every SQLite resource gets two methods injected automatically by the runtime, so the AI never has to be told the table layout in advance:

Method	What it does
`describeTables`	Returns the tables and their columns — the AI inspects the structure before querying.
`runSql`	Runs a query the AI composes itself. In in-memory mode: SELECT only. In file-based mode: all statements.

A schema author can add up to seven more named queries on top, but in practice describeTables + runSql already let an assistant explore an unknown database and answer questions about it on its own.

Choosing between them

Live, frequently changing data → a normal tool (an API call).
Slow-changing API data you don’t want to refetch → add preload to the route.
Small canonical lists shared across schemas → a shared list.
Large static reference data, or private data, or anything you’d otherwise re-parse every time → a SQLite resource (in-memory, read-only).
Data the agent produces and needs to keep → a SQLite resource (file-based, writable).

The common thread: keep schemas thin and keep data where it belongs. Preload and shared lists handle the light cases; SQLite handles the heavy and the private ones — with a read/write boundary the runtime enforces for you.

📖 Read also:

FlowMCP v4.1 — GTFS as the First Data Class with Its Own Add-on — SQLite resources in action on a real, heavy dataset.

OParl — One Schema for Germany’s Council Information Systems — the light-API counterpart, where preload (not SQLite) is the right tool.

Share: Twitter LinkedIn

SQLite — One Store for Local, Large, and Private Data

SQLite — One Store for Local, Large, and Private Data

Three ways to anchor data

SQLite: turning an awkward format into queryable tables

Two modes

Read vs. write: safe by design

The default methods

Choosing between them

Related Posts

Making the Connection — How FlowMCP Became a Mobility Framework

FlowMCP v4.2 — Grading as a Versioned Standard

FlowMCP v4.0 — Skills, Selections, Pipes