Mnema

A tamper-evident, local-first audit trail for AI-agent work. You drive, agents execute — and you can prove what happened.

Mnema is a local-first MCP server that gives external AI agents (Claude Code, Cursor, Aider, …) typed tools to do work behind workflow gates, while every action lands in a SHA-256 hash-chained audit log that records who coordinated, which agent executed, and in which run. Humans drive through the terminal and verify through the history. Mnema does not run agents — it makes their work accountable.

Not a semantic-memory layer. Mnema does not do embeddings or similarity recall — if you want an agent to remember facts across sessions, reach for Mem0 or Cognee. Mnema answers a different question: what did the agents do, who authorized it, and can you prove the record wasn't altered? It pairs cleanly with a memory layer; it doesn't replace one.

Why Mnema

When an AI agent works in your repository, three questions usually go unanswered: what exactly did it change, did it skip the steps it was supposed to follow, and can you trust the record after the fact? Mnema answers all three.

It makes agent work provable. Every mutation appends to a SHA-256 hash-chained audit log. Change one past entry and the chain breaks — mnema doctor catches edits, truncation, replays, and deletion. This is the part most agent tooling doesn't have.
It keeps the human in the loop. Agents move work through a workflow whose gates reject invalid transitions (no submitting a task with no acceptance criteria, no skipping review). You approve through the terminal; the agent can't route around you.
It records who did what. Each event carries a dual identity — the human who coordinated, the agent that executed, the run it belonged to — so the history reads like a chain of custody.
It stays yours. Local-first, zero telemetry, no remote services. SQLite + plain-text Markdown/JSONL in your repo; the files outlive Mnema and open in any editor.

Instead of…	…you get
A task tracker with no cryptographic guarantee the log is intact	A tamper-evident hash chain with `doctor` verification
A semantic memory layer (Mem0, Cognee) that recalls facts	A provable record of actions taken, not facts remembered
A heavyweight Jira/web UI	An MCP server + CLI that lives next to your code
Free-form agent prose you have to trust	Typed tools behind workflow gates that reject bad input

Quickstart

Status: Mnema is published on npm as an alpha (see Install). The surface is feature-rich and is still being hardened toward a stable 1.0.

# 1. Install and initialise a project
npm install -g @felipesauer/mnema@alpha
cd my-project
mnema init --name "My App" --key "MYAPP"

# 2. Wire your AI client to the MCP server
mnema mcp install-instructions claude-code

Step 2 prints the exact registration command and config for your client. For Claude Code it looks like this:

Register with `claude mcp add` (preferred), or paste the JSON
below into ~/.claude.json under `mcpServers`:

  claude mcp add mnema -s user -e MNEMA_AGENT_HANDLE=claude-code -- mnema mcp serve

Run that claude mcp add line, restart your client, and confirm the project is healthy:

mnema doctor          # all checks green on a fresh project

From here your agent drives Mnema through MCP tools, and you watch and approve from the terminal — walked through end to end in How the MCP loop works.

What you get

Surface	What it does
Audit log	Every action appends to a SHA-256 hash-chained JSONL log (mirrored to SQLite). `mnema doctor` detects edits, truncation, replays, and deletion.
Workflow gates	A state machine per task; each transition declares required fields and Mnema rejects invalid moves.
Agent runs & plans	Wrap every batch of mutations in a run (parent/child, max depth 5); inspect any run later via the CLI.
Dual identity	Each event records the human actor, the agent that executed, and the run — a built-in chain of custody.
Tasks, sprints, epics	Full work tracking: tasks with acceptance criteria, estimate, assignee and a token `context_budget`; one active sprint per project (with measurable metrics); epics grouping tasks under a derived lifecycle.
Decisions (ADRs)	proposed → accepted/rejected → superseded chains, each able to record which artefacts it impacts, with a shortcut to promote a note into a decision.
Traceability layer	Trace work end to end: task↔task dependencies and readiness, epic/sprint completion coverage, acceptance-criteria evidence, a read-only work-graph lint, wikilinks between artefacts, and ADR impact queries.
Full-text search	Search across tasks, decisions, notes and more — case- and accent-insensitive.
Attachments	Files attached to a task or decision, deduplicated by content hash.
Skills, memories, observations	Knowledge the agent records as it works (and humans curate) via MCP tools, mirrored to plain `.md` files so it travels with the repo (not semantic recall — see the note above). User-level skills/memories under `~/.config/mnema/` merge in read-only, with the project always shadowing them.
Workflows	4 presets (`default`, `lean`, `kanban`, `jira-classic`) plus custom JSON validated against a schema.
MCP tools	40+ universal tools plus one per workflow action; `context_bootstrap` is the canonical session entry point.

Install

The Quickstart above covers the common path (npm install -g @felipesauer/mnema@alpha). A few platform notes:

Alpha releases live under the alpha dist-tag, so install with @alpha to be explicit about what you're getting. (Until the first stable 1.x ships, latest also points at the current alpha.)
The native SQLite binding (better-sqlite3) installs a prebuilt binary with npm/npx — no compiler needed. With pnpm, run pnpm approve-builds better-sqlite3 afterwards (pnpm blocks build scripts by default). Platforms without a prebuilt binary need a C++ toolchain (python3, make, g++).

To work from source instead:

git clone https://github.com/felipesauer/mnema.git
cd mnema
pnpm install
pnpm build
ln -s "$PWD/mnema" /usr/local/bin/mnema   # optional, for global access
mnema --version

The bundled ./mnema shell script forwards to dist/index.js — useful for dogfooding without a global install.

Adopting an existing project

You don't have to start clean. mnema init --minimal then mnema adopt all eases Mnema into a repo that already has work, and mnema import markdown / mnema import github-issues pull legacy items in.

Project layout after `mnema init`

my-project/
├── AGENTS.md                 # operating manual for agents (generated by init)
├── .gitignore                # pre-seeded ignores for local state
└── .mnema/                   # everything Mnema owns
    ├── mnema.config.json     # project configuration (versioned)
    ├── audit/                # append-only event log (versioned by default)
    │   └── current.jsonl
    ├── state/                # local cache — gitignored
    │   └── state.db          #   SQLite (FTS, tasks, runs, audit metadata)
    ├── backlog/              # one .md per task, foldered by workflow state
    │   ├── DRAFT/MYAPP-1.md  #   carries its epic_key / sprint_key link
    │   ├── READY/
    │   └── …
    ├── sprints/              # one .md per sprint, mirrored from the DB
    ├── roadmap/              # one .md per epic and per decision (ADR)
    ├── memory/               # agent/human-recorded facts, mirrored to .md
    ├── skills/               # agent-recorded skills, mirrored to .md
    └── workflows/
        └── default.json      # active state machine

Common CLI commands

You drive Mnema from the terminal; agents drive the same model through MCP tools. The commands group by what you're doing — run mnema <command> --help for full flags and examples.

Set up & adopt

Command	What it does
`mnema init`	Create the full layout (use `--minimal` for adoption)
`mnema adopt <component>`	Add `skills/`, `memory/` or `roadmap/` later
`mnema import markdown --from PATH`	One-shot import from `## STATE Title` headings
`mnema import github-issues --repo OWNER/REPO`	One-shot import from GitHub Issues

Track work

Command	What it does
`mnema task create / list / show / move`	Manage tasks (`create` takes `--estimate`, `--context-budget`, `--priority`)
`mnema task assign <key> --to <handle>`	Set or clear a task's assignee (`--clear`); an unknown handle is rejected
`mnema sprint plan / start / close / show / add`	Manage sprints (one active per project)
`mnema sprint add-tasks <key> <task...>`	Attach several tasks at once (best-effort, reports per-task failures)
`mnema sprint metric <key> --name --target`	Add a measurable metric (baseline/unit/due optional)
`mnema epic create / show / add / close`	Group tasks; `show` includes the derived lifecycle
`mnema decision record / accept / reject / supersede`	Manage ADRs (`record` takes `--impact`)
`mnema note add` · `mnema attach add <task> <file>`	Annotate a task; attach a file deduped by SHA-256

Trace & verify

Command	What it does
`mnema task depends <key> <blocksKey>` · `mnema task ready`	Declare a task↔task dependency; list tasks whose blockers are all done
`mnema task evidence <key> [--criterion --kind --ref]`	List or attach evidence for acceptance criteria
`mnema sprint coverage <key>` · `mnema epic coverage <key>`	Report % of tasks in a terminal state
`mnema lint sprint <key>` · `mnema lint epic <key>`	Integrity checks (incomplete tasks, subagent-bypass, broken deps)
`mnema decision impacting <ref>`	Which ADRs affect a given artefact
`mnema search <query>`	Full-text search across the project

Inspect & operate

Command	What it does
`mnema doctor`	Read-only diagnostic — re-verifies the audit chain. Add `--rebuild-mirrors` to recreate missing `.md` from the database
`mnema history --since=today` · `mnema watch`	Compact activity view; live tail of mutations
`mnema inbox`	Tasks awaiting your review or blocked
`mnema agent inspect <run_id>` · `mnema audit query [filters]`	One run with its plans + mutations; raw log access
`mnema sync`	Rebuild the SQLite cache from the markdowns
`mnema skill lint / links / refs` · `mnema memory consolidate`	Validate skills & wikilinks; regenerate memory `INDEX.md`

Keep current after a package upgrade

Command	What it does
`mnema upgrade`	Detect everything out of date (pending migrations, stale AGENTS.md, missing mirrors, old `mnema_version`), show the plan, and apply it after confirmation (`--yes` to skip)
`mnema agents sync`	Regenerate only the Mnema-managed block of AGENTS.md, preserving your own content

Integrate (MCP)

Command	What it does
`mnema mcp serve`	Start the MCP server on stdio (called by your AI client)
`mnema mcp install-instructions <client>`	Print the right config snippet

How the MCP loop works

Your AI client (Claude Code, Cursor, …) spawns mnema mcp serve with cwd pointing at your project. Configure it once via mnema mcp install-instructions claude-code (the printed snippet already includes the right agent_handle).
The agent calls context_bootstrap first — it gets the project identity, active workflow, recent decisions and pointers to memory.
Before any mutation it calls agent_run_start({ goal }) — without an active run, mutations are rejected with NO_ACTIVE_RUN.
It then uses task_create, task_submit, task_block, … as the workflow allows. Every transition is validated against the gate (task_submit requires title, description, acceptance_criteria, estimate).
When done, agent_run_end({ status: "completed" }) flushes the sync buffer and closes the run.

A concrete pass

An agent asked to "add a rate limiter" might: start a run, create MYAPP-12, submit it through the gate (which forces acceptance criteria and an estimate), move it to IN_PROGRESS, do the work, then submit it for review. It cannot mark its own task DONE — the default workflow routes that through your approval. Meanwhile you watch and inspect from the terminal:

mnema watch                        # live tail of every mutation
mnema inbox                        # what's waiting on your review
mnema history --since=today        # formatted activity log
mnema agent inspect <run_id>       # one run, with its plans + mutations
mnema agent resume <run_id>        # reattach to an interrupted run
mnema doctor                       # re-verify the audit chain anytime

Approve with mnema task move MYAPP-12 approve, and the whole sequence — who, which agent, which run, in what order — is sitting in the hash-chained audit log, verifiable forever.

Configuration

.mnema/mnema.config.json is the only configuration. Minimal fields:

{
  "version": "1.0",
  "mnema_version": "^0.5.0-alpha.0",
  "project": { "key": "MYAPP", "name": "My Application" },
  "workflow": "default"
}

Optional fields cover custom paths, audit retention, sync flush thresholds and feature flags. Run mnema doctor after editing — it re-validates the file against the schema and reports anything that drifted.

One optional field worth calling out is enforcement_mode, which decides what a failed workflow gate means:

Mode	A failed gate…
`strict` (default)	blocks an agent; a human may override, and the override is audited
`blocking`	blocks everyone, no override
`advisory`	only warns — anyone may proceed, and the skipped gate is audited

mnema doctor prints the active mode so its effect is never a surprise.

User-level defaults

A ~/.config/mnema/config.json lets you set behavior preferences once for every project on your machine — enforcement_mode, audit_strategy, audit_retention_months, and the sync / features blocks. A project's own config always wins key-by-key; the user file only fills the gaps. It cannot set project identity, paths or workflow — those are intrinsic to a project and an attempt to set them is rejected. Example:

{ "enforcement_mode": "strict", "sync": { "mode": "push" } }

Domain-event hooks

A hooks block runs a shell command when a domain event fires — a task reaching a done state, a decision accepted, a sprint or epic closed — not on generic tool calls. The command receives the triggering audit event as JSON on stdin:

{
  "hooks": {
    "on_task_done": ["./scripts/notify.sh"],
    "on_decision_accepted": ["jq '.data.key' >> decisions.log"]
  }
}

Supported events: on_task_done, on_task_transitioned, on_decision_accepted, on_sprint_closed, on_epic_closed.

Hooks run after the triggering event is durably written, and each firing records its own hook_ran audit event (with the exit code) — a hook is part of the trail, never a phantom side effect. A failing or hung command (30s timeout) is captured and audited; it never rolls back the state that triggered it.

Workflows

Workflows are JSON files in workflows/. The default ships with seven states — DRAFT → READY → IN_PROGRESS → IN_REVIEW → DONE, with BLOCKED and CANCELED branches. Each transition declares its gate (which fields are required, with min/max/enum/format constraints expressed in a small JSON DSL) — Mnema translates the gate into Zod at boot time and surfaces one MCP tool per transition.

To switch presets, edit workflow in mnema.config.json and run mnema doctor. To author a new workflow, copy workflows/default.json and tweak.

Status

Mnema is alpha and published on npm. The accountability core — the part that's the actual differentiator — is in place and hardened: the SHA-256 hash chain, doctor tamper-detection, dual-identity capture, workflow gates, and optimistic-concurrency lost-write protection described in Why Mnema and What you get. The work-tracking and traceability surface around it is built out; the remaining road to a stable 1.0 is hardening and ergonomics, not missing pillars.

Confidence comes from how hard it's shaken out: 590 tests, 0 skipped, lint + build clean, repeated adversarial review sweeps (audit immutability, multi-actor concurrency, custom-workflow validation, input-validation parity, ReDoS), and a 13-check publish gate (scripts/publish-check.sh) plus an end-to-end smoke run before every tag. See CHANGELOG.md for the per-version history.

Getting help

Bug or unexpected behaviour? Open an issue — the bug-report template asks for mnema --version, repro steps, and (if relevant) a snippet of .mnema/audit/current.jsonl.
Question or idea? Use GitHub Discussions.
Security issue? Report it privately — see SECURITY.md.
Want to contribute? Start with CONTRIBUTING.md.

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.github		.github
bench		bench
scripts		scripts
src		src
templates		templates
tests		tests
workflows		workflows
.gitignore		.gitignore
.release-please-manifest.json		.release-please-manifest.json
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
biome.json		biome.json
mnema		mnema
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
release-please-config.json		release-please-config.json
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mnema

Table of contents

Why Mnema

Quickstart

What you get

Install

Adopting an existing project

Project layout after `mnema init`

Common CLI commands

How the MCP loop works

A concrete pass

Configuration

User-level defaults

Domain-event hooks

Workflows

Status

Getting help

Further reading

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Mnema

Table of contents

Why Mnema

Quickstart

What you get

Install

Adopting an existing project

Project layout after mnema init

Common CLI commands

How the MCP loop works

A concrete pass

Configuration

User-level defaults

Domain-event hooks

Workflows

Status

Getting help

Further reading

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Project layout after `mnema init`

Packages