HydraFlow

Intent in. Software out.

Log an issue. Agents handle the rest - triaging, planning, implementing, reviewing, and merging every change.

HydraFlow is a delivery kernel for GitHub repositories: it accepts intent, compiles it through a staged pipeline, enforces quality gates, and produces merged software changes.

It scales your workflow, not just your output, turning your repository into a programmable delivery engine powered by your hooks and skills. This is Harness engineering at scale.

What Makes It Different

Quality-gated pipeline, not "one-shot" agent code generation
Explicit stage controls (triage, plan, implement, review) before merge
CI checks and human-in-the-loop escalation when confidence drops
Coverage policy target across stacks: enforce 50% minimum and drive toward 70%+ on critical paths
Repeatable standards that keep output consistent as workload grows

Why Teams Use It

Label-driven workflow from issue to merged PR
Built-in planning, implementation, and review stages
CI-aware automation with human-in-the-loop escalation
Repo prep that scaffolds missing quality gates
Live dashboard for visibility into work, agents, and queue state

How It Works

HydraFlow runs a staged pipeline:

Triage: validate issue readiness and queue it for planning.
Plan: read-only exploration and concrete implementation plan.
Implement: isolated worktree changes with tests.
Review: agent review + CI monitoring + merge decision.
Escalate when needed: failures and ambiguity route to hydraflow-hitl.

See the full product walkthrough and visuals at hydraflow.ai.

Prerequisites

Python 3.11+
uv
GitHub CLI authenticated (gh auth login)
Claude CLI and/or Codex CLI available on PATH
Node.js 20.19+ or 22.12+ (dashboard only)
Beads (optional — auto-installed when needed for task decomposition)

Quick Start

# in your project root
git submodule add https://github.com/T-rav/hydra.git hydraflow
git submodule update --init --recursive
cd hydraflow

# install deps + bootstrap target repo hooks/assets/labels
make setup

# scaffold quality gates in the target repo (CI/make/tests/lint where missing)
make prep

# run orchestrator + dashboard (http://localhost:5556)
make run

Dashboard Mode (multi-repo)

Set HYDRAFLOW_DASHBOARD_ENABLED=true in .env, then:

make run

The dashboard opens at http://localhost:5556. When running as a git submodule, the parent repo is auto-registered on startup.

To manually register additional repos:

curl -X POST "http://localhost:5556/api/repos/add?path=/path/to/repo"
curl -X POST "http://localhost:5556/api/runtimes/{slug}/start"

Core Commands

make              # command help
make setup        # bootstrap .env, hooks, labels, local assets
make prep         # repo audit + scaffold + hardening loop
make run          # start backend + dashboard
make dry-run      # print actions without executing
make smoke        # critical cross-system smoke tests
make quality-lite # lint + typecheck + security
make quality      # quality-lite + tests

Admin API

HydraFlow exposes first-class HTTP endpoints for operational tasks once make run is active:

POST /api/admin/prep — sync lifecycle labels, run the prep audit, and seed manifests/memory.
POST /api/admin/scaffold — scaffold CI/tests (dry-run friendly) and report coverage posture.
POST /api/admin/clean — destroy all HydraFlow worktrees in the repo and reset local state.

The Make targets (make prep, make scaffold, make clean) invoke scripts/run_admin_task.py directly, so they work without a running server. When the server is active you can also hit the endpoints directly via the included helper:

# example: trigger prep via HTTP without leaving the terminal
PYTHONPATH=src python scripts/call_api.py POST /api/admin/prep
PYTHONPATH=src python scripts/call_api.py POST /api/admin/scaffold
PYTHONPATH=src python scripts/call_api.py POST /api/admin/clean

HydraFlow still writes prep logs, memory, manifests, and run artifacts to .hydraflow/ by default when you run make commands directly inside a repo. Set HYDRAFLOW_HOME=~/.hydraflow/<repo-slug> (or any preferred location) before starting the server if you want those artifacts stored outside the working tree.

Background Loops

Beyond the five pipeline stages, HydraFlow runs background loops for operational health:

PR unsticker — recovers stale PRs stuck in review or CI
Memory sync — compacts and syncs agent memory artifacts
Metrics sync — collects pipeline and GitHub metrics
Manifest refresh — keeps repo manifests current
Epic monitor / sweeper — tracks epic progress and generates changelogs
Verify monitor — confirms post-merge verification issues
Worktree GC — prunes stale worktrees from merged PRs
Runs GC — cleans up old run artifacts per retention policy
ADR reviewer — council-based Architecture Decision Record reviews
Report issue — processes bug reports with screenshot scanning

Self-improving harness

This repo includes a harnessed self-improvement loop (observation + session retro + memory candidate artifacts). See docs/self-improving-harness.md for imported skills and runtime behavior.

Issue Flow Labels

Pipeline lifecycle:

hydraflow-find — triage queue
hydraflow-plan — planning stage
hydraflow-ready — ready for implementation
hydraflow-review — PR under review
hydraflow-hitl / hydraflow-hitl-active / hydraflow-hitl-autofix — human-in-the-loop escalation
hydraflow-fixed — completed

Auxiliary labels:

hydraflow-epic / hydraflow-epic-child — epic lifecycle management
hydraflow-verify — post-merge verification tracking
hydraflow-visual-required / hydraflow-visual-skip — visual validation controls
hydraflow-dup — duplicate detection

All label names are overridable via .env (created from .env.sample during make setup).

Prep Output and Local Tracking

make prep stores local prep artifacts under:

.hydraflow/prep/*.md for local prep issues
.hydraflow/prep/runs/<run-id>/ for run logs/transcripts

When HYDRAFLOW_HOME is set, the same structure lives under ~/.hydraflow/<repo-slug>/prep/... so every repo can share a single machine-wide HydraFlow home.

Each prep run gets one locked run ID at start, and all logs for that run are written under the same run directory.

Dashboard

When make run is active:

UI: http://localhost:5556
Shows pipeline state, active workers, CI/review progress, and HITL queue

Development

make test
make lint
make lint-check
make lint-fix
make typecheck
make security
make quality-lite
make quality

Docker Execution Mode

HydraFlow can run agents inside Docker containers for isolation. This requires building the agent image and configuring authentication.

1. Build the agent image

docker build -f Dockerfile.agent -t ghcr.io/t-rav/hydraflow-agent:latest .

2. Enable Docker mode

Add to your .env:

HYDRAFLOW_EXECUTION_MODE=docker

3. Configure agent authentication

Agents running inside containers cannot access your host's OAuth session or macOS keychain. You must provide API credentials via environment variables.

Claude (subscription via Claude Max/Pro):

# Generate a headless auth token
claude setup-token

# Add the token to .env
CLAUDE_CODE_OAUTH_TOKEN=<token-from-setup-token>

Claude (API key):

ANTHROPIC_API_KEY=sk-ant-...

Codex:

Codex credentials are mounted automatically from ~/.codex/ on your host. No extra .env configuration is needed.

Other supported providers (set in .env as needed):

OPENAI_API_KEY, OPENROUTER_API_KEY, GEMINI_API_KEY, XAI_API_KEY, DEEPSEEK_API_KEY, MISTRAL_API_KEY, TOGETHER_API_KEY, GROQ_API_KEY

4. Verify

make run
# Create a test issue — check logs for successful agent output instead of
# "authentication_failed" or empty transcripts.

EC2 Deployment

Running HydraFlow as a 24/7 EC2 service is supported out-of-the-box:

deploy/ec2/deploy-hydraflow.sh bootstraps dependencies, syncs code, and restarts the orchestrator.
deploy/ec2/deploy-hydraflow.sh doctor runs a readiness check (git repo present, .env seeded, log/home dirs created, required binaries installed) before your first deploy.
deploy/ec2/hydraflow.service keeps the process alive under systemd; the install helper renders it with SERVICE_USER, SERVICE_GROUP, SERVICE_WORK_DIR, and SERVICE_LOG_FILE overrides so non-/opt installs work without manual edits.
FastAPI exposes GET /healthz with a ready flag plus per-component checks (orchestrator/worker/dashboard) so load balancers or monitors can make decisions without extra parsing.
deploy/ec2/deploy-hydraflow.sh health [URL] curls /healthz using the host/port from /etc/hydraflow.env (or your override) and exits non-zero when HEALTHCHECK_REQUIRE_READY=1 but ready=false, which makes it easy to wire into cron, ALB checks, or pager hooks.
deploy/ec2/deploy-hydraflow.sh wait-ready (or HEALTHCHECK_WAIT_FOR_READY=1 deploy ... deploy) polls /healthz until ready=true; tune the gating with HEALTHCHECK_WAIT_TIMEOUT_SECONDS and HEALTHCHECK_WAIT_INTERVAL_SECONDS.
/etc/hydraflow.env (override with RUNTIME_ENV_FILE) is sourced automatically so manual run commands and systemd share credentials/config.
deploy/ec2/deploy-hydraflow.sh install copies the unit into /etc/systemd/system (or your custom SYSTEMD_DIR) and runs the required systemctl enable --now incantations.

See docs/deployment/ec2.md for the full playbook, including how to bind the dashboard to 0.0.0.0 using the new HYDRAFLOW_DASHBOARD_HOST config knob and how to scope your EC2 security group safely.

Contributing

Fork it.
Run HydraFlow on HydraFlow. (via make run in the repo)
/gh-issue "thing to change"
PR it back.
See CLAUDE.md for project conventions and lifecycle labels.

Name		Name	Last commit message	Last commit date
Latest commit History 982 Commits
.claude		.claude
.codex/skills		.codex/skills
.githooks		.githooks
.github/workflows		.github/workflows
.hydraflow-hook-cache		.hydraflow-hook-cache
.pi		.pi
deploy/ec2		deploy/ec2
docs		docs
migrations/dolt		migrations/dolt
scripts		scripts
src		src
static		static
templates		templates
tests		tests
.dockerignore		.dockerignore
.env.sample		.env.sample
.gitignore		.gitignore
.lgtm.yml		.lgtm.yml
.nvmrc		.nvmrc
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile.agent		Dockerfile.agent
Dockerfile.agent-base		Dockerfile.agent-base
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
claude.sh		claude.sh
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HydraFlow

What Makes It Different

Why Teams Use It

How It Works

Prerequisites

Quick Start

Dashboard Mode (multi-repo)

Core Commands

Admin API

Background Loops

Self-improving harness

Issue Flow Labels

Prep Output and Local Tracking

Dashboard

Development

Docker Execution Mode

1. Build the agent image

2. Enable Docker mode

3. Configure agent authentication

4. Verify

EC2 Deployment

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HydraFlow

What Makes It Different

Why Teams Use It

How It Works

Prerequisites

Quick Start

Dashboard Mode (multi-repo)

Core Commands

Admin API

Background Loops

Self-improving harness

Issue Flow Labels

Prep Output and Local Tracking

Dashboard

Development

Docker Execution Mode

1. Build the agent image

2. Enable Docker mode

3. Configure agent authentication

4. Verify

EC2 Deployment

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages