deep-code-security

Multi-language SAST tool with agentic verification and AI-powered fuzzing. Two analysis modes:

Static Analysis (SAST) - Uses tree-sitter for deterministic AST parsing, sandbox-verified exploit PoCs, and structured remediation guidance
Dynamic Analysis (Fuzzing) - AI-powered fuzzer with coverage-guided feedback, crash deduplication, and corpus management

Exposes all functionality via an MCP server for Claude Code integration.

Quick Start

# Install (basic)
pip install -e ".[dev]"

# Install with fuzzing support
pip install -e ".[dev,fuzz]"

# Static analysis via CLI
dcs hunt /path/to/project
dcs verify --finding-ids <id1> <id2>

# Dynamic analysis (fuzzing) via CLI
dcs fuzz /path/to/target.py
dcs replay /path/to/corpus
dcs corpus /path/to/corpus
dcs fuzz-plugins
dcs report /path/to/output

# Run via MCP server
python -m deep_code_security.mcp

Architecture

Static Analysis (SAST)

Target Codebase
    |
    v
HUNTER     tree-sitter parse → source/sink match → taint track
    |      Output: RawFinding[] (JSON, paginated)
    v
AUDITOR    PoC generation → sandbox execution → confidence scoring
    |      Output: VerifiedFinding[] (JSON)  [Exploit = 10% bonus only]
    v
ARCHITECT  context gather → guidance generation → dependency analysis
           Output: RemediationGuidance[] (JSON)

Dynamic Analysis (Fuzzing)

Target Function
    |
    v
FUZZER     LLM-guided input generation → sandboxed execution →
           crash detection → corpus management → coverage tracking
           Output: CrashReport[] (JSON, with reproducer inputs)

Supported Languages (v1)

Python (Flask, Django patterns)
Go (net/http, database/sql patterns)
C (stretch goal — basic argv/stdin patterns)

Installation

# Clone
git clone https://github.com/your-org/deep-code-security.git
cd deep-code-security

# Install with dev dependencies
pip install -e ".[dev]"

# Install with fuzzing support (requires anthropic SDK)
pip install -e ".[dev,fuzz]"

# Build sandbox images (requires Docker or Podman)
make build-sandboxes        # Build auditor/architect sandbox images
make build-fuzz-sandbox     # Build fuzzer sandbox image (Podman only)

MCP Configuration

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "deep-code-security": {
      "command": "python",
      "args": ["-m", "deep_code_security.mcp"],
      "cwd": "/path/to/deep-code-security",
      "env": {
        "DCS_REGISTRY_PATH": "/path/to/deep-code-security/registries",
        "DCS_ALLOWED_PATHS": "/path/to/projects",
        "DCS_CONTAINER_RUNTIME": "auto",
        "ANTHROPIC_API_KEY": "your-api-key-here"
      }
    }
  }
}

Optional Environment Variables

For advanced tuning, additional variables are available:

Static Analysis:

Variable	Default	Description
`DCS_MAX_RESULTS`	`100`	Max findings returned per hunt operation
`DCS_MAX_VERIFICATIONS`	`50`	Max findings to verify in auditor phase
`DCS_SANDBOX_TIMEOUT`	`30`	Per-exploit timeout in seconds
`DCS_MAX_FILES`	`10000`	Max files per scan
`DCS_MAX_CONCURRENT_SANDBOXES`	`2`	Concurrency limit for sandbox execution
`DCS_QUERY_TIMEOUT`	`5.0`	Tree-sitter query timeout in seconds
`DCS_QUERY_MAX_RESULTS`	`1000`	Max results per tree-sitter query

Dynamic Analysis (Fuzzing):

Variable	Default	Description
`ANTHROPIC_API_KEY`	(none)	API key for Claude (required for fuzzing)
`GOOGLE_CLOUD_PROJECT`	(none)	GCP project ID for Vertex AI (optional)
`CLOUD_ML_PROJECT_NUMBER`	(none)	GCP project number for Vertex AI (optional)
`ANTHROPIC_VERTEX_PROJECT_ID`	(none)	Vertex AI project override (optional)
`DCS_FUZZ_MODEL`	`claude-sonnet-4-6`	Claude model for input generation
`DCS_FUZZ_MAX_ITERATIONS`	`10`	Max fuzzing iterations
`DCS_FUZZ_INPUTS_PER_ITER`	`10`	Inputs generated per iteration
`DCS_FUZZ_TIMEOUT_MS`	`5000`	Per-input execution timeout
`DCS_FUZZ_MAX_COST_USD`	`5.0`	API cost budget
`DCS_FUZZ_OUTPUT_DIR`	`./fuzzy-output`	Corpus and report output directory
`DCS_FUZZ_CONSENT`	`false`	Pre-configured consent for CI
`DCS_FUZZ_GCP_REGION`	`us-east5`	GCP region for Vertex AI
`DCS_FUZZ_ALLOWED_PLUGINS`	`python`	Comma-separated allowlist of fuzzer plugins
`DCS_FUZZ_MCP_TIMEOUT`	`120`	Hard wall-clock timeout for MCP fuzz invocations
`DCS_FUZZ_CONTAINER_IMAGE`	`dcs-fuzz-python:latest`	Podman image used by ContainerBackend for MCP fuzz runs

MCP Tools

Tool	Description
`deep_scan_hunt`	Run Hunter phase (AST parse + taint track)
`deep_scan_verify`	Run Auditor phase (sandbox exploit verification)
`deep_scan_remediate`	Run Architect phase (remediation guidance)
`deep_scan_full`	Run all three phases sequentially
`deep_scan_status`	Check sandbox health and registry info
`deep_scan_fuzz`	Run AI-powered fuzzing (requires Podman container backend)
`deep_scan_fuzz_status`	Check fuzzer availability and configuration

Confidence Scoring

The confidence score (0-100) uses a weighted composite:

Factor	Weight	Notes
Taint path completeness	45%	Full path = 100, partial = 50, heuristic = 20
Sanitizer absence	25%	No sanitizer = 100, full sanitizer = 0
CWE severity baseline	20%	Critical = 100, High = 75, Medium = 50, Low = 25
Exploit verification	10%	Bonus only — failed PoC does not penalize

Thresholds: >=75 confirmed, >=45 likely, >=20 unconfirmed, <20 false positive

Sandbox Security Policy

Sandbox containers enforce:

--network=none — no network access
--read-only — read-only root filesystem
--tmpfs /tmp:rw,noexec,nosuid,size=64m — writable temp with noexec
--cap-drop=ALL — no Linux capabilities
--security-opt=no-new-privileges — no privilege escalation
--security-opt seccomp=seccomp-default.json — custom seccomp profile
--pids-limit=64 — no fork bombs
--memory=512m — memory ceiling
--user=65534:65534 — run as nobody

Development

make lint           # Lint with ruff
make test           # All tests (90%+ coverage required)
make test-hunter    # Hunter tests only
make test-auditor   # Auditor tests only
make test-architect # Architect tests only
make test-mcp       # MCP server tests only
make test-fuzzer    # Fuzzer tests only
make sast           # Security scan with bandit
make security       # sast + pip-audit

Registry Format

See registries/README.md for the YAML registry format documentation.

Known Limitations (v1)

Intraprocedural taint only — source and sink must be in the same function body. Expected detection rate: 10-25% of real-world injection vulnerabilities. Most web app vulnerabilities span multiple function call boundaries and will NOT be detected by v1.
Query brittleness — tree-sitter queries match specific AST shapes. The following patterns are NOT matched in v1:
- Aliased imports: req = request; req.form
- Fully-qualified names: flask.request.form
- Class attributes: self.request.form
- Chained calls: request.form.get("key") (partial match only)
PoC verification is bonus-only — most template-based PoCs fail due to missing execution context (framework setup, dependency injection, state initialization). A failed PoC does NOT mean the vulnerability is false. The exploit bonus is capped at 10 points.
No cross-language taint — Python calling C via FFI is not analyzed. Each language is analyzed independently.
No interprocedural analysis — call graphs across functions/files are not traced in v1. Deferred to v1.1.
Fuzzer requires optional dependencies — dynamic analysis requires pip install -e ".[fuzz]" to install the anthropic SDK and related packages. The fuzzer will not be available without these dependencies.
deep_scan_fuzz MCP tool requires Podman — The MCP fuzzing tool uses ContainerBackend for full isolation and is only available when Podman is installed and the dcs-fuzz-python:latest image is built via make build-fuzz-sandbox. CLI fuzzing supports both SubprocessBackend (rlimits-only) and ContainerBackend.

Security Model

The MCP server runs as a native stdio process on the host. It does NOT run inside a Docker container, avoiding the Docker socket mount attack vector. Docker or Podman is used for sandbox containers that execute exploit PoCs (auditor/architect phases). The fuzzer's ContainerBackend uses Podman exclusively for rootless container execution.

All file access goes through DCS_ALLOWED_PATHS allowlist validation with symlink resolution. All RawFinding fields are validated before exploit template interpolation. Finding provenance is verified via server-side session store (external callers cannot inject arbitrary findings).

v1.1 Roadmap

Interprocedural taint tracking (call graph construction)
Java and Rust language support
Additional registry patterns (aliased imports, fully-qualified names)
gVisor/Firecracker sandbox option for higher-risk deployments

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.claude		.claude
plans		plans
registries		registries
sandbox		sandbox
src/deep_code_security		src/deep_code_security
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Makefile		Makefile
README.md		README.md
SPIKE.md		SPIKE.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deep-code-security

Quick Start

Architecture

Static Analysis (SAST)

Dynamic Analysis (Fuzzing)

Supported Languages (v1)

Installation

MCP Configuration

Optional Environment Variables

MCP Tools

Confidence Scoring

Sandbox Security Policy

Development

Registry Format

Known Limitations (v1)

Security Model

v1.1 Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

deep-code-security

Quick Start

Architecture

Static Analysis (SAST)

Dynamic Analysis (Fuzzing)

Supported Languages (v1)

Installation

MCP Configuration

Optional Environment Variables

MCP Tools

Confidence Scoring

Sandbox Security Policy

Development

Registry Format

Known Limitations (v1)

Security Model

v1.1 Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages