Agent Behavior Development (ABD) is a software development paradigm that treats agent behavior as the primary engineering artifact.
In ABD:
- Code is not the first-class citizen.
- Prompts, guardrails, output formats, review rules, and recycle loops are.
This repository does not provide tools. It defines a development paradigm, not an SDK.
Agent Behavior Development (ABD) is a paradigm where humans design and evolve the behavior of agents, and code is a byproduct validated by evidence.
graph TB
subgraph Planning
A[Product Intent or Sprint Goal]
B[Notlanot Sprint Plan with Personas]
C[Task Packets]
D[Conflict Inputs]
E[Conflict Scoring and Replan]
end
subgraph PromptOps
F[Prompt Library]
G[Prompt Compiler]
H[Guardrails]
I[Output Format Contract]
end
subgraph Execution
J[Agent Run]
K[Structured Output]
L[Evidence Production]
end
subgraph ReviewRecycle
M[Scorecard]
N[Tags and Anti Patterns]
O[Prompt Patch Rules]
P[Promote to Library]
Q[Recycle Outputs]
R[Changelog and Governance]
end
A --> B
B --> C
C --> D
D --> E
E --> G
F --> G
G --> H
H --> I
I --> J
J --> K
K --> L
K --> M
L --> M
M --> N
N --> O
O --> J
O --> P
P --> F
M --> Q
Q --> R
R --> B
style F fill:#9C27B0,color:#fff
style M fill:#FF9800,color:#fff
style O fill:#F44336,color:#fff
style L fill:#4CAF50,color:#fff
style I fill:#1565C0,color:#fff
Traditional development paradigms assume:
- Humans write specifications.
- Humans write tests.
- Humans write most of the code.
- Tools assist.
That assumption no longer holds.
Modern agents:
- Propose behaviors
- Generate test strategies
- Produce implementation drafts
Yet teams still:
- Debug code instead of behavior
- Retry prompts instead of fixing agent constraints
- Treat prompt failures as one-off accidents
ABD exists to systematize agent behavior improvement.
The defining shift of ABD is simple:
The primary refactoring target is agent behavior, not code.
In ABD, when output quality is poor:
- Fix the prompt
- Tighten guardrails
- Lock output formats
- Improve review rules
- Strengthen recycle logic
Only then adjust code.
ABD is:
- Agent-first, not agent-autonomous
- Behavior-driven, not prompt-driven
- Evidence-oriented, not test-obsessed
- Governance-heavy, not “just try again”
ABD introduces:
- Prompt-as-artifact discipline
- Behavior scorecards
- Mandatory recycle outputs
- Conflict-aware task planning
- Agent behavior retrospectives
ABD is NOT:
- AI-assisted TDD
- BDD automation
- “Let the agent do everything”
- Prompt engineering tips
- A replacement for Scrum
ABD complements Scrum by adding an agent behavior governance layer.
| Aspect | TDD | BDD | ABD |
|---|---|---|---|
| Who defines behavior | Human | Human | Agent (under rules) |
| Primary artifact | Test | Scenario | Prompt + Guardrails |
| What is refactored | Code | Code | Agent behavior |
| Failure resolution | Rewrite code | Rewrite scenario | Patch prompt/flow |
| Evidence | Tests | Scenarios | Tests, checks, demos |
| Paradigm focus | Code correctness | Business behavior | Behavior production system |
TDD and BDD are techniques in ABD, not paradigms.
- Agent output must follow a fixed format.
- Evidence must be proposed before implementation.
- Assumptions must be explicit.
- Agents may ask up to 3 questions and must stop.
- Every task must produce recycle outputs.
- Behavior changes must be versioned.
MANIFESTO.md→ Hard rules of the paradigm0_introduction/→ What ABD is and is not1_core/→ Principles and artifacts2_process/→ Lifecycle, sprint and task models3_promptops/→ Prompt operations and metrics4_templates/→ Ready-to-use templates5_examples/→ End-to-end worked examples
Day 1:
- Read MANIFESTO
- Use the core prompt template for one task
- Score the output
Day 2–3:
- Run 3 more tasks
- Tag recurring failures
Day 4:
- Patch the prompt
- Document behavior improvement
Day 5:
- Sprint review focused on agent behavior, not features