v0.8.59 execution roadmap: promoted provider, sub-agent, workflow, docs, and localization work

## Problem

The v0.8.59 release line now includes work that was originally staged for v0.8.60: provider/model correctness, sub-agent architecture, WhaleFlow workflow authoring, README/site localization, and broader cleanup. Without an execution roadmap, this turns into a flat backlog where an agent can spend time on the wrong layer or accidentally start several heavy sub-agent/workflow changes before the current TUI reliability problems are contained.

This issue is the execution roadmap for the promoted v0.8.59 push.

## Promotion note

On 2026-06-11, the open `v0.8.60` issues were promoted to `v0.8.59` and retitled where they had an explicit v0.8.60 prefix:

- #3097 TypeScript/JavaScript workflow authoring for WhaleFlow-backed headless agents
- #3096 Headless sub-agent worker runtime with lightweight TUI projections
- #3093 Korean, Spanish, and Brazilian Portuguese README + website locales
- #3092 Russian README + website localization with Cyrillic QA
- #3091 Website parity with existing Japanese and Vietnamese README locales
- #3090 README + website localization matrix, source text, and drift checks
- #3087 README rewrite with DeepSeek-TUI history, provider map, and factual onboarding
- #3086 Unified context-budget service for windows, output caps, compaction, and UI pressure
- #3085 Provider+model SKU pricing engine with provenance and UI integration
- #3084 Provider adapter contracts and conformance tests
- #3082 Workflow-style grouping for background tasks and Bash runs
- #3079 Reliable `web_search` via SearXNG JSON backend, health checks, and visible status
- #1310 First-party MiniMax provider route

This promotion does not mean every item is an equal release blocker. It means the v0.8.59 agent plan should pull these capability foundations forward and execute them in a sane order.

## Operating rules for implementation agents

- Start with user-visible reliability and invalid defaults before architecture.
- Keep PRs narrow: one track or one coherent slice per branch.
- Do not launch large parallel sub-agent fanout until #3095 and #3080 have a recovery/backpressure path.
- README work belongs in #3087; do not rewrite README opportunistically in unrelated PRs.
- Workflow authoring in #3097 must lower into typed WhaleFlow IR and Rust-owned execution; do not introduce arbitrary unsupervised Node/JS side effects.
- When a broad issue cannot be completed in one release-sized PR, land a documented contract, tests, and a compatibility path rather than a half-migration.
- Every PR should update the relevant issue with verification commands and remaining follow-up.

## Roadmap

### Track 0: Contain current v0.8.59 reliability and TUI breakage

Goal: make the app stop feeling stuck, misleading, or computer-heavy during normal usage.

Primary issues:

- #3063 release tracker
- #3095 sub-agent fanout stuck in provider wait
- #3080 interrupted sub-agents leave stale UI/task handles
- #3088 sidebar hover detail popovers dropped during live turns
- #3067 raw SGR mouse reports corrupt composer input
- #3065 sidebar right-click shows generic Paste
- #3078 auto-clear completed sub-agent cards
- #3077 resizable/collapsible sidebar controls
- #1190 task status unclear/stuck
- #1679 Windows SSE multi-agent timeout and UI corruption

Exit criteria:

- [ ] Long provider waits show actionable state, timeout, cancel, retry, and queued/running counts.
- [ ] Interrupted/cancelled sub-agents cannot leave stale running handles or fanout cards.
- [ ] Mouse capture, hover popovers, context menus, and composer input are verified together.
- [ ] Activity/sidebar cards do not grow into an unreadable wall during common multi-agent work.

### Track 1: Make provider/model routing factual and API-backed

Goal: stop shipping invalid model IDs, stale provider claims, unknown pricing, and hard-coded model facts.

Primary issues:

- #3094 invalid OpenRouter Nemotron preset
- #3071 model metadata registry
- #3072 provider API hydration with offline cache/provenance
- #3073 hard-coded model list audit/migration
- #3075 provider-aware model picker search
- #3083 provider dashboard/readiness surface
- #3084 provider adapter contracts and conformance tests
- #3085 SKU pricing engine
- #3066 non-DeepSeek cost tracking
- #3070 OpenAI Codex/ChatGPT context metadata
- #3076 provider ordering cleanup
- #1310 first-party MiniMax provider
- #2574 provider fallback chain
- #3079 reliable `web_search` backend/status

Suggested sequence:

1. Fix invalid presets and obvious route breakage first: #3094, #3064, #3079.
2. Establish a single registry/provenance model: #3071, #3072, #3073.
3. Add provider conformance and readiness: #3084, #3083, #3075.
4. Add pricing and context-window correctness: #3085, #3066, #3070.
5. Add/verify MiniMax and fallback behavior: #1310, #2574.

Exit criteria:

- [ ] CodeWhale does not advertise OpenRouter model IDs absent from OpenRouter model metadata.
- [ ] Model picker, routing, docs, prompt facts, context windows, and pricing read from one documented metadata path or an explicitly generated cache.
- [ ] Unknown pricing/context states are displayed as unknown, not silently guessed.
- [ ] Provider readiness explains auth, endpoint, capabilities, model availability, and last metadata refresh.

### Track 2: Unify context budgets, compaction, and stream pressure

Goal: make context/window pressure observable and consistent across model metadata, compaction, output caps, and UI state.

Primary issues:

- #3086 unified context-budget service
- #3070 context-window metadata
- #1120 cache hit problems
- #1060 stream stalled for 90s
- #861 thinking collapse/truncation/drop behavior

Exit criteria:

- [ ] Context windows, output caps, compaction thresholds, and UI pressure indicators share one budget service.
- [ ] Stalls/truncation/collapse paths expose clear user state and preserve model-readable recovery state.
- [ ] Tests cover small-window, large-window, unknown-window, compaction, and stalled-stream paths.

### Track 3: Renovate sub-agents into headless workers and workflow orchestration

Goal: move from ad hoc fanout/TUI-shaped sub-agents to scheduler-owned workers, compact UI projections, and authorable workflows.

Primary issues:

- #3095 immediate stuck fanout recovery
- #3080 interrupted sub-agent cleanup
- #3096 headless worker runtime
- #3082 background task grouping/workflow summaries
- #3097 TypeScript/JavaScript workflow authoring over WhaleFlow IR
- #1917 universal hook layer for cancel/pause/resume
- #2791 modular command dispatch

Suggested sequence:

1. Add scheduler-visible states and backpressure around current sub-agent fanout: queued, starting, model_wait, running_tool, completed, failed, cancelled, interrupted.
2. Define `AgentWorkerSpec` / `AgentWorkerEvent` independent of Ratatui widgets.
3. Make TUI cards/sidebar consume worker events as a projection only.
4. Connect WhaleFlow leaves to the same worker contract rather than building a second scheduler.
5. Add JS/TS workflow authoring as a compile/lowering layer into typed WhaleFlow IR, not as an unsupervised Node runtime.

Exit criteria:

- [ ] A headless test can run worker lifecycle paths without constructing TUI cards.
- [ ] The user sees compact workflow progress rather than raw nested sub-agent rows.
- [ ] Cancellation/backpressure works before, during, and after model wait.
- [ ] JS/TS-authored workflows validate as typed IR before any worker launches.

### Track 4: Clean up user-facing naming, legacy compatibility, and stale issue load

Goal: make the project look coherent after the DeepSeek-TUI -> CodeWhale rename without breaking compatibility.

Primary issues:

- #3081 show shell execution as Bash, keep `exec_shell` internal
- #3069 rename DeepSeek color names in consumer code
- #3068 audit/document `.deepseek/` compatibility paths
- #3058 clarify `.codewhale` per-folder behavior
- #3089 stale issue cleanup and duplicate pre-rename backlog
- #2766 UI refactor needed

Exit criteria:

- [ ] Normal UI does not expose implementation/tool IDs where a user-facing concept exists.
- [ ] Legacy DeepSeek paths are documented as compatibility, migrated, or intentionally retained.
- [ ] Stale issue automation exists before closing old issues in bulk.

### Track 5: Docs, website, README, and localization endcap

Goal: make the public story match the product: DeepSeek-TUI history, CodeWhale rename, provider graph, supported/experimental/self-hosted routes, and real multilingual onboarding.

Primary issues:

- #3087 README rewrite
- #3090 localization matrix, source text, drift checks
- #3091 website parity with Japanese and Vietnamese README locales
- #3092 Russian README + website localization
- #3093 Korean, Spanish, and Brazilian Portuguese locales
- #3068 legacy path docs
- #1118 configured Chinese but thinking/status remains English
- #683 force model reasoning language request

Exit criteria:

- [ ] README and website share a source localization matrix.
- [ ] Locales are grouped by support tier and verified for links, install commands, provider names, and screenshots/media references.
- [ ] Russian Cyrillic rendering is checked in README and website contexts.
- [ ] User-facing language settings affect app chrome/status where feasible, and model-language limitations are documented honestly.

## Recommended first agent packet

If a single implementation agent starts now, execute in this order:

1. #3095 + #3080: contain stuck sub-agent/provider-wait behavior.
2. #3094: fix invalid OpenRouter model metadata immediately.
3. #3088 + #3067 + #3065: finish the mouse/sidebar input cleanup cluster.
4. #3071 + #3072 + #3073: establish model metadata source of truth.
5. #3084 + #3083 + #3075: provider conformance and user-facing readiness.
6. #3086: context budget service, using metadata from the provider/model work.
7. #3096 + #3082: worker event contract and compact workflow projections.
8. #3097: JS/TS workflow authoring layer lowered into WhaleFlow IR.
9. #3087 + #3090-#3093: README/site/localization endcap.
10. #3089: stale issue cleanup policy before any broad closure pass.

## Verification

- `gh issue list --state open --label v0.8.60` should remain empty unless maintainers intentionally reopen the v0.8.60 line.
- `gh issue list --state open --label v0.8.59` should show all promoted issues and active release work.
- For code PRs: run the narrow package tests named in each linked issue, plus `cargo test -p codewhale-tui` or `cargo test -p codewhale-whaleflow --locked` when touching those areas.
- For docs/localization PRs: verify README links, website route links, provider/model naming, locale navigation, and install commands.
- For workflow/sub-agent PRs: manually run a multi-agent prompt and verify queued/running/done/cancelled states without an unreadable activity wall.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.8.59 execution roadmap: promoted provider, sub-agent, workflow, docs, and localization work #3098

Problem

Promotion note

Operating rules for implementation agents

Roadmap

Track 0: Contain current v0.8.59 reliability and TUI breakage

Track 1: Make provider/model routing factual and API-backed

Track 2: Unify context budgets, compaction, and stream pressure

Track 3: Renovate sub-agents into headless workers and workflow orchestration

Track 4: Clean up user-facing naming, legacy compatibility, and stale issue load

Track 5: Docs, website, README, and localization endcap

Recommended first agent packet

Verification

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

v0.8.59 execution roadmap: promoted provider, sub-agent, workflow, docs, and localization work #3098

Description

Problem

Promotion note

Operating rules for implementation agents

Roadmap

Track 0: Contain current v0.8.59 reliability and TUI breakage

Track 1: Make provider/model routing factual and API-backed

Track 2: Unify context budgets, compaction, and stream pressure

Track 3: Renovate sub-agents into headless workers and workflow orchestration

Track 4: Clean up user-facing naming, legacy compatibility, and stale issue load

Track 5: Docs, website, README, and localization endcap

Recommended first agent packet

Verification

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions