Dark Factory KB — 16 Expansion Gaps

Critical

3 items · Core infrastructure, zero KB coverage

1 Beads Adoption — Formula-Driven Pipeline Tracking

The factory adopted Beads for task tracking using formula TOML files as the pipeline source of truth. The "Option B" decision chose formula-driven molecules over ad-hoc bead creation. kelly-web-build.formula.toml and kelly-research.formula.toml define all pipeline steps with dependencies, agent assignment, and workflow paths. The KB has Yegge's Beads concept and a gap analysis, but no article documenting how we actually use beads — how formula TOML works, how spawn-subphase.sh resolves beads, how the bddone wrapper works, or the dual-write requirement (DONE marker + bead closure).

internal Source: memory/2026-04-27, memory/2026-04-29, AGENTS.md

AGENTS.md — Beads section steve-yegge-beads.md kelly-gas-town-gap-analysis.md

2 Auto-Spawn Chain Protocol (PIP-68) — Hands-Free Pipeline Execution

The --auto-spawn flag on spawn-subphase.sh appends structured metadata (NEXT_STEP, CLOSING_BEAD, PIPELINE_DONE) after sessions_spawn output. On wake-up after sessions_yield, the Router reads this metadata and auto-chains the next step without operator intervention. The Kelly handbook Ch7 describes basic spawn/steer/kill, but the auto-spawn chain pattern is a significant evolution — it's only documented in AGENTS.md, not in the KB.

internal Source: AGENTS.md, memory/2026-04-27, factory/scripts/spawn-subphase.sh

AGENTS.md — Auto-Spawn Chain section factory/scripts/spawn-subphase.sh

3 RALPH Refinements — Lessons from Production Use

Several refinements beyond the handbook's basic RALPH protocol: (1) QA agents must test CRUD, not just READ — discovered when EDIT/DELETE were never tested; (2) Lessons must go into agent skill files, not just SELF_IMPROVEMENT.md — the Router's notes don't propagate to sub-agents automatically; (3) Parallel pipeline runs validated that RALPH works across concurrent pipelines. The existing ralph-protocol.md concept only describes the handbook version.

internal Source: memory/2026-04-22, memory/2026-04-27

ralph-protocol.md kelly-handbook-ch7-multi-agent.md

High

6 items · Important patterns or recent external content

4 Operator Control Patterns — Queue, Hold, and Continue

Two patterns for operator control: (1) Operator Queue (PIP-58) — when the Router hits a blocker, it logs to operator.md via operator-queue.sh add instead of just saying it in chat; (2) Operator Hold (PIP-17) — hold after {subphase-id} pauses pipeline at a specific point, continue {project} resumes. Neither is documented in the KB. Essential for safe automation — the operator needs control levers.

internal Source: AGENTS.md (Operator Queue + Operator Hold sections)

AGENTS.md — Operator Queue AGENTS.md — Operator Hold

5 Gas Town Cloud — Kilo Partnership and Factory-as-a-Service

Steve Yegge's Gas Town is partnering with Kilo (a model-agnostic cloud platform) to make Gas Town available as a cloud service — not just local. 70+ engineers built factories at the first Gas City Hall workshop. This is a major shift from local-only dark factories to cloud-hosted ones. The existing Gas Town article in the KB predates this development.

external Published: ~May 20, 2026

The New Stack — Gas Town Cloud steve-yegge-gas-town.md steve-yegge-gas-city.md

6 Karpathy OmegaWiki — From Knowledge Base to Research Lifecycle

Karpathy's LLM Wiki gist has been extended with OmegaWiki — a full research lifecycle platform. New operations: /discover (venue-ranked reading lists), /ingest (typed entries linked into graph), /ideate + /novelty (ideas with bibliographic grounding), /exp-design → /exp-run → /exp-eval (experiment workflow), /paper-plan → /paper-draft, /poster (conference poster generation), /rebuttal (reviewer response). The original LLM Wiki pattern is in the KB; the OmegaWiki extension is not.

external Published: May 2026

Karpathy LLM Wiki Gist karpathy-llm-wiki.md

7 LLM Wiki v2 — Confidence Scoring, Supersession, and Production Patterns

Extends Karpathy's pattern with production lessons from building agentmemory (10K GitHub stars). Key additions: (1) Confidence scoring — every fact carries a score based on source count, recency, and contradictions; (2) Supersession — new information explicitly marks old claims as stale with version control; (3) Forge testing — synthetic queries to validate wiki accuracy; (4) What breaks at scale — practical failure modes. Directly applicable to dark-factory-kb maintenance.

external Published: ~May 22, 2026

LLM Wiki v2 Gist karpathy-gap-analysis.md

8 Coding Agent Economics — Enterprise Pricing Inflection Point

Simon Willison reports that Anthropic and OpenAI have switched enterprise plans from seat-based to API pricing. His personal usage: $1,200/month Anthropic + $980/month OpenAI API equivalent for a $200 subscription. Companies are shocked at LLM bills. Anthropic rumored first profitable quarter. April 2026 is an inflection point for enterprise AI spending. Validates dark factory economics — minimizing token usage via compiled KBs, bounded context, and story-by-story spawns is economically essential.

external Published: May 27, 2026

Willison — Product-Market Fit

9 Ch7 Architecture Update — Current Pattern Not Captured as Standalone Article

Ch7 describes "named leads" (research-lead, project-lead, test-lead) which the factory has moved past. The current architecture — formula TOML + auto-spawn chains + bead tracking + operator queue — isn't captured as a standalone KB article. Not about Ch7 being "stale" (it accurately describes what it describes), but about the current pattern needing its own article. The gap is that no single article documents how multi-agent orchestration actually works today.

internal Source: practice drift since Ch7 was written

kelly-handbook-ch7-multi-agent.md kelly-factory-overview.md

Medium

5 items · Important patterns, derivable from existing content

10 Story-by-Story Build — Context-Bounded Implementation

To avoid context overflow in long BUILD phases, the factory spawns one story at a time rather than handing the agent the entire sprint backlog. Each story gets its own subagent session with bounded context. The RALPH concept mentions "break tasks into smaller chunks" as a failure mode fix, but the story-by-story spawn as a default operating pattern is not documented as a standalone article. Sally loops on multi-storyboard spawns; Amelia does 24+ stories in separate sessions.

internal Source: factory workflow files, memory/2026-04-27

kelly-handbook-ch7-multi-agent.md kelly-handbook-ch8-memory.md

11 DONE Marker Protocol — Why Dual-Write Matters

Agents must write both a DONE marker file AND call bd update --status in_review. Neither alone is sufficient — Beads tracks work state; DONE markers track artifact output. BUG-01 (DONE marker skip) was caused by agents completing work but forgetting the DONE file, which broke prerequisite checks on subsequent spawns. The dual-write requirement is in AGENTS.md but not in the KB.

internal Source: AGENTS.md, memory/2026-04-27

AGENTS.md — Dual-Write section

12 Formula Authoring Pitfalls — Path Conventions and Validation

kelly-web-build.formula.toml had output_path = "artifacts/planning/2.1-prd/" but actual DONE files go to planning/2.1-prd/DONE (without the artifacts/ prefix). All 30 workflow YAML frontmatters had the same bug in done_dir and prerequisites. Fixed in commit 31f00f4. Critical lesson for anyone authoring formulas — path conventions must match actual filesystem layout.

internal Source: memory/2026-04-27, commit 31f00f4

kelly-web-build.formula.toml

13 Subagent Model Configuration — Allowlists and Provider Routing

Subagent spawns were failing with model not allowed: fireworks/accounts/fireworks/models/minimax-m2p7 because the fireworks provider wasn't in the agents.defaults.models allowlist. Root cause: main session could use any model, but subagents were restricted. Workaround: use model=opencode-go/minimax-m2.7 directly instead of agentId-based spawns. Important troubleshooting pattern for anyone configuring subagent models.

internal Source: memory/2026-04-29

openclaw.json config

14 Agentic Engineering Patterns Guide — May 2026 Update

Simon Willison's agentic engineering patterns guide has been significantly expanded since the KB ingest (Feb 23, 2026). New sections: "Using Git with coding agents," "Agentic manual testing," "AI should help us produce better code" (personal manifesto), and updates on anti-patterns. The guide now has 20+ patterns vs the ~15 at ingest time. The existing KB article is stale.

external Updated: May 2026

Willison — Agentic Patterns simon-w-agentic-engineering-patterns.md

Low

2 items · Supplementary or overlapping with existing content

15 Agent Memory Failures — Context Limits and Wiki-Based Solutions

Deep dive on how memory works in Claude Code, Codex CLI, and OpenCode — where it breaks (context limit warnings, forgetting decisions from 3 hours ago), and how Karpathy's LLM Wiki + agentmemory address the problem. Relevant to Kelly's memory architecture but supplementary — overlaps with existing memory concept pages in the KB.

external Published: May 18, 2026

AkitaOnRails — Agent Memory kelly-handbook-ch8-memory.md

16 KB Meta-Pattern — Lint, Q&A Loop, and Filed-Back Outputs

Carson scored the dark-factory-kb at 19/100 against Karpathy's LLM Wiki pattern. Key gaps: no Q&A loop (KB is write-only), no lint gates, no filed-back query outputs, no auto-generated index. The gap analysis report exists in the KB but the meta-lesson about KB maintenance patterns is not compiled as a concept. Low priority because the gap analysis already covers this.

internal Source: memory/2026-04-27, karpathy-gap-analysis.md

karpathy-gap-analysis.md