Merge pull request 'feat: disinto predictor — daily cron-driven formula (#406)' (#417) from action/issue-406 into main

2026-03-20 19:09:02 +01:00 · 2026-03-20 19:09:02 +01:00 · 4377c0812f
commit 4377c0812f
parent 0cfba943b7 0aa6528709
5 changed files with 380 additions and 79 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@ -18,9 +18,10 @@ disinto/
 ├── review/        review-poll.sh, review-pr.sh — PR review
 ├── gardener/      gardener-run.sh — files action issue for run-gardener formula
 │                  gardener-poll.sh, gardener-agent.sh — recipe engine + grooming
+├── predictor/     predictor-run.sh — daily cron executor for run-predictor formula
 ├── planner/       planner-run.sh — direct cron executor for run-planner formula
 │                  planner/journal/ — daily raw logs from each planner run
-│                  prediction-poll.sh, prediction-agent.sh — evidence-based predictions
+│                  prediction-poll.sh, prediction-agent.sh — legacy predictor (superseded by predictor/)
 ├── supervisor/    supervisor-poll.sh — health monitoring
 ├── vault/         vault-poll.sh, vault-agent.sh, vault-fire.sh — action gating
 ├── action/        action-poll.sh, action-agent.sh — operational task execution
@ -191,43 +192,54 @@ issues — the planner is a nervous system component, not work.
 - `planner/MEMORY.md` — Persistent memory across runs (committed to git)
 - `planner/journal/*.md` — Daily raw logs from each planner run (committed to git)

-**Future direction**: The [Predictor](#predictor-planner) already reads `evidence/` JSON and files prediction issues for the planner to triage. The next step is evidence-gated deployment (see `docs/EVIDENCE-ARCHITECTURE.md`): replacing human "ship it" decisions with automated gates across dimensions (holdout, red-team, user-test, evolution fitness, protocol metrics, funnel). Not yet implemented.
+**Future direction**: The [Predictor](#predictor-predictor) files prediction issues daily for the planner to triage. The next step is evidence-gated deployment (see `docs/EVIDENCE-ARCHITECTURE.md`): replacing human "ship it" decisions with automated gates across dimensions (holdout, red-team, user-test, evolution fitness, protocol metrics, funnel). Not yet implemented.

 **Environment variables consumed**:
 - `CODEBERG_TOKEN`, `CODEBERG_REPO`, `CODEBERG_API`, `PROJECT_NAME`, `PROJECT_REPO_ROOT`
 - `PRIMARY_BRANCH`, `CLAUDE_MODEL` (set to opus by planner-run.sh)
 - `MATRIX_TOKEN`, `MATRIX_ROOM_ID`, `MATRIX_HOMESERVER`

-### Predictor (`planner/`)
+### Predictor (`predictor/`)

-**Role**: Evidence-based pattern detection (the "goblin"). Reads structured
-JSON from the project's `evidence/` directory (red-team, evolution, user-test,
-holdout, resources, protocol) plus secondary Codeberg signals (recent issues
-and merged PRs) and system resource snapshots. Asks Claude to identify
-staleness, regressions, opportunities, and risks, then files up to 5
+**Role**: Infrastructure pattern detection (the "goblin"). Runs a 3-step
+formula (preflight → collect-signals → analyze-and-predict) via interactive
+tmux Claude session (sonnet). Collects disinto-specific signals: CI pipeline
+trends (Woodpecker), stale issues, agent health (tmux sessions + logs), and
+resource patterns (RAM, disk, load, containers). Files up to 5
 `prediction/unreviewed` issues for the [Planner](#planner-planner) to triage.
-The predictor MUST NOT emit feature work — only observations about evidence
-state, metric trends, and system conditions.
+The predictor MUST NOT emit feature work — only observations about CI health,
+issue staleness, agent status, and system conditions.

-**Trigger**: `prediction-poll.sh` runs hourly via cron. It iterates over all
-`projects/*.toml` files and runs `prediction-agent.sh` for each project.
-Guarded by a global lock (`/tmp/prediction-poll.lock`) and a memory check
-(skips if available RAM < 2000 MB).
+**Trigger**: `predictor-run.sh` runs daily at 06:00 UTC via cron (1h before
+the planner at 07:00). Guarded by PID lock (`/tmp/predictor-run.lock`) and
+memory check (skips if available RAM < 2000 MB).

 **Key files**:
- `planner/prediction-poll.sh` — Cron wrapper: lock, memory guard, iterates projects, calls prediction-agent.sh per project
- `planner/prediction-agent.sh` — Scans `evidence/` subdirectories for latest + previous JSON, fetches recent Codeberg activity, collects system resource snapshot, builds prompt, invokes `claude -p --model sonnet` (one-shot), parses JSON output lines, creates `prediction/unreviewed` issues on Codeberg, notifies Matrix
+- `predictor/predictor-run.sh` — Cron wrapper + orchestrator: lock, memory guard,
+  sources disinto project config, builds prompt with formula + Codeberg API
+  reference, creates tmux session (sonnet), monitors phase file, handles crash
+  recovery via `run_formula_and_monitor`
+- `formulas/run-predictor.toml` — Execution spec: three steps (preflight,
+  collect-signals, analyze-and-predict) with `needs` dependencies. Claude
+  collects signals and files prediction issues in a single interactive session
+
+**Supersedes**: The legacy predictor (`planner/prediction-poll.sh` +
+`planner/prediction-agent.sh`) used `claude -p` one-shot, read `evidence/`
+JSON, and ran hourly. This formula-based predictor replaces it with direct
+CI/issues/logs signal collection and interactive Claude sessions, matching the
+planner's tmux+formula pattern.

 **Environment variables consumed**:
 - `CODEBERG_TOKEN`, `CODEBERG_REPO`, `CODEBERG_API`, `PROJECT_NAME`, `PROJECT_REPO_ROOT`
- `CLAUDE_TIMEOUT` — Max seconds for the Claude invocation (default 7200)
+- `PRIMARY_BRANCH`, `CLAUDE_MODEL` (set to sonnet by predictor-run.sh)
+- `WOODPECKER_TOKEN`, `WOODPECKER_SERVER` — CI pipeline trend queries (optional; skipped if unset)
 - `MATRIX_TOKEN`, `MATRIX_ROOM_ID`, `MATRIX_HOMESERVER` — Notifications (optional)

-**Lifecycle**: prediction-poll.sh (hourly cron) → lock + memory guard →
-for each project TOML: prediction-agent.sh → scan `evidence/` →
-`claude -p --model sonnet` → parse JSON predictions → create
-`prediction/unreviewed` issues → Matrix notification. The planner's Phase 1.5
-later triages these predictions into action/backlog issues or dismisses them.
+**Lifecycle**: predictor-run.sh (daily 06:00 cron) → lock + memory guard →
+load formula + context → create tmux session → Claude collects signals
+(CI trends, stale issues, agent health, resources) → dedup against existing
+open predictions → file `prediction/unreviewed` issues → `PHASE:done`.
+The planner's Phase 1 later triages these predictions.

 ### Action (`action/`)

@ -296,7 +308,7 @@ sourced as needed.
 | `lib/load-project.sh` | Parses a `projects/*.toml` file into env vars (`PROJECT_NAME`, `CODEBERG_REPO`, `WOODPECKER_REPO_ID`, monitoring toggles, Matrix config, etc.). | env.sh (when `PROJECT_TOML` is set), supervisor-poll (per-project iteration) |
 | `lib/parse-deps.sh` | Extracts dependency issue numbers from an issue body (stdin → stdout, one number per line). Matches `## Dependencies` / `## Depends on` / `## Blocked by` sections and inline `depends on #N` patterns. Not sourced — executed via `bash lib/parse-deps.sh`. | dev-poll, supervisor-poll |
 | `lib/matrix_listener.sh` | Long-poll Matrix sync daemon. Dispatches thread replies to the correct agent via well-known files (`/tmp/{agent}-escalation-reply`). Handles supervisor, gardener, dev, review, vault, and action reply routing. Run as systemd service. | Standalone daemon |
-| `lib/formula-session.sh` | `acquire_cron_lock()`, `check_memory()`, `load_formula()`, `build_context_block()`, `start_formula_session()`, `formula_phase_callback()` — shared helpers for formula-driven cron agents (lock, memory guard, formula loading, tmux session, crash recovery). | planner-run.sh |
+| `lib/formula-session.sh` | `acquire_cron_lock()`, `check_memory()`, `load_formula()`, `build_context_block()`, `start_formula_session()`, `formula_phase_callback()`, `build_prompt_footer()`, `run_formula_and_monitor()` — shared helpers for formula-driven cron agents (lock, memory guard, formula loading, prompt assembly, tmux session, monitor loop, crash recovery). | planner-run.sh, predictor-run.sh |
 | `lib/file-action-issue.sh` | `file_action_issue()` — dedup check, label lookup, and issue creation for formula-driven cron wrappers. Sets `FILED_ISSUE_NUM` on success. | gardener-run.sh |
 | `lib/agent-session.sh` | Shared tmux + Claude session helpers: `create_agent_session()`, `inject_formula()`, `agent_wait_for_claude_ready()`, `agent_inject_into_session()`, `agent_kill_session()`, `monitor_phase_loop()`, `read_phase()`. `create_agent_session(session, workdir, [phase_file])` optionally installs a PostToolUse hook (matcher `Bash\|Write`) that detects phase file writes in real-time — when Claude writes to the phase file, the hook writes a marker so `monitor_phase_loop` reacts on the next poll instead of waiting for mtime changes. Also installs a StopFailure hook (matcher `rate_limit\|server_error\|authentication_failed\|billing_error`) that writes `PHASE:failed` with an `api_error` reason to the phase file and touches the phase-changed marker, so the orchestrator discovers API errors within one poll cycle instead of waiting for idle timeout. When `MATRIX_THREAD_ID` is exported, also installs a Stop hook (`on-stop-matrix.sh`) that streams each Claude response to the Matrix thread. `monitor_phase_loop` sets `_MONITOR_LOOP_EXIT` to one of: `done`, `idle_timeout`, `idle_prompt` (Claude returned to `❯` for 3 consecutive polls without writing any phase — callback invoked with `PHASE:failed`, session already dead), `crashed`, or a `PHASE:*` string. Agents must handle `idle_prompt` in both their callback and their post-loop exit handler. | dev-agent.sh, gardener-agent.sh, action-agent.sh |

@ -331,7 +343,7 @@ Issues flow through these states:
 | `tech-debt` | Pre-existing issue flagged by AI reviewer, not introduced by a PR. | review-pr.sh (auto-created follow-ups) |
 | `underspecified` | Dev-agent refused the issue as too large or vague. | dev-poll.sh (on preflight `too_large`), dev-agent.sh (on mid-run `too_large` refusal) |
 | `vision` | Goal anchors — high-level objectives from VISION.md. | Planner, humans |
-| `prediction/unreviewed` | Unprocessed prediction filed by predictor. | prediction-agent.sh |
+| `prediction/unreviewed` | Unprocessed prediction filed by predictor. | predictor-run.sh |
 | `prediction/backlog` | Prediction triaged as WATCH — not urgent, tracked. | Planner (triage-predictions step) |
 | `prediction/actioned` | Prediction promoted or dismissed by planner. | Planner (triage-predictions step) |
 | `action` | Operational task for the action-agent to execute via formula. | Planner, humans |
--- a/formulas/run-predictor.toml
+++ b/formulas/run-predictor.toml
@ -0,0 +1,187 @@
+# formulas/run-predictor.toml — Predictor formula (disinto-specific signals)
+#
+# Executed by predictor/predictor-run.sh via cron — no action issues.
+# predictor-run.sh creates a tmux session with Claude (sonnet) and injects
+# this formula as context. Claude executes all steps autonomously.
+#
+# Steps: preflight → collect-signals → analyze-and-predict
+#
+# Disinto-specific signal sources:
+#   - CI pipeline trends (Woodpecker)
+#   - Stale issues (open issues with no recent activity)
+#   - Agent health (tmux sessions, recent logs)
+#   - Resource patterns (RAM, disk, load, containers)
+
+name        = "run-predictor"
+description = "Evidence-based prediction: CI trends, stale issues, agent health, resource patterns"
+version     = 1
+model       = "sonnet"
+
+[context]
+files = ["AGENTS.md", "RESOURCES.md"]
+
+[[steps]]
+id    = "preflight"
+title = "Pull latest code and gather environment"
+description = """
+Set up the working environment for this prediction run.
+
+1. Change to the project repository:
+     cd "$PROJECT_REPO_ROOT"
+
+2. Pull the latest code:
+     git fetch origin "$PRIMARY_BRANCH" --quiet
+     git checkout "$PRIMARY_BRANCH" --quiet
+     git pull --ff-only origin "$PRIMARY_BRANCH" --quiet
+"""
+
+[[steps]]
+id    = "collect-signals"
+title = "Collect disinto-specific signals"
+description = """
+Gather raw signal data for pattern analysis. Collect each signal category
+and store the results for the analysis step.
+
+### 1. CI pipeline trends (Woodpecker)
+
+Fetch recent builds from Woodpecker CI:
+  curl -sf -H "Authorization: Bearer $WOODPECKER_TOKEN" \
+    "${WOODPECKER_SERVER}/api/repos/${WOODPECKER_REPO_ID}/pipelines?page=1&perPage=20"
+
+Look for:
+- Build failure rate over last 20 builds
+- Repeated failures on the same step
+- Builds stuck in running/pending state
+- Time since last successful build
+
+If WOODPECKER_TOKEN or WOODPECKER_SERVER are not set, skip CI signals and note
+"CI signals unavailable — WOODPECKER_TOKEN not configured".
+
+### 2. Stale issues
+
+Fetch all open issues:
+  curl -sf -H "Authorization: token $CODEBERG_TOKEN" \
+    "$CODEBERG_API/issues?state=open&type=issues&limit=50&sort=updated&direction=asc"
+
+Identify:
+- Issues with no update in 14+ days (stale)
+- Issues with no update in 30+ days (very stale)
+- Issues labeled 'action' or 'backlog' that are stale (work not progressing)
+- Blocked issues where the blocker may have been resolved
+
+### 3. Agent health
+
+Check active tmux sessions:
+  tmux list-sessions 2>/dev/null || echo "no sessions"
+
+Check recent agent logs (last 24h of activity):
+  for log in supervisor/supervisor.log planner/planner.log planner/prediction.log \
+             gardener/gardener.log dev/dev.log review/review.log; do
+    if [ -f "$PROJECT_REPO_ROOT/$log" ]; then
+      echo "=== $log (last 20 lines) ==="
+      tail -20 "$PROJECT_REPO_ROOT/$log"
+    fi
+  done
+
+Look for:
+- Agents that haven't run recently (missing log entries in last 24h)
+- Repeated errors or failures in logs
+- Sessions stuck or crashed (tmux sessions present but no recent activity)
+- Lock files that may be stale: /tmp/*-poll.lock, /tmp/*-run.lock
+
+### 4. Resource patterns
+
+Collect current resource state:
+  free -m                          # RAM
+  df -h /                          # Disk
+  cat /proc/loadavg                # Load average
+  docker ps --format '{{.Names}} {{.Status}}' 2>/dev/null || true  # Containers
+
+Look for:
+- Available RAM < 2000MB (agents will skip runs)
+- Disk usage > 80% (approaching danger zone)
+- Load average > 3.0 (box overloaded)
+- Containers in unhealthy or restarting state
+
+### 5. Already-open predictions (deduplication)
+
+Fetch existing open predictions to avoid duplicates:
+  curl -sf -H "Authorization: token $CODEBERG_TOKEN" \
+    "$CODEBERG_API/issues?state=open&type=issues&labels=prediction%2Funreviewed&limit=50"
+
+Also check prediction/backlog (watched but not yet actioned):
+  curl -sf -H "Authorization: token $CODEBERG_TOKEN" \
+    "$CODEBERG_API/issues?state=open&type=issues&labels=prediction%2Fbacklog&limit=50"
+
+Record their titles so you can avoid duplicating them.
+"""
+needs = ["preflight"]
+
+[[steps]]
+id    = "analyze-and-predict"
+title = "Analyze signals and file prediction issues"
+description = """
+Analyze the collected signals for patterns and file up to 5 prediction issues.
+
+## What to look for
+
+**CI regression** — Build failure rate increasing or repeated failures:
+- Failure rate > 30% over last 20 builds → high confidence
+- Same step failing 3+ times in a row → high confidence
+- No successful build in 24+ hours → medium confidence
+
+**Stale work** — Issues not progressing:
+- Action issues stale 7+ days → the action agent may be stuck
+- Backlog issues stale 14+ days → work not being picked up
+- Blocked issues whose blockers are now closed → can be unblocked
+
+**Agent health** — Agents not running or failing:
+- Agent log with no entries in 24+ hours → agent may be down
+- Repeated errors in agent logs → systemic problem
+- Stale lock files (process not running but lock exists)
+
+**Resource pressure** — System approaching limits:
+- RAM < 2000MB → agents will start skipping runs
+- Disk > 80% → approaching critical threshold
+- Load sustained > 3.0 → box is overloaded, queued work backing up
+
+**Opportunity** — Good conditions for expensive work:
+- Box idle (RAM > 3000MB, load < 1.0, few active sessions) → good time
+  for expensive operations if any are pending
+
+## Filing predictions
+
+For each prediction, create a Codeberg issue with the `prediction/unreviewed` label.
+
+1. Look up the label ID:
+     curl -sf -H "Authorization: token $CODEBERG_TOKEN" \
+       "$CODEBERG_API/labels" | jq '.[] | select(.name == "prediction/unreviewed") | .id'
+
+2. For each prediction, create an issue:
+     curl -sf -X POST -H "Authorization: token $CODEBERG_TOKEN" \
+       -H "Content-Type: application/json" \
+       "$CODEBERG_API/issues" \
+       -d '{"title":"<title>","body":"<body>","labels":[<label_id>]}'
+
+   Body format:
+     <2-4 sentence description of what was observed, why it matters,
+      what the planner should consider>
+
+     ---
+     **Signal source:** <which signal triggered this>
+     **Confidence:** <high|medium|low>
+     **Suggested action:** <concrete next step for the planner>
+
+3. Send a Matrix notification for each prediction created (optional):
+     Use matrix_send if available, or skip if MATRIX_TOKEN is not set.
+
+## Rules
+- Max 5 predictions total
+- Do NOT predict feature work — only infrastructure/health/metric observations
+- Do NOT duplicate existing open predictions (checked in collect-signals)
+- Be specific: name the metric, the value, the threshold
+- Prefer high-confidence predictions backed by concrete data
+- If no meaningful patterns found, file zero issues — that is a valid outcome
+
+"""
+needs = ["collect-signals"]
--- a/lib/formula-session.sh
+++ b/lib/formula-session.sh
@ -10,6 +10,8 @@
 #   load_formula        FORMULA_FILE       — sets FORMULA_CONTENT
 #   build_context_block FILE [FILE ...]    — sets CONTEXT_BLOCK
 #   start_formula_session SESSION WORKDIR PHASE_FILE — create tmux + claude
+#   build_prompt_footer    [EXTRA_API]      — sets PROMPT_FOOTER (API ref + env + phase)
+#   run_formula_and_monitor AGENT [TIMEOUT] — session start, inject, monitor, log
 #   formula_phase_callback PHASE           — standard crash-recovery callback
 #
 # Requires: lib/agent-session.sh sourced first (for create_agent_session,
@ -123,3 +125,78 @@ formula_phase_callback() {
      ;;
  esac
 }
+
+# ── Prompt + monitor helpers ──────────────────────────────────────────────
+
+# build_prompt_footer [EXTRA_API_LINES]
+# Assembles the common Codeberg API reference + environment + phase protocol
+# block for formula prompts.  Sets PROMPT_FOOTER.
+# Pass additional API endpoint lines (pre-formatted, newline-prefixed) via $1.
+# Requires globals: CODEBERG_API, FACTORY_ROOT, PROJECT_REPO_ROOT,
+#                   PRIMARY_BRANCH, PHASE_FILE.
+build_prompt_footer() {
+  local extra_api="${1:-}"
+  # shellcheck disable=SC2034  # consumed by the calling script's PROMPT
+  PROMPT_FOOTER="## Codeberg API reference
+Base URL: ${CODEBERG_API}
+Auth header: -H \"Authorization: token \$CODEBERG_TOKEN\"
+  Read issue:  curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" '${CODEBERG_API}/issues/{number}' | jq '.body'
+  Create issue: curl -sf -X POST -H \"Authorization: token \$CODEBERG_TOKEN\" -H 'Content-Type: application/json' '${CODEBERG_API}/issues' -d '{\"title\":\"...\",\"body\":\"...\",\"labels\":[LABEL_ID]}'${extra_api}
+  List labels: curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" '${CODEBERG_API}/labels'
+NEVER echo or include the actual token value in output — always reference \$CODEBERG_TOKEN.
+
+## Environment
+FACTORY_ROOT=${FACTORY_ROOT}
+PROJECT_REPO_ROOT=${PROJECT_REPO_ROOT}
+PRIMARY_BRANCH=${PRIMARY_BRANCH}
+PHASE_FILE=${PHASE_FILE}
+
+## Phase protocol (REQUIRED)
+When all work is done:
+  echo 'PHASE:done' > '${PHASE_FILE}'
+On unrecoverable error:
+  printf 'PHASE:failed\nReason: %s\n' 'describe error' > '${PHASE_FILE}'"
+}
+
+# run_formula_and_monitor AGENT_NAME [TIMEOUT]
+# Starts the formula session, injects PROMPT, monitors phase, and logs result.
+# Requires globals: SESSION_NAME, PHASE_FILE, PROJECT_REPO_ROOT, PROMPT,
+#                   CODEBERG_REPO, CLAUDE_MODEL (exported).
+# shellcheck disable=SC2154  # SESSION_NAME, PHASE_FILE, PROJECT_REPO_ROOT, PROMPT set by caller
+run_formula_and_monitor() {
+  local agent_name="$1"
+  local timeout="${2:-7200}"
+
+  if ! start_formula_session "$SESSION_NAME" "$PROJECT_REPO_ROOT" "$PHASE_FILE"; then
+    exit 1
+  fi
+
+  agent_inject_into_session "$SESSION_NAME" "$PROMPT"
+  log "Prompt sent to tmux session"
+  matrix_send "$agent_name" "${agent_name^} session started for ${CODEBERG_REPO}" 2>/dev/null || true
+
+  log "Monitoring phase file: ${PHASE_FILE}"
+  _FORMULA_CRASH_COUNT=0
+
+  monitor_phase_loop "$PHASE_FILE" "$timeout" "formula_phase_callback"
+
+  FINAL_PHASE=$(read_phase "$PHASE_FILE")
+  log "Final phase: ${FINAL_PHASE:-none}"
+
+  if [ "$FINAL_PHASE" != "PHASE:done" ]; then
+    case "${_MONITOR_LOOP_EXIT:-}" in
+      idle_prompt)
+        log "${agent_name}: Claude returned to prompt without writing phase signal"
+        ;;
+      idle_timeout)
+        log "${agent_name}: timed out with no phase signal"
+        ;;
+      *)
+        log "${agent_name} finished without PHASE:done (phase: ${FINAL_PHASE:-none}, exit: ${_MONITOR_LOOP_EXIT:-})"
+        ;;
+    esac
+  fi
+
+  matrix_send "$agent_name" "${agent_name^} session finished (${FINAL_PHASE:-no phase})" 2>/dev/null || true
+  log "--- ${agent_name^} run done ---"
+}
--- a/planner/planner-run.sh
+++ b/planner/planner-run.sh
@ -24,6 +24,7 @@ source "$FACTORY_ROOT/lib/agent-session.sh"
 source "$FACTORY_ROOT/lib/formula-session.sh"

 LOG_FILE="$SCRIPT_DIR/planner.log"
+# shellcheck disable=SC2034  # consumed by run_formula_and_monitor
 SESSION_NAME="planner-${PROJECT_NAME}"
 PHASE_FILE="/tmp/planner-session-${PROJECT_NAME}.phase"

@ -53,6 +54,13 @@ $(cat "$MEMORY_FILE")
 fi

 # ── Build prompt ─────────────────────────────────────────────────────────
+build_prompt_footer "
+  Relabel:     curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" -X PUT -H 'Content-Type: application/json' '${CODEBERG_API}/issues/{number}/labels' -d '{\"labels\":[LABEL_ID]}'
+  Comment:     curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" -X POST -H 'Content-Type: application/json' '${CODEBERG_API}/issues/{number}/comments' -d '{\"body\":\"...\"}'
+  Close:       curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" -X PATCH -H 'Content-Type: application/json' '${CODEBERG_API}/issues/{number}' -d '{\"state\":\"closed\"}'
+"
+
+# shellcheck disable=SC2034  # consumed by run_formula_and_monitor
 PROMPT="You are the strategic planner for ${CODEBERG_REPO}. Work through the formula below. You MUST write PHASE:done to '${PHASE_FILE}' when finished — the orchestrator will time you out if you return to the prompt without signalling.

 ## Project context
@ -61,60 +69,8 @@ ${CONTEXT_BLOCK}${MEMORY_BLOCK}
 ## Formula
 ${FORMULA_CONTENT}

-## Codeberg API reference
-Base URL: ${CODEBERG_API}
-Auth header: -H \"Authorization: token \$CODEBERG_TOKEN\"
-  Read issue:  curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" '${CODEBERG_API}/issues/{number}' | jq '.body'
-  Create issue: curl -sf -X POST -H \"Authorization: token \$CODEBERG_TOKEN\" -H 'Content-Type: application/json' '${CODEBERG_API}/issues' -d '{\"title\":\"...\",\"body\":\"...\",\"labels\":[LABEL_ID]}'
-  Relabel:     curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" -X PUT -H 'Content-Type: application/json' '${CODEBERG_API}/issues/{number}/labels' -d '{\"labels\":[LABEL_ID]}'
-  Comment:     curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" -X POST -H 'Content-Type: application/json' '${CODEBERG_API}/issues/{number}/comments' -d '{\"body\":\"...\"}'
-  Close:       curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" -X PATCH -H 'Content-Type: application/json' '${CODEBERG_API}/issues/{number}' -d '{\"state\":\"closed\"}'
-  List labels: curl -sf -H \"Authorization: token \$CODEBERG_TOKEN\" '${CODEBERG_API}/labels'
-NEVER echo or include the actual token value in output — always reference \$CODEBERG_TOKEN.
+${PROMPT_FOOTER}"

-## Environment
-FACTORY_ROOT=${FACTORY_ROOT}
-PROJECT_REPO_ROOT=${PROJECT_REPO_ROOT}
-PRIMARY_BRANCH=${PRIMARY_BRANCH}
-
-## Phase protocol (REQUIRED)
-When all work is done:
-  echo 'PHASE:done' > '${PHASE_FILE}'
-On unrecoverable error:
-  printf 'PHASE:failed\nReason: %s\n' 'describe error' > '${PHASE_FILE}'"
-
-# ── Create tmux session ─────────────────────────────────────────────────
+# ── Run session ──────────────────────────────────────────────────────────
 export CLAUDE_MODEL="opus"
-if ! start_formula_session "$SESSION_NAME" "$PROJECT_REPO_ROOT" "$PHASE_FILE"; then
-  exit 1
-fi
-
-agent_inject_into_session "$SESSION_NAME" "$PROMPT"
-log "Prompt sent to tmux session"
-matrix_send "planner" "Planner session started for ${CODEBERG_REPO}" 2>/dev/null || true
-
-# ── Phase monitoring loop ────────────────────────────────────────────────
-log "Monitoring phase file: ${PHASE_FILE}"
-_FORMULA_CRASH_COUNT=0
-
-monitor_phase_loop "$PHASE_FILE" 7200 "formula_phase_callback"
-
-FINAL_PHASE=$(read_phase "$PHASE_FILE")
-log "Final phase: ${FINAL_PHASE:-none}"
-
-if [ "$FINAL_PHASE" != "PHASE:done" ]; then
-  case "${_MONITOR_LOOP_EXIT:-}" in
-    idle_prompt)
-      log "planner: Claude returned to prompt without writing phase signal"
-      ;;
-    idle_timeout)
-      log "planner: timed out after 2h with no phase signal"
-      ;;
-    *)
-      log "planner finished without PHASE:done (phase: ${FINAL_PHASE:-none}, exit: ${_MONITOR_LOOP_EXIT:-})"
-      ;;
-  esac
-fi
-
-matrix_send "planner" "Planner session finished (${FINAL_PHASE:-no phase})" 2>/dev/null || true
-log "--- Planner run done ---"
+run_formula_and_monitor "planner"
--- a/predictor/predictor-run.sh
+++ b/predictor/predictor-run.sh
@ -0,0 +1,69 @@
+#!/usr/bin/env bash
+# =============================================================================
+# predictor-run.sh — Cron wrapper: predictor execution via Claude + formula
+#
+# Runs daily (or on-demand). Guards against concurrent runs and low memory.
+# Creates a tmux session with Claude (sonnet) reading formulas/run-predictor.toml.
+# Files prediction/unreviewed issues for the planner to triage.
+#
+# Usage:
+#   predictor-run.sh [projects/disinto.toml]   # project config (default: disinto)
+#
+# Cron: 0 6 * * * cd /path/to/dark-factory && bash predictor/predictor-run.sh
+# =============================================================================
+set -euo pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "$0")" && pwd)"
+FACTORY_ROOT="$(dirname "$SCRIPT_DIR")"
+
+# Accept project config from argument; default to disinto
+export PROJECT_TOML="${1:-$FACTORY_ROOT/projects/disinto.toml}"
+# shellcheck source=../lib/env.sh
+source "$FACTORY_ROOT/lib/env.sh"
+# shellcheck source=../lib/agent-session.sh
+source "$FACTORY_ROOT/lib/agent-session.sh"
+# shellcheck source=../lib/formula-session.sh
+source "$FACTORY_ROOT/lib/formula-session.sh"
+
+LOG_FILE="$SCRIPT_DIR/predictor.log"
+# shellcheck disable=SC2034  # consumed by run_formula_and_monitor
+SESSION_NAME="predictor-${PROJECT_NAME}"
+PHASE_FILE="/tmp/predictor-session-${PROJECT_NAME}.phase"
+
+# shellcheck disable=SC2034  # read by monitor_phase_loop in lib/agent-session.sh
+PHASE_POLL_INTERVAL=15
+
+log() { echo "[$(date -u +%Y-%m-%dT%H:%M:%S)Z] $*" >> "$LOG_FILE"; }
+
+# ── Guards ────────────────────────────────────────────────────────────────
+acquire_cron_lock "/tmp/predictor-run.lock"
+check_memory 2000
+
+log "--- Predictor run start ---"
+
+# ── Load formula + context ───────────────────────────────────────────────
+load_formula "$FACTORY_ROOT/formulas/run-predictor.toml"
+build_context_block AGENTS.md RESOURCES.md
+
+# ── Build prompt ─────────────────────────────────────────────────────────
+build_prompt_footer
+
+# shellcheck disable=SC2034  # consumed by run_formula_and_monitor
+PROMPT="You are the prediction agent (goblin) for ${CODEBERG_REPO}. Work through the formula below. You MUST write PHASE:done to '${PHASE_FILE}' when finished — the orchestrator will time you out if you return to the prompt without signalling.
+
+Your role: spot patterns in infrastructure signals and file them as prediction issues.
+The planner (adult) will triage every prediction before acting.
+You MUST NOT emit feature work or implementation issues — only predictions
+about CI health, issue staleness, agent status, and system conditions.
+
+## Project context
+${CONTEXT_BLOCK}
+
+## Formula
+${FORMULA_CONTENT}
+
+${PROMPT_FOOTER}"
+
+# ── Run session ──────────────────────────────────────────────────────────
+export CLAUDE_MODEL="sonnet"
+run_formula_and_monitor "predictor"