fix: fix: action-agent shares phase handler with dev-agent — review lifecycle + cleanup (#388) (#403)

Fixes #388 ## Changes Action-agent now sources dev/phase-handler.sh and enters monitor_phase_loop after prompt injection. Two paths: (A) git output triggers the same PR/CI/review lifecycle as dev-agent, (B) no-git output writes PHASE:done for cleanup. Adds docker compose down on terminal phases, escalation to supervisor on idle timeout, and proper temp file cleanup. Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/disinto/pulls/403 Reviewed-by: Disinto_bot <disinto_bot@noreply.codeberg.org>
2026-03-20 17:39:44 +01:00 · 2026-03-20 17:39:44 +01:00 · a15623b747
commit a15623b747
parent 1fb6f0a032
4 changed files with 198 additions and 65 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@ -227,9 +227,9 @@ later triages these predictions into action/backlog issues or dismisses them.
 ### Action (`action/`)

 **Role**: Execute operational tasks described by action formulas — run scripts,
-call APIs, send messages, collect human approval. Unlike the dev-agent, the
-action-agent produces no PRs: Claude closes the issue directly after executing
-all formula steps.
+call APIs, send messages, collect human approval. Shares the same phase handler
+as the dev-agent: if an action produces code changes, the orchestrator creates a
+PR and drives the CI/review loop; otherwise Claude closes the issue directly.

 **Trigger**: `action-poll.sh` runs every 10 min via cron. It scans for open
 issues labeled `action` that have no active tmux session, then spawns
@ -237,17 +237,17 @@ issues labeled `action` that have no active tmux session, then spawns

 **Key files**:
 - `action/action-poll.sh` — Cron scheduler: finds open action issues with no active tmux session, spawns action-agent.sh
- `action/action-agent.sh` — Orchestrator: fetches issue body + prior comments, creates tmux session (`action-{issue_num}`) with interactive `claude`, injects formula prompt, monitors session until Claude exits or 4h idle timeout
+- `action/action-agent.sh` — Orchestrator: fetches issue body + prior comments, creates tmux session (`action-{issue_num}`) with interactive `claude`, injects formula prompt with phase protocol, enters `monitor_phase_loop` (shared via `dev/phase-handler.sh`) for CI/review lifecycle or direct completion

 **Session lifecycle**:
 1. `action-poll.sh` finds open `action` issues with no active tmux session.
 2. Spawns `action-agent.sh <issue_num>`.
 3. Agent creates Matrix thread, exports `MATRIX_THREAD_ID` so Claude's output streams to the thread via a Stop hook (`on-stop-matrix.sh`).
-4. Agent creates tmux session `action-{issue_num}`, injects prompt (formula + prior comments).
-5. Claude executes formula steps using Bash and other tools, posts progress as issue comments. Each Claude turn is streamed to the Matrix thread for real-time human visibility.
-6. For human input: Claude sends a Matrix message and waits; the reply is injected into the session by `matrix_listener.sh`.
-7. When complete: Claude closes the issue with a summary comment. Session exits.
-8. Poll detects no active session on next run — nothing further to do.
+4. Agent creates tmux session `action-{issue_num}`, injects prompt (formula + prior comments + phase protocol).
+5. Agent enters `monitor_phase_loop` (shared with dev-agent via `dev/phase-handler.sh`).
+6. **Path A (git output):** Claude pushes branch → `PHASE:awaiting_ci` → handler creates PR, polls CI → injects failures → Claude fixes → push → re-poll → CI passes → `PHASE:awaiting_review` → handler polls reviews → injects REQUEST_CHANGES → Claude fixes → approved → merge → cleanup.
+7. **Path B (no git output):** Claude posts results as comment, closes issue → `PHASE:done` → handler cleans up (kill session, docker compose down, remove temp files).
+8. For human input: Claude sends a Matrix message and waits; the reply is injected into the session by `matrix_listener.sh`.

 **Environment variables consumed**:
 - `CODEBERG_TOKEN`, `CODEBERG_REPO`, `CODEBERG_API`, `PROJECT_NAME`, `CODEBERG_WEB`