disinto

Author	SHA1	Message	Date
openhands	70aea63521	fix: Dual curl calls for HAS_APPROVE / HAS_CHANGES create a race window (#321 ) Each of the three review-check sites (orphan, stuck-PR, backlog) now fetches reviews with a single curl call, storing the JSON response and jq-filtering both HAS_APPROVE and HAS_CHANGES from the cached result. This eliminates the race window where a review submitted between the two calls could cause a transient mismatch. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-20 00:59:51 +00:00
openhands	08d702b055	fix: fix: stale REQUEST_CHANGES reviews are invisible to dev-poll stuck-PR check (#319 ) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 22:24:48 +00:00
openhands	1ab700c87a	fix: feat: review + dev-poll skip CI gate for non-code PRs (#266 ) Add diff_has_code_files() and ci_required_for_pr() helpers to ci-helpers.sh. Non-code PRs (docs/, formulas/, evidence/, .md) that have no CI results now skip the CI gate instead of being stuck forever. Applied to: - review-pr.sh: CI gate skipped for non-code PRs - review-poll.sh: CI gate skipped for non-code PRs - dev-poll.sh: CI state treated as "success" for non-code PRs in orphan, stuck-PR, and backlog merge paths Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 13:48:00 +00:00
openhands	d1cea6c0bb	fix: apply same REQUEST_CHANGES/CI-pending fix to PRIORITY 1 block	2026-03-18 21:03:53 +00:00
openhands	34ddbef3fd	fix: PRIORITY 1.5 misses REQUEST_CHANGES when CI is not yet settled (#41 )	2026-03-18 20:50:56 +00:00
openhands	f73d5f471e	fix: feat: dev-agent merges its own PRs via non-admin Codeberg account (#172 ) - phase-handler.sh: remove do_merge(); on APPROVAL inject exact API commands for agent to merge+close directly; PHASE:done now only does local cleanup (tmux, worktree, labels) — merge already done - dev-agent.sh: update PHASE_PROTOCOL_INSTRUCTIONS — Approved means merge via API, close issue, then write PHASE:done - dev-poll.sh: remove try_merge_or_rebase(); for approved+CI-green orphaned PRs, spawn dev-agent (recovery mode) to merge instead - .env.example: document new token roles (CODEBERG_TOKEN = bot for push/PR/merge; REVIEW_BOT_TOKEN = human account for approvals) - AGENTS.md: update token descriptions to match new roles Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 17:59:36 +00:00
openhands	d904192ab7	fix: Escalation write-once guard is not atomic (pre-existing) (#154 ) - `ci_fix_check_and_increment` now accepts an optional `check_only` arg: - count < 3, check_only: returns `ok:N` without incrementing (deferred to launch time, preserving the WAITING_PRS protection) - count < 3, non-check_only: increments and returns `ok:N` (unchanged) - count == 3 (any mode): atomically bumps to 4 and returns `exhausted_first_time:3` — only one concurrent poller can win this - count > 3 (any mode): returns `exhausted:N` with no write - `handle_ci_exhaustion` unified to a single code path for both check_only and non-check_only: - Writes escalation JSONL + matrix_send only when sentinel is `exhausted_first_time` — never on a bare integer comparison outside a lock - Removes the two separate `ci_fix_increment` bump-to-4 calls that were racy (the sentinel bump is now inside the flock in Python) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 11:44:30 +00:00
openhands	4dc64ea65b	fix: restore deferred increment for backlog path to prevent counter leak The previous commit introduced a counter leak in the backlog scan path: handle_ci_exhaustion (without check_only) atomically incremented the CI fix counter before the WAITING_PRS guard, so an exit 0 that never spawned a dev-agent would silently consume one of the three allowed fix attempts. Restore the READY_PR_FOR_INCREMENT / deferred-increment mechanism: - Backlog scan calls handle_ci_exhaustion with "check_only" (read-only, no increment) to detect exhaustion without touching the counter. - The counter is bumped atomically at LAUNCH time via handle_ci_exhaustion (without check_only), so the increment only happens when we are certain a dev-agent is being spawned. If a concurrent poller already exhausted the counter between scan and launch, the LAUNCH call returns 0 and we bail out cleanly without double-spawning. The in-progress, stuck-PR, and try_merge_or_rebase paths are unaffected: they call handle_ci_exhaustion without check_only, which continues to use the atomic ci_fix_check_and_increment to prevent concurrent double-spawning. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 10:34:41 +00:00
openhands	7bf13567fd	fix: TOCTOU in handle_ci_exhaustion: check-then-act not atomic (#125 ) Add ci_fix_check_and_increment() that performs read + threshold-check + conditional increment in a single flock-protected Python call, replacing the prior three-step sequence (ci_fix_count / bash check / ci_fix_increment) that allowed two concurrent poll invocations to both pass the threshold and spawn duplicate dev-agents for the same PR. handle_ci_exhaustion now calls ci_fix_check_and_increment atomically and returns the new count in CI_FIX_ATTEMPTS; all separate ci_fix_increment calls after handle_ci_exhaustion (including the deferred READY_PR_FOR_INCREMENT mechanism) are removed. Log messages updated from CI_FIX_ATTEMPTS+1 to CI_FIX_ATTEMPTS to reflect the post-increment count. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 10:22:24 +00:00
openhands	7d51e5e333	fix: Add formula guard to backlog scan path (#127 )	2026-03-18 09:49:44 +00:00
openhands	32ee53517f	fix: In-progress formula issue causes infinite dev-agent respawn (#115 ) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 06:41:43 +00:00
openhands	1352620c3d	fix: ci_fix_count/ci_fix_increment not atomic — potential race under concurrent polls (#118 ) Wrap ci_fix_count(), ci_fix_increment(), and ci_fix_reset() with flock on a shared lockfile to prevent concurrent modification of the JSON tracker. Uses flock(1) in command-wrapping mode so each Python process holds an exclusive lock for the duration of its read-modify-write cycle. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 06:30:17 +00:00
openhands	cf8446b451	fix: try_merge_or_rebase rebase-failure spawn bypasses ci_fix_increment (#56 ) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 04:05:18 +00:00
openhands	ff02b1e653	fix: Three near-identical CI-exhaustion blocks should be a shared function (#58 ) Extract CI-exhaustion check/escalate logic into handle_ci_exhaustion() helper. All three call sites (orphaned PRs, stuck PRs, backlog PRs) now use the shared function, eliminating future drift between the copies. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 03:21:27 +00:00
openhands	8e600787c1	fix: ci_passed() still lives in dev/dev-poll.sh, not lib/ (#70 ) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 02:05:54 +00:00
openhands	bd02330b22	fix: shellcheck TODO has no enforcement — \|\| true may never be removed (#71 ) - Fix SC2164: add \|\| exit 1 to bare cd in update-prompt.sh - Fix SC2155: separate declare and assign in env.sh, supervisor-poll.sh, dev-agent.sh - Fix SC2034: inline suppression for vars used by sourced helpers - Remove unused `mergeable` declaration, rename unused loop var to `_w` - Remove \|\| true from shellcheck CI step — failures are now blocking Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 01:53:02 +00:00
openhands	df2522a7cb	fix: address review findings from issue #67 escalation refactor - supervisor: skip *.done.jsonl in escalation glob (bug: wildcard matched harb.done.jsonl producing spurious 'pending' log noise every cycle) - supervisor: use wc -l instead of grep -c . for line counting (style nit) - supervisor: consume gardener-esc-resolved.log via fixed() so escalation resolutions appear in end-of-cycle supervisor reporting - dev-poll: update all 'escalated to supervisor' log/matrix strings to 'escalated to gardener' (lines 263, 268, 344, 420) - gardener: track _esc_total_created across all escalation entries and write count to supervisor/gardener-esc-resolved.log after processing Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 18:30:57 +00:00
openhands	150ede5605	fix: refactor: move escalation processing from supervisor to gardener (#67 ) - dev-poll.sh: write escalations to per-project files (supervisor/escalations-{PROJECT_NAME}.jsonl) and add "project" field so each project's escalations are isolated; update is_escalated() to read from the same per-project paths - gardener-poll.sh: add escalation processing block that reads the per-project escalation file, fetches CI logs via Woodpecker, and creates per-file ShellCheck sub-issues or generic CI failure issues labeled backlog — runs with the correct CODEBERG_API and WOODPECKER_REPO_ID already loaded from the project TOML - supervisor-poll.sh: remove the escalation processing block; replace with a simple flog report counting pending escalations per project Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 17:32:56 +00:00
openhands	13bc948b1d	fix: address review findings for escalation race condition, SQL injection, and sc_codes scope - Race condition: mv escalations.jsonl to a PID-stamped snapshot before processing so concurrent dev-poll appends go to a fresh file; rm snapshot after loop — no entries are ever silently dropped - SQL injection: validate ESC_PR_SHA is a 40-char hex string before interpolating into the wpdb query - sc_codes scope: compute per-file from file_errors (already filtered to that file) instead of the entire step log; also switch grep to -F so dots in filenames are not treated as regex wildcards - step_pid validation: reject non-integer values from Woodpecker API before passing as CLI argument - Fallback body now distinguishes "CI logs unavailable" from "logs found but issue creation API calls failed" - ESC_GENERIC_FAIL: avoid leading blank line by using conditional separator and fix code-block opening newline - is_escalated(): remove dead esc_file/done_file locals; add Python-level int() guard so empty/non-numeric issue or pr values fail cleanly instead of producing a syntax error suppressed by 2>/dev/null Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 15:11:53 +00:00
openhands	d9520f48a6	fix: feat: supervisor breaks down escalated CI failures into sub-issues (#52 ) - supervisor-poll.sh: replace P3 escalation log with actionable sub-issue creation. For each entry in escalations.jsonl: fetch CI logs via woodpecker-cli, create one sub-issue per file for ShellCheck failures, one combined issue for other CI failures, or a fallback investigation issue if logs are unavailable. Move processed entries to escalations.done.jsonl and clear escalations.jsonl. - dev-poll.sh: add is_escalated() helper that checks both escalations.jsonl and escalations.done.jsonl; use it (alongside ci_fix_count >= 3) in all three CI-fix spawn paths so escalated PRs are skipped even if the ci-fixes tracker is reset. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 14:32:41 +00:00
johba	531ae5cf71	fix: escalate once then continue to backlog (#59 ) Two bugs after #53 merged: 1. Escalation written every poll cycle (4 entries in 30min) — now writes once, bumps counter to 4 to skip 2. Exit after escalation blocked backlog work — now falls through to pick up next issue Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/disinto/pulls/59 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-17 15:14:48 +01:00
johba	c24adc4ea2	fix: limit CI fix respawn to 3 attempts, then escalate to supervisor (#53 ) Dev-poll spawned a fresh agent every 10min for CI failures. Each agent started with CI_FIX_COUNT=0 — infinite loop. Now tracks attempts per PR in `/tmp/dev-poll-ci-fixes-{project}.json`. After 3 failed rounds: - Writes escalation to `supervisor/escalations.jsonl` - Sends Matrix alert - Stops respawning Part of #52 (supervisor escalation pipeline). Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/disinto/pulls/53 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-17 13:15:49 +01:00
openhands	ef77c56217	fix: extract ci_passed() helper — fix all CI gates for no-CI projects dev-poll.sh had 5 places checking CI_STATE='success', all blocking projects without CI. Extracted ci_passed() helper that treats empty/pending/unknown as pass when WOODPECKER_REPO_ID=0.	2026-03-17 09:51:18 +00:00
openhands	1b3559bba7	fix: enforce single-threaded pipeline per project Don't start new issues while open PRs are waiting for review/CI. This prevents dev-agent from churning through backlog issues without reviews landing first.	2026-03-17 09:17:02 +00:00
openhands	249eef86c1	fix: per-project lock and log files for dev-poll Hardcoded /tmp/dev-agent.lock meant harb and disinto dev-polls shared a lock — one project's running agent blocked the other. Now uses /tmp/dev-agent-{project}.lock and dev-agent-{project}.log.	2026-03-17 08:18:24 +00:00
johba	9050413994	refactor: split supervisor into infra + per-project, make poll scripts config-driven Supervisor split (#26): - Layer 1 (infra): P0 memory, P1 disk, P4 housekeeping — runs once, project-agnostic - Layer 2 (per-project): P2 CI/dev-agent, P3 PRs/deps — iterates projects/*.toml - Adding a new project requires only a new TOML file, no code changes Poll scripts accept project TOML arg (#27): - dev-poll.sh, review-poll.sh, gardener-poll.sh accept optional project TOML as $1 - env.sh loads PROJECT_TOML if set, overriding .env defaults - Cron: `dev-poll.sh projects/versi.toml` targets that project New files: - lib/load-project.sh: TOML to env var loader (Python tomllib) - projects/versi.toml: current project config extracted from .env Backwards compatible: scripts without a TOML arg fall back to .env config. Closes #26, Closes #27 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 08:57:18 +01:00
johba	98f0c40106	refactor: rewrite parse-deps.py as pure bash, remove only Python from repo Replace lib/parse-deps.py with lib/parse-deps.sh to keep the toolchain all-bash. Rewrite supervisor P3b cycle detection and P3c stale dep check as pure bash using associative arrays and DFS. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-16 21:22:53 +01:00
johba	6cf580c010	refactor: extract shared dep parser to lib/parse-deps.py (Closes #20 ) Single source of truth for dependency parsing, replacing three copies: - dev-poll.sh get_deps() now calls parse-deps.py - supervisor P3b/P3c import parse_deps() via importlib Supports stdin, argument, and --json modes for different callers. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-16 21:16:49 +01:00
johba	77cb4c4643	refactor: rename factory/ → supervisor/, factory-poll → supervisor-poll The supervisor agent was confusingly named "factory" (same as the project). Rename directory, script, log, lock, status, and escalation files. Update all references across scripts and docs. FACTORY_ROOT env var unchanged (refers to project root, not agent). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-15 18:06:25 +01:00
openhands	c5fdd8ac50	fix: always rebase on merge failure, don't trust mergeable field Codeberg's mergeable field flickers between true/false — unreliable for deciding whether to rebase. Just attempt rebase on any non-200/204. Worst case it's a no-op. Also added git fetch before rebase.	2026-03-15 10:51:09 +00:00
openhands	c22f1acbdf	fix: add matrix notifications for silent failure paths dev-poll.sh: - Merge conflicts (rebase attempt + outcome) - Non-conflict merge failures (HTTP code) - Low memory skip - New issue launch review-poll.sh: - Review script failure	2026-03-15 10:27:23 +00:00
openhands	b4d14c4c98	fix: auto-rebase on merge conflict (mergeable=false) When merge returns non-200, check mergeable flag. If false, rebase the PR branch onto master via worktree. If rebase fails, spawn dev-agent to resolve. Prevents infinite 405 retry loops. Extracted try_merge_or_rebase() helper used at all 3 merge points.	2026-03-15 10:21:40 +00:00
johba	f215fbe3cf	feat: add Matrix coordination channel, replace openclaw (Closes #8 ) Add matrix_send() to lib/env.sh and matrix_listener.sh daemon for real-time notifications, threaded escalations, and human-in-the-loop replies. All agents now notify via Matrix instead of openclaw. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 16:25:33 +01:00
johba	90ef03a304	refactor: make all scripts multi-project via env vars Replace hardcoded harb references across the entire codebase: - HARB_REPO_ROOT → PROJECT_REPO_ROOT (with deprecated alias) - Derive PROJECT_NAME from CODEBERG_REPO slug - Add PRIMARY_BRANCH (master/main), WOODPECKER_REPO_ID env vars - Parameterize worktree prefixes, docker container names, branch refs - Genericize agent prompts (gardener, factory supervisor) - Update best-practices docs to use $-vars, prefix harb lessons All project-specific values now flow from .env → lib/env.sh → scripts. Backward-compatible: existing harb setups work without .env changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-14 13:49:09 +01:00
openhands	a3f0f7f6f3	fix: stuck PR issue extraction — check title, body (Closes #N), log skip PRs #684 and #710 had no issue number in branch name or title. Now also checks PR body for 'Closes #NNN'. If still no issue found, logs a skip (dev-agent requires an issue number to work).	2026-03-14 12:01:36 +00:00
openhands	30b31c76aa	fix: stuck PR detection only matched fix/issue-NNN branches PRs with custom branch names (fix/fitness-factory-address, chore/seed-consolidation) were invisible to priority 1.5. Now also extracts issue number from PR title (#NNN) as fallback.	2026-03-14 11:12:27 +00:00
openhands	0f979fd6c9	fix: stuck PRs priority + STATE.md in first commit + 405 bug in dev-poll 1. PRIORITY 1.5 in dev-poll: scan ALL open PRs for REQUEST_CHANGES or CI failure before picking new backlog issues. Stuck PRs get fixed first to avoid complex rebases piling up. 2. STATE.md written in worktree before claude starts (included in first commit, not a separate push that dismisses stale approvals). 3. Removed HTTP 405 from merge success check in dev-poll.sh (was fixed in dev-agent.sh but not here — 2 occurrences).	2026-03-14 07:34:47 +00:00
openhands	98210cc302	fix: dep check — trust closed state, drop merged-PR search The merged-PR search was over-engineered and caused false negatives (couldn't match PR to issue when title/body didn't contain #NNN). Issue closed = dep satisfied. Factory only closes after merging.	2026-03-13 11:25:35 +00:00
openhands	d61dead3f1	fix: dep check fallback — also check PR with same number as issue Codeberg uses shared issue/PR numbering. When a PR IS the dep issue (e.g. PR #665 fixes issue #665), the title search misses it. Fallback checks if pulls/{dep_num} is merged.	2026-03-13 11:24:14 +00:00
openhands	cb24968d9b	feat: dark factory — autonomous CI/CD agents for harb Three agents extracted from ~/scripts/harb-{dev,review}/: - dev/ — pull-based dev agent (find ready issues → implement → PR → merge) - review/ — AI code review (structured verdicts, follow-up issues) - factory/ — supervisor (bash health checks, auto-fix, escalation) All secrets externalized to .env (see .env.example). Shared env/helpers in lib/env.sh.	2026-03-12 12:44:15 +00:00

40 commits