From 9505f0b825d447c233f3fc88f68ec6cf7a96106f Mon Sep 17 00:00:00 2001
From: openhands <openhands@all-hands.dev>
Date: Fri, 20 Mar 2026 15:40:38 +0000
Subject: [PATCH 1/2] fix: Add 'Status: Planned Architecture' banner to
 EVIDENCE-ARCHITECTURE.md (#227)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---
 docs/EVIDENCE-ARCHITECTURE.md | 28 ++++++++++++++--------------
 1 file changed, 14 insertions(+), 14 deletions(-)

diff --git a/docs/EVIDENCE-ARCHITECTURE.md b/docs/EVIDENCE-ARCHITECTURE.md
index f4ea1a5..339e189 100644
--- a/docs/EVIDENCE-ARCHITECTURE.md
+++ b/docs/EVIDENCE-ARCHITECTURE.md
@@ -1,6 +1,6 @@
 # Evidence Architecture — Roadmap
 
-> **Status:** Design document. Describes the target architecture for evidence-driven decision making in disinto. Nothing described here exists yet unless marked ✅.
+> **Status: Planned Architecture** — The infrastructure described here is not yet implemented. See AGENTS.md for the current state. Items marked **Implemented** exist in the codebase; all others are **Planned**.
 
 Disinto is purpose-built for one loop: **build software, launch it, improve it, reach market fit.**
 
@@ -24,12 +24,12 @@ Different domains have different platforms:
 
 | Domain | Platform | What it tracks | Status |
 |--------|----------|---------------|--------|
-| Code | Codeberg | Issues, PRs, reviews | ✅ Live |
-| CI/CD | Woodpecker | Build/test results | ✅ Live |
-| Protocol | Ponder / GraphQL | On-chain state, trades, positions | ✅ Live (not yet wired to evidence) |
-| Infrastructure | DigitalOcean / system stats | CPU, RAM, disk, containers | Supervisor monitors, no evidence output yet |
-| User experience | Playwright personas | Conversion, friction, journey completion | ✅ Scripts exist (`run-usertest.sh`), no evidence output yet |
-| Funnel | Analytics (future) | Bounce rate, conversion, retention | Not started |
+| Code | Codeberg | Issues, PRs, reviews | **Implemented** — Live |
+| CI/CD | Woodpecker | Build/test results | **Implemented** — Live |
+| Protocol | Ponder / GraphQL | On-chain state, trades, positions | **Implemented** — Live (not yet wired to evidence) |
+| Infrastructure | DigitalOcean / system stats | CPU, RAM, disk, containers | **Planned** — Supervisor monitors, no evidence output yet |
+| User experience | Playwright personas | Conversion, friction, journey completion | **Implemented** — Scripts exist (`run-usertest.sh`), no evidence output yet |
+| Funnel | Analytics (future) | Bounce rate, conversion, retention | **Planned** — Not started |
 
 Agents won't need to understand each platform. **Processes act as adapters** — they will read a platform's API and write structured evidence to git.
 
@@ -52,11 +52,11 @@ Will produce evidence. Change nothing. Safe to run anytime.
 
 | Process | Measures | Platform | Status |
 |---------|----------|----------|--------|
-| `run-holdout` | Code quality against blind scenarios | Playwright + docker stack | ✅ `evaluate.sh` exists (harb #977) |
-| `run-user-test` | UX quality across 5 personas | Playwright + docker stack | ✅ `run-usertest.sh` exists (harb #978) |
-| `measure-resources` | Infra state (CPU, RAM, disk, containers) | System / DigitalOcean API | Not started |
-| `measure-protocol` | On-chain health (floor, reserves, volume) | Ponder GraphQL | Not started |
-| `measure-funnel` | User conversion and retention | Analytics API | Not started |
+| `run-holdout` | Code quality against blind scenarios | Playwright + docker stack | **Implemented** — `evaluate.sh` exists (harb #977) |
+| `run-user-test` | UX quality across 5 personas | Playwright + docker stack | **Implemented** — `run-usertest.sh` exists (harb #978) |
+| `measure-resources` | Infra state (CPU, RAM, disk, containers) | System / DigitalOcean API | **Planned** |
+| `measure-protocol` | On-chain health (floor, reserves, volume) | Ponder GraphQL | **Planned** |
+| `measure-funnel` | User conversion and retention | Analytics API | **Planned** |
 
 ### Mutation processes (create change)
 
@@ -64,8 +64,8 @@ Will produce new artifacts. Consume significant resources. Results delivered via
 
 | Process | Produces | Consumes | Status |
 |---------|----------|----------|--------|
-| `run-evolution` | Better optimizer candidates (`.push3` programs) | CPU-heavy: transpile + compile + deploy + attack per candidate | ✅ `evolve.sh` exists (harb #975) |
-| `run-red-team` | Evidence (floor held?) + new attack vectors | CPU + RAM for revm evaluation | ✅ `red-team.sh` exists (harb #976) |
+| `run-evolution` | Better optimizer candidates (`.push3` programs) | CPU-heavy: transpile + compile + deploy + attack per candidate | **Implemented** — `evolve.sh` exists (harb #975) |
+| `run-red-team` | Evidence (floor held?) + new attack vectors | CPU + RAM for revm evaluation | **Implemented** — `red-team.sh` exists (harb #976) |
 
 ### Feedback loops
 

From 8ddfbfe4921550f8ba47f641d2bca54ac5818c6d Mon Sep 17 00:00:00 2001
From: openhands <openhands@all-hands.dev>
Date: Fri, 20 Mar 2026 15:44:40 +0000
Subject: [PATCH 2/2] =?UTF-8?q?fix:=20address=20review=20=E2=80=94=20rewor?=
 =?UTF-8?q?d=20banner,=20mark=20partial=20items=20(#227)?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

- Reword banner to avoid false 'not yet implemented' absolute claim
- Change document-level label from 'Planned Architecture' to 'Partially Implemented'
- Mark Protocol and User experience rows as **Partial** (upstream scripts exist but no evidence output yet)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---
 docs/EVIDENCE-ARCHITECTURE.md | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/docs/EVIDENCE-ARCHITECTURE.md b/docs/EVIDENCE-ARCHITECTURE.md
index 339e189..0a9a463 100644
--- a/docs/EVIDENCE-ARCHITECTURE.md
+++ b/docs/EVIDENCE-ARCHITECTURE.md
@@ -1,6 +1,6 @@
 # Evidence Architecture — Roadmap
 
-> **Status: Planned Architecture** — The infrastructure described here is not yet implemented. See AGENTS.md for the current state. Items marked **Implemented** exist in the codebase; all others are **Planned**.
+> **Status: Partially Implemented** — This document describes the target evidence architecture. Items marked **Implemented** exist in the codebase; items marked **Partial** have upstream scripts but no evidence output yet; all others are **Planned**. See AGENTS.md for the current operational state.
 
 Disinto is purpose-built for one loop: **build software, launch it, improve it, reach market fit.**
 
@@ -26,9 +26,9 @@ Different domains have different platforms:
 |--------|----------|---------------|--------|
 | Code | Codeberg | Issues, PRs, reviews | **Implemented** — Live |
 | CI/CD | Woodpecker | Build/test results | **Implemented** — Live |
-| Protocol | Ponder / GraphQL | On-chain state, trades, positions | **Implemented** — Live (not yet wired to evidence) |
+| Protocol | Ponder / GraphQL | On-chain state, trades, positions | **Partial** — Live (not yet wired to evidence) |
 | Infrastructure | DigitalOcean / system stats | CPU, RAM, disk, containers | **Planned** — Supervisor monitors, no evidence output yet |
-| User experience | Playwright personas | Conversion, friction, journey completion | **Implemented** — Scripts exist (`run-usertest.sh`), no evidence output yet |
+| User experience | Playwright personas | Conversion, friction, journey completion | **Partial** — Scripts exist (`run-usertest.sh`), no evidence output yet |
 | Funnel | Analytics (future) | Bounce rate, conversion, retention | **Planned** — Not started |
 
 Agents won't need to understand each platform. **Processes act as adapters** — they will read a platform's API and write structured evidence to git.