From 9505f0b825d447c233f3fc88f68ec6cf7a96106f Mon Sep 17 00:00:00 2001 From: openhands Date: Fri, 20 Mar 2026 15:40:38 +0000 Subject: [PATCH 1/2] fix: Add 'Status: Planned Architecture' banner to EVIDENCE-ARCHITECTURE.md (#227) Co-Authored-By: Claude Opus 4.6 (1M context) --- docs/EVIDENCE-ARCHITECTURE.md | 28 ++++++++++++++-------------- 1 file changed, 14 insertions(+), 14 deletions(-) diff --git a/docs/EVIDENCE-ARCHITECTURE.md b/docs/EVIDENCE-ARCHITECTURE.md index f4ea1a5..339e189 100644 --- a/docs/EVIDENCE-ARCHITECTURE.md +++ b/docs/EVIDENCE-ARCHITECTURE.md @@ -1,6 +1,6 @@ # Evidence Architecture — Roadmap -> **Status:** Design document. Describes the target architecture for evidence-driven decision making in disinto. Nothing described here exists yet unless marked ✅. +> **Status: Planned Architecture** — The infrastructure described here is not yet implemented. See AGENTS.md for the current state. Items marked **Implemented** exist in the codebase; all others are **Planned**. Disinto is purpose-built for one loop: **build software, launch it, improve it, reach market fit.** @@ -24,12 +24,12 @@ Different domains have different platforms: | Domain | Platform | What it tracks | Status | |--------|----------|---------------|--------| -| Code | Codeberg | Issues, PRs, reviews | ✅ Live | -| CI/CD | Woodpecker | Build/test results | ✅ Live | -| Protocol | Ponder / GraphQL | On-chain state, trades, positions | ✅ Live (not yet wired to evidence) | -| Infrastructure | DigitalOcean / system stats | CPU, RAM, disk, containers | Supervisor monitors, no evidence output yet | -| User experience | Playwright personas | Conversion, friction, journey completion | ✅ Scripts exist (`run-usertest.sh`), no evidence output yet | -| Funnel | Analytics (future) | Bounce rate, conversion, retention | Not started | +| Code | Codeberg | Issues, PRs, reviews | **Implemented** — Live | +| CI/CD | Woodpecker | Build/test results | **Implemented** — Live | +| Protocol | Ponder / GraphQL | On-chain state, trades, positions | **Implemented** — Live (not yet wired to evidence) | +| Infrastructure | DigitalOcean / system stats | CPU, RAM, disk, containers | **Planned** — Supervisor monitors, no evidence output yet | +| User experience | Playwright personas | Conversion, friction, journey completion | **Implemented** — Scripts exist (`run-usertest.sh`), no evidence output yet | +| Funnel | Analytics (future) | Bounce rate, conversion, retention | **Planned** — Not started | Agents won't need to understand each platform. **Processes act as adapters** — they will read a platform's API and write structured evidence to git. @@ -52,11 +52,11 @@ Will produce evidence. Change nothing. Safe to run anytime. | Process | Measures | Platform | Status | |---------|----------|----------|--------| -| `run-holdout` | Code quality against blind scenarios | Playwright + docker stack | ✅ `evaluate.sh` exists (harb #977) | -| `run-user-test` | UX quality across 5 personas | Playwright + docker stack | ✅ `run-usertest.sh` exists (harb #978) | -| `measure-resources` | Infra state (CPU, RAM, disk, containers) | System / DigitalOcean API | Not started | -| `measure-protocol` | On-chain health (floor, reserves, volume) | Ponder GraphQL | Not started | -| `measure-funnel` | User conversion and retention | Analytics API | Not started | +| `run-holdout` | Code quality against blind scenarios | Playwright + docker stack | **Implemented** — `evaluate.sh` exists (harb #977) | +| `run-user-test` | UX quality across 5 personas | Playwright + docker stack | **Implemented** — `run-usertest.sh` exists (harb #978) | +| `measure-resources` | Infra state (CPU, RAM, disk, containers) | System / DigitalOcean API | **Planned** | +| `measure-protocol` | On-chain health (floor, reserves, volume) | Ponder GraphQL | **Planned** | +| `measure-funnel` | User conversion and retention | Analytics API | **Planned** | ### Mutation processes (create change) @@ -64,8 +64,8 @@ Will produce new artifacts. Consume significant resources. Results delivered via | Process | Produces | Consumes | Status | |---------|----------|----------|--------| -| `run-evolution` | Better optimizer candidates (`.push3` programs) | CPU-heavy: transpile + compile + deploy + attack per candidate | ✅ `evolve.sh` exists (harb #975) | -| `run-red-team` | Evidence (floor held?) + new attack vectors | CPU + RAM for revm evaluation | ✅ `red-team.sh` exists (harb #976) | +| `run-evolution` | Better optimizer candidates (`.push3` programs) | CPU-heavy: transpile + compile + deploy + attack per candidate | **Implemented** — `evolve.sh` exists (harb #975) | +| `run-red-team` | Evidence (floor held?) + new attack vectors | CPU + RAM for revm evaluation | **Implemented** — `red-team.sh` exists (harb #976) | ### Feedback loops From 8ddfbfe4921550f8ba47f641d2bca54ac5818c6d Mon Sep 17 00:00:00 2001 From: openhands Date: Fri, 20 Mar 2026 15:44:40 +0000 Subject: [PATCH 2/2] =?UTF-8?q?fix:=20address=20review=20=E2=80=94=20rewor?= =?UTF-8?q?d=20banner,=20mark=20partial=20items=20(#227)?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit - Reword banner to avoid false 'not yet implemented' absolute claim - Change document-level label from 'Planned Architecture' to 'Partially Implemented' - Mark Protocol and User experience rows as **Partial** (upstream scripts exist but no evidence output yet) Co-Authored-By: Claude Opus 4.6 (1M context) --- docs/EVIDENCE-ARCHITECTURE.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/docs/EVIDENCE-ARCHITECTURE.md b/docs/EVIDENCE-ARCHITECTURE.md index 339e189..0a9a463 100644 --- a/docs/EVIDENCE-ARCHITECTURE.md +++ b/docs/EVIDENCE-ARCHITECTURE.md @@ -1,6 +1,6 @@ # Evidence Architecture — Roadmap -> **Status: Planned Architecture** — The infrastructure described here is not yet implemented. See AGENTS.md for the current state. Items marked **Implemented** exist in the codebase; all others are **Planned**. +> **Status: Partially Implemented** — This document describes the target evidence architecture. Items marked **Implemented** exist in the codebase; items marked **Partial** have upstream scripts but no evidence output yet; all others are **Planned**. See AGENTS.md for the current operational state. Disinto is purpose-built for one loop: **build software, launch it, improve it, reach market fit.** @@ -26,9 +26,9 @@ Different domains have different platforms: |--------|----------|---------------|--------| | Code | Codeberg | Issues, PRs, reviews | **Implemented** — Live | | CI/CD | Woodpecker | Build/test results | **Implemented** — Live | -| Protocol | Ponder / GraphQL | On-chain state, trades, positions | **Implemented** — Live (not yet wired to evidence) | +| Protocol | Ponder / GraphQL | On-chain state, trades, positions | **Partial** — Live (not yet wired to evidence) | | Infrastructure | DigitalOcean / system stats | CPU, RAM, disk, containers | **Planned** — Supervisor monitors, no evidence output yet | -| User experience | Playwright personas | Conversion, friction, journey completion | **Implemented** — Scripts exist (`run-usertest.sh`), no evidence output yet | +| User experience | Playwright personas | Conversion, friction, journey completion | **Partial** — Scripts exist (`run-usertest.sh`), no evidence output yet | | Funnel | Analytics (future) | Bounce rate, conversion, retention | **Planned** — Not started | Agents won't need to understand each platform. **Processes act as adapters** — they will read a platform's API and write structured evidence to git.