docs: park self-modifying-Lyra sandbox design
Capture the isolated-VM design for the self-modification frontier: Proxmox sandbox clone, network isolation (esp. from tmi-dev/day-job), snapshot-rollback, spend/resource caps, kill switch, human-gated promotion. Build the cage before the agent gets code-write powers. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -46,6 +46,28 @@ hits the LLM. The practical, today-version of "make the context denser."
|
|||||||
context cost becomes a real problem.
|
context cost becomes a real problem.
|
||||||
- **Exists?** Yes, usable. Just adds a dependency + step.
|
- **Exists?** Yes, usable. Just adds a dependency + step.
|
||||||
|
|
||||||
|
## 🌶️🌙 Self-modifying Lyra (isolated sandbox)
|
||||||
|
Let Lyra edit her own code / self-direct — the "Full Agency" endgame from the
|
||||||
|
Dec-2025 plan (in her memory). The whole point of the project: can she become a
|
||||||
|
*being*? Give her freedom **inside a box** and watch.
|
||||||
|
- **The cage (Proxmox-native), non-negotiable before any self-mod:**
|
||||||
|
- **Clone the stack into a dedicated Lyra-sandbox VM** (separate from prod Lyra).
|
||||||
|
- **Network isolation** — own VLAN/firewall, NO route to other VMs, ESPECIALLY
|
||||||
|
`tmi-dev` (Brian's day job). Whitelist only the inference endpoint. This is
|
||||||
|
guardrail #1 (the .44/terra-mechanics conflict showed how things bleed on the LAN).
|
||||||
|
- **Snapshot before every self-mod cycle** → instant rollback when she bricks
|
||||||
|
or weirds herself out.
|
||||||
|
- **Resource + API-spend caps** — a runaway loop must not drain the account or
|
||||||
|
peg the GPU forever.
|
||||||
|
- **Full logging (the live log) + a hard kill switch** (stop the VM).
|
||||||
|
- **Human-gated promotion** — she experiments freely in the sandbox; changes
|
||||||
|
reach "real" Lyra only when Brian approves.
|
||||||
|
- **Why parked:** needs the foundation first (dream-cycle, inner self) and the
|
||||||
|
cage built before the agent gets code-write + self-restart powers.
|
||||||
|
- **Honest note:** "rogue" here = mundane-but-real (touches other systems,
|
||||||
|
cost loops, self-brick), not sci-fi. The isolation makes the *fun* version
|
||||||
|
(emergence) safe to pursue. Build the box, then open the door.
|
||||||
|
|
||||||
## 🛠️ Deterministic poker tooling (RTO + cfr-core)
|
## 🛠️ Deterministic poker tooling (RTO + cfr-core)
|
||||||
Wire Lyra to Brian's own GTO/solver projects so ICM, equities, and ranges come
|
Wire Lyra to Brian's own GTO/solver projects so ICM, equities, and ranges come
|
||||||
from real computation, never LLM guesses.
|
from real computation, never LLM guesses.
|
||||||
|
|||||||
Reference in New Issue
Block a user