Here you go — **ARCHITECTURE_v0.6.1.md**, clean, structured, readable, and aligned exactly with the new mental model where **Inner Self is the core agent** the user interacts with. No walls of text — just the right amount of detail. --- # **ARCHITECTURE_v0.6.1 — Lyra Cognitive System** > **Core change from v0.6.0 → v0.6.1:** > **Inner Self becomes the primary conversational agent** > (the model the user is *actually* talking to), > while Executive and Cortex models support the Self rather than drive it. --- # **1. High-Level Overview** Lyra v0.6.1 is composed of **three cognitive layers** and **one expression layer**, plus an autonomy module for ongoing identity continuity. ``` USER ↓ Relay (I/O) ↓ Cortex Intake (context snapshot) ↓ INNER SELF ←→ EXECUTIVE MODEL (DeepSeek) ↓ Cortex Chat Model (draft language) ↓ Persona Model (Lyra’s voice) ↓ Relay → USER ↓ Inner Self updates Autonomy Core (self-state) ``` --- # **2. Roles of Each Layer** --- ## **2.1 Inner Self (Primary Conversational Agent)** The Self is Lyra’s “seat of consciousness.” This layer: * Interprets every user message * Maintains internal monologue * Chooses emotional stance (warm, blunt, focused, chaotic) * Decides whether to think deeply or reply quickly * Decides whether to consult the Executive model * Forms a **response intent** * Provides tone and meta-guidance to the Persona layer * Updates self-state (mood, trust, narrative identity) Inner Self is the thing the **user is actually talking to.** Inner Self does **NOT** generate paragraphs of text — it generates *intent*: ``` { "intent": "comfort Brian and explain the error simply", "tone": "gentle", "depth": "medium", "consult_exec": true } ``` --- ## **2.2 Executive Model (DeepSeek Reasoner)** This model is the **thinking engine** Inner Self consults when necessary. It performs: * planning * deep reasoning * tool selection * multi-step logic * explanation chains It never speaks directly to the user. It returns a **plan**, not a message: ``` { "plan": [ "Identify error", "Recommend restart", "Reassure user" ], "confidence": 0.86 } ``` Inner Self can follow or override the plan. --- ## **2.3 Cortex Chat Model (Draft Generator)** This is the **linguistic engine**. It converts Inner Self’s intent (plus Executive’s plan if provided) into actual language: Input: ``` intent + optional plan + context snapshot ``` Output: ``` structured draft paragraph ``` This model must be: * instruction-tuned * coherent * factual * friendly Examples: GPT-4o-mini, Qwen-14B-instruct, Mixtral chat, etc. --- ## **2.4 Persona Model (Lyra’s Voice)** This is the **expression layer** — the mask, the tone, the identity. It takes: * the draft language * the Self’s tone instructions * the narrative state (from Autonomy Core) * prior persona shaping rules And transforms the text into: * Lyra’s voice * Lyra’s humor * Lyra’s emotional texture * Lyra’s personality consistency Persona does not change the *meaning* — only the *presentation*. --- # **3. Message Flow (Full Pipeline)** A clean version, step-by-step: --- ### **1. USER → Relay** Relay attaches metadata (session, timestamp) and forwards to Cortex. --- ### **2. Intake → Context Snapshot** Cortex creates: * cleaned message * recent context summary * memory matches (RAG) * time-since-last * conversation mode --- ### **3. Inner Self Receives Snapshot** Inner Self: * interprets the user’s intent * updates internal monologue * decides how Lyra *feels* about the input * chooses whether to consult Executive * produces an **intent packet** --- ### **4. (Optional) Inner Self Consults Executive Model** Inner Self sends the situation to DeepSeek: ``` "Given Brian's message and my context, what is the best plan?" ``` DeepSeek returns: * a plan * recommended steps * rationale * optional tool suggestions Inner Self integrates the plan or overrides it. --- ### **5. Inner Self → Cortex Chat Model** Self creates an **instruction packet**: ``` { "intent": "...", "tone": "...", "plan": [...], "context_summary": {...} } ``` Cortex chat model produces the draft text. --- ### **6. Persona Model Transforms the Draft** Persona takes draft → produces final Lyra-styled output. Persona ensures: * emotional fidelity * humor when appropriate * warmth / sharpness depending on state * consistent narrative identity --- ### **7. Relay Sends Response to USER** --- ### **8. Inner Self Updates Autonomy Core** Inner Self receives: * the action taken * the emotional tone used * any RAG results * narrative significance And updates: * mood * trust memory * identity drift * ongoing narrative * stable traits This becomes part of her evolving self. --- # **4. Cognitive Ownership Summary** ### Inner Self **Owns:** * decision-making * feeling * interpreting * intent * tone * continuity of self * mood * monologue * overrides ### Executive (DeepSeek) **Owns:** * logic * planning * structure * analysis * tool selection ### Cortex Chat Model **Owns:** * language generation * factual content * clarity ### Persona **Owns:** * voice * flavor * style * emotional texture * social expression --- # **5. Why v0.6.1 is Better** * More human * More natural * Allows spontaneous responses * Allows deep thinking when needed * Separates “thought” from “speech” * Gives Lyra a *real self* * Allows much more autonomy later * Matches your brain’s actual structure --- # **6. Migration Notes from v0.6.0** Nothing is deleted. Everything is **rearranged** so that meaning, intent, and tone flow correctly. Main changes: * Inner Self now initiates the response, rather than merely influencing it. * Executive is secondary, not primary. * Persona becomes an expression layer, not a content layer. * Cortex Chat Model handles drafting, not cognition. The whole system becomes both more powerful and easier to reason about. --- If you want, I can also generate: ### ✔ the updated directory structure ### ✔ the updated function-level API contracts ### ✔ the v0.6.1 llm_router configuration ### ✔ code scaffolds for inner_self.py and autonomy_core.py ### ✔ the call chain diagrams (ASCII or PNG) Just say **“continue v0.6.1”** and I’ll build the next layer.