project-lyra/lyra at 5dc3fa17d7b2e291f4b11e7b057ed7f493d7fd1e - project-lyra - Serversdown Labs

serversdown/project-lyra

Files

T

History

serversdown 5dc3fa17d7 feat(web): stream chat replies token-by-token (M3)

- llm.chat_call_stream: streaming generator for all 3 backends (Ollama NDJSON,
  OpenAI/MI50 SSE), accumulating tool-call fragments by index.
- chat.respond_stream: mirrors respond()'s tool loop and persistence/compaction,
  yielding ("delta", text) / ("tool", name) / ("done", reply).
- POST /v1/chat/stream: SSE endpoint; blocking generator bridged to async via a
  worker thread + asyncio.Queue. Old completions endpoint kept as fallback.
- Client streams into a live bubble with a blinking caret; rAF-throttled render
  (no full re-parse per token) and instant scroll during stream — fixes iOS
  Safari ghosting from per-token smooth-scroll. Falls back to the blocking
  endpoint only if nothing streamed (no double-persist).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2026-06-19 00:06:51 +00:00

..

feat: deterministic equity/board-reading tool (math via tools, not LLM)

2026-06-18 18:45:40 +00:00

feat(web): stream chat replies token-by-token (M3)

2026-06-19 00:06:51 +00:00

__init__.py

chore: project scaffold (uv, .env.example, README, lyra package)

2026-05-16 06:01:08 +00:00

__main__.py

feat: persona chat loop, web UI, and local (Ollama) embeddings

2026-06-15 18:36:31 +00:00

backfill.py

fix: backfill skips hand extraction by default (prose->replay too lossy)

2026-06-18 06:04:02 +00:00

chat.py

feat(web): stream chat replies token-by-token (M3)

2026-06-19 00:06:51 +00:00

clock.py

feat: time awareness — Lyra perceives 'now' and how long it's been

2026-06-17 02:31:40 +00:00

config.py

feat: separate CHAT_MODEL (gpt-4o) for persona fidelity

2026-06-16 21:05:47 +00:00

dream.py

feat: Lyra's journal — permanent thought record + a knowing journal note

2026-06-17 06:40:46 +00:00

equity.py

feat: deterministic equity/board-reading tool (math via tools, not LLM)

2026-06-18 18:45:40 +00:00

era.py

feat: era-rollup + narrative engine (consolidation steps 3-4)

2026-06-16 19:28:01 +00:00

ingest.py

feat: import raw ChatGPT export (new sharded format)

2026-06-16 02:40:32 +00:00

llm.py

feat(web): stream chat replies token-by-token (M3)

2026-06-19 00:06:51 +00:00

logbus.py

feat: run dream cycle as a systemd user service + journald-visible logs

2026-06-17 01:42:55 +00:00

memory.py

feat: behind-the-scenes 👍/👎 rating system (fine-tune data collection)

2026-06-18 19:32:27 +00:00

narrative.py

feat: era-rollup + narrative engine (consolidation steps 3-4)

2026-06-16 19:28:01 +00:00

persona.py

feat: persona chat loop, web UI, and local (Ollama) embeddings

2026-06-15 18:36:31 +00:00

poker.py

feat: backfill poker tracker from curated .md session logs

2026-06-18 05:55:22 +00:00

profile.py

feat: profile layer — semantic memory (consolidation step 2)

2026-06-16 04:11:19 +00:00

self_state.py

feat: break reflection repetition — varied grist, show-and-forbid, wider lens

2026-06-18 19:21:51 +00:00

session.py

feat: persona chat loop, web UI, and local (Ollama) embeddings

2026-06-15 18:36:31 +00:00

summary.py

perf: concurrent summarize-all (parallel LLM, serial DB)

2026-06-16 16:30:07 +00:00

tools.py

feat: deterministic equity/board-reading tool (math via tools, not LLM)

2026-06-18 18:45:40 +00:00