seismo-relay

Author	SHA1	Message	Date
serversdown	e8682d49ad	scripts/backfill_sidecars: cascade h5 regen when sidecar is stale + bump TOOL_VERSION Two coupled changes that close the rollout gap left by the read_blastware_file codec wiring: 1. minimateplus/event_file_io.py: bump TOOL_VERSION from 0.16.1 to 0.20.0. This is the version stamp the backfill script reads from each sidecar's source.tool_version field to detect "this sidecar was written before the current decoder shipped, regenerate it." Bumping past every value baked into existing prod sidecars flags them all as stale on the next backfill run — which is exactly what we want, since every pre-codec-wiring sidecar was written by the retracted int16-LE decoder. 2. scripts/backfill_sidecars.py: when the sidecar is being regenerated this iteration (sha mismatch, tool_version too old, or --force), also regenerate the .h5. Previously the .h5 logic only rewrote when --force was passed or the file was missing — so a tool_version-driven sidecar regen left the broken .h5 in place forever. Added a `sidecar_stale` boolean to track the "we're rewriting the sidecar this iteration" state and wired it into the h5 need-rewrite check. Path coverage (verified by trace): - sidecar missing → both regen - --force → both regen - sha mismatch → both regen - tool_ver too old → both regen (THE post-codec-wiring case) - everything OK → skip iteration entirely (h5 untouched) Operator review state (review.false_trigger, reviewer, notes) and the sidecar's extensions block are preserved across regen by the existing read-existing-sidecar / pass-into-event_to_sidecar_dict path — unchanged from prior behavior. Deploy procedure (on prod): 1. Pull this change + the read_blastware_file codec wiring. 2. `python scripts/backfill_sidecars.py --dry-run` to preview. Every sidecar with source.tool_version<0.20.0 will show as "would (re)write". 3. Run for real (drop --dry-run). Expect every pre-fix event to regen. Big stores may take a while. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-20 18:24:06 +00:00
serversdown	31d691b40b	minimateplus: wire read_blastware_file to verified body codec `read_blastware_file()` was still calling `_decode_samples_4ch_int16_le` (the retracted int16-LE-interleaved hypothesis) on the body bytes, producing ±32K noise on every channel of every BW file read from disk. This was the path watcher-forwarded events take into the system (via the import endpoint → save_imported_bw → read_blastware_file, since the watcher doesn't ship A5 frames), so every .h5 sidecar generated for a forwarded event has been wrong since the feature shipped. The fix is mechanical: pass the body bytes straight to `waveform_codec.decode_waveform_v2()` and run the result through `decoded_to_adc_counts()` for the 16x geo scaling. The body already starts with the codec's exact 7-byte preamble `00 02 00 [Tran[0] BE] [Tran[1] BE]` — confirmed by `body[:3].hex()` across all 9 fixture events. No body-slice adjustment needed. If the codec returns None (truncated/malformed file, synthetic test input with no real waveform), fall back to empty channels with a log warning. The rest of the event (timestamp, waveform_key, project strings, sensor_location, peaks-from-samples=0) is still recoverable. Verified against the bundled fixture corpus: V70 Tran/Vert/Long 3328/3328 sample-sets match .TXT ground truth within the 0.005 in/s display quantum, every row 6S0/RG0/AB0/470 (5-8-26) 3328/2304/1280/1280 samples; Vert PPVs match BW's own report within 0.02 in/s JQ0 3328 samples, Vert PPV 3.384 vs BW 3.465 SP0/SS0/SV0 (loud events) 3072–3328 samples; known walker tail-truncation 1–7 samples per channel, samples reached are byte-exact Existing `test_read_blastware_file_round_trip` (synthetic empty event) continues to pass thanks to the None-fallback. Codec verify scripts (`analysis/verify_quiet_bundle.py`, `analysis/verify_full_decode.py`) re-run unchanged. Added two regression-lock tests in tests/test_event_file_io.py: - test_read_blastware_file_decodes_via_codec[6 fixtures] — verifies sample count + Vert PPV per fixture - test_read_blastware_file_v70_samples_match_txt_truth — verifies every one of V70's 3328 sample-sets across Tran/Vert/Long matches the .TXT ground truth row-by-row within 0.003 in/s Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-20 18:13:24 +00:00
serversdown	cd20be2eff	feat: add thor/micromate compatibility v0.18.0	2026-05-19 04:32:43 +00:00
serversdown	aac1c8e06d	fix(import): derive record_type from filename suffix instead of hardcoding "Waveform" The BW ACH ingest path was inserting every event with record_type="Waveform" regardless of the actual type because read_blastware_file() had `ev.record_type = "Waveform"` hardcoded, and the live watcher-forward path parses files from a tmp path (suffix ".bw") that doesn't carry the original extension. V10.72+ MiniMate Plus firmware encodes the event type as the last character of the AB0T extension scheme (H=Histogram, W=Waveform, M=Manual, E=Event, C=Combo). This change: 1. Adds derive_record_type_from_filename() public helper in minimateplus/event_file_io.py 2. Uses it inside read_blastware_file() so direct callers (the --dry-run path of scripts/import_bw.py, tests, ad-hoc scripts) get correct types automatically 3. Overrides ev.record_type in WaveformStore.save_imported_bw() using the ORIGINAL filename (source_path.name) — required because the parser sees only the tmp file Old S338 firmware (3-char extensions ending in `0`) and any unrecognized suffix fall back to "Waveform". Existing DB rows ingested before this fix are stuck with record_type="Waveform" — a one-off SQL backfill would fix them retroactively if desired. Terra-view's event modal also derives client-side from the filename, so the UI already shows the correct type for old events even without the backfill. Version bumped to 0.16.1 in pyproject.toml, event_file_io.py TOOL_VERSION, sfm/server.py FastAPI version, and CHANGELOG.md. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 21:09:21 +00:00
serversdown	197c0630e2	chore(release): v0.16.0 — BW ACH ingestion The "BW ACH ingestion" release. Paired with series3-watcher v1.5.0, every Blastware ACH event (binary + _ASCII.TXT report) lands in SeismoDb with device-authoritative peaks, project metadata, sensor self-check, and ZC/Time-of-Peak data — without depending on the still-undecoded waveform body codec. Bumps pyproject.toml + minimateplus/event_file_io.py TOOL_VERSION to 0.16.0. README banner + CHANGELOG entry summarise the work that landed across commits cdfe4ad..f83993a on this branch.	2026-05-11 07:33:48 +00:00
serversdown	6b2a44ff02	fix(import): overlay BW report onto Event + upsert DB row on re-import Two compounding bugs caused forwarded events to land in the DB with broken-codec peak values (~10 in/s saturation on every channel) and no project info, even when the watcher correctly paired a BW ASCII report with the binary. Bug 1: save_imported_bw built the sidecar JSON with the report's authoritative peak / project values via event_to_sidecar_dict( bw_report=...), but never overlaid those onto the in-memory Event that flows to db.insert_events(). So the DB row got peak_values from read_blastware_file()._peaks_from_samples() — which runs the still-undecoded waveform body codec assuming raw int16 LE and produces ±32K-shaped noise (= ±10 in/s at Normal range) regardless of the actual signal. The sidecar JSON had the truth but the DB columns (which the webapp queries for fast filter/sort) lied. Bug 2: insert_events' IntegrityError handler only refreshed the filename/filesize/a5_pickle/sidecar columns when a duplicate (serial, timestamp) was seen. Peak values, project info, sample_rate, record_type stayed locked in at whatever the FIRST insert wrote. So even after Bug 1 was fixed, the historical events in the DB (already inserted with broken-codec peaks) would never get their values corrected, because a re-forward would just hit IntegrityError and skip the field refresh. Fix 1 (minimateplus/event_file_io.py + sfm/waveform_store.py): - New apply_report_to_event(event, report) helper folds the BW report's device-authoritative fields onto the Event in-place: per-channel PPV, peak vector sum, mic PSPL→psi, project / client / operator / sensor_location, sample_rate, record_time. - save_imported_bw() calls the helper right after parsing the report. The Event that flows to insert_events() now carries correct values. Fix 2 (sfm/database.py): - insert_events()'s IntegrityError UPDATE now refreshes every device-authoritative column from the new data: tran_ppv, vert_ppv, long_ppv, peak_vector_sum, mic_ppv, project, client, operator, sensor_location, sample_rate, record_type, plus the existing filename/filesize/a5_pickle/sidecar fields. - Preserves: id, waveform_key, session_id, created_at (immutable / FK fields), and false_trigger (operator review state). End-to-end simulation verified: - Step 1: import without report → DB has ±10 in/s peaks, no project - Step 2: re-import WITH report → upsert path fires, DB now has device-authoritative 0.005 in/s peaks + sensor_location - Step 3: operator sets false_trigger=1, re-import again → flag preserved, peaks remain correct For the user's situation: deleting the watcher state file forces a re-forward of all events. Each re-forward now pairs with its _ASCII.TXT, applies the report onto the Event, and the upsert refreshes the DB row. No DB nuke needed. Full SFM suite: 62 passed, 44 skipped.	2026-05-11 05:51:39 +00:00
serversdown	cdfe4ad3c8	feat(import): parse paired BW ASCII reports on /db/import/blastware_file Blastware's ACH writes a per-event ASCII report (.TXT) alongside each event binary, containing the rich derived per-channel fields BW computes (PPV, ZC Freq, Time of Peak, Peak Acceleration, Peak Displacement, Peak Vector Sum + time, sensor self-check Pass/Fail, monitor-log timestamps). None of this lives in the BW binary itself. When the watcher daemon forwards both files to /db/import/blastware_file in one multipart POST, we now: - Pair binaries with their .TXT partners by filename match - Parse the report into a structured BwAsciiReport - Land the rich fields in a new top-level `bw_report` block of the sidecar JSON - Overlay the report's peaks/project_info/timestamp/sample_rate/ record_time/total_samples/pretrig_samples onto the canonical sidecar fields (the report values are device-authoritative; the BW-binary STRT-derived values had bugs like reading the 0x46 record-type marker as rectime) This unblocks the monthly-summary review workflow — events become sortable/filterable by peak, location, project, etc. — without depending on the still-undecoded waveform body codec.	2026-05-08 23:56:43 +00:00
serversdown	c641d5fc10	feat: v0.15.0 ### Added - Layered event storage architecture. Each event now lands as four files in the per-serial waveform store, each with a clear role: - `<filename>` — the Blastware-readable binary (BW file). Untouched. - `<filename>.a5.pkl` — the raw 5A frames (regenerative source). - `<filename>.h5` — clean per-channel waveform arrays in physical units (in/s for geo, psi for mic) plus event metadata (HDF5 with gzip compression). This is the canonical format for downstream analysis tools. - `<filename>.sfm.json` — the modern review/metadata sidecar (peaks, project, source provenance, review state, extensions). SQLite (`seismo_relay.db`) is the searchable index over all four. - Plot-ready waveform JSON (`sfm.plot.v1`). The `/device/event/{idx}/waveform` and `/db/events/{id}/waveform.json` endpoints now return samples in physical units with explicit time-axis metadata, peak markers, and per-channel unit hints — no more guessing the ADC-to-velocity scale client-side. The webapp waveform viewer was rewritten to consume this shape. - In-app waveform viewer accuracy fix. The standalone SFM webapp viewer was scaling geophone amplitudes by `geoAdcScale / 32767` (≈ 6.206 / 32767), where `geoAdcScale = 6.206053` is the device's in/s per V hardware constant — not the ADC-counts-to-velocity factor. This silently scaled every plot ~38% too low for Normal-range geophones (the correct full-scale is 10.0 in/s, or 1.25 in/s for Sensitive). Conversion is now done server-side using the geo_range from compliance config; the client just plots. - New `sfm/event_hdf5.py` module: `write_event_hdf5()`, `read_event_hdf5()`, plus a plot-JSON helper. - Backfill script extended to also emit `.h5` for existing events. ### Dependencies - Added `h5py>=3.10` and `numpy>=1.24` for the HDF5 storage layer. - Added `python-multipart>=0.0.7` (required by FastAPI for the `/db/import/blastware_file` endpoint introduced in this release).	2026-05-08 04:39:51 +00:00

8 Commits