Tepna physiological-signal suite — cross-node perspective
Single-signal synthetic physiological generation — plausible-looking ECG, PPG, SpO₂, or CGM traces from GANs and diffusion models — is a mature research area; reproducing one channel that looks right is largely a solved problem. Yet the failures that actually break a multi-device analysis suite live almost entirely outside the waveform: in timestamp pathology, in cross-modal temporal coherence, and in the provenance metadata that lets a derived metric be trusted. Tepna's harness already models these as first-class concerns — a frozen Clock Contract, an event-bus currency that reconstructs absolute time across nodes, and a build-hash provenance gate — and does so deterministically, without any learned generator. This note records, while the observation is fresh, four directions where synthetic-data work is still genuinely open, why Tepna is unusually well-positioned to study them, and which would be worth a real paper once the suite's core scope is met. This is a perspective, not a study; it makes no empirical claims.
Keywords: synthetic data · physiological time-series · multimodal fusion · timestamp normalization · data provenance · deterministic generation · benchmarking
The question that prompted this note was simply whether Tepna is interesting for synthetic-data work, or whether the field has moved past it. The honest answer is split, and the split is itself the contribution: the obvious target (better-looking signals) is largely done elsewhere, while the targets Tepna happens to take seriously (time, cross-modal coherence, provenance) remain under-served. That asymmetry is easy to lose once the build resumes, so it is recorded here as a parked agenda rather than pursued — explicitly out of the current scope.
Generating a single physiological channel that passes visual and short-horizon statistical inspection is well-trodden: adversarial and diffusion models for ECG morphology, PPG synthesis, and CGM trajectory generation are established. If the goal were "produce fake vitals that look real," off-the-shelf approaches suffice and Tepna adds little. We therefore treat per-channel realism as a solved input, not a research question, and direct attention to the layers above it.
startEpochMs plus wall-clock event strings reconstructed to absolute time, monotonic past midnight. Correlated multimodal synthesis with shared temporal anchors is substantially harder than single-channel generation and remains largely unaddressed.k is a pure function of k — with no model training, so a fixture set is byte-reproducible and auditable. This is the right tool for adversarially probing parsers and fusion logic (malformed stamps, wrapped clocks, mixed locales, dropped channels). The open question is whether a deterministic, fully-specified fixture suite can serve as a shared "where do consumer multi-signal pipelines break" benchmark — complementary to, and more reproducible than, a learned generator.| Layer | Field maturity | Harness fit | Needs ML? |
|---|---|---|---|
| Single-channel waveform realism | mature | low | yes |
| Timestamp / clock pathology | open | high | no |
| Cross-modal temporal coherence | open | high | partly |
| Provenance & evidence metadata | open | high | no |
| Deterministic fixture benchmark | emerging | high | no |
Three of the four open directions need no learned generator at all — they are specification, generation, and verification problems, which is exactly what the deterministic cohort harness is good at. That keeps any future work cheap to start and reproducible by construction.
The most defensible first paper is the narrowest: a deterministic timestamp-pathology benchmark. Enumerate the Clock Contract's failure surface, generate a fixture corpus that exercises each case with known ground truth, run the production parsers against it, and report where consumer-style parsers (and, separately, naive new Date(str) baselines) diverge from the contract. It needs no cohort physiology, no model training, and grounds out entirely in code Tepna already ships. The multimodal-coherence and provenance-degradation directions are larger and better deferred until the cross-node Integrator scope is complete.
CLAUDE.md §"THE CLOCK CONTRACT" and each node's *-dsp.js parseTimestamp.ganglior_events / startEpochMs export contract; ganglior-provenance.js for build-hash.cohort-gen.js, cohort-harness.html, COHORT-WORKFLOW-GUIDE.md.metric-registry.js (BADGE_CSS) and each node's <node>-registry.js.CLAUDE.md (Clock Contract, provenance & evidence gates), INTEGRATOR-BUILD-BRIEF.md, COHORT-WORKFLOW-GUIDE.md, Tepna suite.