Author: Truthfarian / Endarr Carlton Ramdin

Date: 18^th April 2026

Core Thesis

The system does not wait for disease to become obvious.
It identifies biological drift before overt manifestation.

Conventional sequence: symptom → test → diagnosis → intervention
Proposed sequence: baseline → repeated sampling → drift detection → prognostic warning → early escalation

Table of Contents

About
Chapter 1. The Failure of After-Event Medicine
Chapter 2. Personal Baseline as the Primary Instrument
Chapter 3. Low-Burden Sampling and the User Loop
Chapter 4. State Space and Drift Logic
Chapter 5. Signal Sources and Observation Layers
Chapter 6. Prognostic Engine
Chapter 7. Cancer as an Application Layer
Chapter 8. Multi-Disease Expansion
Chapter 9. User Ownership, Trust, and Interface
Chapter 10. Validation Architecture
Chapter 11. Code and Deployment Specification
Chapter 12. Dataset Register
Chapter 13. Limits, Error, and Disclosure Boundary
Appendices

Chapter 1. The Failure of After-Event Medicine

The central weakness of contemporary healthcare is not only diagnostic difficulty. It is diagnostic timing. Most systems remain organised around confirmation after emergence rather than detection before manifestation. A person enters the medical pathway when symptoms appear, when damage becomes measurable, or when a conventional threshold is crossed late enough to trigger institutional attention. By that stage, the event is already under way. Prognosis has failed, and detection has become reactive. The purpose of this chapter is to define that failure clearly and to show why a pre-event, user-centred model is required.

1.1 The Institutional Timing Problem

The institutional health model is structured around interruption rather than continuity. A person is generally measured when they enter a clinic, report symptoms, qualify for a screening window, or reach a visible threshold of concern. Between those moments, the biological process continues, but the system is largely absent. This produces a timing defect at the core of care. The system does not observe formation. It observes consequence.

This defect is not simply administrative. It shapes the entire diagnostic sequence. Disease is often recognised only once enough material change has accumulated to become legible within an institutional frame. By then, the event is no longer emerging. It is already underway. The logic is therefore reactive by design.

The weakness can be stated plainly:

institutional medicine tends to ask, "What is wrong now?"

A prognostic system must ask, "What is forming, and how long has it been forming?"

That difference defines the timing problem.

The present structure creates four failures.

First, it fragments the human timeline. Biological change is continuous, but observation is episodic.

Second, it privileges population thresholds over personal deviation. A value may remain inside a broad clinical range while still representing significant departure from that individual's own baseline.

Third, it delays escalation until abnormality becomes sufficiently visible to outside systems, rather than sufficiently persistent within the person.

Fourth, it mistakes silence for safety. Lack of institutional detection is treated too easily as lack of underlying formation.

The result is a model that too often identifies disease after structure has already consolidated. That is the precise point at which prognosis has been replaced by delayed recognition.

The proposed paper rejects that timing arrangement. It starts from a different premise: the person is the continuous site of biological information, and meaningful deterioration should be detected as directional drift across time rather than as a single late event.

1.1.1 Structural Consequence

When timing is late, intervention is late. When observation is intermittent, continuity is lost. When continuity is lost, disease formation is mistaken for sudden appearance.

This is why after-event medicine repeatedly gives the impression that illness arrives abruptly. In reality, many conditions develop through gradual divergence long before formal recognition.

1.1.2 Chapter Function

This section establishes the first foundation of the paper: the principal weakness of the dominant model is not merely diagnostic imperfection, but temporal misalignment.

The remaining sections of this chapter will show how that misalignment produces delayed recognition, why post-event detection is structurally weak, and why a pre-event prognostic architecture is required.

1.2 Disease as Delayed Recognition

Disease is too often treated as though it begins at the moment of discovery. That is a categorical error. In many cases, discovery is not the beginning of the event. It is the late recognition of a process that has already been forming across time.

The distinction is central.

A disease event has at least two timelines: the biological timeline, in which deviation emerges, accumulates, and stabilises; and the institutional timeline, in which the condition becomes visible to formal systems.

These two timelines are not the same. The biological sequence usually begins earlier. The institutional sequence begins later. The gap between them is where avoidable deterioration often occurs.

This paper therefore treats diagnosis, in its conventional form, as a downstream act. It is not the first appearance of structure. It is the point at which structure has become sufficiently legible to institutional thresholds, tools, or habits of recognition.

That is why the phrase delayed recognition is more accurate than late detection.

The system often does detect something. But what it detects is not emergence. It detects an already-developed signal that has crossed into visibility.

1.2.1 The Misreading of Onset

When a condition is said to have "appeared," that language usually compresses a longer formation path into a single event. It gives the illusion of suddenness. In reality, what appears sudden is often the point at which accumulated divergence can no longer remain hidden within broad tolerance bands.

The present medical sequence therefore tends to collapse three distinct stages into one: formation, visibility, recognition. They should be separated.

Formation is the period in which biological drift begins. Visibility is the stage at which the drift becomes measurable. Recognition is the point at which the system acknowledges it.

The current model places excessive weight on the third stage and insufficient attention on the first.

1.2.2 Why This Matters

If disease is understood only at the point of recognition, then the entire preventive field is reduced. The person is treated as well until proven otherwise, even where meaningful deviation may already be underway. That is not because the body is silent, but because the model is not listening continuously enough.

This creates a false binary: healthy or diseased.

The stronger interpretation is process-based: stable, drifting, persistently diverging, clinically manifest.

That sequence is more realistic and more useful. It allows prognosis to become possible because it acknowledges that deterioration is often progressive before it is undeniable.

1.2.3 Prognostic Implication

Once disease is reframed as delayed recognition, the design requirement changes immediately. The task is no longer to wait for sufficient proof after manifestation. The task is to monitor the formation path itself.

That means the system must be built to observe: small deviation, repetition, persistence, acceleration, multi-signal convergence.

These are not yet full diagnosis. But they are the architecture of emergence.

A prognostic engine must therefore operate before formal certainty. It must identify when a person is leaving their normal corridor in a way that is structured, sustained, and directionally meaningful.

1.2.4 Section Conclusion

The dominant model recognises too much too late because it treats disease as an event of visibility rather than a process of formation. This paper rejects that framing. Disease should be understood as a temporal progression that becomes institutionally legible only after a prior interval of biological drift.

That is the second foundation of the paper: what is commonly called detection is often delayed recognition of an earlier process.

1.3 Why Post-Event Detection Is Structurally Weak

Post-event detection is structurally weak because it enters the sequence after meaningful formation has already occurred. It is not weak merely because some tests are imperfect. It is weak because the model itself begins too late.

Once a condition has crossed into symptomatic or clinically visible form, several disadvantages are already in play.

First, biological divergence may have been active for a substantial period without structured observation.

Second, the person may only enter the system when damage has become difficult to reverse.

Third, intervention options may narrow because the system is now responding to an established pattern rather than a forming one.

Fourth, the interpretation of the event becomes compressed into a short institutional window, even though the actual formation path may have been long and gradual.

This means post-event detection is not simply a later version of the same task. It is a different task altogether. It is no longer the detection of emergence. It is the recognition of consequence.

1.3.1 Late Recognition Distorts the Signal

When the system waits until the event is visible, it loses access to the earlier gradient of change. That gradient matters. It is often the only place where subtle but persistent deviation can be distinguished from abrupt incidental fluctuation.

In a late model, the system sees the result but not the pathway.

That creates three distortions: compression (a long biological sequence is reduced to a short interpretive moment), threshold bias (attention is given mainly to what crosses a visible line, not to what has been drifting toward it), and context loss (the signal is judged without sufficient personal history, baseline continuity, or rate-of-change memory).

The result is a medicine of snapshots rather than trajectories.

1.3.2 The Cost of Episodic Measurement

Episodic measurement assumes that intermittent checking is enough. In many cases it is not. A person may pass through long periods of meaningful but unrecognised change between formal contacts. The health system then encounters only scattered fragments of a continuous process.

That arrangement has an obvious weakness: if measurement is sparse, then change can consolidate between observations.

This is why many conditions appear to be discovered "suddenly." The suddenness often belongs to the institution, not to the biology. The system notices late and mistakes lateness for abruptness.

1.3.3 Population Thresholds Arrive Too Coarsely

Post-event systems also rely too heavily on broad population bands. These are useful for general classification, but they are often too coarse to function as early personal warning instruments.

A person can drift materially away from their own norm while still remaining inside a wide accepted range.

So the structural weakness is not only that testing is delayed. It is also that the criteria for abnormality are often insufficiently personalised at the point when early recognition would matter most.

The model therefore fails twice: it checks too late, and when it does check, it may compare too broadly.

1.3.4 Post-Event Logic Reduces Prognosis to Reaction

A system designed around post-event confirmation naturally centres reaction. It asks what is present, what is measurable now, and what intervention follows confirmed recognition. That logic has value, but it is not prognosis.

Prognosis requires a different orientation. It requires the system to identify directional change before formal confirmation becomes unavoidable. It must be able to say that the person is not yet clinically manifest, but is no longer remaining within their own stable corridor.

That intermediate space is precisely what post-event models are poorly built to manage.

1.3.5 Section Conclusion

Post-event detection is structurally weak because it begins after formation, compresses long biological change into short institutional moments, and relies too heavily on coarse thresholds once deterioration is already underway. It is therefore reactive by architecture, not merely by accident.

This establishes the third foundation of the paper: a late-stage detection model cannot serve as a full prognostic model because it encounters disease too far downstream.

1.4 The Need for Pre-Event Prognosis

If after-event medicine is structurally late, then the necessary response is not minor optimisation of the same sequence. The necessary response is a different temporal model.

That model is pre-event prognosis.

Pre-event prognosis does not wait for disease to become obvious. It does not rely on the event being visible enough to satisfy institutional habits of recognition. It begins earlier, at the level of repeated deviation, persistent drift, and directional biological change. Its purpose is to identify the formation path before overt manifestation.

This is the central requirement of the paper.

The system must be able to recognise when a person is no longer remaining within their own stable corridor, even if formal clinical criteria have not yet been fully met. That is not guesswork. It is structured interpretation of continuity.

1.4.1 Prognosis as Temporal Reversal

Conventional detection asks: What condition is present now?

Pre-event prognosis asks: What condition is forming, how stable is that formation, and how early can intervention begin without waiting for overt manifestation?

That is a reversal of timing, but also a reversal of logic.

The emphasis moves: from confirmation to emergence, from event to trajectory, from episodic recognition to continuous interpretation, from institutional visibility to personal deviation.

This is the shift the paper advances.

1.4.2 What a Prognostic System Must Observe

A viable prognostic model must observe more than isolated values. It must track pattern across time. The relevant units are not single readings, but structured persistence.

The system must therefore be able to recognise: small but repeated deviation, rate of change, persistence across windows, convergence across signal groups, departure from personal baseline, failure to return to equilibrium.

These are the true early units of deterioration.

A person does not usually move from stable to manifest disease in one step. There is commonly an intermediate corridor in which change is present but not yet formally declared. That corridor is where prognosis operates.

1.4.3 Why the Person Must Be the Continuity Point

No external institution sees the body continuously. The person does. That is why the individual, not the appointment, must become the continuity node in the model.

A pre-event system must therefore be user-centred by design.

It must permit repeated low-burden capture. It must store personal temporal history. It must compare each new signal against the person's own corridor. It must detect drift before clinical manifestation. It must escalate only when structured abnormality persists.

This is why the paper is not simply about a test. It is about a continuity architecture.

The pinprick model becomes important here because it makes repeated participation realistic. If the signal can be gathered with low friction, then a genuine temporal map becomes possible. Without repeatability, prognosis collapses back into occasional checking.

1.4.4 Prognosis Before Proof

A pre-event system does not require absolute proof at the earliest stage in order to be useful. Its role is not to declare certainty prematurely. Its role is to identify structured concern before damage is consolidated.

That means the system operates in a different evidential mode.

Not final diagnosis, but early warning. Not institutional closure, but monitored escalation. Not proof after the event, but structured indication before it.

This distinction is critical. It allows the system to act lawfully within uncertainty. It does not claim to know more than is present. It claims only to recognise when the trajectory has become sufficiently abnormal to justify attention.

1.4.5 Section Conclusion

Pre-event prognosis is required because delayed recognition cannot protect against processes that form before visibility. A viable health architecture must therefore begin with continuity, repeated personal measurement, and structured detection of drift prior to overt clinical manifestation.

This establishes the fourth foundation of the paper: the correct object of health intelligence is not merely the diagnosed event, but the formation path that precedes it.

1.5 Chapter Conclusion

This chapter has established the temporal failure at the centre of after-event medicine. The dominant model is built around delayed recognition. It enters too late, observes too intermittently, and relies too heavily on visibility after formation has already advanced. As a result, it often confuses late notice with sudden onset and treats consequence as though it were emergence.

The alternative proposed in this paper is pre-event prognosis. Disease is not approached as a binary event that begins at the moment of discovery. It is approached as a progressive departure from equilibrium that can be tracked earlier through repeated personal observation.

The key conclusion is therefore straightforward: the principal weakness of conventional detection is not only imperfect testing, but temporal misalignment between biological formation and institutional recognition.

That conclusion justifies the next chapter. If the timing problem is real, then the system needs a different reference instrument. That instrument is not the occasional appointment. It is the person's own longitudinal baseline.

Chapter 2. Personal Baseline as the Primary Instrument

If the failure of after-event medicine is a failure of timing, then the corrective instrument must be continuity. That continuity cannot be supplied by occasional institutional contact alone. It must be supplied by the individual baseline.

The central proposition of this chapter is simple: the most meaningful reference point in a prognostic system is not merely the average population value, but the person's own longitudinal norm. A human being is not a sequence of isolated readings. A human being is a continuous biological process with its own range, rhythm, variance, and recovery pattern. For that reason, early deterioration is often best detected not by asking whether a value is abnormal in general, but by asking whether it is abnormal for that person across time.

A baseline is therefore not a static opening measurement. It is a living corridor of expected behaviour. It records the person's normal range, fluctuation limits, directional tendencies, and recovery characteristics. Once that corridor exists, new readings can be interpreted in context rather than in isolation.

This changes the logic of detection.

Instead of asking: Is this reading outside a broad population band?

The prognostic system asks: Is this reading departing from the person's own established equilibrium, and is that departure persistent enough to matter?

That is the foundation of personal prognosis.

2.1 The Person as a Continuous Dataset

The person must be treated as the continuity point because that is where the biological process actually exists. Institutions see fragments. The individual contains the timeline.

This distinction matters because many forms of deterioration do not begin as dramatic events. They begin as small deviations that repeat, accumulate, and slowly stabilise into a pattern. A single appointment may miss that sequence entirely. A continuous personal record will not.

To treat the person as a continuous dataset is not to reduce the person to numbers. It is to recognise that health unfolds across time and that any serious prognostic system must preserve that temporal structure.

The continuous dataset includes: repeated biological readings, historical variance, behavioural rhythm, recovery behaviour, exposure pattern, personal symptom timing, family-risk context.

Taken together, these do not merely describe the person at one moment. They describe the person in motion.

That is why baseline matters. Without it, every reading is forced to stand alone. With it, each reading becomes part of a temporal sequence that can be compared, weighted, and interpreted.

2.1.1 Why Institutional Fragments Are Not Enough

An institutional system usually sees the person in episodes. A test is taken here. A symptom is reported there. A check-up occurs months later. These fragments may each be accurate in themselves, but they do not necessarily preserve the continuity of change between them.

The result is a fragmented model of health: isolated measurement, isolated interpretation, isolated response.

A prognostic architecture cannot rely on fragmentation. It must be able to observe whether a deviation is new, recurring, accelerating, recovering, or converging with other changes. None of that can be done properly without continuity.

2.1.2 The Baseline as a Living Corridor

The baseline should therefore be defined as a corridor, not a point.

A point says only what was true once. A corridor says what is normally true across time.

This corridor has structure. It includes: normal mean level, normal fluctuation range, rate of return after disturbance, seasonal or cyclical variation, interaction with lifestyle and exposure.

That is what makes it clinically useful. It permits the system to identify not only extreme abnormality, but meaningful departure from the person's own expected path.

2.1.3 Section Conclusion

The person is the proper continuity node in a prognostic model because health is continuous at the level of life, not at the level of appointments. A viable early-warning system must therefore treat the individual as a longitudinal dataset and the baseline as a living equilibrium corridor rather than a single opening value.

2.2 Baseline Versus Population Threshold

Population thresholds are useful, but they are not sufficient as the primary instrument of prognosis. They define what is broadly expected across groups. They do not necessarily define what is normal for one person across time. That distinction is decisive.

Laboratory medicine has been moving in this direction for several years. Work on personalized reference intervals argues that interpretation can be improved by using an individual's own historical results together with analytical and within-subject biological variation, rather than relying only on standard population ranges.

The practical reason is straightforward: many biomarkers vary less within a person than they vary across a population. When that is true, a person can drift materially away from their own normal setpoint while still remaining inside a broad "normal range." Recent work on hematologic setpoints found that complete blood count setpoints are stable and patient-specific, and that using those setpoints improved sensitivity and specificity for several common conditions compared with population-range interpretation alone.

That is the structural weakness of population-first interpretation. It is broad enough for classification, but often too coarse for early personal warning.

2.2.1 What a Population Threshold Actually Does

A population threshold answers this question: Does the observed value fall within the expected distribution for a reference group?

That has obvious value in general clinical screening. But it is not the same as asking whether the reading is unusual for this individual. A personalized-data review from 2024 makes this distinction directly, contrasting diagnosis based on population data with diagnosis based on personalized longitudinal data and arguing that personalized interpretation can detect change that population thresholds may miss.

So the two instruments serve different functions.

Population threshold: broad external comparison. Personal baseline: internal temporal comparison.

A prognostic system needs both, but the personal baseline must come first.

2.2.2 Why Personal Baselines Are Stronger for Early Drift

Early deterioration often begins as small but persistent movement away from an individual setpoint. That kind of movement may not cross a population threshold immediately. Yet it may still be biologically meaningful.

This is why personalized reference-interval work matters. The premise is not that population ranges are wrong. The premise is that they are incomplete where early change is subtle and person-specific. Personalized intervals and reference-change methods attempt to capture whether the new result is meaningfully different from the person's own prior distribution.

The same logic appears in digital-biomarker literature. Reviews note that individual baseline thresholds and added longitudinal data can reveal small clinical differences that standardized norms may not capture well, especially in conditions with variable or insidious onset.

2.2.3 The Baseline-First Rule

For the purposes of this paper, the interpretive order should therefore be: first, compare the person to themselves; second, compare the person to the population; third, evaluate whether the deviation is persistent, convergent, and directionally worsening.

That order preserves sensitivity to early drift without abandoning broader clinical context.

The principle can be stated simply: a reading should not be treated as reassuring merely because it remains inside a population band if it is simultaneously leaving the person's own historical corridor.

That principle is consistent with the broader movement toward longitudinal precision health and big biological data, where repeated within-person measurements are treated as essential to precision medicine rather than secondary to one-off classification.

2.2.4 Section Conclusion

Population thresholds remain useful for broad classification, but they are too coarse to serve as the primary reference frame for pre-event prognosis. Early warning depends on detecting meaningful change relative to the person's own stable corridor. For that reason, the baseline must be treated as the primary instrument, with population ranges functioning as a secondary external check rather than the main decision axis.

2.3 Personal Equilibrium Corridor

A personal baseline becomes clinically useful only when it is treated as a corridor rather than a single value. The corridor is the person's expected range of behaviour across time: not merely what their markers are on one occasion, but how those markers usually fluctuate, recover, and co-vary under ordinary conditions.

This is consistent with the direction of current precision-health work. Longitudinal physiology research increasingly treats the individual as having a stable personal range or setpoint, with departures from that range becoming meaningful when they persist or accumulate. Recent work on patient-specific blood count setpoints found strong intra-individual stability and showed that setpoint-based interpretation improved detection performance over population-range interpretation alone.

2.3.1 Corridor, Not Point

A point measurement says only this: what was observed at one moment.

A corridor says something stronger: what is normally expected for this person across repeated moments.

That distinction matters because a single normal-looking value can conceal directional change if the person is steadily moving away from their own prior level while still remaining inside broad reference limits. Personalized reference-interval research addresses exactly this problem by combining prior personal results with biological and analytical variation to improve interpretation.

The corridor should therefore include at least five components: mean level, ordinary variance, rate of return after disturbance, cyclical or seasonal pattern, cross-signal relationship with other markers.

This is what makes the baseline dynamic rather than static.

2.3.2 Stability Has Structure

Stability does not mean perfect sameness. A healthy system still fluctuates. The issue is whether fluctuation remains inside the person's ordinary operating band and whether recovery occurs as expected.

So the corridor is not a rigid line. It is a bounded, living zone.

That zone can widen or narrow over time. It can shift with age, environment, medication, stress, sleep, or hormonal cycles. Reviews of personalized medicine and digital biomarkers emphasize that repeated within-person data can reveal meaningful small changes precisely because normality is individualized and temporally patterned, not fixed at one universal value.

2.3.3 Early Deterioration Appears as Corridor Exit

The prognostic importance of the corridor is straightforward: early deterioration is often first visible not as catastrophic abnormality, but as repeated failure to remain inside the person's expected band.

That failure can take several forms: drift away from usual mean, increased volatility, slower recovery after disturbance, new coupling between previously separate signals, persistent low-grade deviation across multiple channels.

This is the key design move in the paper. Disease is not treated first as a named object. It is treated first as an abnormal departure from personal equilibrium.

2.3.4 Corridor Logic

The personal equilibrium corridor can be expressed simply as: expected state = personal mean plus ordinary fluctuation bounds across time.

Then each new reading is interpreted in relation to that corridor, not in isolation.

A useful operational rule is: if a signal remains outside the person's expected corridor across repeated windows, or repeatedly returns to the outer edge without full recovery, then the system should treat that as structured deviation rather than noise.

That approach is aligned with current longitudinal precision-health thinking, which treats repeated within-person change as a fundamental source of clinical information rather than as background variation.

2.3.5 Section Conclusion

The personal equilibrium corridor is the operational form of the baseline. It converts isolated readings into a temporal reference structure and allows the system to distinguish ordinary fluctuation from meaningful departure. That is the point at which prognosis becomes possible. The question is no longer whether one reading looks abnormal in general. The question is whether the person is beginning to leave their own stable corridor in a sustained and directionally significant way.

2.4 Repeated Measurement as Structure, Not Noise

A prognostic system fails if it treats repeated readings as disposable fluctuation. Repetition is not a nuisance variable. Repetition is the mechanism by which structure becomes visible.

A single measurement may show position. Repeated measurements show direction.

That distinction is central to this paper. Pre-event prognosis depends less on one isolated result than on whether small departures recur, persist, widen, or converge over time. Current work on personalized reference intervals, longitudinal biomarkers, and precision health increasingly supports this view: repeated within-person measures can reveal clinically meaningful change that broad one-off interpretation can miss.

2.4.1 Why Repetition Changes Interpretation

One reading can always be dismissed as transient variance, sampling error, temporary stress, hydration change, or random disturbance. That is reasonable. But once a similar deviation appears again, or appears across several adjacent windows, its meaning changes.

The question is no longer: Is this reading abnormal?

The question becomes: Is this person forming a repeatable departure from their own equilibrium corridor?

That is why repeated measurement is not redundant. It converts uncertainty into temporal evidence.

Longitudinal methods literature makes this point in statistical form: repeat measurements improve the ability to estimate within-person change and are specifically used to distinguish true temporal movement from noise and measurement error.

2.4.2 Sequence Reveals What Snapshots Cannot

A snapshot may look ordinary even when a sequence is not. Three or four mildly shifted readings, none of which individually appears dramatic, may together reveal a clear drift path. This is especially important in early-stage deterioration, where the biology may move gradually before any one point crosses a conventional threshold.

That is why repeated measures create an entirely different class of signal: persistence, rate of change, acceleration, failed recovery, cross-channel recurrence.

These are structural properties. They do not exist in a one-off reading.

Digital biomarker and precision-health reviews now explicitly frame continuous or repeated capture as valuable because it provides individual baseline thresholds and longitudinal context rather than isolated clinic-time observations.

2.4.3 Noise Is What Fails to Persist

The usual objection is that repeated small changes may just be noise. That is true only if they fail to organise.

Noise fluctuates without stable direction. Structure returns, persists, or accumulates.

So the role of the prognostic engine is not to overreact to every small deviation. Its role is to test whether deviation repeats with enough consistency to cease being random.

This is where repetition becomes decisive. A system that samples once must choose between overreaction and blindness. A system that samples repeatedly can remain conservative while still becoming sensitive to pattern.

2.4.4 Repetition Supports Personal Setpoints

Repeated measurement is also what makes a personal baseline possible in the first place. Without repeated observations, there is no lawful way to estimate a person's ordinary range, ordinary variance, or ordinary recovery behaviour.

Recent work on patient-specific setpoints and repeated biomarker assessment shows exactly this logic: stable within-person values can be estimated only through repeated data, and those repeated data can improve sensitivity to meaningful change.

So repetition serves two functions at once: it builds the corridor, and it tests whether the corridor is being exited.

That dual role makes it one of the core instruments of the entire model.

2.4.5 Section Conclusion

Repeated measurement should not be treated as background clutter around a supposedly decisive single test. It is the opposite. Repetition is what reveals persistence, drift, failed recovery, and convergent deviation across time. In a prognostic architecture, repeated readings are not noise around the signal. They are the signal's temporal form.

2.5 Chapter Conclusion

This chapter has established the individual baseline as the primary reference instrument of prognosis. The person is the continuity point, not the appointment. Population thresholds remain useful, but they are too coarse to function as the main early-warning frame. The stronger instrument is the personal equilibrium corridor: a living record of the person's usual range, variance, and recovery pattern. Repeated measurements then convert that corridor into a usable prognostic system by distinguishing transient fluctuation from persistent departure.

The consequence is clear. If early warning depends on continuity, then the next requirement is operational practicality. The system must be simple enough to be repeated, owned by the person, and integrated into daily life rather than rare institutional contact.

Chapter 3. Low-Burden Sampling and the User Loop

If the personal baseline is the primary instrument, then the next requirement is practical continuity. A prognostic system cannot depend on rare, burdensome, institution-controlled sampling and still claim to observe early formation. It must be simple enough to repeat, light enough to sustain, and personal enough to remain embedded in ordinary life.

That is why low-burden sampling is central.

The purpose of this chapter is to define the operational loop through which prognosis becomes usable: repeated user-led sampling, app-based ingestion, baseline comparison, drift scoring, and early escalation. The point is not to wait until the body has become legible to institutional crisis thresholds. The point is to capture enough repeated personal signal to identify structured divergence before overt manifestation.

This approach is increasingly consistent with the direction of precision health. Longitudinal health research emphasizes dense, repeated personal measurement as a route to earlier risk detection and preventive intervention, while recent studies show that finger-prick and home-capillary sampling can support remote biomarker collection in practical settings.

3.1 Why Low-Burden Sampling Matters

A prognostic system succeeds or fails at the level of repeatability. If data capture is too difficult, too expensive, too invasive, or too dependent on institutional scheduling, continuity collapses. Once continuity collapses, prognosis collapses with it.

The issue is not only technical feasibility. It is behavioural feasibility.

A person may tolerate a burdensome procedure once. They will not reliably sustain it as a lifelong monitoring architecture.

For that reason, the correct entry point is a low-friction capture model: simple enough for regular use, stable enough for repeated comparison, and structured enough to feed a longitudinal system.

Finger-prick and home-capillary methods matter here because they reduce logistical burden while extending temporal reach beyond the clinic. Recent validation work has shown that remote finger-prick blood collection can be technically usable for selected blood biomarkers, and patient-facing studies report strong acceptability for self-administered home blood tests when instructions are clear and the collection process is manageable.

3.1.1 Continuity Requires Low Friction

The first design rule is simple: the easier the sampling loop, the more likely the timeline survives.

A prognostic architecture needs repeated windows of observation. That means the user must be able to contribute data without entering a high-cost ritual each time. Low-burden sampling changes the problem from occasional institutional capture to sustained personal monitoring.

This is not merely a convenience feature. It is a structural requirement. Without repeated capture, the system cannot reliably observe: persistence, rate of change, failed recovery, cross-signal convergence, drift acceleration.

These are the actual early units of deterioration, and they only become visible when data arrive often enough.

3.1.2 The User Loop Is More Important Than the Single Test

The value of low-burden sampling is not that one finger-prick result replaces a hospital laboratory in every circumstance. The value is that repeated user-led capture creates a temporal loop.

That loop is: sample → upload → compare → score → store → repeat.

The single result matters less than the sequence it enters.

This is consistent with broader digital-biomarker and wearable literature, which repeatedly emphasizes that continuous or repeated personal data can reveal patterns unavailable to isolated clinic-time measurements. Wearable and sensor reviews describe real-time health data as a route to digital biomarkers and earlier personalized insight.

3.1.3 Low-Burden Does Not Mean Low Value

There is a common assumption that easier sampling must imply weaker clinical value. That is not necessarily true. In a prognostic architecture, value comes from temporal density as much as from single-sample sophistication.

A lower-burden signal repeated across time may be more useful for early warning than a higher-burden signal obtained rarely.

That is the practical reversal this paper introduces.

The question is not: What is the most elaborate test available after concern?

The stronger question is: What is the most repeatable signal architecture that can show the earliest structured departure from personal equilibrium?

That is why low-burden sampling belongs at the centre of the user experience chapter rather than at its margins.

3.1.4 Section Conclusion

Low-burden sampling matters because prognosis depends on continuity, and continuity depends on repeatability. A user-centred health engine cannot rely on rare institutional access if its purpose is to detect pre-event formation. It needs a personal capture loop that is simple enough to sustain and stable enough to build a meaningful longitudinal record. Finger-prick and related home-capillary approaches show that this direction is technically and behaviourally plausible for at least part of the biomarker field.

3.2 The Pinprick Model

The pinprick model is the clearest practical form of low-burden prognosis. It reduces the distance between the body and the monitoring system by making repeated biological sampling ordinary, local, and personally manageable. The point is not that one pinprick replaces every laboratory pathway. The point is that recurring small-volume sampling creates continuity, and continuity is what allows early drift to become visible.

This direction is technically plausible. Recent reviews and studies describe capillary microsampling, dried blood spots, and home-sampled blood collection as increasingly usable for biomarker work, remote monitoring, and longitudinal health research, while also noting that analyte-specific validation and standardisation remain important.

3.2.1 Why the Pinprick Matters

The pinprick matters because it changes the monitoring architecture.

Venous testing belongs mainly to institutional encounters. Pinprick testing can belong to the user's own timeline.

That distinction is not minor. A prognostic model cannot depend on infrequent institutional access if its purpose is to identify pre-event formation. It needs a method that can be repeated with low friction and incorporated into ordinary life.

Home-sampled dried blood spot work already shows why this is attractive: self-sampling expands access to repeated measurement, supports remote collection, and can preserve useful proteomic and biomarker information for later analysis.

3.2.2 Small Sample, Large Temporal Value

The strength of the pinprick model is not sample size. It is temporal density.

A small sample taken repeatedly across time can reveal: baseline stability, micro-deviation, persistence, failed recovery, cross-signal drift, rate of worsening.

That is more valuable for prognosis than a rare high-information test viewed in isolation. A 2025 review of patient-centric blood sampling and analysis describes capillary blood as increasingly feasible for clinical chemistry and diagnostics in user-centred settings, particularly where convenience and repeated access matter.

So the operational logic is: small burden → more repetition → stronger baseline → earlier drift detection.

That is the real advantage.

3.2.3 From Event Testing to Continuous Capture

The institutional model usually treats blood testing as an event.

The pinprick model treats it as a loop.

That loop is: collect, upload, compare, store, repeat.

This shift is decisive because the value of the result no longer lies only in whether the current reading is above or below a fixed threshold. The value lies in how the reading sits inside a sequence.

Recent work on digital tools and self-administered home blood tests describes exactly this broader model: remote sample collection combined with digital support can expand research and monitoring workflows, improve flexibility, and create more continuous biological-data management outside traditional clinic structures.

3.2.4 Multi-Disease Applicability

The pinprick model is also useful because it is not disease-specific at the architectural level. Once a personal biological sampling loop exists, the same user pathway can support different signal classes.

That means the same sampling architecture can, in principle, be adapted for: metabolic drift, inflammatory drift, oncological signal suspicion, immune instability, treatment response monitoring, recovery monitoring.

The disease layer changes. The user loop does not.

This is consistent with the wider microsampling literature, which presents dried blood spot and related approaches as scalable platforms for biomarker discovery, epidemiology, and repeated health monitoring rather than as single-condition tools only.

3.2.5 Limits of the Pinprick Model

The paper should still remain precise. The pinprick model is not automatically universal for every analyte, and not every biomarker performs identically in home capillary formats. Reviews on microsampling and standardisation stress issues such as sample quality, matrix effects, analyte stability, collection variability, and the need for assay-specific validation.

That does not weaken the architecture. It simply means the system must be built as a validated signal platform rather than as an overclaimed universal replacement for all blood testing.

So the correct claim is: the pinprick model is the practical entry point for repeated prognostic sampling, provided the selected biomarkers are validated for that format.

That is sufficient for the paper and materially stronger than a blanket claim.

3.2.6 Section Conclusion

The pinprick model is the most direct operational form of user-centred prognosis. It lowers the burden of repeated biological capture, strengthens continuity, and allows the app-based system to interpret health as a temporal sequence rather than as a rare institutional event. Its value lies not in mimicking every hospital test, but in making repeated personal sampling realistic enough to support early drift detection across time.

3.3 App-Centred Data Continuity

The app is not an accessory to the prognostic system. It is the continuity layer. Without a continuity layer, repeated personal sampling collapses into disconnected events, and the entire logic of early drift detection is lost.

The app has one central function: to turn isolated personal readings into a longitudinal health sequence.

That is the point at which user experience becomes part of the scientific architecture rather than a cosmetic wrapper.

Current digital-health literature supports this framing. Reviews of patient-generated health data describe mobile and connected systems as mechanisms for capturing health information in daily life, outside episodic appointments, and for supplying real-time or longitudinal insight that conventional care pathways often lack. WHO likewise frames digital health as part of strengthening health systems through technologies that improve continuity and access.

3.3.1 The App as Temporal Memory

A single test result has limited prognostic value unless it is placed into sequence. The app is what stores the sequence.

It records: sampling time, signal values, deviation from baseline, persistence across windows, co-occurring symptom or contextual data, alert history, recovery or non-recovery after deviation.

This is not just storage. It is temporal memory.

That temporal memory is what allows the system to distinguish one-off disturbance from structured directional drift.

Recent reviews on electronic patient-generated health data emphasize that high-frequency or repeated personal data become clinically useful when they are collected, organized, and analyzed over time rather than treated as isolated reports.

3.3.2 Why the User Loop Must Be App-Centred

A person cannot manually integrate dozens or hundreds of low-level signals across months or years with consistent precision. The app exists to perform that continuity task.

The app-centred loop is: capture, timestamp, normalise, compare to personal corridor, update drift state, store outcome, repeat.

That loop makes prognosis operational.

It also changes the ownership structure of the timeline. Instead of the health record being produced only when an institution sees the person, the health sequence is generated continuously by the person's own life process and preserved through the app. Reviews of PGHD describe this as a shift toward person-centred and real-world data capture, though they also note challenges in standardisation, presentation, and integration.

3.3.3 The App as a Convergence Engine

The app is not merely a diary. It is the convergence engine for multiple signal classes.

It can combine: pinprick biomarker results, wearable data, self-reported symptoms, medication timing, environmental exposure markers, sleep and stress patterns, family-history flags, past deviations and recovery behaviour.

That matters because early deterioration is often not visible in one channel alone. It becomes visible when small abnormalities recur across several channels and fail to resolve. Reviews of wearables, mobile apps, and remote monitoring repeatedly describe the value of combining multiple streams of patient-generated data for more continuous and personalised monitoring.

3.3.4 User Experience as a Scientific Constraint

The app must be simple because sustained prognosis depends on adherence. If the interface is confusing, overly clinical, or burdensome, the longitudinal record degrades.

That means UX is not secondary. It is part of the validity condition of the model.

A usable continuity app must do five things well: make sampling easy to log, show drift without overwhelming the user, distinguish low concern from escalation states, preserve history clearly, support repeated engagement without friction.

Research on mobile remote monitoring and digital self-care tools consistently finds that usability, adherence, and data presentation affect whether long-term monitoring systems actually work in practice.

3.3.5 The App Does Not Need to Diagnose to Be Useful

The app's first role is continuity and early warning, not final diagnosis. It does not need to claim absolute certainty in order to be valuable.

Its job is to determine whether: the user remains within their corridor, deviation is repeating, recovery is failing, signal groups are converging, escalation has become justified.

That is a stronger and more defensible role than pretending the app itself replaces every downstream clinical function. Reviews of digital biomarkers and PGHD repeatedly describe value in monitoring, personalization, and earlier recognition, while also noting that interpretation and workflow integration remain important challenges.

3.3.6 Section Conclusion

The app-centred layer is what transforms repeated sampling into a prognostic system. It provides temporal memory, baseline comparison, signal convergence, and user-held continuity across time. Without it, the model falls back into isolated tests. With it, health can be interpreted as a longitudinal pattern rather than as a sequence of disconnected institutional moments.

3.4 Signal Ingestion and Personal Tracking

A prognostic engine becomes real only when personal signals can be ingested consistently and tracked longitudinally. Sampling alone is not enough. The system must be able to receive data, clean it, time-bind it, place it against the user's own corridor, and preserve the result as part of a living sequence.

That is the purpose of signal ingestion.

The ingestion layer is the bridge between raw personal input and usable prognostic structure. It converts disconnected observations into a coherent personal timeline. Digital-health and patient-generated data literature increasingly treats this layer as essential, because repeated personal data only become clinically meaningful when they are captured, standardized, and analyzed over time rather than left as isolated records.

3.4.1 What the System Must Ingest

The system should be designed to ingest more than one signal type. Early deterioration often does not declare itself in a single variable. It emerges across partially linked channels.

The ingestion layer should therefore support at least six categories: biological sample values, wearable sensor values, self-reported symptom entries, contextual behaviour and exposure markers, historical health events, family-risk and inherited background flags.

This multimodal direction is consistent with current digital-biomarker and precision-health work, which increasingly combines biological, sensor, behavioural, and record-based inputs to support continuous monitoring and personalized interpretation.

3.4.2 Ingestion Is Not Mere Storage

To ingest a signal properly is not simply to save a number. The system must also preserve the context that gives the number meaning.

Each event should therefore carry: timestamp, signal type, measurement value, sampling method, confidence or quality flag, user context, link to prior baseline state.

Without that context, the value loses temporal and interpretive force.

Research on digital platforms for continuous monitoring emphasizes safe and agile capture pipelines because the utility of patient-facing health systems depends on how data are structured, linked, and made available for downstream interpretation rather than merely accumulated.

3.4.3 Time Is Part of the Signal

The timestamp is not incidental metadata. It is part of the signal itself.

A reading at one hour, one day, or one month after a prior event does not carry the same meaning. The prognostic engine must therefore treat time gap, sequence density, and recurrence interval as first-class variables.

This is one of the main advantages of app-centred personal systems. They do not simply record what happened. They record when, how often, and in what sequence it happened. Longitudinal monitoring literature repeatedly uses frequent data collection specifically to model temporal change, recurrence, and trajectory rather than one-off status.

3.4.4 Personal Tracking Means Corridor Updating

Once a signal is ingested, it must be compared against the user's existing baseline corridor and then written back into the longitudinal record.

So the tracking loop is: ingest, validate, timestamp, compare to personal corridor, update drift status, store outcome, prepare next comparison.

That loop does two things at once. It preserves continuity. It updates interpretation.

This is the point where user tracking becomes more than journaling. It becomes a dynamic baseline system.

Recent work on data-intelligence platforms and integrated digital health systems supports this logic by emphasizing continuous personalization, repeated updating, and integration of multiple patient-generated streams rather than static one-time classification.

3.4.5 Quality Control at the User Layer

A user-centred system still needs data discipline. Not every reading should be treated as equally reliable. The ingestion layer must therefore distinguish between strong and weak inputs.

Useful controls include: sample-completeness checks, range plausibility checks, duplicate-event detection, missing-data flags, device-source tagging, confidence scoring.

This is especially important in remote and home-based systems. Reviews of digital biomarkers and self-administered collection repeatedly note that real-world utility depends not only on access and convenience, but also on data quality, adherence, and signal reliability.

3.4.6 Why Personal Tracking Is the Actual Product

The deeper point is that the product is not the isolated test result. The product is the continuity field built around the person.

A single reading can only say so much. A tracked sequence can show: departure, persistence, acceleration, failed recovery, multi-signal convergence.

That is why personal tracking is not an optional feature added after the science. It is the science in operational form.

Precision-health reviews increasingly make this same shift: the value of digital and patient-generated data lies in repeated real-world capture that makes individualized, proactive, and earlier interpretation possible.

3.4.7 Section Conclusion

Signal ingestion and personal tracking are what turn low-burden sampling into a real prognostic engine. The system must receive multimodal personal data, preserve their temporal and contextual structure, compare them against the user's own corridor, and continuously update the longitudinal record. Without that, repeated measurements remain fragments. With it, they become a usable pre-event detection field.

3.5 From Repeated Readings to Prognosis

Repeated readings become prognostic only when they are interpreted as a temporal pattern rather than as disconnected values. The purpose of the engine is not to collect numbers for their own sake. The purpose is to determine whether those numbers, across time, indicate that the person is remaining within their own equilibrium corridor or progressively leaving it.

That is the turning point of the entire paper.

A repeated reading is useful. A repeated sequence is meaningful. A repeated sequence with persistence, directional movement, and convergence becomes prognostic.

This is consistent with current precision-health and digital-biomarker research, which increasingly treats longitudinal within-person data as the basis for earlier risk detection, trend interpretation, and personalized decision support rather than relying only on isolated cross-sectional measurements.

3.5.1 A Prognostic Reading Is Not a Single Reading

No single low-burden sample should be asked to carry the entire interpretive burden. That would collapse the model back into event testing. The purpose of repeated capture is to allow the system to ask a stronger question: is this deviation repeating in a way that suggests formation rather than fluctuation?

That requires the engine to evaluate at least five temporal properties: magnitude of deviation, persistence across windows, rate of directional change, recovery or non-recovery, convergence with other signal groups.

These are the actual units of prognosis. Longitudinal health-data research uses repeated measures for precisely this reason: to characterize trajectories, detect change over time, and separate structured movement from isolated variance.

3.5.2 Direction Matters More Than Snapshot Position

A reading that is technically inside a broad acceptable range may still be prognostically important if it is moving steadily away from the person's own historical baseline. This is why direction is often more informative than position.

A stable but slightly unusual value may be less concerning than a steadily worsening value that still appears "normal" by broad external standards.

So the model must not only ask: where is the signal now?

It must also ask: where is it moving, how fast is it moving, and is it returning?

This is aligned with the broader shift toward longitudinal precision health, where repeated within-person change is used to identify meaningful biological movement before overt endpoint confirmation.

3.5.3 Persistence Converts Suspicion into Structure

The main difference between a weak signal and a strong early signal is persistence.

One deviation may be incidental. Repeated deviation in the same direction is structure.

This means the prognostic engine should remain conservative at first. It does not need to overreact to every soft abnormality. It only needs to retain the event, compare it to subsequent events, and determine whether the pattern is stabilizing into a real divergence path.

That is how prognosis becomes disciplined rather than speculative.

Reviews of patient-generated and digital longitudinal health data repeatedly emphasize that repeated observations increase interpretive value because persistence and recurrence carry information that isolated observations do not.

3.5.4 Convergence Creates Escalation Power

A repeated abnormal reading becomes more powerful when other channels begin to move with it.

For example, a blood-based shift may initially remain soft. But if it coincides with altered sleep, fatigue pattern, inflammatory fluctuation, or failed recovery after prior disturbance, the structure strengthens.

This is why prognosis should not be based on one variable alone. It should be based on cross-signal convergence across the personal timeline.

Digital-health and wearable-monitoring literature now routinely supports multi-stream interpretation for more personalized and continuous monitoring, especially when early-stage abnormalities are subtle and distributed rather than concentrated in one measurement alone.

3.5.5 Prognosis Is a Controlled Early Warning State

The endpoint of this sequence is not necessarily immediate diagnosis. The endpoint is an early warning state strong enough to justify escalation.

That is the correct role of the app-centred engine.

It should be able to say: the person is no longer simply fluctuating, the person is showing sustained and directionally meaningful divergence, the divergence is now strong enough to justify further investigation.

This is a more defensible and technically serious claim than pretending the user loop by itself replaces every downstream institutional function. Current digital-health literature commonly places continuous monitoring tools in an early-recognition, risk-stratification, and escalation-support role rather than treating them as isolated final arbiters of disease.

3.5.6 Section Conclusion

Repeated readings become prognosis when they are organized into a directional, persistent, and convergent sequence relative to the person's own baseline corridor. The system does not wait for overt clinical manifestation. It interprets the formation path itself. That is the operational meaning of pre-event health intelligence.

3.6 Chapter Conclusion

This chapter has defined the practical user loop of the proposed system. Low-burden sampling matters because prognosis depends on repeatability. The pinprick model matters because it allows biological capture to become ordinary and user-led. The app matters because it preserves temporal memory, integrates signal streams, and turns repeated readings into a longitudinal sequence. Signal ingestion matters because the system must preserve time, context, and quality rather than merely storing values. Prognosis then emerges when repeated readings show persistence, directional drift, and convergence across channels.

The architecture is now in place for formalisation. The next chapter can move from user operation to system logic by defining the latent states, transition rules, and drift conditions that govern the engine.

Chapter 4. State Space and Drift Logic

The previous chapters established the temporal problem, the personal baseline, and the user loop. This chapter now defines the internal logic of the engine itself.

A prognostic system cannot rely on raw readings alone. It must have a structured way to represent where the person currently sits in relation to their own equilibrium corridor, how deviation is interpreted, and when repeated abnormality becomes strong enough to justify escalation. That requires a state model.

The purpose of this chapter is therefore to define the health-state architecture of the system and the rules by which the engine moves from ordinary fluctuation to structured concern.

The central proposition is this: health deterioration should be modelled as movement across states, not as a binary switch between healthy and diseased.

That is necessary because most meaningful deterioration does not emerge in one jump. It passes through intermediate conditions: slight departure, repeated departure, persistent divergence, and only later overt manifestation. A viable prognostic engine must be able to represent those intermediate conditions without collapsing too early into certainty and without remaining blind until damage is obvious.

So the system must answer five questions: What state is the person in now? How far have they moved from their own corridor? Is the movement repeating? Is it converging across signals? Has it become strong enough to escalate?

Those questions define the transition from user-side data collection to model-side interpretation.

4.1 Purpose

This chapter establishes the formal internal structure of the prognostic engine.

It defines: the state set, the meaning of each state, the observation field read by the engine, the logic of deviation, the rules of persistence, the conditions for escalation.

The point is not merely classification. The point is controlled temporal interpretation.

A strong prognostic model must do two things at once: remain sensitive to early change, avoid false collapse on weak noise.

For that reason, the engine should not treat every abnormal reading as decisive. It must retain ambiguity where necessary, monitor repetition, and only move upward when the pattern becomes sufficiently persistent and directionally coherent.

This is why the state model matters. It gives the system a lawful structure for handling uncertainty without reverting to crude binary logic.

4.1.1 Chapter Function

The function of this chapter is to convert the paper from concept to mechanism.

Chapters 1 to 3 answered why the system is needed and how the user interacts with it.

Chapter 4 answers: how the engine thinks.

4.2 Core State Set

The engine requires a state structure because prognosis is not a one-step judgment. A person does not move directly from stable health to formal disease in a single interpretable jump. There are intermediate conditions in which weak deviation appears, persists, gathers reinforcement, and only then becomes strong enough to justify escalation.

The state set must therefore represent progressive departure from the personal equilibrium corridor without forcing premature certainty.

The proposed core state set is: S = {Stable, Susceptible, Soft Deviation, Persistent Divergence, Escalation Candidate, Clinical Review}

This set is sufficient for the first version of the paper because it captures the full movement from ordinary continuity to formal handover without overcomplicating the early architecture.

4.2.1 Stable

Stable is the resting state of the system.

In this state, observed signals remain within the person's own expected corridor, or any deviations are minor, isolated, and self-correcting. Stability does not mean perfect sameness. It means the person's biological and behavioural signals continue to fluctuate within their own normal operating pattern.

This is the baseline state against which all later movement is judged.

4.2.2 Susceptible

Susceptible is the background vulnerability state.

This state does not mean active deterioration is occurring. It means the person carries some standing exposure, inherited risk, prior event history, or contextual vulnerability that raises the importance of monitoring even in the absence of persistent present drift.

Examples of susceptibility include: family history, known predisposition, prior disease event, chronic exposure pattern, structural lifestyle or environmental burden.

The state is important because risk can exist before active deviation.

4.2.3 Soft Deviation

Soft Deviation is the first active departure state.

A signal or small set of signals has moved away from the person's ordinary corridor, but not yet with enough persistence, magnitude, or convergence to justify strong concern. The key characteristic of this state is that abnormality has appeared, but it remains weakly structured.

This state prevents the engine from making a false binary choice between "nothing" and "serious concern."

4.2.4 Persistent Divergence

Persistent Divergence is the first strongly prognostic state.

Here, abnormality is no longer incidental. A deviation has repeated across enough windows, failed to recover adequately, or continued in a sufficiently consistent direction such that the system can treat it as real structure rather than temporary fluctuation.

This is the state in which prognosis becomes materially meaningful.

The person may still be far from overt disease manifestation, but the system can no longer treat the observed pattern as ordinary noise.

4.2.5 Escalation Candidate

Escalation Candidate is the threshold state immediately preceding handover.

In this state, deviation is not only persistent, but sufficiently reinforced by magnitude, recurrence, or cross-signal convergence that formal attention is now justified. The engine is no longer merely observing structured concern; it is identifying a state strong enough to warrant active review, confirmatory testing, or specialist interpretation.

This is not yet the same as confirmed disease. It is the point at which the risk structure has become too coherent to leave unexamined.

4.2.6 Clinical Review

Clinical Review is the handover state.

At this stage, the prognostic engine has completed its early-warning function. The person is no longer being managed solely inside the monitoring loop. The system recommends or initiates onward investigation, interpretation, or external review.

This state is necessary because the engine is not designed to replace all downstream diagnostic processes. It is designed to detect and escalate early enough that those processes begin sooner and on stronger temporal evidence.

4.2.7 Why This State Set Works

This six-state structure is strong for three reasons.

First, it preserves gradation. The model does not collapse the person into crude healthy/diseased binaries.

Second, it preserves timing. The model can detect and hold early drift before overt pathology.

Third, it preserves discipline. The model does not escalate at the first sign of movement, but only once the deviation becomes stable enough to justify concern.

That balance is essential. A prognostic engine that escalates too easily becomes noisy. A prognostic engine that escalates too late becomes reactive. The state set must sit between those failures.

4.2.8 State Order

The natural order of the states is: Stable → Susceptible → Soft Deviation → Persistent Divergence → Escalation Candidate → Clinical Review.

This does not mean every person passes through every state in a simple linear path. Some may remain susceptible for long periods without active deviation. Some may move from stable to soft deviation and return to stable. Some may cycle between soft deviation and persistent divergence before escalation becomes justified.

The state set therefore defines a lawful space of movement, not a rigid one-direction staircase.

4.2.9 Section Conclusion

The core state set provides the minimum necessary architecture for a prognostic health engine. It defines the difference between ordinary continuity, background risk, early departure, structured drift, escalation readiness, and formal handover. With that architecture in place, the next step is to define each state in stricter operational terms so the model can distinguish them consistently.

4.3 State Definitions

The core state set must now be defined in operational terms. A state is only useful if the engine can distinguish it from neighbouring states without ambiguity. That means each state must be tied to specific interpretive conditions.

The purpose of this section is to give each state a functional definition based on deviation, persistence, recovery behaviour, and convergence.

4.3.1 Stable

Stable means the person remains within their own equilibrium corridor.

In this state: observed values remain within expected personal range, or near it; minor deviations are brief and self-correcting; recovery to prior pattern occurs without sustained drift; no meaningful cross-signal reinforcement is present.

Stable does not mean static. It means fluctuation remains ordinary for that person.

The engine should therefore keep a person in Stable when variation is present but does not organize into a directional pattern.

4.3.2 Susceptible

Susceptible means background vulnerability is present without sufficient active evidence of present divergence.

In this state: inherited risk, prior history, environmental burden, or behavioural load may be present; current readings do not yet show persistent departure; monitoring priority is elevated relative to a fully ordinary baseline; the system watches more carefully, but does not yet interpret active deterioration.

This state matters because risk can exist before observable drift. The person may not be leaving the corridor yet, but the threshold for attention may be lower because the background field is heavier.

4.3.3 Soft Deviation

Soft Deviation means a measurable departure from the personal corridor has appeared, but not yet with enough strength to justify escalation.

In this state: one or more signals move outside ordinary personal expectation; the deviation may be recent, isolated, or low magnitude; persistence is not yet established; recovery remains possible and not yet disproven; convergence across signal groups is weak or absent.

This is the first active warning state.

The purpose of Soft Deviation is to prevent two errors: ignoring early movement entirely, and overreacting to the first abnormal reading.

The engine therefore holds the signal in view without prematurely collapsing it into a stronger interpretation.

4.3.4 Persistent Divergence

Persistent Divergence means abnormality has repeated or continued long enough to become structured.

In this state: deviation is no longer isolated; abnormality persists across repeated windows; recovery is incomplete, delayed, or absent; direction of change remains consistent enough to suggest real drift; at least some reinforcement from adjacent signal layers may be present.

This is the first state in which the engine can say: the person is no longer merely fluctuating; the person is beginning to leave their corridor in a sustained way.

Persistent Divergence is therefore the true prognostic hinge of the model.

4.3.5 Escalation Candidate

Escalation Candidate means the observed divergence has become strong enough to justify formal attention.

In this state: deviation magnitude is material; persistence is established; recovery failure is evident or increasingly likely; convergence across multiple signals is sufficient; the pattern is coherent enough that continued passive observation alone is no longer appropriate.

This state does not mean confirmed disease. It means the engine has reached the point where monitored abnormality has become actionable abnormality.

The system should enter Escalation Candidate only when the abnormality field is both repeated and reinforced.

4.3.6 Clinical Review

Clinical Review means the system has completed its internal early-warning task and now hands the case outward for higher-order examination.

In this state: escalation conditions have been met; the monitoring loop alone is no longer the proper endpoint; the case should move to confirmatory testing, specialist review, or formal diagnostic interpretation; the app continues to preserve continuity, but the case has crossed beyond routine monitoring.

Clinical Review is therefore not a stronger internal drift state. It is a handover state.

4.3.7 Boundary Logic Between States

The states must also be distinguished by transition boundaries.

The boundary between Stable and Soft Deviation is crossed when variation ceases to look ordinary for that person.

The boundary between Soft Deviation and Persistent Divergence is crossed when deviation repeats, recovery fails, or directional consistency becomes visible.

The boundary between Persistent Divergence and Escalation Candidate is crossed when persistence is joined by sufficient magnitude or multi-signal convergence.

The boundary between Escalation Candidate and Clinical Review is crossed when the system determines that external investigation is now warranted.

This boundary logic is important because the model should not treat neighbouring states as vague labels. They are thresholded interpretive conditions.

4.3.8 Operational Summary

The states can be summarized functionally as follows: Stable: ordinary fluctuation; Susceptible: background vulnerability without active structured drift; Soft Deviation: early abnormality without sufficient persistence; Persistent Divergence: repeated abnormality with real directional structure; Escalation Candidate: strong enough abnormality to justify active review; Clinical Review: onward handover beyond the monitoring loop.

This summary gives the engine a clear ladder of meaning without collapsing into binary logic.

4.3.9 Section Conclusion

The state definitions provide the interpretive backbone of the engine. Each state now carries a distinct meaning tied to deviation, persistence, recovery, and convergence. With these definitions in place, the next step is to define what the engine actually reads: the observation field through which these states are inferred.

4.4 Observation Field

The engine cannot infer state without an observation field. The observation field is the full set of signals through which the person's present condition is read and compared against their own longitudinal corridor.

The purpose of this section is to define what kinds of inputs the engine uses and how those inputs are organised.

The observation field should be broad enough to detect early drift, but disciplined enough to remain interpretable. It should not depend on a single privileged signal. Prognosis becomes stronger when the system can observe partial change across several layers and detect whether those layers are beginning to move together.

4.4.1 Definition

Let the observation field at time t be: \(y(t) = [b(t), w(t), s(t), e(t), h(t), f(t)]\)

Where: b(t) = blood-based micro-signals; w(t) = wearable and physiological signals; s(t) = symptom and self-report signals; e(t) = exposure and behavioural context; h(t) = personal health history and prior event structure; f(t) = family-risk and inherited predisposition field.

This is the minimum working observation structure for the paper.

4.4.2 Blood-Based Micro-Signals

The blood layer is the most direct biological layer in the model.

This includes any repeatable low-burden biomarker field that can be captured through the user loop. It may include: inflammatory markers, metabolic markers, immune-response markers, cell-stress indicators, disease-specific biomarker panels where validated, future analyte classes as the system expands.

The point is not to overname every possible analyte at this stage. The point is to establish that blood provides a recurring biological snapshot that can be compared longitudinally.

In the architecture of the paper, blood is not read once for final truth. It is read repeatedly for drift.

4.4.3 Wearable and Physiological Signals

The wearable layer provides continuity between biological sampling windows.

This may include: heart-rate pattern, heart-rate variability, sleep regularity, temperature trend, movement pattern, respiration-linked features where available.

These signals matter because they extend the temporal density of the observation field. A blood signal may appear weekly or monthly. A wearable signal may appear continuously. The model can therefore use wearables to strengthen detection of persistence, stress response, failed recovery, and cross-signal reinforcement.

4.4.4 Symptom and Self-Report Signals

The self-report layer is not secondary. It captures subjective and experiential information that may precede or accompany measurable biological change.

This may include: fatigue, pain, sleep disruption, appetite shift, mood change, unusual recovery pattern, recurrent minor symptoms, any repeated anomaly the user notices.

The engine should not treat self-report as proof by itself. It should treat it as a meaningful signal class that may converge with other layers.

4.4.5 Exposure and Behavioural Context

No prognostic reading is fully interpretable without context.

The exposure and behavioural field may include: diet disruption, environmental burden, stress load, medication change, alcohol or substance effect, infection exposure, unusual exertion, travel or climate disruption, work-pattern instability.

This layer matters because deviation is not meaningful in abstraction. A changed reading after a known disturbance does not carry the same interpretive weight as the same reading under stable conditions.

So the system must retain contextual memory.

4.4.6 Personal Health History

The engine also requires a backward-looking health structure.

This includes: prior episodes of abnormality, prior diagnoses, prior recovery patterns, previous escalation events, prior drift corridors, past confirmed false alarms, temporal links between exposures and deviations.

This layer allows the engine to interpret whether a present signal is novel, recurrent, seasonal, or structurally related to earlier events.

Without history, the system sees only the present. With history, it sees pattern.

4.4.7 Family-Risk and Inherited Field

The family and inherited layer does not prove current deterioration, but it affects background susceptibility.

This may include: family history of specific disease classes, known inherited predisposition, lineage-linked vulnerability markers, recurring family event patterns where relevant.

This layer primarily informs the Susceptible state and modifies interpretive weight. It does not by itself force escalation. It changes the background field against which present deviation is judged.

4.4.8 Why the Observation Field Must Be Layered

A layered observation field is necessary because early deterioration is often incomplete at first.

One layer may move slightly. Another may remain silent. A third may show soft reinforcement later.

If the system depends on one signal alone, it becomes brittle. It will either miss subtle multi-layer formation or overreact to isolated anomalies.

A layered field is stronger because it allows the engine to detect: partial emergence, repeated soft reinforcement, cross-channel convergence, contextual explanation versus unexplained drift, escalation strength built across time.

That is the right structure for a prognostic model.

4.4.9 Observation Does Not Mean Equal Weight

The six layers do not need identical weight in every use case.

A cancer-oriented surveillance layer may place greater weight on selected blood signals and family-risk structure.

A metabolic model may place heavier weight on behavioural, blood, and wearable layers.

An inflammatory model may rely more strongly on blood, symptoms, recovery pattern, and environmental burden.

So the observation field is fixed at the architectural level, but weighted at the application level.

This is important because the engine must remain general while still allowing disease-specific tuning later.

4.4.10 Section Conclusion

The observation field provides the engine with its perceptual structure. It defines the six input layers through which personal state is inferred: blood, wearables, symptoms, exposure, health history, and inherited risk. Together these layers allow the system to read not just isolated abnormalities, but patterned departure across a living personal timeline.

4.5 Deviation Logic

The engine cannot work with raw readings alone. It must convert observation into deviation, and deviation must be defined relative to the person's own baseline rather than treated as a free-floating abnormality.

The purpose of this section is to define how the system measures departure from the personal equilibrium corridor.

The principle is straightforward: a reading is meaningful not because it is merely different, but because it is different from what is ordinarily expected for that person.

4.5.1 Deviation as Baseline Departure

Let: x_i(t) = observed value of signal i at time t; μ_i(t) = personal baseline mean for signal i; σ_i(t) = personal variance band for signal i; ε = small stabilising constant to prevent collapse where variance is extremely small.

The personal deviation score for signal i is: \(z_i(t) = (x_i(t) - \mu_i(t)) / \max(\sigma_i(t), \varepsilon)\)

This gives the relative displacement of the current signal from that person's own expected corridor.

The purpose of this score is not to make a diagnosis. Its purpose is to state how far the person is from themselves.

4.5.2 Why Relative Deviation Matters

A raw number by itself is weak. The same value may be ordinary for one person and unusual for another.

The engine therefore must not ask only: What is the value?

It must ask: How unusual is this value for this person at this stage of their own trajectory?

That is why deviation is measured relative to the corridor.

A person may show meaningful internal departure while still remaining inside broad external population limits. Deviation logic is what allows the engine to capture that early movement.

4.5.3 Magnitude Alone Is Not Enough

A large deviation is important, but deviation magnitude alone is not sufficient for escalation.

The engine must distinguish between: one-off displacement, repeated displacement, escalating displacement, displacement with failed recovery, displacement reinforced by other layers.

So deviation is the first layer of interpretation, not the last.

A high z-score may trigger attention, but not necessarily transition to a stronger state on its own.

4.5.4 Multi-Signal Deviation

Because the system reads several signal classes, it also needs an aggregate deviation measure.

Let the weighted total deviation at time t be: \(D(t) = [\sum w_i |z_i(t)|] / [\sum w_i]\)

Where w_i = importance weight assigned to signal i.

This produces a weighted deviation magnitude across the observation field.

The weighting structure allows the engine to remain general while still adapting to application-specific domains. A metabolic engine, an inflammatory engine, and an oncology-oriented engine may assign different weights to the same general layers.

4.5.5 Directional Deviation

The engine should not only track magnitude. It should also track direction.

A signal that repeatedly rises in one direction does not carry the same meaning as a signal that oscillates without structure.

So for each signal, the model should preserve whether the deviation is: upward, downward, oscillatory, recovering, non-recovering.

This matters because prognosis is not only about distance from baseline. It is about movement through time.

4.5.6 Drift Versus Disturbance

Deviation logic must also distinguish between temporary disturbance and real drift.

A disturbance may produce a brief spike or dip followed by ordinary recovery.

A drift path shows one or more of the following: repeated deviation in the same direction, widening displacement, slower return toward baseline, recurrence after partial recovery, spread into adjacent signal layers.

So the deviation layer must be read as the beginning of a temporal pattern, not as a terminal conclusion.

4.5.7 Contextual Adjustment

Deviation should not be interpreted in abstraction from context.

The same measured departure may carry different meaning depending on: recent illness, unusual stress load, medication change, injury, travel, disrupted sleep, known sampling artefacts, menstrual or hormonal timing where relevant.

So the raw deviation score must be supplemented by contextual interpretation. Context does not erase deviation. It helps determine whether the deviation should be treated as expected disturbance, ambiguous movement, or unexplained concern.

4.5.8 Baseline Corridor Breach

For operational purposes, the engine should treat a signal as having breached the personal corridor when the deviation exceeds the ordinary tolerated range for that person.

This can be expressed simply as: corridor breach occurs when |z_i(t)| > θ_i, where θ_i = the chosen boundary threshold for signal i.

This does not yet mean escalation. It means the signal has left its expected corridor and now requires monitoring for persistence, recurrence, or convergence.

4.5.9 Deviation Hierarchy

The deviation layer can therefore be read in three levels: low deviation (still plausibly ordinary fluctuation), moderate deviation (outside ordinary personal expectation, requires attention), high deviation (strong corridor breach, requires close monitoring and possible reinforcement check).

This hierarchy keeps the model disciplined. It avoids both under-reading and over-reading weak movement.

4.5.10 Section Conclusion

Deviation logic gives the engine its first formal measure of departure from personal equilibrium. Each signal is compared against the person's own baseline corridor, aggregated through weighted multi-signal deviation, and interpreted in light of direction and context. This allows the system to detect that something is moving before deciding whether that movement is persistent enough to matter.

The next step is therefore persistence.

4.6 Persistence Logic

Deviation becomes prognostically meaningful when it persists. Without persistence, the engine cannot reliably distinguish structural change from temporary fluctuation.

The purpose of this section is to define how the system measures whether abnormality is repeating strongly enough across time to justify transition from weak concern to real drift.

The governing principle is simple: a single departure may be noise; repeated departure in the same corridor is structure.

4.6.1 Persistence as Repetition Across Windows

Let the engine observe deviation across a rolling window of length k.

Define: D(t) = aggregate deviation at time t; θ_d = deviation threshold; I(·) = indicator function, equal to 1 when the condition is true and 0 otherwise.

Then persistence at time t can be expressed as: \(P(t) = (1/k) \sum I(D(t-r) > \theta_d)\) for r = 0 to k-1.

This gives the proportion of recent observation windows in which deviation exceeded the defined concern threshold.

So: P(t) near 0 means deviation is mostly absent; P(t) near 1 means deviation is repeatedly present.

That is the simplest persistence measure.

4.6.2 Why Persistence Matters More Than Isolated Magnitude

A large isolated deviation may still be caused by disturbance, temporary stress, sampling error, or context-specific fluctuation.

Persistence changes the interpretation because it shows the system is not returning cleanly to baseline.

This is the critical distinction: disturbance = deviation followed by ordinary recovery; persistence = deviation that repeats, lingers, or reappears before recovery completes.

The engine therefore should not escalate on magnitude alone unless the signal is extreme. In most cases, it should ask whether the abnormality survives across enough windows to become temporally credible.

4.6.3 Forms of Persistence

Persistence does not appear in one form only. The engine should recognise at least four persistence patterns: continuous persistence (deviation remains elevated across consecutive windows), intermittent persistence (deviation drops briefly but repeatedly returns), directional persistence (the signal keeps moving in the same direction even if the threshold is crossed gradually), recovery-failure persistence (the signal begins to normalize but never fully returns to prior corridor before deviating again).

These patterns matter because early deterioration may not always present as a clean uninterrupted line. Sometimes the structure lies in repeated failed recovery rather than steady monotonic worsening.

4.6.4 Persistence Strength

For operational use, persistence can be graded: weak persistence (abnormality appears in a minority of recent windows), moderate persistence (abnormality appears in a substantial portion of recent windows), strong persistence (abnormality dominates the recent window history).

This allows the engine to distinguish: one-off concern, repeated concern, sustained concern.

A useful interpretation rule is: the stronger the persistence, the less likely the system is observing random fluctuation.

4.6.5 Persistence with Recovery Tracking

Persistence should not be measured only by threshold exceedance. It should also consider recovery behaviour.

A signal may technically dip below threshold but still show abnormal recovery if: it returns only partially toward baseline, it returns unusually slowly, it rebounds back into deviation shortly after apparent normalization.

So the engine should track not just whether deviation is present, but whether the system is capable of restoring equilibrium cleanly.

This matters because a failing recovery pattern is often an early sign that the person is not merely experiencing disturbance, but is beginning to lose regulatory stability.

4.6.6 Weighted Persistence

Because different signal groups may carry different interpretive value, persistence may also be weighted.

Let: P_i(t) = persistence of signal i; w_i = signal weight.

Then weighted persistence can be expressed as: \(P_w(t) = [\sum w_i P_i(t)] / [\sum w_i]\)

This allows the engine to give greater persistence importance to selected biomarker classes or high-value observation layers depending on the disease application.

So the architecture remains general, while the application layer remains tunable.

4.6.7 Persistence Boundary Between States

Persistence is one of the main boundary operators between states.

The move from Soft Deviation to Persistent Divergence occurs when abnormality is no longer isolated and the engine can show sufficient recurrence, duration, or recovery failure.

The move from Persistent Divergence to Escalation Candidate usually requires persistence plus either stronger magnitude, stronger convergence, or both.

So persistence is necessary for structured drift, but not always sufficient on its own for escalation.

4.6.8 Practical Interpretation

In practical terms, persistence means the person is not simply having a bad reading.

It means one of the following is true: the same abnormality keeps returning, the abnormality is not clearing properly, the person keeps drifting in the same direction, the system is failing to restore the prior corridor.

That is the point at which the engine should stop treating the event as weak uncertainty and start treating it as a real temporal pattern.

4.6.9 Section Conclusion

Persistence logic gives the model its ability to distinguish fleeting disturbance from structured deviation. By measuring how often abnormality recurs across recent windows, whether recovery succeeds, and how strongly the deviation remains present, the engine can identify when early movement has become genuine drift.

The next step is to determine whether that drift is reinforced across multiple signal layers.

4.7 Convergence and Escalation

Persistence alone is not always enough to justify escalation. A signal may persist and still remain ambiguous. The engine becomes stronger when persistent deviation is reinforced by movement in other parts of the observation field.

That reinforcement is convergence.

The purpose of this section is to define how the system determines whether multiple signals are beginning to support the same concern and when that combined structure is strong enough to move from monitoring to alert.

The governing principle is: one signal may warn; multiple aligned signals justify escalation.

4.7.1 What Convergence Means

Convergence occurs when abnormality is not confined to one isolated layer, but begins to appear across two or more related layers in a temporally meaningful way.

This may include combinations such as: blood deviation with symptom recurrence, blood deviation with wearable instability, symptom recurrence with failed recovery and exposure pattern, repeated biomarker shift with prior-history similarity, moderate abnormalities across several layers that together form a stronger pattern than any one layer alone.

The point is not that every layer must move at once. The point is that partial deviations begin to support one another.

4.7.2 Why Convergence Matters

A persistent signal can still be misleading if it remains isolated.

Convergence strengthens interpretation because it reduces the likelihood that the engine is responding to a narrow artefact or one-dimensional fluctuation. When several layers move in compatible directions, the pattern becomes more coherent.

So the engine should ask not only: Is deviation present? Is it persistent?

It should also ask: Is the deviation beginning to organize across the observation field?

That question defines the difference between drift that is merely local and drift that is becoming systemically meaningful.

4.7.3 Convergence Function

Let: G = total number of signal groups in the observation field; A_g(t) = indicator that signal group g is abnormal at time t.

Then convergence at time t can be expressed as: \(C(t) = [\sum A_g(t)] / G\)

This gives the proportion of signal groups currently showing abnormality.

So: C(t) near 0 means abnormality is isolated; C(t) at moderate levels means partial cross-layer reinforcement; C(t) at higher levels means broad convergence.

This does not require every group to be present. It simply measures how distributed the concern has become.

The weighted convergence term is retained here as the base operational form. In implementation, however, convergence should not be assumed to remain purely linear. Some signal combinations may produce stronger joint meaning than the sum of their separate contributions, especially where deviations are temporally clustered or occur within the same inherited-risk corridor. The implementation layer should therefore test interaction terms between selected signal classes. The additive form remains the first-order model; interaction effects become the calibrated second-order extension.

4.7.4 Weighted Convergence

Not all signal groups need equal interpretive value.

For some applications, blood and inherited-risk layers may matter more. For others, wearables and recovery pattern may carry greater importance.

So weighted convergence may also be used: \(C_w(t) = [\sum w_g A_g(t)] / [\sum w_g]\) where w_g = weight assigned to group g.

This preserves the same logic while allowing disease-specific tuning later.

4.7.6 Soft Convergence Versus Hard Convergence

The model should distinguish between weak and strong convergence.

Soft convergence means two or more layers show mild or partial support. Hard convergence means several layers show sustained, mutually reinforcing abnormality.

This distinction matters because the engine should be able to remain watchful before it becomes assertive.

So convergence should be read as a graded field: isolated abnormality, weak convergence, moderate convergence, strong convergence.

This allows the engine to preserve nuance rather than forcing premature alerting.

4.7.7 Escalation Is Not Final Diagnosis

Escalation must be understood correctly.

Escalation does not mean the system has proven disease. Escalation means the monitored abnormality is now strong enough that continued passive observation alone is no longer sufficient.

This is the point where the engine moves from watching to recommending active review.

That distinction protects the model from overclaiming while preserving its true strength, which is early structured warning.

4.7.8 Boundary Between Persistent Divergence and Escalation Candidate

The main boundary operator here is joint reinforcement.

A person remains in Persistent Divergence when abnormality is real but still insufficiently reinforced.

A person moves to Escalation Candidate when persistent abnormality is joined by enough magnitude, convergence, or recovery failure that the pattern becomes actionable.

So the state transition can be expressed in principle as: Persistent Divergence → Escalation Candidate when D(t), P(t), and C(t) jointly exceed their operative threshold structure.

This is the main escalation gate in the model.

4.7.9 Section Conclusion

Convergence gives the engine its ability to determine when persistent deviation is no longer merely local or ambiguous. Once abnormality begins to organize across multiple signal layers, the pattern gains structural force. Escalation then occurs not because one reading looked concerning, but because deviation, persistence, and convergence have become jointly strong enough to justify active review.

The remaining safeguard is commit discipline: how the engine avoids collapsing too early on incomplete evidence.

4.8 Commit Conditions

A prognostic engine must not only detect drift. It must also know when not to commit too early.

That is the purpose of commit conditions.

The system may observe deviation. It may observe persistence. It may even observe partial convergence.

But until the evidence is sufficiently separated from competing interpretations, the engine should retain uncertainty rather than collapsing prematurely into a stronger state.

This is one of the main safeguards in the model.

4.8.1 Why Commit Discipline Matters

Without commit discipline, the engine becomes unstable.

It may: overreact to temporary clustering, misread contextual disturbance as structural deterioration, force escalation from incomplete information, generate noisy alerts that reduce trust in the system.

A strong prognostic model therefore needs two capacities at once: sensitivity to early formation, and restraint in the face of incomplete reinforcement.

Commit conditions provide that restraint.

4.8.2 Retaining Competing Interpretations

At any point in time, more than one state may remain plausible.

For example: a person may show Soft Deviation that still plausibly reflects transient disturbance; a person may show Persistent Divergence but without enough convergence for escalation; a person may appear close to escalation, while partial recovery still remains possible.

So the engine should not treat state as a crude yes-or-no label. It should retain a graded belief over possible states.

Let γ_t(j) = probability or confidence assigned to state j at time t.

Then the engine maintains a distribution over states rather than an immediate single-state collapse.

This means the system can say: the person is most likely in one state, but other neighbouring states remain live possibilities.

That is the correct behaviour for an early-warning model.

4.8.3 Separation Requirement

Commitment should occur only when the leading interpretation is sufficiently separated from the next strongest alternative.

Let: j* = most likely state at time t; γ_t(j*) = confidence in the leading state; γ_t(k) = confidence in competing state k.

Define the separation measure: \(BF(t) = \gamma_t(j^*) / \max[\gamma_t(k)]\) for all k ≠ j*.

This expresses how clearly the leading state outruns the nearest competing explanation.

If BF(t) is low, the model should remain cautious. If BF(t) is high, commitment becomes more justified.

4.8.5 Soft Hold Versus Hard Commit

The engine should recognise two different internal modes before and after commitment.

Soft hold means: the system is tracking meaningful abnormality, concern is real, neighbouring interpretations remain open, monitoring continues without full escalation.

Hard commit means: the abnormality pattern is sufficiently reinforced, the leading state clearly outruns alternatives, escalation or state transition is now justified.

This distinction is important because many early biological processes are not immediately clean. The engine must be allowed to hold concern without pretending certainty.

4.8.6 Commit Failure and Reversal

Commit conditions must also allow reversal.

A strong model should be able to step back if: convergence weakens, deviation recovers, persistence drops, contextual explanation becomes stronger, the leading state no longer clearly outruns alternatives.

So commitment is not permanent by default. It remains conditional on the pattern continuing to justify it.

This preserves the system's temporal honesty.

The engine should therefore allow: state strengthening, state holding, state de-escalation, return to stability.

That is more realistic than a one-way ladder.

4.8.7 Escalation Commit Condition

The move into Escalation Candidate should therefore require two layers of control.

First, the joint condition must hold: deviation is strong enough, persistence is strong enough, convergence is strong enough.

Second, the interpretive separation must hold: the escalation interpretation sufficiently outruns weaker neighbouring interpretations.

So the full escalation principle is: Escalate only when the pattern is both jointly reinforced and sufficiently separated from competing explanations.

That is the main commit safeguard in the architecture.

4.8.8 Why This Strengthens the Paper

This section matters because it prevents the model from sounding naïve or overclaimed.

The paper is not proposing a machine that panics at every abnormality. It is proposing a disciplined prognostic engine that: detects early movement, preserves uncertainty where necessary, strengthens concern only when the evidence stabilises, escalates only when justified.

That is a much stronger position.

4.8.9 Section Conclusion

Commit conditions give the engine interpretive discipline. They ensure that the system does not collapse too early on incomplete evidence, but instead retains neighbouring possibilities until the leading state becomes sufficiently reinforced and clearly separated. This makes the model sensitive without becoming unstable, and cautious without becoming blind.

4.9 Chapter Conclusion

This chapter has defined the internal state logic of the prognostic engine.

The model does not treat health as a binary switch between normality and disease. It treats deterioration as movement across a structured state field: Stable → Susceptible → Soft Deviation → Persistent Divergence → Escalation Candidate → Clinical Review.

That movement is inferred through the observation field, measured through deviation from the personal corridor, strengthened through persistence, reinforced through convergence, and controlled through commit conditions.

The chapter therefore establishes the engine's core interpretive rule: early warning becomes actionable only when deviation, persistence, convergence, and state separation become jointly sufficient.

This completes the internal state architecture.

The next chapter should move outward from state logic to data structure by defining the signal layers in greater practical detail.

Chapter 5. Signal Sources and Observation Layers

The previous chapter defined the state logic of the engine. This chapter defines the signal structure through which those states are inferred in practice.

A prognostic model is only as strong as its observation layers. If the signal field is too narrow, early deterioration will be missed. If it is too loose, the model becomes noisy and unstable. The task is therefore to define a layered signal architecture that is broad enough to detect formation, but disciplined enough to remain interpretable.

The central proposition of this chapter is: the engine should not rely on one privileged signal, but on a structured observation field composed of biological, physiological, behavioural, contextual, and historical layers.

That layered design matters because early deterioration is rarely total at the beginning. It appears in fragments, weak reinforcements, partial drift, failed recovery, or recurring anomalies across different domains. A single-layer model will either miss this or overstate it. A multi-layer model can hold partial emergence and detect when it begins to organise.

5.1 Purpose

This chapter establishes the signal architecture of the system.

It defines: the major signal classes, the role of each class, how each class contributes to early drift detection, how different disease applications weight the same architecture differently, how the engine separates primary, secondary, and contextual evidence.

The purpose is not to produce a random inventory of possible inputs. It is to define a lawful observation structure.

5.1.1 Chapter Function

The function of this chapter is to answer: what exactly is the engine looking at when it decides that a person is stable, drifting, or approaching escalation?

5.2 Blood-Based Biological Signals

The blood layer is the primary biological observation layer of the engine. It provides repeatable internal measurements that can be sampled across time and compared against the person's own corridor. In the architecture of this paper, blood is not treated as a one-off truth event. It is treated as a recurring biological signal field.

That distinction matters.

A single blood result may indicate very little on its own. A sequence of blood results, interpreted against personal baseline, can reveal: early deviation, repeated deviation, failed recovery, progressive movement, cross-signal reinforcement with symptoms, wearables, or exposure.

This makes blood a strong anchor layer for prognosis.

5.2.1 Why Blood Sits at the Centre

Blood matters because it provides a direct internal biochemical view of the person's present state. It is closer to biological process than many surface-level observations, yet still compatible with repeated low-burden capture through the user loop.

For the purposes of this model, the blood layer performs four functions: detects internal deviation before overt manifestation, gives repeatable quantitative markers for temporal comparison, supports cross-checking against symptom and wearable data, helps define whether drift is local, systemic, or recovering.

So blood is not the whole engine, but it is one of its strongest internal anchors.

5.2.2 Blood as a Signal Class, Not a Single Test

The paper should not overcommit to one analyte list too early. It is stronger to define blood as a layered class of repeatable biological signals.

These may include: inflammatory markers, metabolic markers, immune-response markers, tissue-stress indicators, endocrine-linked measures, disease-specific biomarker panels where validated.

This keeps the architecture general.

The same blood layer can later be specialised for different application domains: oncology, metabolic drift, inflammatory disease, autoimmune instability, recovery failure, treatment monitoring.

So the model begins at the level of signal class, then later applies disease-specific weighting.

5.2.3 Longitudinal Blood Interpretation

The engine should never treat a blood reading in isolation where a sequence is available.

The correct interpretive order is: compare current blood value to personal baseline, measure deviation from ordinary corridor, test whether that deviation persists, check whether related blood signals are co-moving, check whether extra-blood layers reinforce or weaken the interpretation.

This is what turns blood from laboratory reporting into prognostic structure.

The central question is not: What is the value now?

The stronger question is: How is this value behaving relative to this person's own history?

That is the point at which blood enters the state engine rather than remaining a raw output.

5.2.4 Blood Signal Grouping

For operational clarity, blood signals should be grouped rather than treated as an undifferentiated list.

A practical grouping structure is:

Group A: inflammatory and immune activity – signals associated with ongoing inflammation, abnormal immune behaviour, or repeated immune load.

Group B: metabolic and regulatory function – signals associated with glucose handling, lipid balance, energy regulation, and broader metabolic drift.

Group C: tissue stress and organ strain – signals associated with unusual burden, injury, stress, or breakdown processes.

Group D: disease-specific targeted panels – signals selected for a particular application layer, such as oncology-oriented or autoimmune-oriented surveillance.

This grouping helps the engine do two things: track within-group persistence, and detect between-group convergence. That is stronger than reading every blood variable as a disconnected event.

5.2.5 Personal Blood Corridor

Each repeatable blood signal should have its own personal corridor.

That corridor includes: usual level, ordinary fluctuation band, directional history, recovery behaviour after disturbance, relation to adjacent blood markers.

So the engine is not looking only for "high" or "low." It is looking for whether a marker is: leaving its own corridor, staying outside it, widening away from it, failing to return, dragging related markers with it.

This is the biological version of drift.

5.2.6 Blood Layer Strength Versus Blood Layer Limits

The blood layer is strong because it offers direct biological information. But it is not sufficient by itself in every case.

A blood abnormality may be: real and important, transient and contextual, technically noisy, weak unless reinforced by other layers.

That is why the engine must not overread blood in isolation.

The blood layer gains prognostic power when joined to: symptom recurrence, wearable instability, failed recovery, prior-history similarity, inherited susceptibility.

So the correct architecture is not blood-only. It is blood-anchored but cross-layer aware.

5.2.7 Blood in the User Experience Model

Within the user loop, blood has a special role because it is both biologically meaningful and practically repeatable. It is one of the few internal signal classes that can be gathered in a small-volume format and passed into an app-based continuity system.

That makes it ideal for the model proposed in this paper.

The user does not need to understand every biochemical detail. The system does not need to display every raw marker at equal prominence. The engine only needs to determine whether the blood layer remains within the person's corridor or is contributing to a drift pattern.

This keeps the blood layer useful without making the interface unusable.

5.2.8 Section Conclusion

The blood layer is the central biological observation field of the prognostic engine. Its value lies not in one-off test interpretation, but in repeated measurement across time relative to the person's own corridor. By grouping blood signals, tracking within-person deviation, and checking reinforcement across other layers, the engine can use blood to detect early biological drift before overt manifestation.

5.3 Wearable and Physiological Signals

The wearable and physiological layer provides temporal density between biological sampling windows. If the blood layer gives repeated internal checkpoints, the wearable layer gives ongoing continuity. It allows the engine to observe whether the person is remaining stable in daily life or beginning to show repeated physiological disturbance that supports a drift interpretation.

This layer matters because many early changes do not first appear as dramatic biochemical events. They may appear as altered rhythm, failed recovery, sleep disruption, temperature irregularity, autonomic instability, or movement-pattern change. These are not always decisive on their own, but they become powerful when they persist and reinforce other layers.

5.3.1 Why the Wearable Layer Matters

The wearable layer matters for three reasons.

First, it increases observation frequency. Second, it captures recovery and rhythm rather than isolated status. Third, it strengthens cross-layer convergence.

A prognostic engine needs this because deterioration is often temporal before it is categorical. The body may begin to lose regularity before it becomes clinically legible in a narrow institutional frame.

So wearable signals help answer questions such as: Is the person recovering normally? Is physiological rhythm becoming unstable? Is there repeated stress without return to baseline? Are subtle disturbances accumulating between blood samples?

That is their real value.

5.3.2 Core Wearable Signal Classes

The paper should keep the wearable layer broad but structured. The core classes are: heart-rate pattern, heart-rate variability, sleep continuity and fragmentation, movement and inactivity pattern, temperature trend, respiration-linked features where available.

These do not need to be framed as final diagnostic indicators. They function as continuity signals that help the engine detect regulation failure, rhythm disturbance, and multi-day pattern change.

5.3.3 Rhythm as an Early Indicator

One of the strongest contributions of the wearable layer is rhythm analysis.

A person may remain technically functional while showing: reduced sleep stability, altered recovery after exertion, prolonged autonomic stress, irregular movement-rest cycles, abnormal temperature drift.

These are not necessarily disease proofs. But they may indicate that the system is no longer self-regulating in its ordinary way.

That is why rhythm belongs in the engine. It is often the earliest visible form of instability.

5.3.4 Recovery Tracking

Wearables are especially useful for recovery tracking.

The engine should ask: Does the person return to prior physiological range after stress? Is recovery slower than usual? Does instability recur before recovery is complete? Is the baseline rhythm becoming harder to restore?

This is important because early deterioration often appears not only as deviation, but as failure to recover. The body may still oscillate, but the oscillation no longer returns cleanly to its prior corridor.

That is a highly prognostic pattern.

5.3.5 Wearables Are Supportive, Not Sovereign

The paper should remain precise here.

The wearable layer is powerful, but it should not be treated as sovereign by itself. Consumer wearables can be noisy, incomplete, or context-sensitive. Their value lies in continuity and pattern support, not in pretending that one device reading alone should govern the full interpretation.

So the correct role of the wearable layer is: continuity support, recovery monitoring, rhythm instability detection, convergence reinforcement.

That makes it strong without overstating it.

5.3.6 Wearable Corridor Logic

Like the blood layer, the wearable layer should also be interpreted against the person's own corridor.

The engine is not primarily asking whether a sleep score or heart-rate pattern is abnormal for the population. It is asking whether it is abnormal for this person over time.

So the wearable layer should track: usual rhythm, ordinary day-to-day variance, response to routine stressors, response to unusual stressors, recovery speed, recurrence tendency.

This makes wearable data part of the same equilibrium framework rather than a separate gadget stream.

5.3.7 Cross-Layer Role

The wearable layer becomes most powerful when joined to other layers.

Examples include: blood deviation plus poor recovery pattern, symptom recurrence plus temperature irregularity, rising inflammatory signals plus sleep fragmentation, exposure load plus reduced autonomic stability.

In these cases, wearables do not prove the condition. They strengthen the structure of the observed drift.

That is exactly the role they should play in this model.

5.3.8 User Experience Advantage

From a user perspective, wearables also reduce burden because they capture data passively. This matters because a prognostic system should not rely entirely on active user effort. Passive physiological capture helps preserve continuity even when the user is tired, stressed, or not fully engaged.

So the wearable layer improves not only signal density, but adherence stability.

That makes it operationally important.

5.3.9 Section Conclusion

The wearable and physiological layer gives the engine continuity between sampling events. It captures rhythm, recovery, and autonomic stability across ordinary life and helps the system detect early instability that may not yet be obvious in a single biological reading. Its strength lies not in standalone certainty, but in temporal density and cross-layer reinforcement.

5.4 Symptom and Self-Report Signals

The symptom and self-report layer captures what the person experiences before, during, and between measurable deviations. This layer is essential because not all meaningful change appears first in a laboratory value or passive device stream. Some changes appear first as felt disturbance: fatigue, discomfort, altered recovery, unusual recurrence, or subtle functional decline.

The engine should therefore treat self-report as a legitimate signal class, but not as a sovereign one. Its strength lies in pattern, repetition, and convergence with other layers.

5.4.1 Why Self-Report Matters

The person is often the first observer of drift.

Before a marker becomes clearly abnormal, the person may notice: unusual tiredness, altered sleep quality, recurrent pain or discomfort, appetite change, temperature sensitivity, poor recovery after ordinary effort, vague but repeated "not right" states.

These are important because the body does not only communicate through instruments. It also communicates through lived experience.

A prognostic engine that ignores this loses an entire early-warning channel.

5.4.2 Self-Report as Structured Input

The paper should not treat symptom reporting as loose narrative alone. It should be structured enough to become longitudinally usable.

A practical self-report architecture includes: symptom type, onset time, duration, recurrence frequency, intensity band, recovery pattern, relation to other events, user confidence or certainty.

This allows the engine to preserve symptom information as signal rather than anecdote.

The point is not to turn lived experience into sterile abstraction. The point is to make repeated experience trackable across time.

5.4.3 Why Repetition Matters More Than Single Complaint

A one-off symptom may be noise, context, or transient disturbance. Repeated symptoms of similar type, timing, or intensity carry much greater prognostic value.

So the engine should ask: Does the same complaint recur? Does it recur in the same temporal corridor? Does it intensify? Does it fail to resolve cleanly? Does it appear alongside blood or wearable deviation?

This is where self-report becomes structurally meaningful.

The signal is not just the symptom itself. The signal is the recurrence pattern.

5.4.4 Subjective Does Not Mean Useless

A common institutional weakness is to treat self-report as weak because it is subjective. That is too crude.

Subjective does not mean false. It means first-person.

In a pre-event system, first-person data can be extremely valuable because the system is specifically trying to detect early departure before overt manifestation. At that stage, soft experiential change may appear before stronger external confirmation.

So the correct stance is: self-report is not final proof, but it is valid early evidence.

That is the right balance for this paper.

5.4.5 Symptom Clustering

The symptom layer should also allow clustering.

Examples include: fatigue plus sleep fragmentation, pain plus inflammatory drift, poor recovery plus wearable instability, appetite change plus metabolic deviation, recurrent vague discomfort plus persistent blood-layer change.

A single symptom may remain weak. A symptom cluster, especially when repeated, becomes much stronger.

This is why the engine should not look only at symptom count. It should also look at symptom pattern.

5.4.6 Silence Is Not the Same as Stability

The paper should also remain careful in the opposite direction. Lack of reported symptoms does not automatically mean stability. Some important biological drifts remain subjectively silent for long periods.

So the engine must treat self-report as one layer among several, not as the sole gate of interpretation.

That means: symptoms can strengthen concern, symptom absence does not automatically remove concern, self-report must be read alongside blood, wearables, history, and exposure.

This prevents the model from becoming too dependent on conscious noticing alone.

5.4.7 User Experience Value

From a UX perspective, the self-report layer is important because it gives the person an active role in the system. The user is not just being measured. The user is participating in continuity.

That has two benefits: it improves temporal detail, and it preserves trust, because the person can see that their lived experience matters inside the architecture.

This is particularly important in a system designed to move beyond purely institutional timing.

5.4.8 Section Conclusion

The symptom and self-report layer captures first-person evidence of change across time. Its value lies not in isolated complaint, but in repeated, structured, and convergent experience that supports or clarifies other layers of the observation field. In a prognostic model, this layer helps detect early departure that may still be soft in purely external measures.

5.5 Behavioural and Exposure Signals

The behavioural and exposure layer provides the context in which all other signals must be interpreted. No deviation is fully meaningful in abstraction. A changed reading may reflect genuine emerging deterioration, but it may also reflect stress load, environmental burden, disrupted sleep, medication change, infection exposure, unusual exertion, dietary instability, or other contextual forces.

The engine therefore requires a contextual field.

Without it, the model risks two errors: overreading explainable disturbance as structural drift, and underreading repeated harmful exposure that is actively shaping the drift path.

This layer helps prevent both.

5.5.1 Why Context Matters

A signal is never just a number. It occurs inside conditions.

The same blood deviation may carry very different meaning depending on whether the person has: slept normally or poorly, experienced acute stress, changed medication, travelled, changed diet sharply, encountered infection, worked under unusual pressure, experienced unusual heat, cold, pollution, or disruption.

So the engine must not only ask: What changed?

It must also ask: What surrounded the change?

That is the role of the exposure layer.

5.5.2 Behaviour as a Drift Driver

Behavioural patterns matter because they do not simply explain readings after the fact. They may actively drive the deviation itself.

This includes patterns such as: repeated poor sleep, chronic stress loading, prolonged inactivity, unstable eating rhythm, alcohol or substance burden, overexertion without recovery, irregular medication adherence, repeated disruption of ordinary routine.

These do not guarantee disease. But they can alter the person's corridor, weaken recovery, and amplify other abnormalities.

So behaviour in this model is not background decoration. It is part of the causal field.

5.5.3 Exposure as Environmental Load

The exposure side of the layer captures conditions that may affect biological stability without being purely behavioural.

This may include: infection exposure, allergen burden, pollution load, workplace strain, heat stress or cold stress, travel-related disruption, toxic or inflammatory environment, recurrent environmental conditions linked to prior deviations.

This matters because some drift paths are not driven primarily by internal defect. They are driven by repeated external load acting on the person's system.

The engine should therefore preserve exposure memory across time.

5.5.4 Context Does Not Cancel Deviation

It is important to state this clearly.

Context should not be used to erase concerning deviation too easily.

The fact that a person was stressed or sleep-deprived does not necessarily make the signal unimportant. It may instead show: vulnerability to that stressor, reduced recovery capacity, repeated destabilisation under common conditions, an exposure-linked pathway that is itself meaningful.

So the correct role of context is not to explain away deviation. It is to classify the deviation more accurately.

5.5.5 Repeated Context-Signal Pairing

One of the most useful functions of this layer is repeated pairing.

The engine should track whether particular deviations repeatedly co-occur with: certain behaviours, certain environments, certain time patterns, certain recovery failures.

This helps distinguish: random disturbance, recurring contextual destabilisation, more autonomous unexplained drift.

That distinction is important. If a pattern only appears under one specific burden, the intervention path may differ from a pattern that appears regardless of context.

5.5.6 Context as a Weight Modifier

The behavioural and exposure layer should also modify interpretive weight.

For example: a blood deviation after extreme stress may initially be down-weighted, then re-evaluated for recovery; a mild abnormality repeated under stable conditions may be up-weighted; recurring symptoms under repeated environmental exposure may be treated as a structured pattern rather than isolated complaint.

So context does not simply sit beside the model. It changes how the model reads other layers.

5.5.7 User Experience Importance

From a UX perspective, this layer is also important because it allows the person to record what was happening around the deviation rather than being reduced to abstract readings alone.

That improves trust and interpretability.

The user can see that the system is not asking only: What number came in?

It is also asking: What was occurring in life at the time?

That makes the model more human and more accurate.

5.5.8 Section Conclusion

The behavioural and exposure layer gives the engine contextual intelligence. It allows the model to interpret deviation in light of lifestyle, stress, environment, routine disruption, and recurring external load. This prevents naïve reading of isolated signals and helps distinguish explainable disturbance, context-linked instability, and genuinely unexplained drift.

5.6 Health History and Prior Event Memory

The health-history layer gives the engine temporal depth. Without it, the system sees only current deviation. With it, the system can determine whether the present pattern is novel, recurrent, seasonal, context-linked, partially resolved, or structurally similar to a prior event.

This layer matters because prognosis is not only about what is happening now. It is also about what has happened before and how the person typically responds across time.

5.6.1 Why Prior Event Memory Matters

A present deviation is interpreted differently depending on whether it is: entirely new, a recurrence of a known pattern, a partial return of a previously unresolved issue, a repeat of a context-linked instability, a known false alarm pattern, part of a longer unfinished drift sequence.

So the engine must preserve memory of prior events, not just current values.

Without memory, the system is forced to read every disturbance as if it has no past. That weakens prognosis.

5.6.2 What the History Layer Should Contain

The prior-event layer should include: previous deviations from baseline, prior escalation episodes, prior confirmed diagnoses, prior non-diagnostic but significant anomalies, recovery times, recurrence intervals, known triggers or context associations, prior false-positive patterns, past treatment or intervention responses.

This gives the engine an interpretive archive rather than a flat present-tense feed.

5.6.3 Novelty Versus Recurrence

One of the most important functions of history is distinguishing novelty from recurrence.

A novel event may require cautious interpretation because no prior pattern exists.

A recurrent event may carry stronger weight because the system can compare: time since last event, similarity of signal shape, similarity of context, similarity of recovery behaviour, whether recurrence is worsening or shortening in interval.

This makes the model much stronger than one-off screening because it can ask not only whether something is wrong, but whether the person is re-entering an established deviation corridor.

5.6.4 Recovery Memory

History should also preserve recovery patterns.

This means the engine should know: how quickly the person usually returns to baseline, whether recovery tends to be complete or partial, whether certain deviations tend to leave residual instability, whether each recurrence recovers more slowly than the last.

This matters because worsening recovery is itself a prognostic signal.

A person may show similar deviation magnitude on two occasions, yet the second event may be more serious if recovery is slower, weaker, or incomplete.

So prior-event memory is not only about what happened. It is also about how the system came back.

5.6.5 Prior False Alarms and Pattern Discipline

The history layer also protects the model from noisy escalation.

If the engine can recognise that a particular combination of signals has previously: appeared, resolved quickly, failed to converge further, repeatedly remained non-progressive, then it can read current recurrence with greater discipline.

This does not mean dismissing the event automatically. It means reading it with informed caution.

So memory strengthens both sensitivity and restraint.

5.6.6 Temporal Compression Without History

Without prior-event memory, a system tends to compress time. It sees the current signal as a narrow present event rather than part of a larger trajectory.

That creates three weaknesses: recurrence may be mistaken for novelty, worsening interval may be missed, recovery failure across episodes may go unrecognised.

This is one of the main reasons the history layer belongs inside the prognostic engine itself rather than being left to fragmented recollection.

5.6.7 History as a Weighting Layer

The history layer should also modify interpretive weight.

Examples: a present deviation that closely resembles a previously escalated pattern may be up-weighted; a known benign recurrent fluctuation may initially be down-weighted but still monitored; a shorter recurrence interval than usual may increase concern; repeated incomplete recovery across episodes may strengthen escalation probability.

So health history is not passive background. It actively changes how the present is read.

5.6.8 User Experience Importance

From the user's perspective, prior-event memory also solves a common institutional failure: having to explain the same pattern repeatedly from scratch.

In this model, the system remembers: what has happened before, how it unfolded, what helped, what did not, whether the pattern is returning.

That makes the app more useful and more truthful to lived continuity.

5.6.9 Section Conclusion

The health-history and prior-event layer gives the engine temporal depth, recovery memory, and recurrence intelligence. It allows present deviation to be interpreted in light of past pattern, rather than as an isolated occurrence. This strengthens the model's ability to distinguish novelty, recurrence, worsening interval, and incomplete resolution across time.

5.7 Family Risk and Inherited Background

The family-risk and inherited layer defines the background susceptibility field of the person. It does not prove present deterioration, and it should not be treated as a diagnosis in advance. Its function is different. It establishes whether the person begins from a heavier prior risk position than would be inferred from current signals alone.

This matters because prognosis is not only about current deviation. It is also about the conditions under which deviation emerges. Family history and inherited predisposition can shift the interpretive meaning of otherwise modest patterns, especially where repeated low-level drift appears in a biologically relevant corridor. Research on hereditary cancer syndromes and broader genomic screening continues to support the practical relevance of inherited-risk identification for earlier surveillance and prevention planning.

5.7.1 Why Inherited Background Matters

Two people may present with the same weak present deviation and yet require different interpretive weight because their background fields are different.

One person may have: no known family pattern, no prior inherited signal, no known lineage-linked recurrence.

Another may have: repeated family occurrence in a related disease class, known inherited predisposition, a lineage pattern of early or recurring onset, prior family clustering around similar trajectories.

The present signal is the same. The background risk is not.

That is why inherited background belongs in the engine.

5.7.2 The Role of This Layer

This layer performs three functions.

First, it informs the Susceptible state. Second, it modifies how strongly present deviation is interpreted. Third, it helps determine how low the system's tolerance should be for repeated weak abnormalities in specific domains.

The key principle is: inherited background raises interpretive vigilance; it does not by itself force escalation.

That distinction keeps the model disciplined.

5.7.3 What the Layer Should Contain

The inherited-background field may include: family history of major disease classes, age of onset in relatives where known, recurrence patterns across generations, known inherited variants or predisposition findings where available, disease clustering within lineage, unresolved but repeated family-pattern concern.

This should be treated as structured background, not as a loose anecdotal note.

The engine does not need complete genetic certainty in every case for this layer to be useful. Even family-pattern recurrence can materially affect surveillance logic. At the same time, when genomic data are available, they can sharpen risk stratification and targeted screening pathways.

5.7.4 Family History Is Not Destiny

The paper should remain precise here.

Inherited background does not mean the event will occur. It means the baseline field of risk is not flat.

That is a crucial distinction because the engine is not designed to convert predisposition into fatalism. It is designed to use predisposition as one weighting layer inside a broader continuous-monitoring architecture.

So the model should reject both errors: ignoring inherited background, and overreading inherited background as present disease.

The correct position is intermediate and structured.

5.7.5 Background Weighting of Present Signals

The main operational use of this layer is weighting.

Examples: a small but persistent blood deviation in a high-risk lineage may carry more importance than the same deviation in a low-risk background; repeated soft symptoms linked to a family-pattern disease corridor may justify earlier reinforcement checks; absence of family history may slightly reduce prior concern, but should never override strong present convergence.

So inherited background changes how the present is read, but it never replaces the present.

5.7.6 Disease-Specific Importance

This layer will not matter equally in every application.

For some domains, inherited background may have modest influence. For others, especially some cancer, cardiovascular, metabolic, or autoimmune pathways, it may have stronger interpretive value.

That means the architecture should keep the layer general, but allow application-specific weighting later.

This is another reason the paper's layered design is strong: it does not assume one universal weighting scheme for every disease pathway.

5.7.7 User Experience Importance

From the user's perspective, this layer is also practical because many people already know part of their family history even where full clinical genetic work has never been completed.

That means the system can begin with ordinary lived knowledge: what ran in the family, what appeared early, what repeated, what was feared but never properly monitored.

Then, where formal inherited-risk data become available, the same layer can be refined rather than rebuilt from scratch.

5.7.8 Section Conclusion

The family-risk and inherited-background layer defines the person's susceptibility field. It does not diagnose present disease, but it changes the interpretive weight of present deviation and helps the engine determine how closely weak abnormalities should be watched in particular domains. It is therefore a background-risk layer, not a sovereign decision layer.

5.8 Signal Hierarchy and Weighting

The engine should not treat all signals as equal at all times. A layered model is only useful if it can distinguish between stronger and weaker evidence, between direct and indirect indicators, and between core and contextual inputs.

That is the purpose of signal hierarchy and weighting.

The question is not only: What signals are present?

The stronger question is: How much interpretive force should each signal carry in this application, at this time, for this person?

5.8.1 Why Hierarchy Is Necessary

Without hierarchy, the model becomes flat.

A flat model creates two failures: weak contextual signals may be given too much force, and strong biological or temporal signals may be diluted by noise.

A prognostic engine must therefore distinguish between: primary signals, secondary signals, contextual modifiers.

This allows the model to remain sensitive without becoming indiscriminate.

5.8.2 Primary Signals

Primary signals are those with the strongest direct relationship to internal biological deviation or repeated physiological instability.

These usually include: blood-based biological markers, strong wearable recovery or rhythm failures, repeated prior-event similarity, disease-specific targeted signals where validated.

Primary signals carry the greatest interpretive weight because they are closest to the core drift process.

They do not automatically determine the outcome by themselves, but they are usually the first drivers of state movement.

5.8.3 Secondary Signals

Secondary signals are signals that may not be decisive alone, but materially strengthen or weaken the interpretation when they align with primary signals.

These often include: structured symptom recurrence, passive wearable instability of lower specificity, repeated self-reported recovery failure, behavioural patterns linked to known destabilisation.

Secondary signals help answer: Is the primary abnormality isolated, or is it beginning to organise into a larger pattern?

Their value lies in reinforcement.

5.8.4 Contextual Modifiers

Contextual modifiers do not usually serve as direct evidence of internal deterioration by themselves. Their role is interpretive.

These include: stress load, sleep disruption, medication change, travel, environmental burden, acute lifestyle disturbance, known sampling artefact conditions.

These signals should not be ignored, but they should not usually govern the engine on their own.

Their main function is to adjust how the engine reads other layers.

5.8.5 Weighting Across Applications

The same architecture must support different disease domains. That means weighting cannot be fixed in exactly the same way for every use case.

For example: an oncology-oriented model may give heavier weight to selected blood panels, inherited risk, and prior-event similarity; a metabolic model may weight blood, behaviour, and wearables more heavily; an inflammatory model may weight blood, symptoms, recovery pattern, and context more strongly; a recovery-monitoring model may place more emphasis on wearables and recurrence timing.

So the hierarchy is stable in principle, but flexible in deployment.

That is one of the strengths of the architecture.

5.8.6 Weighting Across Persons

Weighting should also vary by person, not only by disease class.

A signal that is highly informative for one person may be less informative for another depending on: baseline stability, known susceptibility, prior false-alarm patterns, quality of available data, recurrence history, degree of passive data coverage.

So the engine should be capable of personal weighting adjustment across time.

This prevents the model from being trapped inside a rigid universal rule set.

5.8.7 Static Versus Dynamic Weighting

The paper should distinguish between static and dynamic weighting.

Static weighting means fixed importance assigned by design or application layer.

Dynamic weighting means importance that changes according to present conditions, such as: increasing weight when a signal repeatedly proves predictive, reducing weight when a signal is repeatedly noisy, increasing weight when a signal converges with high-value layers, reducing weight when context strongly explains the deviation.

A strong prognostic engine should support both.

Static weighting provides structure. Dynamic weighting provides adaptation.

5.8.8 Weighting and Escalation Discipline

Weighting is also one of the system's main safeguards against noisy escalation.

If all inputs are treated equally, the engine may overreact to shallow convergence built from low-value signals.

Hierarchy prevents that by ensuring that escalation is not driven merely by signal count, but by signal significance.

So the system should not ask only: How many layers moved?

It should also ask: Which layers moved, and how much do they matter?

That is the correct escalation discipline.

5.8.9 Practical Weight Structure

A simple practical structure for the paper is: Tier 1 – primary biological and high-confidence temporal signals; Tier 2 – reinforcing symptom and physiological signals; Tier 3 – contextual and explanatory modifiers.

This structure is easy to explain, easy to implement, and strong enough for the chapter.

It also preserves clarity for later code and dataset chapters.

5.8.10 Section Conclusion

Signal hierarchy and weighting give the engine interpretive discipline. They ensure that the model does not treat every input as equivalent, but instead distinguishes direct biological evidence, reinforcing personal evidence, and contextual modifiers. This allows the same layered architecture to support multiple disease domains while remaining personally adaptable and resistant to shallow noise.

5.9 Cross-Layer Reinforcement

Cross-layer reinforcement is the point at which separate signals stop behaving like isolated fragments and begin to form a coherent drift structure. This is one of the engine's most important interpretive mechanisms.

A prognostic system should not depend on one signal becoming extreme before it responds. It should be able to detect when several weaker signals begin to support the same concern across different parts of the observation field.

That is the function of reinforcement.

5.9.1 Why Reinforcement Matters

A single abnormality may remain ambiguous.

It may reflect: temporary disturbance, local instability, measurement noise, context-specific disruption, non-progressive fluctuation.

But when another layer begins to move in a related way, the interpretive force changes.

For example: blood deviation plus recurrent fatigue, blood deviation plus poor recovery pattern, symptom recurrence plus wearable instability, prior-history similarity plus current corridor breach, weak biomarker shift plus inherited-risk background.

None of these combinations proves a diagnosis by itself. But together they create structural reinforcement.

That is exactly what a prognostic engine needs.

5.9.2 Reinforcement Is Stronger Than Signal Count

The paper should make an important distinction here.

Cross-layer reinforcement is not the same as simply counting how many signals are abnormal.

A weak cluster of low-value signals is not necessarily stronger than one high-value biological signal with persistence.

So reinforcement must be read through hierarchy.

The correct question is not: How many things changed?

It is: Are meaningful layers changing in a compatible temporal pattern?

That is a much stronger rule.

5.9.3 Temporal Compatibility

Signals do not reinforce each other merely by coexisting. They reinforce each other when they align in time, direction, or recovery logic.

That means the engine should look for: similar onset corridors, repeated co-occurrence, worsening in parallel, failure to recover together, recurrence across adjacent windows, similar relation to context or exposure.

This is important because a signal that occurs months apart from another unrelated change may not strengthen interpretation. Reinforcement requires temporal compatibility, not mere coexistence.

5.9.4 Types of Reinforcement

The engine should recognise at least four forms of reinforcement.

Direct reinforcement: two strong layers both move in a compatible way.

Supportive reinforcement: a strong primary signal is supported by weaker secondary layers.

Historical reinforcement: a present pattern resembles a prior known drift corridor.

Contextual reinforcement: a deviation appears repeatedly under similar conditions, showing a stable destabilisation pathway.

These forms allow the engine to interpret reinforcement more intelligently than by simple addition.

5.9.6 Reinforcement Does Not Need Total Agreement

The paper should also remain precise here.

Not every signal layer needs to agree for reinforcement to occur.

A real early-stage process may show: strong blood movement with only mild symptom support, strong symptom recurrence with moderate wearable disruption, inherited background with repeated weak blood deviation and no obvious symptoms yet.

So reinforcement should be treated as partial and graded, not all-or-nothing.

This is one of the reasons the model is stronger than binary screening logic.

5.9.7 Negative Reinforcement and Weakening

The engine should also recognise weakening conditions.

A pattern may initially look reinforced, then lose structure because: recovery succeeds, a contextual explanation becomes stronger, a key primary signal returns to baseline, convergence fails to continue, recurrence does not repeat.

This matters because reinforcement should increase with structure and decrease with dissolution.

That preserves interpretive honesty.

5.9.8 Cross-Layer Reinforcement as Prognostic Strength

The deeper role of reinforcement is this: it allows the engine to act before any single layer becomes overwhelming.

That is one of the central strengths of the paper.

Instead of waiting for a dramatic solitary event, the system reads the formation path through coordinated smaller movements. This is exactly how pre-event prognosis becomes possible.

5.9.9 Section Conclusion

Cross-layer reinforcement is the mechanism by which partial abnormalities become a coherent prognostic structure. It allows the engine to detect when separate layers are beginning to support the same concern across time, direction, and recovery pattern. This is stronger than isolated signal reading and more disciplined than simple signal counting.

5.10 Chapter Conclusion

This chapter has defined the signal architecture of the prognostic engine. The model reads the person through layered observation fields: blood-based biological signals, wearable and physiological continuity signals, symptom and self-report signals, behavioural and exposure context, prior health history, and family-risk background. These layers do not contribute equally. They are organised through hierarchy and weighting, then strengthened through cross-layer reinforcement.

The core conclusion is: the engine does not infer state from one signal alone, but from the structured interaction of multiple signal layers across time.

That completes the observation architecture.

The next chapter should formalise the engine itself.

Chapter 6. Prognostic Engine

The previous chapters established the user loop, the state logic, and the layered observation field. This chapter defines how those parts are processed into an operational engine.

The prognostic engine is the mechanism that takes repeated personal signals, compares them against the person's own corridor, determines whether drift is forming, updates the current state, and decides whether the pattern should remain under observation or move toward escalation.

The central proposition of this chapter is: the engine should not classify disease directly from isolated inputs; it should process layered personal data into controlled drift scores, state updates, and thresholded warning outputs.

That is what makes the system prognostic rather than merely reactive.

6.1 Purpose

This chapter establishes the operational logic of the engine.

It defines: signal intake into the engine, signal normalisation against personal baseline, drift score construction, persistence and convergence integration, state update logic, escalation threshold logic, de-escalation and return-to-stability logic, output structure for the user and onward review.

The point is not to describe a black box. The point is to define a lawful sequence of interpretation.

6.1.1 Chapter Function

The function of this chapter is to answer: how does the system move from layered personal data to a structured prognostic decision?

6.2 Engine Input Layer

The engine input layer is the formal entry point at which personal observations become machine-readable state evidence. Its role is not to interpret the signals in full. Its role is to receive them in a structured form, preserve their time relation, preserve their source class, and pass them into the normalisation and drift logic.

This matters because a prognostic engine cannot operate on raw fragments. It requires ordered input.

6.2.1 Function of the Input Layer

The input layer performs five tasks: receives the incoming signal event, identifies its source layer, attaches timestamp and context, checks minimum validity, routes the signal into the baseline comparison process.

So the input layer is not just a storage pipe. It is the structured intake gate of the engine.

6.2.2 Input Event Structure

Each incoming event should be represented as a structured record.

A minimum event record contains: event identifier, user identifier, timestamp, signal layer, signal name, signal value, signal unit, collection method, quality flag, context tag set.

This gives the engine enough structure to preserve continuity and support later interpretation.

A simple formal representation is: InputEvent = (u, t, L, n, v, q, c)

Where: u = user, t = timestamp, L = signal layer, n = signal name, v = signal value, q = quality status, c = context field.

6.2.3 Input Layers

The engine should accept events from the six major signal classes already defined: blood-based biological inputs, wearable and physiological inputs, symptom and self-report inputs, behavioural and exposure inputs, history-linked updates, family-risk background updates.

This means the input layer is multimodal from the outset.

The engine does not need every layer at every time step. It only needs to preserve whichever layers are present and time-bind them correctly.

6.2.4 Sparse and Dense Inputs

Not all inputs arrive at the same frequency.

Some are sparse: blood samples, updated family history, medical-event entries.

Some are dense: wearables, symptom logs, behavioural records.

The input layer must therefore support mixed cadence.

That means the engine should not assume equal timing intervals. It must preserve the actual sequence of arrival and later allow the model to interpret recurrence, gap length, and clustering properly.

6.2.5 Input Validation

The input layer should apply only basic validation before forwarding the event.

This includes: required-field presence, unit recognition, value plausibility, duplicate detection, missing-quality flags, source-type confirmation.

The point is not to overprocess at intake. The point is to ensure the event is usable and traceable.

A low-quality event should not necessarily be discarded. It may instead be marked with reduced confidence and passed forward with that status attached.

6.2.6 Context Binding

Every input should be able to carry context.

Examples include: fasting or non-fasting state, recent exertion, acute stress, illness context, medication change, disrupted sleep, travel or environmental disruption.

This matters because the same value may carry different interpretive weight under different conditions.

So the engine input layer should preserve context at entry rather than trying to reconstruct it later.

6.2.7 Temporal Ordering

The input layer must preserve event order.

That means the engine should know: what came first, what came next, how far apart the events were, whether several signals arrived in the same window, whether a gap in observation has occurred.

This is essential because prognosis depends on sequence, not just content.

The input layer therefore prepares the conditions for persistence logic, convergence logic, and recovery logic later in the engine.

6.2.8 Input Windows

For operational use, the engine may also group events into rolling windows.

A window can be defined as: a day, a week, a disease-specific interval, or an adaptive interval based on signal density.

Within each window, the engine can summarise: present signals, absent signals, abnormal signals, context burden, recovery state, cross-layer co-occurrence.

This does not replace raw event storage. It gives the engine a usable temporal frame for state updates.

6.2.9 Input-Layer Principle

The governing principle of the input layer is: capture first, structure immediately, interpret later.

This keeps the architecture disciplined.

The input layer should not overclaim meaning. It should make meaning possible.

6.2.10 Section Conclusion

The engine input layer converts personal observations into structured prognostic evidence. It preserves source class, timestamp, context, quality, and sequence so that later stages of the engine can compare signals against personal baseline, detect drift, and update state lawfully. Without this layer, the model would receive disconnected fragments. With it, the model receives ordered temporal evidence.

6.3 Signal Normalisation

The engine cannot compare raw inputs directly unless they are first normalised against the person's own corridor. Normalisation is the stage at which incoming signals stop being isolated values and become interpretable departures from personal equilibrium.

The purpose of this section is to define how each incoming signal is converted into a baseline-relative form that can be used by the drift engine.

6.3.1 Function of Normalisation

Normalisation performs four tasks: aligns each signal to the person's own baseline, accounts for ordinary personal variance, preserves direction of movement, makes different signal types comparable inside one engine.

Without normalisation, the system would be forced to compare unlike quantities directly. With normalisation, the engine can read all signals as forms of relative departure.

6.3.2 Personal Baseline Reference

For each signal i at time t, define: x_i(t) = observed value; μ_i(t) = personal baseline mean; σ_i(t) = personal variance band; ε = small stabilising constant.

Then the normalised signal value is: \(z_i(t) = (x_i(t) - \mu_i(t)) / \max(\sigma_i(t), \varepsilon)\)

This expresses how far the current reading sits from the person's own ordinary corridor.

A positive value indicates upward departure. A negative value indicates downward departure. A value near zero indicates corridor consistency.

6.3.3 Why Normalisation Must Be Personal

The engine is not trying to answer only whether a value is unusual in general. It is trying to answer whether the value is unusual for this person.

That is why the baseline mean and variance must be personal rather than purely population-based.

A modest shift may be highly meaningful for one person and unremarkable for another. Normalisation preserves that distinction.

6.3.4 Layer-Specific Normalisation

Not all signal types should be normalised in exactly the same way.

For example: blood signals may use baseline-relative deviation against recent and long-horizon history; wearable signals may require rhythm-aware or recovery-aware normalisation; symptom signals may use recurrence-normalised scoring rather than biochemical distance; exposure signals may be normalised as burden intensity or disruption load; history updates may function as state modifiers rather than ordinary current-value deviations.

So the engine should keep one general principle and several layer-specific implementations.

The general principle is: all signals must be converted into comparable forms of personal departure.

6.3.5 Baseline Updating

The baseline itself should not be fixed forever.

A person changes across time. Age, season, treatment, recovery, and life pattern may all alter the corridor.

So the engine should update the baseline gradually, but only under conditions that do not contaminate it with active drift.

That means: stable periods should inform baseline updating; strong drift periods should usually be excluded from baseline reset; recovery phases may be tracked separately until new stability is established.

This prevents the engine from normalising pathology into the new ordinary state too quickly.

6.3.6 Direction Preservation

Normalisation should preserve direction as well as magnitude.

This matters because the same absolute distance may carry different meaning depending on whether the signal is: rising, falling, oscillating, recovering, stuck outside the corridor.

So the engine should store both: |z_i(t)| for magnitude, and sign[z_i(t)] for direction.

That allows later stages to distinguish worsening drift from recovering deviation.

6.3.7 Confidence-Aware Normalisation

Where signal quality varies, the normalised value should carry a confidence adjustment.

If the sample quality is weak, the source is incomplete, the context is uncertain, or the reading is technically noisy, then the engine should retain the normalised value but reduce its interpretive force.

So each normalised signal can be paired with a confidence term: n_i(t) = (z_i(t), c_i(t)), where c_i(t) = confidence score.

This lets the engine remain informative without pretending equal trust in every input.

6.3.8 Window-Level Normalisation

When several events arrive inside one rolling window, the engine may compute a window summary for each signal class.

For each window W(t), it may retain: mean normalised deviation, maximum corridor breach, directional consistency, recovery sign, confidence-weighted average.

This gives the later drift engine a cleaner temporal frame while preserving raw-event history underneath.

6.3.9 Normalisation Principle

The governing principle is: convert every incoming signal into a baseline-relative, direction-preserving, confidence-aware representation of personal departure.

That is the foundation on which the rest of the engine operates.

6.3.10 Section Conclusion

Signal normalisation transforms raw personal observations into comparable forms of deviation from the person's own equilibrium corridor. By aligning values to personal baseline, preserving direction, adjusting for confidence, and allowing layer-specific implementations, the engine creates a lawful signal field from which drift can be computed.

6.4 Drift Score Construction

Once signals have been normalised, the engine must combine them into a usable measure of movement away from the person's own equilibrium corridor. That combined measure is the drift score.

The drift score does not diagnose a disease class. It measures how strongly the person is departing from their established baseline in a way that may become prognostically meaningful.

6.4.1 Function of the Drift Score

The drift score performs five tasks: combines multiple normalised signals into one interpretable movement measure, preserves weighting across signal classes, distinguishes weak from strong departure, supports persistence and convergence testing, provides the main quantitative input for state transition.

So the drift score is not a replacement for the full state model. It is the engine's core movement indicator.

6.4.2 Signal-Level Contribution

For each signal i, the engine already holds a normalised deviation z_i(t) and, where needed, an associated confidence term c_i(t).

The simplest weighted signal contribution is: \(d_i(t) = w_i \cdot c_i(t) \cdot |z_i(t)|\)

Where: w_i = signal importance weight; c_i(t) = confidence weight; |z_i(t)| = magnitude of personal deviation.

This gives each signal a contribution proportional to both importance and reliability.

6.4.3 Aggregate Drift Score

The aggregate drift score at time t can then be defined as: \(D(t) = [\sum d_i(t)] / [\sum w_i \cdot c_i(t)]\)

This creates a weighted average deviation across all presently available signals.

So: low D(t) means the person remains close to corridor; moderate D(t) means meaningful departure is emerging; high D(t) means strong current displacement from personal equilibrium.

This is the engine's main quantitative summary of present drift magnitude.

6.4.4 Why Drift Must Be Weighted

Not all signals should contribute equally.

A strong, repeated biological corridor breach should generally matter more than a single weak contextual modifier.

That is why the drift score must be weighted. Without weighting, the model becomes vulnerable to shallow abnormality spread across low-value signals, while under-reading more direct biological or high-confidence temporal change.

So the drift score must reflect: signal importance, signal quality, disease-application weighting, person-specific predictive value over time.

6.4.5 Directional Drift

Magnitude alone is not enough. The engine must also retain directional structure.

A person whose markers repeatedly rise away from baseline is different from a person whose markers oscillate around baseline without persistent movement.

So the engine should also compute a directional drift component, such as: \(R(t) = [\sum w_i \cdot c_i(t) \cdot \text{sign}(z_i(t)) \cdot |z_i(t)|] / [\sum w_i \cdot c_i(t)]\)

This does not replace the magnitude score. It supplements it.

The engine can then distinguish: strong but mixed fluctuation, strong directional worsening, partial recovery movement, unstable oscillation without clear trend.

That is important for prognosis.

6.4.6 Layer-Level Drift

For clarity, the engine should also compute drift by signal layer.

For each layer L, define: D_L(t) = [Σ d_i(t)] / [Σ w_i · c_i(t)], taken over signals in layer L.

This allows the system to know not only that drift exists, but where it is forming: biological drift, physiological drift, symptom drift, contextual burden, history-linked recurrence, inherited-risk-loaded deviation.

This becomes especially useful later for convergence and user-facing explanation.

6.4.7 Drift Bands

For operational use, the drift score can be grouped into bands.

A simple structure is: Band 0 – corridor-consistent; Band 1 – soft movement; Band 2 – meaningful departure; Band 3 – strong current drift.

This does not replace precise scoring. It provides usable interpretive bands for state logic and display.

The important point is that the bands should be calibrated against personal history and later application-specific validation.

6.4.8 Drift Does Not Equal Persistence

The paper should keep this distinction clear.

A high drift score at one moment does not by itself prove structured concern. It shows present departure.

Persistence answers whether that departure holds. Convergence answers whether it is reinforced across layers. Commit logic answers whether the system should act on it.

So the drift score is necessary, but not sufficient.

6.4.9 Drift Interpretation Principle

The governing principle is: the drift score measures how far the person currently sits from their own ordinary corridor, after weighting for signal importance and confidence.

That is the correct role of the score.

It is a structured summary of present movement, not a final verdict.

6.4.10 Section Conclusion

Drift score construction gives the engine a formal way to convert multiple normalised personal signals into a weighted measure of present departure from equilibrium. By combining signal magnitude, confidence, hierarchy, and direction, the system produces an interpretable movement score that can be passed forward into persistence, convergence, and state update logic.

6.5 Persistence Integration

The drift score measures present departure. Persistence integration determines whether that departure is becoming temporally credible.

This is the stage at which the engine stops asking only: How far is the person from baseline now?

and begins asking: Has that departure lasted, repeated, or failed to resolve strongly enough to count as real drift?

That distinction is critical. A prognostic engine that does not integrate persistence will overread transient instability. A prognostic engine that waits too long for persistence will become reactive. This section defines the middle path.

6.5.1 Function of Persistence Integration

Persistence integration performs five tasks: measures whether drift repeats across time, distinguishes temporary disturbance from structured deviation, tracks failed recovery, strengthens or weakens current drift interpretation, supplies a temporal stability term for state transition.

So the persistence layer does not replace the drift score. It tests whether the drift score deserves temporal trust.

6.5.2 Window Persistence

Let: D(t) = aggregate drift score at time t; θ_d = drift threshold; I(·) = indicator function.

Then the simplest persistence term over the last k windows is: \(P(t) = (1/k) \sum I(D(t-r) > \theta_d)\) for r = 0 to k-1.

This produces a value between 0 and 1.

Values near 0 indicate little sustained drift. Values near 1 indicate repeated threshold breach.

This is the core persistence signal.

6.5.3 Magnitude-Aware Persistence

Threshold counting alone is not always sufficient. Two windows just above threshold do not necessarily carry the same meaning as two windows far above threshold.

So the engine may also compute magnitude-aware persistence: \(P_m(t) = (1/k) \sum \min(D(t-r)/\theta_d, 1)\) for r = 0 to k-1.

This allows stronger repeated drift to count more heavily than weak threshold contact.

The point is not precision for its own sake. The point is to preserve the difference between shallow persistence and strong persistence.

6.5.4 Directional Persistence

The engine should also ask whether repeated drift is moving in a consistent direction.

Let R(t) = directional drift component from the prior section.

Then directional persistence can be assessed by testing whether the sign and relative magnitude of R(t-r) remain stable across recent windows.

This matters because: repeated worsening in one direction is stronger than random oscillation; repeated failed recovery is stronger than simple variance; mixed-direction fluctuation may require caution even when magnitude is elevated.

So persistence should include not only recurrence, but directional consistency.

6.5.5 Recovery Failure as Persistence

Persistence does not only mean "still above threshold." It can also mean "not returning properly."

So the engine should track whether: drift decreases too slowly, a signal partially recovers but does not re-enter the corridor, recovery occurs briefly then the same drift reappears, the person's system no longer restores equilibrium at the expected pace.

This is especially important in early deterioration, where the body may not yet show constant severe abnormality but may show repeated inability to settle.

That pattern is prognostically serious.

6.5.6 Signal-Level Persistence

The engine should not compute persistence only at the aggregate level. It should also retain signal-level and layer-level persistence.

For each signal i, define: P_i(t) = (1/k) Σ I(|z_i(t-r)| > θ_i).

For each layer L, define a weighted persistence summary: P_L(t) = [Σ w_i P_i(t)] / [Σ w_i], over signals in layer L.

This helps the engine answer: Is persistence concentrated in one layer? Is it spreading across layers? Is one biologically important signal repeatedly failing while others remain quiet?

That becomes useful later for convergence and explanation.

6.5.7 Persistence Adjustment of Current Drift

The current drift score should be modified by persistence rather than left as a standalone present-tense quantity.

A simple adjusted drift term is: \(D_p(t) = D(t) \cdot [1 + \lambda_p P(t)]\) where λ_p = persistence amplification factor.

This means repeated drift carries more prognostic weight than the same present drift appearing for the first time.

The engine is therefore not only reading "how much movement is there now," but also "how long has movement been forming."

6.5.8 Persistence Bands

For operational clarity, persistence can be grouped into bands: Band 0 – weak or absent persistence; Band 1 – early repetition; Band 2 – moderate sustained recurrence; Band 3 – strong persistence or repeated recovery failure.

These bands are useful for state transitions.

For example: low drift plus low persistence may remain Soft Deviation; moderate drift plus moderate persistence may become Persistent Divergence; strong drift plus strong persistence may support Escalation Candidate, especially if convergence is also present.

6.5.9 Persistence Principle

The governing principle is: persistence makes present deviation temporally credible.

That is the correct role of this layer.

It does not prove final diagnosis. It proves that the deviation is no longer behaving like a brief anomaly.

6.5.10 Section Conclusion

Persistence integration gives the engine its temporal discipline. It tests whether current drift is repeating, worsening, or failing to recover across recent windows, and it strengthens or weakens present concern accordingly. This is the step that converts current abnormality into structured temporal evidence.

6.6 Convergence Integration

Persistence shows that drift is lasting. Convergence integration shows whether that lasting drift is being reinforced across the observation field.

This is the stage at which the engine determines whether abnormality remains narrow and local, or whether it is beginning to organize across multiple layers into a stronger prognostic structure.

The core principle is: persistent drift becomes more actionable when several meaningful layers begin to move in compatible ways.

6.6.1 Function of Convergence Integration

Convergence integration performs five tasks: measures whether multiple signal layers are abnormal together, determines whether those abnormalities align in a meaningful way, strengthens concern when reinforcement appears, prevents overreliance on isolated persistent signals, supplies the cross-layer support term for escalation logic.

So convergence does not replace drift or persistence. It tells the engine whether the existing drift pattern is becoming systemically coherent.

6.6.2 Group-Level Convergence

Let: G = total number of signal groups; A_g(t) = indicator that group g is currently abnormal at time t.

Then the simplest convergence measure is: \(C(t) = [\sum A_g(t)] / G\)

This gives the proportion of groups showing abnormality in the current window.

So: low C(t) means abnormality is mostly isolated; moderate C(t) means partial reinforcement; high C(t) means broad multi-layer agreement.

6.6.3 Weighted Convergence

Not every signal group should contribute equally.

A blood-layer abnormality may deserve more force than a weak contextual disturbance. A prior-event recurrence may deserve more force than one low-confidence symptom entry.

So weighted convergence can be defined as: \(C_w(t) = [\sum w_g \cdot A_g(t)] / [\sum w_g]\) where w_g = weight of group g.

This allows the engine to read reinforcement through hierarchy rather than through raw signal count.

6.6.4 Compatibility Requirement

Signals do not reinforce each other merely because they are abnormal at the same time. They should also be compatible.

Compatibility may include: similar directional movement, similar onset window, shared recovery failure, recurrence in linked temporal sequence, known prior pattern similarity.

So the engine should only count a group fully toward convergence when its abnormality is not merely present, but relevant to the same forming concern.

This prevents false convergence built from unrelated noise.

6.6.5 Partial Convergence

The engine should not require total agreement across all layers.

Early-stage deterioration may show: strong blood drift with soft symptom support, strong symptom recurrence with moderate wearable disruption, prior-history reinforcement with no current overt symptoms, inherited susceptibility with repeated low biological deviation.

So convergence should be graded, not absolute.

A practical graded structure is: Band 0 – isolated abnormality; Band 1 – weak partial reinforcement; Band 2 – moderate multi-layer support; Band 3 – strong cross-layer coherence.

This keeps the model sensitive to early formation without demanding total visibility.

6.6.6 Temporal Convergence

The engine should also measure whether reinforcement holds across adjacent windows rather than appearing only once.

A simple temporal convergence term can be written as: \(C_p(t) = (1/k) \sum C(t-r)\) for r = 0 to k-1.

This gives the mean convergence across recent windows.

That matters because a one-window cluster may still be weak, while repeated cross-layer support is much stronger.

6.6.7 Convergence-Adjusted Drift

The current adjusted drift can then be strengthened by convergence.

If D_p(t) = persistence-adjusted drift and λ_c = convergence amplification factor, then: \(D_c(t) = D_p(t) \cdot [1 + \lambda_c C_p(t)]\)

This means the same present drift receives greater force when multiple relevant layers reinforce it across time.

That is the correct operational role of convergence.

6.6.8 Convergence and State Transition

Convergence is one of the main boundaries between Persistent Divergence and Escalation Candidate.

A person may remain in Persistent Divergence when: deviation is real, persistence is real, but reinforcement is still too narrow.

A person moves toward Escalation Candidate when: deviation is meaningful, persistence is established, convergence becomes sufficiently strong.

So convergence is the difference between a real but still local concern and a broader actionable one.

6.6.9 Convergence Principle

The governing principle is: convergence gives persistent drift structural reinforcement across the observation field.

That is why it matters.

It does not prove a diagnosis. It shows that the abnormality is no longer confined to one weak interpretive lane.

6.6.10 Section Conclusion

Convergence integration gives the engine its cross-layer intelligence. It determines whether persistent drift is being reinforced across meaningful signal groups, whether those groups are compatible in direction and timing, and whether the pattern is becoming strong enough to justify escalation.

6.7 State Update Operator

The state update operator is the rule by which the engine converts current evidence into a revised health-state position. It is the point where the model stops merely measuring signal behaviour and begins deciding where the person now sits inside the prognostic state field.

The operator must be disciplined. It cannot jump upward on weak evidence, and it cannot remain static when structured deterioration is clearly forming. Its purpose is to absorb drift, persistence, convergence, recovery behaviour, and prior state position into one lawful transition process.

6.7.1 Function of the State Update Operator

The state update operator performs six tasks: receives the current adjusted evidence state, compares it with the person's previous state, determines whether the person should remain, advance, or de-escalate, preserves continuity rather than treating each window as disconnected, prevents unstable oscillation between neighbouring states, prepares the engine for alert logic and outward output.

So the operator is not merely a classifier. It is a continuity-preserving transition rule.

6.7.2 Inputs to the Operator

Let the operator receive the following at time t: X(t) = prior state; D(t) = current drift magnitude; P(t) = current persistence strength; C(t) = current convergence strength; R(t) = recovery or directional condition; H(t) = history-weight or prior-pattern memory term; B(t) = background susceptibility modifier.

The state update rule may then be written in general form as: \(X(t+1) = T[X(t), D(t), P(t), C(t), R(t), H(t), B(t)]\) where T = controlled transition operator.

This is the core formal expression for the engine.

6.7.3 Why Prior State Must Matter

The engine should not decide state from current evidence alone.

A person already in Persistent Divergence should not be treated the same as a person encountering a first weak deviation. Likewise, a person who has just recovered from a prior escalation should not be read as fully equivalent to someone with no recent history.

So the update rule must preserve state continuity.

This prevents the model from behaving like a memoryless threshold machine.

6.7.4 Transition Classes

The operator should support four transition classes: hold (remain in current state), advance (move upward to stronger concern), de-escalate (move downward as structure weakens), reset to stability (return to Stable after sufficient recovery and corridor re-entry).

These four classes are enough for the chapter.

They reflect the fact that prognosis is dynamic. The person is not moving through a one-direction ladder only.

6.7.5 Basic Transition Logic

A simple operational form is: remain Stable when drift is low and persistence is absent; enter Susceptible when background risk is elevated but active drift remains weak; enter Soft Deviation when current drift breaches the personal corridor without sufficient persistence; enter Persistent Divergence when drift and persistence jointly exceed the soft-deviation boundary; enter Escalation Candidate when drift, persistence, and convergence jointly exceed escalation structure; enter Clinical Review when escalation is committed and outward handover is triggered.

This is the high-level ladder.

The engine should then refine this with thresholds and commit safeguards.

6.7.6 Thresholded Transition Form

Let: θ_s = soft-deviation threshold; θ_p = persistent-divergence threshold; θ_e = escalation threshold.

Then a simplified thresholded logic may be stated as:

If D(t) < θ_s and P(t) is low, remain or return toward Stable.

If D(t) ≥ θ_s but P(t) remains weak, move to Soft Deviation.

If D(t) ≥ θ_p and P(t) is moderate or strong, move to Persistent Divergence.

If D(t), P(t), and C(t) jointly exceed θ_e, move toward Escalation Candidate.

If escalation is committed outward, move to Clinical Review.

6.7.7 Susceptibility as Background Overlay

The Susceptible state behaves slightly differently from the main drift ladder.

It may be entered even when current drift remains low, provided background risk is materially elevated.

So susceptibility may function either as a distinct operative state before active deviation, or as a modifier on the interpretation of neighbouring states.

For the paper, it is cleaner to retain it as a distinct state, while also allowing background susceptibility to raise vigilance thresholds and weighting.

This preserves conceptual clarity.

6.7.8 Recovery-Sensitive Transitioning

The operator must also respond to recovery. But downward movement alone is not sufficient to justify de-escalation.

If drift weakens, persistence drops, convergence dissolves, signals re-enter the corridor, and recovery becomes stable across enough windows, then the engine should allow downward movement. But this should occur only when the weakening is sustained and not contradicted by stronger residual concern elsewhere in the field.

So downward movement is lawful only when: recent state reduction persists across the defined return window; corridor re-entry occurs where relevant; no higher-weight signal layer is continuing to intensify; no reinforcement pattern remains strong enough to support the prior elevated state.

A recovery-sensitive transition principle is therefore: if drift, persistence, and convergence fall below their operative state boundaries for a sufficient return window, and no stronger layer continues to support elevated concern, then the state may step downward.

This allows: Escalation Candidate → Persistent Divergence; Persistent Divergence → Soft Deviation; Soft Deviation → Stable, provided the recovery structure is real.

This prevents two failure modes: oscillatory flicker between neighbouring states, and recovery lock, where a brief improvement suppresses escalation even though higher-order concern remains active.

6.7.9 Hysteresis and Oscillation Control

The operator should include hysteresis control so the state does not flicker between neighbouring levels too easily.

This means: upward movement should require slightly stronger evidence than mere threshold touch; downward movement should require sustained weakening, not one soft window.

This stabilises the engine.

6.7.10 State Posterior Form

A stronger form of the operator is probabilistic.

Let γ_t(j) = posterior belief that the person is in state j at time t.

Then the engine may update state by first updating the posterior distribution and then selecting an operative state only after threshold and commit logic are satisfied.

In general form: γ_{t+1}(j) = U[γ_t(j), y(t), D(t), P(t), C(t), R(t), H(t), B(t)] where U = posterior update rule.

This is useful because it allows the engine to hold ambiguity before making a hard transition.

6.7.11 Practical Interpretation

In practical terms, the state update operator answers: Has the person meaningfully moved? Has that movement lasted? Has it spread or reinforced? Is it recovering? Is the current state still the best interpretation?

That is the real work of the engine.

6.7.12 Section Conclusion

The state update operator converts current and prior evidence into a revised position inside the prognostic state field. It preserves continuity, supports upward and downward movement, respects recovery, and prevents unstable oscillation.

6.8 Alert and Escalation Thresholds

The state update operator determines where the person sits in the prognostic field. Alert and escalation thresholds determine when that state becomes actionable.

This section defines the boundary between internal monitoring and active warning.

The central principle is: not every abnormality deserves an alert, and not every alert deserves escalation.

A strong prognostic engine must therefore separate three things: observed drift, internal concern, outward action.

6.8.1 Function of Thresholds

The threshold layer performs five tasks: defines when drift is significant enough to matter, defines when persistent drift becomes an alert condition, defines when reinforced drift becomes an escalation condition, prevents overreaction to weak or isolated abnormalities, provides a clear transition from monitoring to outward recommendation.

6.8.2 Alert Threshold Versus Escalation Threshold

The paper should distinguish clearly between alert and escalation.

An alert threshold means: the pattern has become strong enough that the system should actively flag concern inside the user loop.

An escalation threshold means: the pattern has become strong enough that passive monitoring alone is no longer sufficient and onward review is justified.

So the model has two distinct outward-control levels: Alert and Escalation.

6.8.3 Internal Alert Rule

Let: D(t) = current drift magnitude; P(t) = persistence strength; C(t) = convergence strength; θ_a = alert threshold.

Then a basic internal alert rule is: Alert(t) = 1 if D(t) × P(t) × C(t) > θ_a; Alert(t) = 0 otherwise.

This means the engine alerts only when current departure, temporal credibility, and cross-layer reinforcement jointly pass the required level.

6.8.4 Escalation Rule

Let: θ_e = escalation threshold; BF(t) = state separation or commit strength.

Then escalation should require two conditions: Escalate(t) = 1 if D(t) × P(t) × C(t) > θ_e and BF(t) > θ_b; Escalate(t) = 0 otherwise, where θ_b = commit-separation threshold.

This preserves discipline. The engine does not escalate merely because concern exists. It escalates when concern is both strong and sufficiently separated from weaker competing interpretations.

6.8.5 Threshold Bands

For practical operation, thresholds should be tiered rather than singular.

A simple band structure is: Band 0 – corridor-consistent, no alert; Band 1 – watch state, continue monitoring; Band 2 – internal alert state; Band 3 – escalation-ready state; Band 4 – committed escalation and outward handover.

6.8.6 Signal-Specific Overrides

The general threshold structure should remain in place, but the engine may allow selected override conditions.

Examples: a very strong single biological abnormality may trigger immediate alert even before full persistence develops; a historically high-risk recurrent pattern may lower the escalation boundary; a weak but repeated drift in a high-susceptibility person may trigger earlier reinforcement checks; a strongly context-explained abnormality may delay escalation despite current corridor breach.

So thresholds are structured, but not stupid. They should be lawful and adjustable, not rigidly blind.

6.8.7 Threshold Calibration

The paper should make clear that thresholds are not fixed in the abstract. They must be calibrated against: personal baseline behaviour, disease-application weighting, prior event structure, acceptable false-alert tolerance, acceptable false-miss tolerance, data-source reliability.

This is important because the right alert boundary in one application may be too sensitive or too blunt in another.

So thresholding belongs partly to architecture and partly to validation.

6.8.8 Alert Fatigue Control

A user-centred system must also protect against alert fatigue.

If the engine alerts too often on weak instability, the user will stop trusting it. If it alerts too rarely, prognosis fails.

So the threshold layer should include anti-fatigue controls such as: requiring persistence before repeated alerting, suppressing duplicate alerts in the same unresolved corridor, increasing alert prominence only when state worsens, distinguishing watch states from action states clearly.

6.8.9 Escalation Pathway

Escalation should not be a vague red warning. It should correspond to a structured handover path.

That may include: stronger user-facing warning, recommendation for repeat targeted sampling, recommendation for external testing, recommendation for clinical or specialist review, export of the recent drift corridor and supporting signal summary.

So escalation is not merely a label. It is a transition into action.

6.8.10 Threshold Principle

The governing principle is: the system should warn only when abnormality becomes jointly meaningful, and escalate only when that meaning becomes actionably coherent.

That is the correct threshold logic for the paper.

6.8.11 Section Conclusion

Alert and escalation thresholds convert internal drift logic into outward control decisions. They distinguish weak concern from active warning, and active warning from justified escalation. By requiring joint reinforcement and commit separation, the engine remains sensitive to early formation without becoming unstable or noisy.

6.9 De-escalation and Recovery Logic

A prognostic engine must not only know when to rise. It must know when to come down.

Without de-escalation logic, the model becomes sticky and punitive. It may continue to hold a person in an elevated concern state even after the structure has weakened, the signals have returned toward corridor, or the pattern has dissolved. That would make the system inaccurate and unusable.

The central principle of this section is: recovery must be interpreted as carefully as deterioration.

6.9.1 Function of De-escalation Logic

De-escalation and recovery logic perform five tasks: determine whether previously abnormal signals are returning toward baseline, distinguish genuine recovery from temporary weakening, control downward movement between states, prevent the engine from staying elevated without justification, preserve memory of the episode even after the state weakens.

So this is not a cosmetic add-on. It is part of the engine's truthfulness.

6.9.2 Why Recovery Must Be Modelled

A person may move from drift toward stability in several ways: rapid clean recovery, gradual recovery, partial recovery, oscillatory weakening, apparent recovery followed by relapse.

These are not equivalent.

A system that treats all downward movement as identical loses important prognostic information. In many cases, the difference between full recovery and partial recovery is itself predictive of future risk.

So the engine must ask: Is the signal truly returning to corridor? Is it returning at the expected rate? Is the recovery holding? Is the person returning to prior baseline, or to a degraded approximation of it?

That is the core of the recovery layer.

6.9.3 Recovery Conditions

A pattern may be treated as recovering when one or more of the following occur: current drift magnitude declines, persistence weakens across recent windows, convergence dissolves, signals re-enter the personal corridor, recovery is sustained across a sufficient return window.

So de-escalation should not be based on one weak reading alone. It should be based on repeated evidence that the drift structure is losing force.

6.9.4 Recovery Score

A simple recovery term can be defined by measuring reduction in adjusted drift across recent windows.

Let: D_c(t) = convergence-adjusted drift; k_r = recovery window length.

Then one simple recovery measure is: \(Q(t) = (1/k_r) \sum \max[D_c(t-r-1) - D_c(t-r), 0]\)

This gives the average recent reduction in adjusted drift.

A higher Q(t) indicates stronger recent improvement. A lower Q(t) indicates weak or absent recovery.

This is only one component. Recovery should also be checked against actual corridor re-entry and persistence weakening.

6.9.5 Corridor Re-Entry

Recovery is stronger when the person's signals return not merely downward, but back into their own expected corridor.

So for each signal i, the engine should test whether |z_i(t)| ≤ θ_i for a sufficient number of recent windows.

This matters because downward movement alone is not enough. A signal can improve while still remaining materially abnormal.

The stronger condition is: deviation weakens, the corridor is re-entered, the re-entry holds. That is genuine recovery.

6.9.6 Partial Recovery

The engine must also support partial recovery.

A person may show reduced drift, reduced persistence, reduced convergence while still not being fully stable.

In that case, the system should allow downward movement without pretending the episode is over.

For example: Escalation Candidate → Persistent Divergence; Persistent Divergence → Soft Deviation.

This is why de-escalation should be stepwise rather than all-or-nothing.

6.9.7 Relapse Protection

Recovery should be tested for durability.

A common false signal is apparent recovery followed by rapid relapse. The engine should therefore require that weakening holds across a sufficient return window before fully resetting to stability.

This means the model should distinguish: transient easing, and durable recovery. The second is what justifies reset.

6.9.8 De-escalation Rule

A simple de-escalation principle is: move downward only when drift, persistence, and convergence all weaken sufficiently, and that weakening holds for a defined return interval.

In symbolic form: DeEscalate(t) = 1 if D(t) < θ_d,down, P(t) < θ_p,down, C(t) < θ_c,down, and recovery holds across the return window; DeEscalate(t) = 0 otherwise.

The downward thresholds may be lower than upward thresholds to preserve hysteresis and prevent flicker.

De-escalation should therefore be blocked where temporary improvement is contradicted by continued strengthening in a higher-weight signal layer.

6.9.9 Return to Stability

The person should only return fully to Stable when: the operative drift structure has dissolved, relevant signals have re-entered corridor, recovery has held long enough to be credible, no strong reinforcing layers remain active.

This protects the system from declaring stability too quickly.

6.9.10 Memory After Recovery

Even after de-escalation, the episode should not vanish from the model.

The history layer should preserve: what happened, how long it lasted, how recovery occurred, whether recovery was complete, whether the pattern later returned.

This matters because prior episodes may shape future interpretation even after apparent resolution.

6.9.11 Section Conclusion

De-escalation and recovery logic give the engine downward intelligence. They ensure that elevated concern states weaken only when the drift structure genuinely dissolves, corridor re-entry occurs, and recovery holds across time. This prevents sticky overclassification while preserving episode memory for future recurrence analysis.

6.10 Output States and User-Facing Results

The engine does not exist only to compute internal state. It must also produce outputs that are intelligible, restrained, and useful to the person using it. A prognostic system fails if its outputs are either too vague to guide action or too aggressive to preserve trust.

The purpose of this section is to define what the system actually returns to the user and how those outputs should relate to the internal state architecture.

The governing principle is: the user should see the condition of their trajectory, not an opaque machine verdict.

6.10.1 Function of the Output Layer

The output layer performs five tasks: translates internal state into usable user-facing language, distinguishes watch states from action states, shows whether the person is stable, drifting, or recovering, preserves continuity across time, prepares structured export for onward review if escalation occurs.

So the output layer is not cosmetic. It is the public surface of the engine.

6.10.2 Internal State Versus User Output

The internal engine may track precise values for: drift magnitude, persistence strength, convergence strength, recovery condition, state posterior, commit separation.

The user, however, should not necessarily be shown all of this raw structure at equal prominence.

The user-facing result should be simpler while remaining truthful.

A clean output structure is: Stable, Watch, Drift Detected, Escalation Recommended, Under Review, Recovering.

This allows the system to preserve nuance without overwhelming the person with internal mechanics.

6.10.3 Output Mapping

The user-facing results can be mapped from the internal state field as follows: Stable → internal state mainly Stable; Watch → Susceptible or very early Soft Deviation; Drift Detected → Soft Deviation or Persistent Divergence; Escalation Recommended → Escalation Candidate; Under Review → Clinical Review; Recovering → downward transition from elevated state with active recovery.

This mapping is strong because it preserves both interpretive clarity and technical continuity.

6.10.4 What the User Should See

For each output state, the user should see: current status, whether the status is improving, stable, or worsening, what signal layers contributed most, whether repeat sampling is needed, whether outside review is advised, whether recovery is holding.

The system should therefore not merely say: something is wrong.

It should say: the system is observing repeated movement in these layers, at this level, with this current trend.

That makes the output actionable.

6.10.5 Output Restraint

The system must avoid using output language that overclaims.

It should not state final diagnosis unless that function exists elsewhere and is properly validated. The prognostic output should remain within its proper scope.

So the language should favour forms such as: your readings remain within your usual range, a small deviation is being watched, repeated drift is being detected, the pattern has strengthened and further review is recommended, recovery is occurring but is not yet complete.

This is stronger than dramatic alert language because it preserves trust and precision.

6.10.6 Trend Display

The user should also be able to see directional change over time.

A useful display layer includes: current state, prior state, movement direction, short timeline of recent windows, whether the pattern is stabilising, worsening, or recovering.

This matters because prognosis is about trajectory. The output should therefore show movement, not only category.

6.10.7 User Action Layer

Each output state should correspond to a practical user instruction.

For example: Stable → continue ordinary monitoring; Watch → repeat routine sampling; Drift Detected → increase attention and repeat sampling within defined interval; Escalation Recommended → initiate targeted follow-up or external review; Under Review → preserve monitoring and share recent corridor data; Recovering → continue monitoring until recovery holds.

This prevents the output layer from becoming abstract.

6.10.8 Export for Review

Where escalation occurs, the engine should be able to produce a structured summary rather than a raw data dump.

A review export should include: recent state trajectory, key contributing signal layers, persistence summary, convergence summary, recovery status, relevant history links, context notes for the recent window.

This makes the engine useful beyond the app itself.

6.10.9 Output Principle

The governing principle is: the user should receive a truthful, graded view of their current trajectory, together with the minimum clear action appropriate to that state.

That is the correct role of the output layer.

6.10.10 Section Conclusion

Output states and user-facing results translate the internal state engine into usable human form. They preserve trajectory, trend, and recommended action without overclaiming certainty. This allows the system to remain technically disciplined while still being understandable and useful to the person living inside the monitored timeline.

6.11 Chapter Conclusion

This chapter has defined the operational core of the prognostic engine. The model receives structured personal inputs, normalises them against the individual corridor, constructs weighted drift scores, integrates persistence and convergence, updates state through a continuity-preserving operator, and applies threshold controls for alert, escalation, de-escalation, and recovery. It then returns a graded user-facing result that reflects trajectory rather than isolated event.

The central conclusion is: the engine does not diagnose from a single moment; it computes a controlled interpretation of personal drift across time.

That completes the engine chapter.

The next chapter should apply the architecture to a concrete domain.

Chapter 7. Cancer as an Application Layer

The previous chapters defined the general prognostic architecture. This chapter applies that architecture to cancer.

The argument here is not that all cancers are directly "detectable at birth." The stronger and more defensible claim is different: cancer can be modelled as a monitored drift pathway in which inherited susceptibility, repeated biological sampling, temporal deviation, and multi-layer reinforcement are used to identify pre-event concern earlier than conventional episodic recognition. NCI states that inherited harmful variants account for about 5% to 10% of all cancers and that hereditary cancer syndromes can be identified through genetic testing of blood or saliva. At the same time, current reviews of liquid-biopsy-based multi-cancer early detection describe the field as promising for asymptomatic screening, but still technically and clinically evolving.

The chapter therefore treats cancer not as a special exception to the model, but as one high-risk application layer within it.

7.1 Purpose

This chapter establishes how the prognostic engine can be specialised for cancer-oriented surveillance.

It defines: inherited susceptibility as the opening risk field, repeated blood-based monitoring as the primary biological loop, symptom, wearable, and history layers as reinforcement fields, cancer drift as movement from susceptibility to actionable concern, escalation as targeted review rather than one-step proof.

7.1.1 Chapter Function

The function of this chapter is to show that cancer can be treated as a prognostic monitoring problem rather than only as a late-stage confirmation problem.

That position is consistent with current research directions. NCI's hereditary-cancer resources focus on inherited-risk identification and surveillance relevance, while recent MCED and liquid-biopsy reviews frame early cancer detection as a multi-analyte, multi-step process aimed at asymptomatic populations, not a single definitive one-off event.

7.2 Why Cancer Fits the Drift Model

Cancer fits the drift model because it is rarely a pure moment-event at the level of biology. What institutions often encounter as a diagnosis is usually the later recognition of a process that has already been forming through accumulated change. That makes cancer especially suitable for a prognostic architecture built around susceptibility, repeated biological observation, persistence, convergence, and early escalation.

This does not mean every cancer is silently predictable in the same way. It means the structure of oncological emergence is compatible with the logic already established in this paper: deviation forms, persists, and becomes progressively more legible before formal confirmation. Current early-detection literature supports this framing. Reviews of liquid-biopsy-based multi-cancer early detection describe cancer screening as moving toward multi-analyte, noninvasive, and earlier-stage signal detection in asymptomatic populations, even while emphasizing that the field is still technically and clinically evolving.

7.2.1 Cancer Is Not Only a Late Event

The standard institutional picture of cancer is often shaped by the point of diagnosis: imaging, pathology, biopsy, or overt clinical manifestation. But that point is not the whole biological timeline. It is the point at which the process has become sufficiently visible to formal systems.

That makes the same distinction relevant here as in earlier chapters: biological formation, measurable signal emergence, institutional recognition.

Cancer belongs to that three-stage structure as strongly as any other major disease class. Early-detection reviews emphasize that biomarkers for cancer can arise in circulating DNA, RNA, proteins, metabolites, and related analytes before conventional late-stage presentation, which is precisely why noninvasive early detection is an active research field.

7.2.2 Why a Binary Model Is Too Weak

A binary model of cancer says: no cancer or cancer present. That is too weak for prognosis.

A drift model allows a stronger sequence: susceptible background, early unexplained deviation, repeated weak biological anomaly, reinforced multi-layer concern, escalation-ready pattern, formal review.

This does not replace diagnosis. It creates a monitored formation corridor before diagnosis.

That is exactly where the proposed engine has force. It does not need to claim certainty at the earliest stage. It only needs to determine when the person is leaving their own stable corridor in a way that is repeated, biologically relevant, and increasingly coherent.

7.2.3 Cancer Signals Are Layered

Cancer also fits the drift model because its early signal field is not limited to one observation class.

Potential oncology-oriented drift may appear through combinations such as: inherited susceptibility or family clustering, repeated blood-based biomarker deviation, cfDNA, methylation, fragmentomics, or related liquid-biopsy signals, unexplained fatigue, weight change, or symptom recurrence, prior-event similarity, failure of the system to return to ordinary baseline.

Current reviews of emerging cancer biomarkers explicitly discuss multi-analyte and multi-omics approaches, including circulating nucleic acids, proteins, extracellular vesicles, metabolites, and related noninvasive strategies, which supports a layered rather than single-marker view.

7.2.4 Why Repeated Monitoring Matters More in Cancer

Cancer fits repeated monitoring especially well because one-off screening has obvious structural limits.

A one-off screen may miss: weak early-stage signal, intermittent low-level abnormality, a pattern that becomes meaningful only when repeated, a soft signal that needs reinforcement across time.

That is why the drift architecture is stronger than episodic checking alone. It allows the engine to treat cancer-oriented concern as a temporal formation path rather than as a single opportunity for detection.

This is aligned with current MCED discussions, which repeatedly stress both the promise and the limitations of one-time screening and emphasize the importance of sensitivity, specificity, validation, and downstream diagnostic pathways.

7.2.5 Cancer as a High-Risk Application Layer

For this paper, the strongest formulation is therefore: cancer is a high-risk application layer within a general prognostic drift engine.

That is a stronger claim than treating cancer as a unique exception. It means the same architecture already developed in this paper can be tuned for oncological concern by adjusting: the susceptibility field, the weighting of blood-based signals, the threshold for persistence, the role of inherited background, the escalation pathway.

This preserves both generality and seriousness.

7.2.6 Section Conclusion

Cancer fits the drift model because oncological emergence is compatible with a layered, temporal, and reinforcement-based architecture. It is not best understood only as a late institutional event, but as a process that may become partially measurable before formal recognition through inherited risk, repeated biological deviation, and multi-layer convergence. That makes cancer an especially strong application domain for a prognostic engine built around personal baseline and pre-event surveillance.

7.3 Birth-Anchored Susceptibility and Family Risk

The cancer application layer should begin with susceptibility, not with a claim of universal direct cancer detection at birth. The stronger formulation is birth-anchored risk mapping: identify inherited predisposition where present, preserve family-pattern intelligence, and then monitor the person longitudinally for drift that may justify earlier escalation.

That framing is consistent with current cancer genetics literature. NCI states that about 5% to 10% of cancers are caused by inherited harmful genetic changes, and that inheriting a cancer-related genetic change increases risk without guaranteeing that cancer will occur.

7.3.1 Why Birth-Anchoring Matters

Birth-anchoring matters because it gives the engine an opening risk field before any active drift has appeared. If a person begins life with known inherited susceptibility, or with a strong family-pattern burden suggestive of such susceptibility, then the surveillance architecture should not wait for late-stage institutional recognition to begin caring about that fact.

The point is not to label the child as diseased. The point is to initialise a different starting vigilance state.

That is exactly what the Susceptible state in the general engine was designed to do.

7.3.2 Susceptibility Is Not Diagnosis

This chapter must keep the distinction clean.

Inherited risk means: the baseline field is heavier, the threshold for interpretive attention may be lower, repeated weak abnormalities may deserve stronger scrutiny.

Inherited risk does not mean: cancer is already present, escalation is automatic, every later deviation should be overread.

That distinction is supported by NCI's guidance: inherited cancer-related variants raise risk, but do not mean a person will definitely develop cancer.

7.3.3 Family History as a First Risk Instrument

A practical system does not need to wait for full genomic certainty before it can use susceptibility intelligently. Family-pattern information is already useful.

Relevant background structure includes: repeated cancers in close relatives, early age of onset in relatives, clustering of related cancer types, known hereditary cancer syndrome in the family, repeated lineage-specific pattern of recurrence.

This is important because real-world people often know family history before they ever receive formal genomic testing. The app can therefore begin with family-structured susceptibility and refine it later if genomic data become available.

7.3.3A Maternal Lineage Prior

The inherited prior in this model should not be treated as a flat family-history note, nor should it be confined only to the immediate mother. It should be represented as a maternal lineage prior.

This matters because the maternal contribution to offspring is structurally asymmetric across generations. All offspring inherit mitochondrial DNA through the maternal line, which means that maternal bioenergetic and mitochondrial dysfunction pathways may remain relevant across mother, grandmother, great-grandmother, and earlier maternal ancestry. In addition, sons inherit their sole X chromosome from the mother, while daughters inherit one maternal X and one paternal X. This means maternally conveyed X-linked tumour-suppression or repair vulnerability may carry more direct force in males, while still contributing in females.

The maternal lineage prior should therefore be understood as an inherited bias in the prognostic field, not as proof of disease. It shifts the starting vigilance of the model before present-time blood evidence is applied.

In practical terms, the maternal lineage prior may include: immediate maternal history, maternal grandmother history, maternal great-grandmother history, deeper known maternal-line recurrence where available, mitochondrial dysfunction patterns suspected to track through the maternal corridor, maternally conveyed X-linked vulnerability where relevant.

The maternal lineage prior may be written conceptually as: \(H_{\text{mat-line}} = H_{\text{mother}} + H_{\text{mGM}} + H_{\text{gGM}} + H_{\text{deep}}\)

Where: H_mother = immediate maternal contribution; H_mGM = maternal grandmother contribution; H_gGM = maternal great-grandmother contribution; H_deep = deeper known maternal-line contribution.

This form is not intended to claim equal knowledge across all generations. It is intended to give the model a lawful way to register that lineage-linked cancer corridors may accumulate across the maternal line rather than appearing as isolated one-generation events.

The maternal lineage prior therefore acts as a baseline susceptibility field. It does not diagnose. It changes the amount of present-time weak signal required before the system raises a flag.

7.3.3 B Sex-Conditioned Maternal Inheritance

The maternal lineage prior is not identical in force across male and female offspring.

Its universal component arises from maternal mitochondrial inheritance, which applies across all offspring. Its sex-conditioned component arises from the maternal X-linked contribution, which carries greater exclusivity in sons because they receive only one X chromosome and that X is maternally derived.

For this reason, the inherited prior should be treated as composed of two parts: a universal mitochondrial maternal component, and a sex-conditioned maternal X-linked component.

This means the prior field is shared across sons and daughters in mitochondrial terms, but not identically weighted in X-linked terms.

The practical consequence is that the same weak present-time blood-field lean may deserve different vigilance depending on whether the maternally conveyed corridor is acting through a universally shared mitochondrial pathway or through a more direct sex-conditioned X-linked route.

7.3.4 Genomic Risk Identification from Birth or Early Childhood

The stronger research-facing claim is that genomic newborn or early-childhood screening may identify a subset of children at risk for early-onset cancers and allow surveillance to begin earlier. Population-based genomic newborn screening for pediatric cancer risk is now being studied, and recent pediatric cancer surveillance reporting describes newborn genetic screening as a feasible research direction for spotting risk before disease develops.

So the cancer paper should not say: all cancer can be detected at birth.

It should say: birth or early-childhood genomic risk mapping can initialise lifelong surveillance in the subset of persons carrying actionable inherited predisposition.

That is materially stronger and more defensible.

7.3.5 Susceptibility as a Background Modifier

In the engine, birth-anchored susceptibility should function as a background modifier on later state interpretation.

That means it can: increase surveillance vigilance, lower tolerance for repeated weak oncological drift, raise the weight of selected biological anomalies, shorten the path from weak persistent deviation to targeted reinforcement checks.

But it should not bypass the rest of the engine. Drift, persistence, convergence, and commit discipline still apply.

This keeps the cancer model from collapsing into genetic determinism.

7.3.6 Why This Strengthens the Cancer Chapter

Starting with susceptibility makes the whole chapter stronger for three reasons.

First, it aligns with real inherited-risk science rather than overstating early direct detection.

Second, it creates a lawful opening state for oncology-oriented monitoring.

Third, it gives the cancer application layer a true pre-event anchor: not proof of disease, but proof that some people begin from a materially different surveillance position.

7.3.7 Section Conclusion

Birth-anchored susceptibility and family risk provide the correct starting point for the cancer application layer. They do not diagnose cancer in advance, but they initialise the surveillance field for persons carrying inherited or lineage-linked risk. In practical terms, this means the cancer engine should begin by mapping family history and, where available, inherited predisposition, then use that background to weight later drift rather than replacing later drift altogether.

7.4 A Pinprick Blood as a Closure-Proximity Detector

The pinprick blood layer is not used here as a definitive cancer-identification mechanism. Its role is narrower and more operationally realistic. It functions as a closure-proximity detector.

That means the objective is not to determine tissue origin, tumour class, or pathological finality from one micro-sample. The objective is to determine whether the sampled blood field is leaning toward an abnormal closure point strongly enough to justify further checking.

This distinction matters. A pinprick sample may carry only weak, partial, or low-abundance evidence. That does not make it useless. It means the sample should be read as a weighted signal field rather than as a binary yes-or-no test.

7.4 A.1 Scope of the Pinprick Layer

The pinprick layer is designed to raise an alarm, not to close the case.

Its job is to detect whether the sampled field shows directional abnormality across one or more weak signal classes, such that the person should move into repeat surveillance or onward review.

So the output of this layer is not: cancer identified, tumour localised, pathology confirmed.

The output is: weak closure-lean detected, repeated closure-lean detected, reinforced closure-lean detected, escalation candidate.

That is the correct role of the capillary blood loop inside the broader prognostic engine.

7.4 A.2 Weak Signal Classes in the Pinprick Field

The pinprick sample is treated as a mixed biological field. Within that field, the engine may look for weak deviation in several classes of signal, including: ctDNA-like aberrant signal, fragment distortion, methylation drift, extracellular-vesicle anomaly, protein deviation, broader blood-state or inflammatory disturbance.

These are not assumed to be equally strong, equally mature, or equally recoverable in present-day capillary workflows. Their purpose here is not to claim full molecular sufficiency from one drop of blood. Their purpose is to define the possible weak-signal channels through which abnormal closure tendency may begin to appear.

7.4 A.3 Inherited Prior Reference

The pinprick closure model does not begin from a flat field. It operates against the inherited susceptibility prior already defined in Section 7.3, including maternal lineage effects where relevant.

This means the blood-field detector is not reading current weak signal in isolation. It is reading current weak signal against an already structured background of inherited possibility.

7.4 A.4 Closure-Flag State Equation

The pinprick layer is best represented as a balance between destabilising weak signals and stabilising or corrective terms.

\(\text{State}(t) = \text{inherited prior} + \alpha_1 f_{\text{ctDNA}}(t) + \alpha_2 f_{\text{frag}}(t) + \alpha_3 f_{\text{meth}}(t) + \alpha_4 f_{\text{EV}}(t) + \alpha_5 f_{\text{prot}}(t) - \beta_1 g_{\text{qual}}(t) - \beta_2 g_{\text{base}}(t) - \beta_3 g_{\text{CHIP}}(t) - \beta_4 g_{\text{context}}(t) - \beta_5 g_{\text{recovery}}(t)\)

In compressed form: State(t) = inherited prior + weighted abnormal signals − weighted correction terms.

This equation does not ask whether one signal alone proves cancer. It asks whether the total weighted field is leaning toward abnormal closure strongly enough to justify a flag.

7.4 A.5 Meaning of Zero

Within this framework, zero does not mean absence of information. Zero means equilibrium balance.

So: State = 0 means the destabilising load is balanced by stabilising and corrective structure; State > 0 means the field is leaning toward concern; State < 0 means the field remains net-stabilised or over-corrected toward baseline.

This is consistent with the wider equilibrium logic already used throughout the model.

7.4 A.6 Flag Condition

The pinprick layer should produce a flag when the closure-lean becomes strong enough to cross a threshold.

So: Flag(t) = 1 if State(t) > θ_f; Flag(t) = 0 otherwise.

This threshold is not fixed in abstraction. It depends on application layer, signal maturity, and inherited background. Where the inherited prior is elevated, the system reaches the flag boundary with less additional present-time abnormality.

In practical terms, the effective alert boundary is lower in a person with stronger inherited susceptibility.

7.4 A.7 Persistence and Reinforcement

One weak flag is not yet enough for strong interpretation.

The system must then ask whether the closure-lean repeats.

Persistence is therefore defined across recent windows. If the state remains above the flag threshold across repeated samples, the concern gains force.

Then the model asks whether the repeated lean is reinforced across signal classes. For example: ctDNA-like abnormality plus fragment distortion, weak molecular abnormality plus repeated protein deviation, weak blood-field abnormality plus inherited lineage susceptibility, repeated weak abnormality plus failure to return to baseline.

This is the step that turns a one-off weak lean into a meaningful surveillance pattern.

So the logic becomes: one crossing of the threshold = weak flag; repeated crossing = persistent flag; repeated crossing plus multi-signal support = reinforced flag; reinforced flag plus sufficient separation from noise = escalation candidate.

That is the correct operational ladder.

7.4 A.8 What the Pinprick Layer Does Not Claim

This section must remain disciplined.

The pinprick closure-flag model does not claim: exact tissue origin, exact cancer class, definitive molecular diagnosis, full replacement of confirmatory workup.

It claims something narrower and more useful: the blood microfield may carry weak distributed evidence of abnormal closure before conventional late recognition, and repeated capillary surveillance can raise an earlier alarm when that field leans consistently enough toward concern.

That is all it needs to do.

7.4 A.9 Operational Consequence

The practical consequence is clear.

If the pinprick layer raises a weak flag, the person repeats surveillance.

If it raises a persistent flag, the person moves into reinforced observation.

If it raises a reinforced flag, the person proceeds to targeted checking or onward review.

This means the pinprick sample is not being overburdened with the task of final diagnosis. It is being used correctly as an early detector of field disturbance.

7.4 A.10 Section Conclusion

The pinprick blood layer should be treated as a closure-proximity detector rather than a terminal diagnostic instrument. Its role is to identify when a small blood sample shows enough weighted abnormal lean to justify further attention. The field is interpreted through a state equation that balances inherited susceptibility, weak abnormal molecular or biochemical signals, and corrective or noise-suppression terms. Zero represents equilibrium. Positive state represents drift toward concern. Repetition and reinforcement then determine whether the signal remains a weak flag or becomes an escalation candidate.

This is the correct way to include pinprick surveillance in the cancer application layer: not as overclaimed certainty, but as an early alarm mechanism within a wider prognostic system.

7.4B Worked Six-Month Example

To make the pinprick closure-proximity model operational, this section sets out a six-month worked example using fortnightly surveillance windows. The purpose of the example is not to claim that a completed oncology home-testing platform already exists in final deployment form. The purpose is to show how the state logic behaves when weak blood-field evidence is read repeatedly across time.

The example therefore demonstrates the function of the detector, not the final maturity of every assay component.

7.4 B.1 Structure of the Example

The example uses: three illustrative subjects, thirteen fortnightly windows across six months, one inherited-prior term, five weak destabilising blood-field signals, five corrective or stabilising terms, one closure-flag state value, one flag threshold, one persistence rule, one reinforcement rule.

The worked example is designed to show the difference between: ordinary fluctuation, weak persistent closure-lean, temporary false-watch behaviour.

This is necessary because a detector of this kind must show not only when it flags, but when it does not.

7.4 B.2 Subjects

Three illustrative profiles are used.

Subject A: 35-year-old man with no elevated inherited lineage prior and ordinary fluctuation.

Subject B: 40-year-old man with elevated maternal lineage prior and weak persistent closure-lean.

Subject C: 10-year-old girl with temporary disturbance but non-progressive behaviour across the six-month interval.

These subjects are not treated as confirmed pathological cases. They are surveillance examples used to demonstrate state behaviour.

7.4 B.3 Variables Used in the Example

At each fortnightly timepoint, the detector reads the following weak destabilising signal classes: ctDNA-like signal, fragment distortion, methylation drift, extracellular-vesicle anomaly, protein deviation. These are represented as scaled values between 0 and 1 for the purpose of the worked example.

The detector also applies the following corrective or stabilising terms: sample-quality penalty, baseline-fit correction, CHIP or background penalty, context-resolution term, recovery term.

The inherited prior enters as a baseline term before present-time weak signals are added.

7.4 B.4 State Equation Used in the Example

For the worked example, the state is calculated as follows:

State(t) = inherited prior + 0.30 × ctDNA-like(t) + 0.20 × fragment(t) + 0.20 × methylation(t) + 0.15 × EV(t) + 0.15 × protein(t) − 0.20 × quality(t) − 0.20 × baseline(t) − 0.20 × CHIP(t) − 0.20 × context(t) − 0.20 × recovery(t)

For the example, the flag threshold is set at: θ_f = 0.15

So: State > 0.15 = weak flag; repeated state > 0.15 = persistent flag candidate; persistent flag plus reinforcement = escalation candidate.

7.4 B.5 Persistence Rule Used in the Example

Persistence is defined over the last three windows.

A persistent flag is treated as present when the state exceeds the flag threshold in at least two of the last three windows.

That gives the model enough memory to distinguish temporary fluctuation from repeated lean.

7.4 B.6 Reinforcement Rule Used in the Example

Reinforcement is treated as present when at least two destabilising signal classes move together above weak concern level while the state remains above the flag threshold across repeated windows.

For the purposes of the worked example, reinforcement is strongest where: ctDNA-like signal and fragment distortion rise together; ctDNA-like signal and methylation drift rise together; protein deviation persists alongside a raised inherited prior; recovery fails to cancel a weak rise across successive windows.

This is sufficient for demonstrating the detector logic.

7.4 B.7 Subject A: Ordinary Fluctuation Without Persistent Lean

Subject A is a 35-year-old man with no elevated inherited lineage prior.

Inherited prior: 0.02

Fortnightly state values across thirteen windows: 0.03, 0.05, 0.07, 0.09, 0.12, 0.11, 0.08, 0.10, 0.13, 0.09, 0.06, 0.08, 0.07

Interpretation: the state never crosses the flag threshold of 0.15; no persistent flag occurs; no reinforced concern corridor forms; ordinary fluctuation remains ordinary fluctuation.

This is an important control case. It shows that the detector does not convert ordinary mild variability into concern merely because the system is observing closely.

Subject A result: weak flag: no; persistent flag: no; reinforced flag: no; escalation candidate: no.

Subject A meaning: The model remains below closure-proximity concern across the six-month interval. This demonstrates that the detector can preserve ordinary stability without artificial inflation.

7.4 B.8 Subject B: Maternal-Lineage Prior with Weak Persistent Closure-Lean

Subject B is a 40-year-old man with elevated maternal lineage prior.

Inherited prior: 0.10

Fortnightly state values across thirteen windows: 0.08, 0.11, 0.14, 0.17, 0.18, 0.19, 0.21, 0.20, 0.23, 0.24, 0.26, 0.27, 0.29

Interpretation: the state first crosses the flag threshold at the fourth window; the state remains above threshold thereafter; at least two of the last three windows are above threshold from the mid-series onward; the repeated above-threshold pattern is therefore persistent; because the rise is not one-dimensional and remains supported by more than one weak signal class, the pattern becomes reinforced.

In this example, the maternal lineage prior matters because the detector does not begin from a flat field. The same weak molecular disturbance in a person with less inherited susceptibility would take longer to cross the same concern boundary.

Subject B result: first weak flag: window 4; persistent flag: established by window 6; reinforced flag: established by approximately window 8; escalation candidate: late-series candidate if reinforcement and non-recovery hold.

Subject B meaning: This is the model's main demonstration case. It shows how a detector can raise an alarm before final diagnosis, not by overclaiming from one sample, but by recognising repeated weak closure-lean against an inherited prior field.

7.4 B.9 Subject C: Temporary Disturbance Without Progressive Pattern

Subject C is a 10-year-old girl with no progressive six-month drift corridor.

Inherited prior: 0.06

Fortnightly state values across thirteen windows: 0.02, 0.04, 0.07, 0.18, 0.17, 0.09, 0.05, 0.03, 0.04, 0.02, 0.01, 0.03, 0.02

Interpretation: the state crosses the flag threshold briefly at windows 4 and 5; the rise does not hold across the later interval; recovery suppresses the field back toward ordinary range; reinforcement does not remain active; the detector therefore produces a temporary watch pattern rather than a persistent surveillance corridor.

This is the necessary false-watch control case.

Subject C result: weak flag: yes; persistent flag: no enduring persistence; reinforced flag: no; escalation candidate: no.

Subject C meaning: The detector registers that a temporary disturbance occurred, but it does not interpret short-lived rise as a sustained closure-lean. This is essential for preventing overreaction.

7.4 B.10 Worked Interpretation Table

For clarity, the worked example may be summarised as follows.

Subject	Inherited prior	Threshold crossing	Persistence	Reinforcement	Final interpretation
A	low	none	absent	absent	ordinary fluctuation
B	elevated maternal lineage	repeated	present	present	reinforced closure-lean, onward checking justified
C	modest	brief	not sustained	absent	temporary watch, no escalation

7.4 B.11 What the Worked Example Proves

This worked example proves the operational role of the detector.

It shows that the pinprick closure-proximity model can: remain quiet in ordinary fluctuation, raise an early weak alarm when state crosses the threshold, distinguish persistent lean from brief disturbance, strengthen concern when repeated weak evidence accumulates, incorporate inherited prior without collapsing into deterministic overreading.

That is exactly what the pinprick layer needs to do.

It does not need to localise tumour origin or provide terminal diagnosis. It needs to identify when the blood microfield stops behaving like ordinary fluctuation and starts behaving like a repeated lean toward abnormal closure.

7.4 B.12 Section Conclusion

The six-month worked example shows how the pinprick closure-proximity detector behaves across three distinct surveillance patterns: ordinary fluctuation, persistent weak concern, and temporary false-watch behaviour. The model does not overburden one micro-sample with final diagnostic responsibility. Instead, it treats each sample as one weighted contribution to a repeated state field. That repeated state field then determines whether the person remains within ordinary fluctuation, enters watch status, or moves toward reinforced concern and onward checking.

7.5 Blood-Based Oncology Signals

The blood layer is the central biological loop of the cancer application. Its purpose is not to replace pathology or imaging. Its purpose is to detect repeated biological deviation that may signal oncological concern earlier than episodic recognition alone.

That requires precision in wording.

The paper should not claim that one blood draw proves early cancer. It should claim that repeated blood-based oncology signals, interpreted against personal baseline and background susceptibility, can form an early-warning layer inside a broader surveillance system.

That position is aligned with current liquid-biopsy and MCED literature, which increasingly focuses on multi-analyte blood-based detection in asymptomatic or pre-diagnostic settings while still emphasizing technical and clinical limits.

7.5.1 Why Blood Is Central in the Cancer Layer

Cancer is especially suited to a blood-based surveillance loop because blood can carry fragments of the developing process before overt presentation.

These may include signals derived from: circulating tumor DNA-related patterns, methylation signatures, fragmentomic patterns, circulating RNA-related patterns, protein and metabolite abnormalities, broader host-response or inflammatory changes.

Recent reviews describe MCED and liquid-biopsy strategies precisely in these terms: multi-analyte blood-based assessment rather than one single universal marker.

So the strength of the blood layer in cancer is not that it contains one magical signal. Its strength is that it can carry multiple weak or moderate early signals that become meaningful when tracked repeatedly.

7.5.2 Blood Signals Must Be Longitudinal

A single oncology-oriented blood signal may be weak, noisy, or ambiguous.

That is why the cancer application must use the same core engine principle as the general model: compare to personal corridor, test for repeated deviation, test for persistence, test for convergence with other layers, escalate only when the pattern becomes actionably coherent.

This is especially important in early cancer detection, because tumor-derived material can be very sparse in asymptomatic or very early-stage disease. Reviews of MCED and early liquid-biopsy approaches repeatedly note that sensitivity depends on tumor burden, stage, analyte type, and assay design.

So the paper should favour repeated oncology drift surveillance over one-off oncological certainty claims.

7.5.3 Signal Classes for the Oncology Blood Layer

For the architecture of the paper, the blood layer should be grouped into classes rather than locked too early to one assay list.

A practical grouping is:

Group O1: inherited-risk-linked baseline context – This is not a blood analyte by itself, but it modifies how the blood layer is read when hereditary susceptibility is known.

Group O2: circulating nucleic-acid patterns – This includes cfDNA-related, ctDNA-related, methylation, fragmentomic, and related sequence-pattern signals where available.

Group O3: protein and metabolite shifts – This includes repeatable blood-based markers that may strengthen concern when they move persistently or in combination with more targeted analytes.

Group O4: host-response and inflammatory patterns – These are less specific, but may provide supportive reinforcement when repeated unexplained deviation aligns with other oncology-oriented signals.

This grouping is stronger than pretending the cancer layer rests on one analyte alone. Current reviews support this multi-analyte structure.

7.5.4 Oncology Blood Signals Need Weighting

Not every blood signal in the cancer layer should carry the same interpretive force.

For example: a validated methylation or fragmentomic abnormality may carry more weight than a general inflammatory fluctuation; repeated unexplained protein-pattern change may gain force when inherited susceptibility is present; weak host-response abnormalities may remain secondary unless reinforced by stronger primary oncology signals.

So the cancer application should use hierarchical weighting inside the blood layer itself.

This keeps the system from overreacting to nonspecific noise while still preserving early sensitivity.

7.5.5 Cancer-Oriented Drift in the Blood Layer

The blood layer should be read for oncology-oriented drift in the same way the general engine reads other domains.

The engine should ask: Is the signal leaving the person's ordinary corridor? Is it doing so repeatedly? Is the direction stable? Is recovery absent? Are related blood groups moving with it? Is there reinforcement from family risk, symptoms, or history?

That is the correct blood-surveillance question.

It is stronger than asking only: Is a cancer test positive right now?

The paper should therefore frame the blood layer as a pre-diagnostic signal field, not as a final diagnostic event.

7.5.6 Why Blood Alone Is Not Enough

The chapter must remain disciplined here.

Blood is central, but blood alone is not enough. Current early-detection literature repeatedly emphasizes that even promising MCED and liquid-biopsy strategies still face limits around sensitivity, stage dependence, false positives, false negatives, and the need for downstream diagnostic pathways.

So the oncology blood layer gains real force only when it is joined to: inherited susceptibility, repeated persistence, symptom or recovery abnormalities, prior-event similarity, cross-layer convergence.

This keeps the chapter serious and avoids overclaim.

7.5.7 Why the Pinprick Vision Still Matters

Even if some current oncology assays are not yet realistic in every home-sampling format, the paper's architecture remains strong. The pinprick or low-burden blood loop is the user-experience target: repeated blood access through a personally managed timeline.

The paper should therefore distinguish between: architectural aim (user-owned repeated blood surveillance) and current assay reality (analyte-specific validation and platform maturity vary).

That distinction preserves the long-term design while staying grounded in real data-source constraints.

7.5.8 Section Conclusion

The blood-based oncology layer is the primary biological surveillance field for the cancer application. Its role is not to deliver one-step certainty, but to track repeated deviation in circulating cancer-relevant and host-response signals across time, weighted by susceptibility and reinforced by other layers. That makes it the central biological loop of a pre-event cancer surveillance engine rather than a standalone diagnostic claim.

7.6 Multi-Layer Reinforcement in Cancer Surveillance

The cancer application becomes materially stronger when the blood layer is not read in isolation. Oncology-oriented concern should rise when repeated blood-based deviation is reinforced by other layers in the observation field.

That is the logic of multi-layer reinforcement.

The system is not asking only whether a cancer-relevant blood signal moved. It is asking whether that movement is being supported by background susceptibility, symptom recurrence, recovery failure, prior-pattern similarity, or other compatible deviations across time.

Current early-detection literature supports this multi-signal direction. Reviews of MCED and liquid-biopsy approaches repeatedly describe cancer detection as a multi-analyte, multi-step problem rather than a single isolated marker problem.

7.6.1 Why Reinforcement Matters More in Cancer

Cancer-oriented early signals are often weak at first. A single deviation may remain ambiguous for several reasons: tumor-linked material may be sparse, host-response changes may be nonspecific, early symptoms may be soft or absent, one abnormal sample may not be enough to distinguish drift from disturbance.

That is why reinforcement is essential.

A blood-based deviation becomes more serious when it is joined by one or more of the following: inherited susceptibility or strong family-pattern burden, repeated recurrence of the same blood deviation, symptom recurrence that does not resolve cleanly, unusual fatigue or recovery failure, prior event similarity, repeated cross-window coherence rather than one-off abnormality.

This is the correct cancer-surveillance logic.

7.6.2 Reinforcement Structure in the Cancer Layer

A practical cancer-oriented reinforcement ladder is:

Level 1: isolated blood deviation – A cancer-relevant or host-response signal leaves the personal corridor, but remains weakly structured.

Level 2: persistent blood deviation – The same deviation recurs or fails to recover across windows.

Level 3: blood deviation plus background susceptibility – Inherited risk or strong family clustering raises the interpretive weight of the repeated deviation.

Level 4: blood deviation plus cross-layer support – Symptoms, recovery instability, or prior-history similarity begin to align with the blood pattern.

Level 5: reinforced oncological concern – The multi-layer pattern becomes coherent enough to justify escalation.

This ladder is stronger than asking for one decisive cancer signal at the outset.

7.6.3 Compatible Reinforcement

The engine should also require compatibility, not just co-occurrence.

For cancer surveillance, reinforcement is stronger when the layers support the same concern corridor. Examples include: repeated blood deviation plus family history of related cancer class, repeated blood deviation plus unexplained fatigue or weight change, persistent drift plus prior unresolved oncological concern, repeated blood abnormality plus failure to recover to personal baseline.

This compatibility rule matters because unrelated abnormalities should not automatically be treated as oncological reinforcement.

7.6.4 Soft Reinforcement and Hard Reinforcement

The cancer layer should distinguish between soft and hard reinforcement.

Soft reinforcement means: repeated blood deviation with one supporting layer; concern is rising, but ambiguity remains.

Hard reinforcement means: repeated blood deviation with multiple compatible supporting layers; concern is now structurally coherent.

This allows the model to remain watchful before it becomes assertive.

That is especially important in oncology, where premature overclaim would weaken the entire paper.

7.6.5 Symptom Silence Does Not Cancel Reinforcement

The paper should stay precise here.

Cancer-related concern can still strengthen even when symptoms remain minimal or absent. Early-stage or asymptomatic detection research exists precisely because symptom silence does not guarantee biological silence. Reviews of MCED approaches focus on asymptomatic populations for this reason.

So the reinforcement rule should be: symptoms can strengthen concern, symptom absence does not automatically remove concern, the engine must still read persistence, inherited risk, and blood-layer structure.

That keeps the oncology model serious.

7.6.6 Why This Strengthens the Chapter

Multi-layer reinforcement strengthens the cancer chapter because it avoids both weak extremes: blood-only overclaim and symptom-only delay.

Instead, it builds a lawful oncology surveillance structure: susceptibility → repeated blood drift → reinforcement → escalation.

That is the real application of the engine to cancer.

7.6.7 Section Conclusion

Multi-layer reinforcement is what turns oncology-oriented blood drift into a credible surveillance pattern. Repeated blood deviation gains force when it is supported by inherited susceptibility, compatible symptoms, failed recovery, or prior-pattern similarity across time. This allows the engine to detect cancer-related concern earlier than episodic recognition without pretending that one isolated signal is final proof.

7.7 Cancer-Specific Weighting and Thresholding

The general engine uses weighted signals and staged thresholds. The cancer application must tune those weights and thresholds to the realities of oncological surveillance.

This is necessary because oncology is not just another generic drift domain. Early cancer-related signals may be sparse, weak, intermittent, or highly unequal in specificity. A cancer application that weights every layer evenly will either overreact to nonspecific noise or miss repeated weak but meaningful biological departure.

The governing principle is: in cancer surveillance, weighting should favour biologically relevant and inherited-risk-relevant signals, while thresholding should preserve sensitivity to repeated weak drift without collapsing into noise.

7.7.1 Why Cancer Needs Different Weighting

In a general health engine, many layers may contribute comparably.

In a cancer-oriented engine, some layers usually deserve greater force: inherited susceptibility, repeated oncology-relevant blood deviation, prior oncological pattern similarity, persistent unexplained recovery failure.

Other layers remain important, but usually as reinforcement rather than primary drivers: soft symptom recurrence, general wearable instability, broad contextual disruption.

This is consistent with current cancer early-detection work, where multi-analyte blood signals and inherited-risk structure are treated as central, while broader symptoms and context support interpretation rather than replace molecular evidence.

7.7.2 Proposed Weight Hierarchy for the Cancer Layer

A practical hierarchy for the paper is:

Tier 1: primary oncological signals – repeated cancer-relevant blood-based deviation, validated cfDNA, methylation, fragmentomic, or related analyte signals where available, strong inherited-risk background, prior-history similarity to unresolved oncological concern.

Tier 2: reinforcing personal signals – repeated unexplained fatigue, repeated recovery failure, recurrent symptom clusters compatible with the drift corridor, consistent wearable instability linked to recovery loss.

Tier 3: contextual modifiers – stress load, sleep disruption, infection context, medication changes, environmental burden.

This hierarchy keeps the cancer model anchored in stronger biology while still preserving the wider person-centred architecture.

7.7.3 Why Thresholds Must Be Staged

Cancer-oriented surveillance should not rely on one rigid threshold.

If the threshold is too high, early weak drift will be ignored. If it is too low, the user will be flooded with unstable concern.

So the cancer layer needs staged thresholds: watch threshold, reinforcement threshold, escalation threshold.

This is aligned with the current reality of MCED and liquid-biopsy work, where one positive signal is rarely the whole story and downstream confirmation pathways remain necessary.

7.7.4 Lower Tolerance for Repeated Weak Drift in High-Risk Persons

One of the main advantages of the cancer application is that thresholds can change with susceptibility.

For a person with strong inherited risk or family-pattern burden, the system can justify: earlier reinforcement checks, lower tolerance for repeated weak oncology-oriented blood drift, faster transition from watch state to targeted review.

For a person without such background risk, the same weak deviation may remain under observation longer before escalation.

This does not mean two standards of truth. It means two standards of vigilance, which is exactly what a susceptibility layer is for.

7.7.5 Persistence Thresholds May Matter More Than Magnitude Alone

In cancer surveillance, weak persistent drift may be more important than one isolated larger anomaly.

That is because early oncological signals may remain small before they become repeated. Current early-detection literature repeatedly emphasizes that signal scarcity in asymptomatic or low-burden disease is a real challenge.

So the cancer layer should weight: persistence, repeatability, failure to recover, cross-layer reinforcement at least as seriously as one-off absolute magnitude.

This makes the cancer model stronger than a simple "positive threshold" framework.

7.7.6 Adaptive Thresholding

The paper should also allow adaptive thresholding.

Cancer-specific thresholds may be adjusted by: inherited-risk level, prior false-alarm history, analyte confidence, signal quality, recurrence interval, age-linked risk corridor, application domain.

This is important because the same threshold may be inappropriate for: a high-risk hereditary-cancer surveillance pathway, a broader lower-risk general surveillance pathway, a previously escalated person under continued monitoring.

So thresholding must be principled, but not rigid.

7.7.7 Escalation Should Require Coherent Strength, Not Panic

The cancer layer should remain disciplined.

Escalation should occur when: repeated oncological drift is present, persistence is established, reinforcement is compatible, competing explanations are sufficiently weaker.

In other words, the cancer application should keep the general engine's commit discipline.

That is what protects the paper from overstatement.

7.7.8 Section Conclusion

Cancer-specific weighting and thresholding allow the general engine to become oncology-sensitive without becoming unstable. The application should weight inherited susceptibility and repeated biologically relevant blood deviation more heavily than broad nonspecific signals, while using staged and adaptive thresholds to detect repeated weak concern earlier in the persons for whom it matters most.

7.8 Escalation Pathways for Oncology-Oriented Review

The cancer application layer must end in a structured escalation pathway, not a vague warning. If the engine identifies reinforced oncological concern, the next step cannot simply be "alarm." It must be a lawful transition from surveillance to targeted review.

The governing principle is: oncology-oriented escalation should move from repeated concern to structured follow-up, not from weak signal to premature certainty.

That position is consistent with current MCED and liquid-biopsy literature, which treats positive or suspicious early-detection signals as requiring downstream localization, confirmatory workup, and clinical interpretation rather than being final proof by themselves.

7.8.1 Why Escalation Must Be Structured

A prognostic cancer engine is not useful if it only says that something may be wrong. It becomes useful when it can distinguish: continued watch, intensified repeat sampling, targeted external review, formal onward diagnostic pathway.

This matters because oncology-related concern exists on a ladder, not in one instant binary.

A repeated weak blood drift in a susceptible person may justify targeted repeat testing. A strongly reinforced multi-layer oncological pattern may justify urgent specialist review.

The escalation pathway must preserve that difference.

7.8.2 Escalation Stages

A practical oncology-oriented escalation ladder is:

Stage E1: Watch Intensification – The engine identifies repeated weak concern and shortens the next surveillance interval.

Stage E2: Targeted Repeat Sampling – The engine requests confirmatory or more focused repeat sampling within a tighter window.

Stage E3: Structured Oncology Review Recommendation – The engine determines that passive surveillance is no longer sufficient and recommends formal external review.

Stage E4: Clinical Review / Diagnostic Workup – The case exits ordinary monitoring and enters the confirmatory pathway.

This staged structure is stronger than a single generic "high risk" output.

7.8.3 What Triggers Oncology Escalation

The cancer application should escalate when four conditions become jointly strong enough: repeated blood-based oncological deviation, sufficient persistence, compatible reinforcement across other layers, adequate separation from weaker competing explanations.

So the escalation rule remains continuous with the general engine.

The oncology layer does not invent a different logic. It tunes the same logic for a higher-risk application domain.

7.8.4 Repeat Sampling Before Full Escalation

One of the strongest features of the cancer application is that it does not need to jump from weak concern directly to full external handover.

A lawful intermediate step is targeted repeat sampling.

That is important because current early-detection science recognizes that many blood-based oncology signals require confirmation, especially in asymptomatic settings where signal burden may be low and false positives or uncertain findings remain possible.

So the pathway can legitimately include: shorter repeat interval, same panel repeated for persistence confirmation, strengthened panel with more specific analytes where available, continued cross-layer tracking while awaiting repeat evidence.

This makes the model disciplined rather than alarmist.

7.8.5 Output to the User During Oncology Escalation

The user-facing output should remain clear and restrained.

Suitable output forms include: repeated cancer-related drift is being monitored more closely, the current pattern warrants repeat targeted sampling, the recent pattern has strengthened and external oncology-oriented review is recommended, the system is preserving your recent signal timeline for structured review.

This is stronger than panic language and more useful than vague caution.

7.8.6 Review Export for Oncology Pathways

Where oncology-oriented escalation occurs, the system should export a structured summary rather than raw signal fragments.

That review packet should include: recent state trajectory, cancer-layer drift summary, persistence summary, reinforcement summary, inherited-risk background if relevant, recent recovery or non-recovery pattern, major contextual notes, timeline of repeated abnormal windows.

This matters because the value of the engine lies partly in temporal organisation. It can show not just that concern exists, but how it formed.

7.8.7 Escalation Is Still Not Final Proof

The paper should preserve this boundary explicitly.

Even at the escalation stage, the engine is not claiming pathological confirmation. It is claiming that the monitored pattern has become sufficiently coherent that continued personal surveillance alone is no longer the correct endpoint.

That is exactly the right role for an oncology-oriented prognostic system and is aligned with current early-detection pathways, which still require downstream diagnostic confirmation after positive blood-based or multi-analyte concern signals.

7.8.8 Section Conclusion

Escalation pathways for oncology-oriented review should be staged, structured, and restrained. The system should move from watch intensification to targeted repeat sampling, then to formal review only when repeated oncological concern becomes persistent, reinforced, and actionably coherent. This preserves the engine's early-warning strength without overstating its role.

7.9 Limits of the Cancer Application Layer

The cancer application layer is useful only if its limits are stated clearly. A serious prognostic system must define not only what it may detect earlier, but also what it cannot honestly claim.

The governing principle is: the engine can identify structured oncological concern earlier than episodic recognition in some cases, but it does not eliminate biological uncertainty, assay limits, or the need for downstream confirmation.

Current early-detection literature is explicit on this point. Reviews of MCED and liquid-biopsy approaches continue to emphasize challenges around sensitivity in low-burden disease, false positives, false negatives, localization of signal origin, and proof of clinical utility in population screening.

7.9.1 The Engine Does Not Prove Cancer at Birth

The first limit is conceptual.

The paper should not claim that all cancers are directly detectable at birth. What can be birth-anchored is susceptibility mapping in the subset of persons with inherited predisposition, together with the establishment of a lifelong surveillance architecture. NCI states that inherited pathogenic variants account for only a minority of cancers, about 5% to 10% overall.

So the correct claim is: birth may initialise risk; birth does not universally reveal future cancer directly.

That distinction must remain explicit.

7.9.2 Early Blood Signals May Be Sparse

The second limit is technical.

Very early cancer-related blood signals may be extremely weak, intermittent, or absent at certain stages. Reviews of liquid biopsy and MCED repeatedly note that low tumor burden can limit detectability, especially in asymptomatic settings.

That means the engine may sometimes face: weak signal without persistence, weak signal without reinforcement, no usable blood signal despite real underlying process, delayed detectability relative to biological onset.

So the system must be designed for repeated surveillance, not one-step certainty.

7.9.3 Nonspecific Signals Can Mislead

The third limit is specificity.

Some supportive layers used in cancer surveillance, such as inflammation, fatigue, recovery failure, or general physiological instability, are not unique to cancer. They may reflect infection, stress, autoimmune activity, metabolic instability, or other non-oncological processes.

That means: supportive layers can reinforce concern; supportive layers cannot by themselves establish oncological origin.

This is why weighting and compatibility logic are essential in the cancer application.

7.9.4 Signal Origin May Remain Unclear

Even where a cancer-oriented signal is detected, the source of that signal may not be immediately localisable. MCED literature repeatedly identifies localization of the tissue or organ of origin as a major downstream challenge after blood-based signal detection.

So the engine can reasonably claim: structured concern is rising.

It cannot always immediately claim: this is the precise site.

That is an important operational boundary.

7.9.5 Clinical Utility Still Requires Validation

The fourth limit is evidential.

A surveillance architecture may be conceptually strong and technically plausible, but any claim about real-world cancer outcomes still requires validation against retrospective and prospective datasets. Reviews of MCED and blood-based screening continue to stress that detection performance alone is not enough; impact on downstream care and outcomes must also be shown.

So the paper should distinguish between: architectural validity, analytical validity, clinical validity, clinical utility.

That makes the cancer chapter stronger, not weaker.

7.9.6 Home-Sampling Vision Does Not Mean Every Assay Is Ready

The fifth limit is deployment maturity.

The user-centred vision of repeated pinprick or low-burden blood surveillance is a valid architectural direction. But analyte-specific readiness varies. Not every oncology-relevant assay is currently practical, validated, or robust in every home-sampling format.

So the paper should maintain the distinction between: long-term user experience architecture, and present assay maturity and validation status.

This keeps the system forward-looking without becoming careless.

7.9.7 The Engine Is an Early-Warning Layer, Not the Entire Oncology Pathway

The final limit is role definition.

The engine is not the entire cancer pathway. It is the pre-diagnostic surveillance layer. Its role is to identify when repeated personal data have become sufficiently concerning that continued passive monitoring is no longer enough.

That means the engine is strongest when it does three things well: detects repeated weak concern earlier, preserves the temporal structure of that concern, supports earlier and better-founded escalation.

It does not need to replace imaging, pathology, or downstream oncology review in order to be valuable.

7.9.8 Section Conclusion

The cancer application layer is strongest when framed as a disciplined early-warning architecture. It can support earlier detection of structured oncological concern through susceptibility-aware, repeated, multi-layer surveillance, but it does not prove universal birth detection, does not eliminate sparse-signal problems, and does not remove the need for downstream confirmation and validation. Those limits are not weaknesses of the chapter. They are the conditions that make it serious.

7.10 Chapter Conclusion

This chapter has shown how the general prognostic engine can be specialised for cancer. Cancer fits the model because it is often better understood as a formation path than as a single late recognition event. The correct starting point is birth-anchored susceptibility and family risk, not universal direct detection at birth. The primary biological loop is the blood layer, but that layer gains real strength only when it is interpreted longitudinally and reinforced by other compatible layers. Cancer-specific weighting and thresholding then allow repeated weak concern to be treated seriously without collapsing into noise, and escalation proceeds through structured review rather than one-step certainty.

The central conclusion is: the cancer application layer is best framed as a susceptibility-aware, blood-anchored, multi-layer surveillance system for earlier detection of structured concern, not as an overclaimed universal diagnostic event.

That completes the oncology application chapter.

Chapter 8. Multi-Disease Expansion

The cancer chapter established one high-risk application of the prognostic engine. This chapter shows that the architecture is not limited to oncology. Its stronger value lies in its generality.

The same engine can be extended across multiple disease classes because its core object is not a named disease at the outset. Its core object is structured biological drift across time: deviation from personal baseline, persistence, convergence, recovery failure, and escalation when the pattern becomes actionably coherent. This is consistent with the broader direction of precision-health research, which increasingly uses repeated multimodal data to detect risk and deterioration earlier across several disease domains rather than treating each condition as an isolated screening problem.

8.1 Purpose

This chapter establishes how the same prognostic engine can be applied beyond cancer.

It defines: why the architecture generalises across disease classes, which domains are especially compatible with personal drift surveillance, how the same signal field can be reweighted by disease application, why a unified engine is stronger than building isolated disease-specific tools from scratch.

8.1.1 Chapter Function

The function of this chapter is to show that the engine is best understood as a general prognostic infrastructure with disease-specific application layers rather than as a one-condition device.

That is supported by current trends in digital health and precision health, where repeated biomarker, wearable, EHR, and patient-generated data are being used across multiple disease areas, including metabolic disease, cardiovascular monitoring, neurodegeneration, and chronic inflammatory conditions.

8.2 Why the Engine Generalises

The engine generalises because it is built around process, not around one disease label. Its core logic does not ask first, "Which disease is this?" It asks, "Is the person leaving their own stable corridor in a way that is repeated, reinforced, and failing to recover?"

That question applies across many conditions.

This is why the architecture is transferable. It is based on general features of deterioration that recur in multiple domains: deviation from personal baseline, persistence across time, convergence across signal layers, recovery failure, transition from weak concern to actionable concern.

These are not cancer-specific. They are structural features of biological destabilisation more broadly.

Current precision-health and digital-health research supports exactly this broader direction. Longitudinal multimodal monitoring is increasingly being used across several domains, including metabolic disease, cardiovascular risk, neurodegeneration, and chronic inflammatory conditions, rather than being treated as a one-condition paradigm only.

8.2.1 The Engine Models Drift, Not Disease Names

The strongest reason the model generalises is that it does not require the disease identity to be known at the earliest stage.

Instead, it identifies: that drift is forming, how strong that drift is, whether it is persisting, whether it is spreading across layers, whether it is recovering or worsening.

Only later does the application layer ask which disease corridor is most compatible with the observed pattern.

That sequence is stronger than building a separate fixed model for every possible condition from the beginning.

8.2.2 Many Diseases Share the Same Early Logic

Different disease classes may have different biological mechanisms, but many share similar early monitoring logic.

A condition may begin with: small deviation, repeated deviation, incomplete recovery, rising contextual sensitivity, cross-layer reinforcement, eventual escalation.

That pattern appears in many domains, even where the specific analytes and thresholds differ.

For example: metabolic disease may show repeated regulatory drift before overt breakdown; inflammatory disease may show recurrent activation and failed resolution; cardiovascular instability may show altered recovery and physiological stress patterning; neurodegenerative decline may show subtle functional and behavioural drift before formal recognition.

The pathways differ. The surveillance logic remains similar.

8.2.3 A Shared Engine Is Stronger Than Separate Isolated Tools

Building a unified prognostic engine has several advantages over building many isolated disease-specific tools.

First, it preserves the person as one continuous system rather than splitting them into disconnected diagnostic silos.

Second, it allows the same personal baseline and longitudinal memory to serve multiple application layers.

Third, it lets one disease corridor reinforce another when appropriate. For example, metabolic instability, inflammatory load, and recovery failure may all matter across more than one downstream condition.

Fourth, it reduces duplication in data capture, signal handling, and user experience design.

That is one of the strongest architectural features of the paper.

8.2.4 The Shared Observation Field Makes Generalisation Possible

The engine generalises because its observation field is already broad enough to support multiple domains.

It includes: blood-based biological signals, wearable and physiological signals, symptom and self-report signals, behavioural and exposure context, health history and prior event memory, family risk and inherited background.

This field does not belong only to oncology. It can be reweighted for other domains without needing to rebuild the entire system.

That is the key to expansion.

8.2.5 What Changes Across Diseases

The engine generalises at the architectural level, but not by pretending all diseases are the same.

What changes across application layers includes: which blood signals matter most, which wearable features carry the strongest force, how much inherited background matters, what persistence looks like, how quickly escalation should occur, what downstream review pathway is appropriate.

So the engine is shared, but the weighting, thresholds, and escalation pathways are specialised.

That is the correct balance between generality and specificity.

8.2.6 Generalisation Preserves User Simplicity

A unified engine also improves the user experience.

The user does not need a separate monitoring architecture for every possible disease class. The same core loop remains in place: sample, track, compare, watch for drift, escalate when needed.

What changes is the interpretation layer behind the system, not the basic user burden.

That is a major practical strength.

8.2.7 Section Conclusion

The engine generalises because it models the structure of deterioration rather than beginning with a fixed disease label. Deviation, persistence, convergence, recovery failure, and escalation are common surveillance features across multiple domains, even when the biological mechanisms differ. This makes the system a general prognostic infrastructure with disease-specific application layers, not a one-condition tool.

8.3 Metabolic Disease and Regulatory Drift

Metabolic disease is one of the clearest application domains for the prognostic engine because it is inherently regulatory and longitudinal. It rarely appears as a single sudden event. It more often develops through repeated instability in glucose handling, energy balance, inflammatory load, recovery, and behavioural rhythm before overt breakdown is formally declared.

That makes metabolic disease especially compatible with a drift architecture.

Current literature supports this direction. Reviews of continuous glucose monitoring in prediabetes and broader non-diabetic use describe CGM as increasingly relevant for early individualized monitoring, behavioural feedback, and progression-risk assessment before overt diabetes develops.

8.3.1 Why Metabolic Disease Fits the Engine

Metabolic deterioration often unfolds as: mild regulatory instability, repeated glycaemic deviation, reduced recovery after food or stress, behavioural amplification, persistence across time, eventual progression into overt disease state.

This sequence aligns directly with the core logic of the engine: deviation from personal corridor, persistence across windows, convergence with other layers, escalation when regulation failure becomes structurally credible.

So metabolic disease does not need a separate conceptual model. It needs application-specific weighting inside the same engine.

8.3.2 Glucose Regulation as a Drift Field

Glucose is one of the strongest examples of why repeated measurement matters more than one-off thresholding.

A person may show meaningful instability in: post-meal response, time spent outside personal range, recovery speed back toward baseline, variability across days, sensitivity to sleep, stress, or behaviour even before a traditional diagnostic threshold is crossed.

That is why CGM is so relevant to this chapter. Recent reviews argue that CGM has growing potential in prediabetes and early dysglycaemia because it can support earlier individualized intervention and better observation of progression-risk dynamics.

8.3.3 Behaviour and Metabolic Drift

Metabolic disease also fits the engine because behaviour and biology are tightly coupled in this domain.

Relevant reinforcing layers include: meal timing instability, sleep disruption, reduced activity or abnormal exertion response, stress load, repeated poor recovery, weight or appetite change.

This is exactly the kind of cross-layer architecture the engine was designed to handle. A glucose shift alone may remain weak. A glucose shift that recurs with poor sleep, stress sensitivity, and slow recovery becomes a much stronger drift pattern.

8.3.4 Why Personal Baseline Matters in Metabolism

Population thresholds are useful in metabolic screening, but they are often too coarse for early personal regulation loss.

A person may still fall inside broad accepted ranges while already showing: worsening variability, slower return to baseline, increasing peak response, rising sensitivity to ordinary disruption.

That makes the metabolic application a particularly strong example of the paper's baseline-first rule: compare the person to themselves first, then to the population.

8.3.5 Metabolic Weighting Inside the Engine

For the metabolic application, the engine would typically weight more heavily: repeated glucose-related biological deviation, behavioural rhythm and meal-linked context, sleep and wearable recovery pattern, prior regulatory instability, persistence of poor return to baseline.

Inherited background may still matter, but often as a supporting modifier rather than the central axis unless there is strong family-pattern burden.

This is different from the cancer layer, where inherited susceptibility and oncology-oriented blood signals may carry heavier force.

8.3.6 Escalation in the Metabolic Layer

The escalation pathway in metabolic surveillance can also be staged.

It may move from: routine watch, intensified behavioural and glucose tracking, targeted repeat monitoring, structured review of recurring dysregulation, onward metabolic or endocrine evaluation.

That is one of the strengths of the shared architecture: the logic is the same, but the downstream pathway is domain-specific.

8.3.7 Section Conclusion

Metabolic disease is an especially strong fit for the prognostic engine because it is fundamentally a longitudinal regulation problem. Repeated glucose instability, failed recovery, behavioural amplification, and persistence across time can all be captured inside the same deviation–persistence–convergence framework. This makes the metabolic layer one of the clearest demonstrations that the engine generalises beyond oncology.

8.4 Inflammatory and Autoimmune Instability

Inflammatory and autoimmune disease are also strong fits for the prognostic engine because these conditions often unfold through recurrence, fluctuation, incomplete resolution, and flare-like instability rather than through one clean linear event. That makes them especially compatible with a model built around persistence, convergence, and recovery failure.

Current literature supports this direction. Reviews and recent studies describe wearable and remote-monitoring approaches for immune-mediated disease as increasingly able to capture flare-related physiological change, fatigue, inflammatory activity, and repeated instability over time rather than only episodic clinic snapshots. Wearable-derived signals have been reported to predict inflammatory bowel disease flares weeks in advance, and rheumatoid arthritis studies have found physiological changes preceding symptomatic and inflammatory flares.

8.4.1 Why Inflammatory Disease Fits the Engine

Inflammatory and autoimmune disease often follow a pattern such as: low-level instability, repeated inflammatory activation, partial suppression or apparent recovery, recurrence under stress or exposure, incomplete return to baseline, escalation into clinically obvious flare or sustained deterioration.

This is almost a direct expression of the general engine logic: deviation from personal corridor, persistence across windows, recovery failure, cross-layer reinforcement, state transition when the pattern becomes actionably coherent.

That makes inflammatory disease one of the clearest non-oncology use cases for a drift engine.

8.4.2 Flares Are a Drift Problem

A flare is not always a sudden event in the strongest sense. In many cases it is the visible phase of a process that has been building through physiological instability, altered recovery, subtle symptoms, or rising inflammatory burden.

That is why flare prediction fits this architecture so well.

The engine can watch for: repeated soft inflammatory biomarker deviation, worsening wearable recovery pattern, symptom recurrence, context-linked destabilisation, repeated failed return to baseline.

This is aligned with current wearable and remote-monitoring work in inflammatory bowel disease and rheumatology, where physiological digital biomarkers and longitudinal monitoring are being used to predict or track flare activity rather than only document it after the fact.

8.4.3 Recovery Failure Is Especially Important Here

Recovery failure matters in every disease domain, but it is especially important in inflammatory and autoimmune pathways.

A person may show: repeated flare-like abnormality, partial improvement, reactivation before full recovery, shortening interval between episodes, rising burden from previously manageable triggers.

That is exactly the kind of pattern the engine is designed to detect.

A model based only on one-off measurement may miss this. A longitudinal drift model can preserve: recurrence interval, duration of instability, recovery slope, relapse after apparent improvement.

That is a major advantage in immune-mediated disease.

8.4.4 Useful Signal Layers in Inflammatory Disease

For this application, the strongest layers often include: blood-based inflammatory or immune-activity signals, symptom recurrence and fatigue pattern, wearable-derived recovery and physiological instability, exposure and trigger context, prior flare history, treatment-response memory.

Recent reviews also support the relevance of noninvasive wearable biosensors and digital biomarkers for inflammation monitoring, including sweat-based or physiologic measures and remote monitoring in IBD and rheumatologic disease.

8.4.5 Wearables Are Particularly Strong in This Domain

The inflammatory/autoimmune domain is one of the places where wearables may become especially powerful.

Recent reports describe: IBD flares predicted from wearable physiological metrics weeks in advance, longitudinal physiological changes associated with rheumatoid arthritis flares, wearable and remote-monitoring models for immune-mediated inflammatory disease activity.

These findings fit the paper's emphasis on repeated, low-burden, personal monitoring. They show that regulation loss, physiological stress, and flare formation may be detectable as a temporal pattern before or alongside overt clinical escalation.

8.4.6 Weighting in the Inflammatory Layer

Compared with the cancer application, inherited susceptibility may matter differently here, and blood specificity may be lower. So the inflammatory layer often needs stronger weighting on: recurrence pattern, recovery failure, symptom clustering, wearable instability, context-trigger linkage.

This is a good example of why the engine generalises by weighting, not by pretending every disease class uses the same priorities.

8.4.7 Section Conclusion

Inflammatory and autoimmune disease fit the prognostic engine because they are fundamentally recurrence-and-resolution problems. Repeated activation, failed recovery, flare-like patterning, and cross-layer reinforcement can all be tracked within the same deviation–persistence–convergence architecture. That makes this domain one of the clearest demonstrations that the engine can function as a general infrastructure for chronic instability rather than a single-disease detector.

8.5 Cardiovascular and Recovery-State Monitoring

Cardiovascular disease is another strong fit for the prognostic engine because much cardiovascular deterioration is not best understood only as a single catastrophic event. It is often preceded by repeated physiological instability, autonomic imbalance, altered recovery, rhythm disturbance, or progressive loss of resilience before overt clinical recognition. That makes cardiovascular monitoring especially compatible with a drift architecture built around repeated deviation, persistence, and recovery failure.

Current literature supports this direction. Recent reviews on wearable cardiovascular monitoring describe continuous collection of heart rate, rhythm, sleep, activity, and related physiologic measures outside traditional clinical settings, while work on heart-rate variability continues to position autonomic instability as a potentially useful prognostic marker in cardiovascular disease, albeit with methodological limits.

8.5.1 Why Cardiovascular Disease Fits the Engine

Cardiovascular deterioration often unfolds through patterns such as: reduced autonomic stability, altered recovery after ordinary exertion, repeated rhythm irregularity, progressive loss of exercise tolerance, sleep-linked physiological disturbance, escalating burden under stress or load.

This is structurally compatible with the same core engine logic: deviation from personal corridor, persistence across windows, convergence across wearable, symptom, and blood layers, failed recovery, escalation when the pattern becomes actionably coherent.

So cardiovascular monitoring fits the engine not because it mirrors cancer, but because it is also a temporal deterioration problem.

8.5.2 Recovery-State Monitoring Is Central Here

Recovery is especially important in the cardiovascular layer.

A person may appear acceptable at rest and yet show abnormality in: return to baseline heart rate after activity, sustained low heart-rate variability, worsening exertional tolerance, disrupted sleep-related physiological regulation, repeated autonomic stress without full recovery.

That makes recovery-state monitoring one of the strongest cardiovascular uses of the engine. It allows the system to detect not only overt abnormality, but the gradual loss of regulatory resilience that may precede clearer clinical endpoints. Recent reviews on wearable cardiovascular sensing and HRV both support the importance of these kinds of physiological signals, while also noting the need for robust validation and standardization.

8.5.3 Wearables Are Particularly Relevant in Cardiovascular Monitoring

The cardiovascular domain is one of the clearest examples of why passive, repeated wearable data matter.

Useful layers include: resting and active heart-rate pattern, rhythm irregularity detection, heart-rate variability, sleep-related recovery pattern, movement and exertion-response pattern, symptom-linked temporal alignment.

A recent 2026 European Heart Journal review describes wearable devices as transforming cardiovascular medicine through continuous monitoring of physiologic and behavioural measures outside traditional settings.

That is closely aligned with the engine proposed in this paper: a user-held continuity architecture rather than an appointment-only model.

8.5.5 Remote Monitoring Already Points in This Direction

Remote cardiac rehabilitation and wearable-guided monitoring provide a practical example of this logic. Recent JMIR work reported improved exercise capacity and physical activity in coronary artery disease patients using wearable devices and real-time remote monitoring in cardiac rehabilitation settings.

That is not the same as pre-event disease detection, but it supports the broader claim that continuous personal monitoring is already clinically meaningful in cardiovascular care and can support earlier, more individualized interpretation of physiological state.

8.5.6 Weighting in the Cardiovascular Layer

Compared with oncology, the cardiovascular layer would usually weight more heavily: wearable physiological continuity, autonomic and recovery measures, rhythm instability, exertion-response pattern, prior cardiovascular event history.

Blood-based signals may still matter, but often as one part of a larger recovery-state and physiological-stability architecture rather than the sole central axis.

8.5.7 Section Conclusion

Cardiovascular disease fits the prognostic engine because deterioration can often be read as progressive instability in rhythm, autonomic balance, exertional recovery, and physiological resilience before overt clinical events. Repeated wearable monitoring, recovery-state tracking, and cross-layer reinforcement make this domain a strong example of how the engine can extend beyond oncology and metabolism into broader physiological risk surveillance.

8.6 Neurodegenerative and Functional Decline Pathways

Neurodegenerative disease is also well suited to the prognostic engine because decline in this domain is often gradual, distributed, and partially visible before formal diagnosis or obvious functional breakdown. The signal may first appear as subtle change in movement, speech, sleep, cognition, routine behaviour, or recovery rather than as one dramatic event. That makes neurodegeneration especially compatible with a longitudinal, multimodal, personal-baseline architecture.

Recent literature supports this direction. Reviews describe digital biomarkers for neurodegenerative disease as promising for longitudinal monitoring, early detection, and continuous remote assessment using wearables, smartphones, speech, gait, and passive sensing. Parkinson's disease studies and reviews likewise report that digital measures can provide quantitative longitudinal data on motor and non-motor symptoms.

8.6.1 Why Neurodegeneration Fits the Engine

Neurodegenerative and related functional-decline pathways often unfold through: subtle motor drift, sleep and circadian disturbance, speech or communication change, cognitive or behavioural slowing, repeated recovery failure, progressive loss of stability in ordinary daily function.

This maps directly onto the engine: deviation from personal corridor, persistence across time, reinforcement across layers, failed recovery or worsening interval, escalation when the pattern becomes actionably coherent.

That is why this domain belongs naturally inside the model.

8.6.2 Functional Change Often Appears Before Formal Recognition

One of the strongest reasons this domain fits the engine is that early change may be visible in function before it is formally named in clinic. Passive and digital biomarker work in Alzheimer's disease and broader brain-health monitoring increasingly focuses on continuous, low-burden assessment of cognition, behaviour, speech, sleep, movement, and affect because these may reveal meaningful early variability before conventional late recognition.

That makes the paper's baseline-first logic especially relevant here: the question is not only whether a person crosses a population threshold, but whether they are beginning to function differently from themselves.

8.6.3 Useful Signal Layers in Neurodegenerative Monitoring

For this application, the strongest layers may include: gait and movement pattern, tremor, bradykinesia, or motor irregularity where relevant, speech and language pattern, sleep and circadian disruption, passive phone or device interaction pattern, self-reported cognition, fatigue, or functional difficulty, prior decline pattern or family background where relevant.

Recent work in Parkinson's and Alzheimer's disease supports this broad multimodal direction, including wearables for motor monitoring, passive multimodal brain-health data, and digital biomarkers from mobile and wearable devices.

8.6.4 Wearables and Passive Monitoring Are Especially Important Here

This is one of the domains where passive monitoring may be particularly valuable. Neurodegenerative change may be slow and distributed, so high-friction one-off testing can miss the pattern while repeated passive sensing can preserve it.

Reviews on Parkinson's disease and broader neurological wearables describe remote continuous monitoring of motor symptoms, physical activity, gait, and daily-life movement as a promising route toward more realistic longitudinal assessment.

That fits the engine closely: passive capture preserves continuity, continuity reveals drift, drift reveals functional decline earlier than episodic snapshots alone.

8.6.5 The Role of Recovery and Routine Stability

Recovery remains important in this domain as well, though it may look different from metabolic or inflammatory applications.

The engine can ask: Is the person returning to their normal speech, sleep, movement, or cognitive rhythm? Is daily function becoming slower or more fragmented? Is there progressive loss of stability in ordinary routine? Are repeated small failures accumulating into a coherent decline path?

This is particularly important in neurodegeneration because functional deterioration may emerge as gradual loss of smoothness, resilience, and ordinary continuity rather than as a single acute abnormality.

8.6.6 Weighting in the Neurodegenerative Layer

Compared with oncology, the neurodegenerative layer would usually weight more heavily: passive behavioural and motor signals, wearable movement and sleep continuity, speech or interaction-pattern change, repeated functional self-report, prior decline pattern.

Blood-based signals may still matter in some subdomains, but the centre of gravity often shifts toward functional and behavioural continuity.

That is another demonstration of why the engine generalises by weighting rather than by imposing one rigid signal hierarchy across all disease classes.

8.6.7 Section Conclusion

Neurodegenerative and functional-decline pathways fit the prognostic engine because deterioration in this domain is often gradual, multimodal, and visible in repeated functional deviation before formal late recognition. Digital biomarkers, wearables, passive sensing, speech, sleep, and movement data all support the use of a longitudinal personal-baseline architecture for earlier detection of structured concern.

8.7 Shared Architecture, Different Weighting

The engine generalises because its architecture stays the same while its weighting changes by domain.

That is the core design rule.

The system does not need a different conceptual engine for cancer, metabolic disease, inflammatory instability, cardiovascular monitoring, and neurodegenerative decline. It needs one shared structure for: baseline-relative deviation, persistence across time, convergence across layers, recovery or non-recovery, state transition, escalation when the pattern becomes actionably coherent.

What changes is not the engine's logic. What changes is the importance assigned to different layers, signals, and thresholds inside that logic.

8.7.1 Why This Matters

This is what prevents the paper from becoming fragmented.

If every disease class required a completely separate framework, the result would be: duplicated architecture, duplicated user experience, duplicated temporal logic, duplicated signal handling, reduced cross-condition insight.

A shared engine avoids that. It preserves one continuity model across the person while allowing domain-specific tuning.

That is stronger both scientifically and practically.

8.7.2 What Stays Shared

Across all applications, the following remain constant: the person is the continuity point, baseline is personal not only population-based, repeated sampling matters more than isolated events, state movement matters more than binary labeling, escalation requires reinforcement not mere noise, recovery must be tracked as carefully as deterioration.

These are the invariant rules of the engine.

8.7.3 What Changes by Application

What changes across applications includes: which blood signals matter most, how strongly wearables are weighted, how important inherited background is, what persistence pattern is most meaningful, how much symptom recurrence should count, what threshold should trigger escalation, what form of review follows escalation.

This is the adjustable layer.

So the architecture is common. The weighting is specific.

8.7.4 Example of Weight Shifts Across Domains

A practical summary looks like this:

Cancer layer: Heavier weight on inherited susceptibility, oncology-oriented blood deviation, and reinforced multi-layer concern.

Metabolic layer: Heavier weight on repeated glucose-related drift, behaviour-linked instability, and recovery to baseline after load.

Inflammatory/autoimmune layer: Heavier weight on flare recurrence, recovery failure, symptom clustering, wearable instability, and trigger-linked context.

Cardiovascular layer: Heavier weight on autonomic pattern, wearable recovery metrics, exertion response, and rhythm instability.

Neurodegenerative layer: Heavier weight on passive functional drift, sleep and movement pattern, speech or behavioural change, and repeated loss of routine stability.

This shows the principle clearly: same engine, different weighting.

8.7.5 Why Shared Architecture Preserves the Human System

The shared-engine approach is also important because real human deterioration does not always stay inside one neat diagnostic box.

A person may show: metabolic instability with inflammatory burden, cardiovascular strain with poor sleep and stress dysregulation, neurofunctional drift with systemic recovery failure.

A fragmented one-disease-at-a-time model can miss that overlap.

A shared architecture can preserve the person as one continuous regulated system, while still allowing different application layers to interpret the same underlying data differently.

That is a major strength of the model.

8.7.6 Weighting Can Evolve Across Time

Weighting should not only vary by disease class. It can also evolve for the same person over time.

For example: a signal that repeatedly predicts deterioration may gain weight, a signal that repeatedly produces noise may lose weight, a disease-specific suspicion may temporarily raise weighting on certain layers, a new family-history revelation may raise the inherited-risk contribution.

This makes the engine adaptive without making it structurally unstable.

8.7.7 Shared Architecture Preserves UX Simplicity

The same principle also keeps the user experience simple.

The user does not need five separate surveillance systems. The user needs one continuity loop: sample, track, watch, respond.

The interpretation layer can become more complex behind the scenes without increasing burden on the person.

That is important because repeatability and adherence are central to the whole thesis.

8.7.8 Section Conclusion

The engine should be understood as a shared prognostic architecture with different weighting across disease applications. The person, the baseline, the state logic, the persistence model, and the escalation structure remain constant. What changes is the weighting of signals, thresholds, and review pathways according to the disease corridor being modelled. This is the principle that allows one system to support multiple domains without losing coherence.

8.8 Limits of Multi-Disease Expansion

The multi-disease expansion is strongest when its limits are stated clearly. A shared prognostic engine can generalise across disease classes, but generalisation does not mean that all conditions become equally detectable, equally weightable, or equally mature for deployment.

The governing principle is: one architecture can span multiple domains, but each domain still requires its own validation, weighting, signal quality assessment, and escalation logic.

This is consistent with current precision-health and digital-biomarker literature. Reviews across cardiovascular, neurological, inflammatory, and broader digital-health monitoring repeatedly support longitudinal multimodal tracking, but they also emphasize variability in data quality, disease-specific signal relevance, validation maturity, and clinical utility.

8.8.1 Not All Disease Classes Are Equally Observable

Some disease domains lend themselves more easily to repeated personal monitoring than others.

For example: glucose regulation can often be tracked densely and directly, cardiovascular recovery may be visible through wearables, some inflammatory flare patterns may show repeated physiological instability, some cancer-related or neurodegenerative signals may remain sparse, delayed, or less specific at early stages.

So the engine can be shared, but observability still differs by domain. That difference must be acknowledged in the paper.

8.8.2 Shared Logic Does Not Remove Domain Specificity

A common engine does not eliminate the need for disease-specific judgment.

Each application still needs its own: relevant signal set, weighting hierarchy, persistence interpretation, threshold calibration, escalation pathway, validation evidence.

This means the shared architecture is real, but it is not a shortcut around application-layer discipline.

8.8.3 Signal Quality and Data Burden Still Matter

The engine also depends on the quality of its inputs.

If the system receives: sparse data, noisy wearables, inconsistent self-report, unvalidated assay signals, long gaps in observation, then the quality of inference weakens.

So multi-disease expansion is not simply a matter of adding more disease labels. It requires that the underlying signal architecture remain strong enough in each domain to support lawful drift interpretation. Reviews of digital biomarkers and patient-generated monitoring repeatedly note this dependency on signal reliability, adherence, and validation.

8.8.4 Cross-Disease Overlap Can Create Ambiguity

A further limit is overlap.

The same pattern may support more than one concern corridor. For example: fatigue, poor recovery, and physiological instability may appear in inflammatory, metabolic, cardiovascular, or neurodegenerative pathways; broad inflammatory deviation may reinforce several possible interpretations rather than one; behavioural disruption may amplify many domains at once.

So a shared engine must be able to hold ambiguity rather than forcing premature disease naming.

That is not a weakness. It is exactly why the state-based drift architecture is stronger than a premature disease-classifier model.

8.8.5 Clinical Utility Must Still Be Proven Per Domain

Even if the architecture is coherent across domains, usefulness still has to be demonstrated separately.

A model that works well for metabolic regulation may not yet be equally proven for early oncology or neurodegenerative surveillance. Current literature across these areas consistently distinguishes conceptual promise from validated clinical utility.

So the paper should distinguish clearly between: architectural generality, and domain-specific validation.

That keeps the chapter serious.

8.8.6 Section Conclusion

The multi-disease engine is strongest when framed as a shared prognostic infrastructure with domain-specific limits. The architecture generalises, but observability, signal maturity, weighting, and validation do not generalise automatically at the same speed. Each disease layer still requires its own evidence base, calibration, and escalation discipline. That limit does not weaken the model. It defines the conditions under which it remains rigorous.

8.9 Chapter Conclusion

This chapter has shown that the prognostic engine extends beyond cancer because it is built around the structure of deterioration rather than around one disease label. Metabolic, inflammatory, cardiovascular, and neurodegenerative pathways all fit the model when they are treated as longitudinal drift problems involving deviation, persistence, reinforcement, and recovery failure. What remains shared is the architecture. What changes is the weighting, thresholds, and review pathway for each domain. The result is a general prognostic infrastructure rather than a collection of isolated single-disease tools.

Chapter 9. User Ownership, Trust, and Interface

A prognostic engine does not succeed by mathematics alone. It succeeds only if the person will actually live inside it. That makes user ownership, trust, and interface design structural parts of the model rather than decorative additions.

The central proposition of this chapter is: if the person is the continuity point of the data, then the person must also be the practical owner of the monitoring relationship.

Current literature supports this direction. Reviews of patient-generated health data describe digital systems as most useful when they support continuity, self-management, and clinically meaningful integration, while trust, usability, accessibility, and adherence remain decisive barriers to sustained use.

9.1 Purpose

This chapter establishes the user-side governance and experience architecture of the system.

It defines: why ownership matters in a prognostic model, how trust is built or lost, how the interface should present drift without causing panic or fatigue, how adherence and usability affect scientific validity, how data control, clarity, and continuity should be handled.

9.1.1 Chapter Function

The function of this chapter is to show that the interface is not just where the system appears. It is where the system either survives in lived use or fails.

That is consistent with current digital-health evidence. Usability of home and wearable monitoring devices affects whether remote-monitoring systems work in practice, and broader reviews identify trust, privacy, accessibility, and sustained engagement as major determinants of real-world effectiveness.

9.2 Why User Ownership Is Structural

User ownership is structural because the prognostic engine depends on continuity, and continuity exists first in the life of the person, not in the schedule of the institution. If the person is the continuity point of the data, then the person must also be the practical holder of the monitoring loop.

That is not a philosophical preference. It is an operational requirement.

A system based on repeated sampling, repeated self-observation, repeated wearable capture, and repeated return to baseline cannot function if the user is treated merely as a passive source of extracted data. Patient-generated health-data research increasingly frames people as active participants in longitudinal data production and self-management, not just subjects of occasional institutional measurement.

9.2.1 The Person Holds the Timeline

Institutions encounter episodes. The person lives the sequence.

That difference is the reason ownership matters. A prognostic system is not built around one consultation, one test, or one result. It is built around the person's ongoing biological and functional continuity across days, weeks, months, and years.

So the engine should be designed on the assumption that: the person initiates or permits repeated sampling, the person holds the ongoing timeline, the person can see state movement across time, the person decides when and how to share onward.

Without this, the system falls back into an extraction model rather than a continuity model.

9.2.2 Ownership Improves Data Integrity

Ownership also strengthens data integrity.

When the person can see: what was captured, when it was captured, how it was interpreted, whether it is improving or worsening, what action is being recommended, then the data are less likely to become opaque fragments detached from lived reality.

This matters because early prognostic interpretation often depends on context and continuity. A user who can review and correct the surrounding picture improves the quality of the timeline.

9.2.3 Ownership Is Not the Same as Isolation

The paper should remain precise here.

User ownership does not mean the person is left alone to interpret everything in isolation. It means the person is the primary continuity holder inside the system.

That allows three things at once: personal continuity, selective sharing, structured escalation when required.

So ownership strengthens the system without abolishing onward review.

9.2.4 Why Ownership Supports Adherence

A person is more likely to remain engaged with a system that is visibly theirs.

If the interface behaves like a distant institutional instrument, adherence will weaken. If it behaves like a personal continuity tool that preserves history, explains drift, and supports action, adherence is more likely to survive.

Digital-health literature repeatedly finds that engagement improves when systems support personal relevance, self-management, understandable feedback, and perceived usefulness rather than opaque data extraction alone.

9.2.5 Ownership and Escalation

Ownership also matters during escalation.

When the system moves from watch to action, the person should be able to carry forward: their recent drift timeline, the major contributing layers, the persistence summary, the recovery or non-recovery pattern, the relevant history corridor.

That makes the transition outward more coherent. The person does not arrive at review empty-handed. They arrive with a structured continuity record.

9.2.6 Section Conclusion

User ownership is structural because the prognostic model depends on the person as the holder of the living timeline. The engine can only work properly if the individual remains the practical continuity point for capture, review, and selective onward sharing. That is not an optional UX preference. It is part of the scientific architecture of pre-event monitoring.

9.3 Trust as a Technical Requirement

Trust is not an emotional extra added after the system is built. It is a technical requirement of the system itself. A prognostic engine that is not trusted will not be used consistently, and a system that is not used consistently loses continuity, weakens its baseline, and degrades its own scientific value.

That is why trust belongs inside the architecture.

Current literature supports this directly. Reviews of patient-generated health data and remote monitoring repeatedly identify usability, transparency, privacy, perceived usefulness, and data handling as core determinants of whether people remain engaged with digital health systems over time.

9.3.1 Why Trust Is Technical

A system based on repeated personal participation depends on several things happening reliably: the user continues to sample, the user believes the outputs are meaningful, the user understands what the system is doing, the user feels in control of their data, the user does not feel manipulated, overwhelmed, or ignored.

If any of these break down, adherence falls. Once adherence falls, the data become sparse, the baseline weakens, the state engine becomes less reliable, and the whole prognostic model degrades.

So trust is not merely about comfort. It is about signal continuity and model integrity.

9.3.2 What Builds Trust

Trust in this kind of system is built through five main features: clarity (the user can understand the current state and why it changed), restraint (the system does not overclaim or over-alert), consistency (similar patterns are handled in similar ways), control (the user can see, retain, and choose how to share their data), usefulness (the outputs clearly help the person act earlier or better).

Digital-health and PGHD reviews consistently associate trust and adoption with understandability, perceived benefit, privacy protection, and good usability rather than with raw technical sophistication alone.

9.3.3 Opaque Systems Lose Trust Quickly

An opaque system is especially dangerous in a prognostic context.

If the app produces a warning but cannot show: what changed, whether the pattern is worsening or recovering, which layers contributed most, what action is actually recommended, then the user is left with either confusion or alarm.

That weakens adherence and eventually weakens the data. Reviews of remote-monitoring usability and PGHD integration repeatedly note that poor interpretability, poor workflow fit, and opaque presentation reduce sustained value in practice.

9.3.4 Privacy and Data Handling Are Part of Trust

Trust also depends on whether the user believes their data are being handled properly.

Public-attitudes work from NHS England shows that trust in health-data handling is substantial but conditional, and that security concerns remain important, especially around cyberattack risk.

That means the system must be designed so the user can understand: what is stored, what is local versus shared, what leaves the app and when, who can view the data if escalation occurs, how permissions can be changed.

A person-centred system cannot ask for repeated intimate data while treating privacy as a secondary issue.

9.3.5 Trust Requires Alert Discipline

Trust also depends on how the system behaves under uncertainty.

If the system alerts too often, the user stops believing it. If it stays silent until deterioration is obvious, the user stops needing it.

So trust depends on disciplined alerting: soft concerns should stay soft, action states should be clearly distinguished from watch states, repeat alerts should not feel like duplication without added meaning, worsening should be visible as worsening, not as random churn.

This is one reason the chapter on thresholds matters so much. Alert discipline is part of trust architecture, not only mathematics.

9.3.6 Trust Is Strengthened by User Verifiability

The user should be able to verify the system's recent interpretation against their own visible history.

That means the interface should allow the person to see: recent trend, current state, what moved, whether recovery is occurring, what triggered the alert or recommendation.

This does not require exposing all internal model mechanics. It requires enough visibility that the system's behaviour does not feel arbitrary.

That kind of verifiability is central to sustained use in health technologies because people are more likely to continue with systems that feel understandable and controllable.

9.3.7 Section Conclusion

Trust is a technical requirement because the prognostic engine depends on repeated user participation, and repeated participation depends on clarity, restraint, privacy control, and visible usefulness. A system that cannot sustain trust cannot sustain continuity, and without continuity the science of personal drift detection weakens. Trust therefore belongs inside the architecture itself, not outside it.

9.4 Interface Simplicity and Longitudinal Use

A prognostic engine only works if the interface is simple enough to survive repeated use. This is not a matter of visual taste. It is a condition of data continuity. If the interface is confusing, overburdening, or cognitively noisy, the user will not sustain the loop, and the system will lose the very temporal density it needs in order to function.

That is why interface simplicity is part of the scientific design of the model.

Current research supports this. Reviews of digital-health adherence and patient-generated health data repeatedly identify usability, accessibility, alert burden, scheduling burden, and clarity of presentation as key determinants of sustained engagement. In one 2025 feasibility study of an app-plus-Apple-Watch remote research system, most participants rated the app highly user-friendly and found the alert frequency and schedule manageable, which illustrates how low-friction design supports long-term participation.

9.4.1 Simplicity Is a Data-Integrity Requirement

In this system, simplicity protects the timeline.

If logging a sample is cumbersome, fewer samples arrive. If reviewing drift is confusing, the user stops checking. If alerts are too frequent or too vague, trust erodes. If navigation is cluttered, adherence drops.

That means interface complexity is not neutral. It directly affects baseline quality, persistence measurement, and the reliability of state updates. Broader digital-health adherence reviews likewise identify usability, perceived effort, and sustained engagement as major influences on long-term adherence.

9.4.2 What the Interface Must Do Well

The interface does not need to expose all internal mechanics. It does need to make the core loop effortless.

At minimum, it must support: easy sample logging, easy symptom or context entry, clear current-state display, visible recent trend, clear distinction between watch, alert, and escalation states, obvious next action.

This is consistent with home-use digital-health evidence showing that usability and satisfaction improve when systems avoid technical overload and present relevant information in manageable form for patients using them outside clinical settings.

9.4.3 The Interface Should Show Trajectory, Not Just Status

A prognostic system is about movement across time, not only about a label at one moment.

So the interface should not merely display: you are in state X.

It should display: where the person is now, where they were recently, whether the pattern is worsening, stabilising, or recovering, which layers contributed most, whether repeat action is needed.

This matters because users are more likely to stay engaged with systems that feel understandable and useful, rather than opaque. PGHD reviews and home-use digital-health usability studies both point toward the value of understandable feedback and practical usefulness.

9.4.4 Low Cognitive Load Matters

The app must reduce cognitive load rather than add to it.

That means: no unnecessary technical language on the main screen, no cluttered dashboards full of equal-priority metrics, no repeated prompts that do not change meaning, no over-detailed graphs where a simple trajectory summary would suffice.

The user should not have to decode the system each time they open it. They should be able to see quickly: current condition, current direction, current need.

Research on digital-health usability and accessibility shows that poor design and inaccessible presentation can materially limit effectiveness, especially in long-term home use.

9.4.5 Longitudinal Use Requires Routine Fit

A useful interface fits into life. It does not demand that life reorganise itself around the app.

That means the monitoring loop should be compatible with ordinary routine: sample entry in seconds, not minutes; passive data capture wherever possible; short symptom/context inputs; reminders that are timely but not oppressive; immediate return to useful summary after data entry.

This is supported by evidence that adherence to digital health technologies depends heavily on burden, routine fit, motivation, and perceived manageability across time.

9.4.6 Interface Simplicity Strengthens Equity

Simplicity also matters because the system should remain usable across different ages, capacities, and levels of technical confidence. A complicated interface may work for highly engaged users and fail for everyone else.

That is one reason usability and accessibility principles matter. The 2025 review of digital-health-app usability against DTAC and NICE-aligned criteria underscores that accessibility and usability should not be treated as optional add-ons in health technologies.

9.4.7 Section Conclusion

Interface simplicity is a condition of longitudinal validity. A prognostic engine requires repeated user participation, and repeated participation depends on low cognitive load, easy logging, clear trajectory display, and routine fit. In practical terms, the interface must make continuity easy enough that the user can stay inside the monitoring loop without friction becoming the reason the data fail.

9.5 Alert Presentation and User Action

An alert is only useful if the user can understand its meaning and respond appropriately. In a prognostic system, alert presentation is therefore not a cosmetic messaging problem. It is part of the control architecture. Poorly presented alerts create panic, fatigue, confusion, or disengagement. Well-structured alerts preserve trust, support action, and protect continuity.

Current evidence supports this. Reviews of digital health alerts and remote monitoring repeatedly show that alert burden, poor prioritisation, and unclear messaging reduce adherence and usefulness, while graded and interpretable feedback improves user engagement and response quality.

9.5.1 Alerts Must Be Graded

The system should not present every abnormality with the same intensity.

A weak watch signal should not look like an urgent escalation. A strong escalation state should not look like a routine reminder.

So the interface should preserve at least four clear alert grades: No concern, Watch, Drift detected, Escalation recommended.

This keeps the signal hierarchy visible to the user and prevents shallow abnormalities from feeling catastrophic.

9.5.2 An Alert Should State Three Things

Every alert should tell the user three things clearly: what changed, how strong the concern is, what action is appropriate now.

Without these three components, the alert becomes noise.

A useful alert is not: something may be wrong.

A useful alert is: a repeated change has been detected in these layers, the current level is this, and the next step is this.

That structure preserves interpretability.

9.5.3 Action Must Match State

Each alert band should map to a specific action level.

For example: Watch → continue routine monitoring; Drift detected → repeat sampling or review recent trend; Escalation recommended → initiate targeted follow-up or share the structured review summary; Under review → continue monitoring while external interpretation proceeds.

This prevents the system from producing warnings without usable consequence.

9.5.4 Avoiding Panic Language

The paper should be clear that the interface should not use dramatic or catastrophic wording as a default response to uncertainty.

This is especially important in a system designed to detect pre-event concern rather than late-stage certainty. The output should remain precise and restrained, for example: your recent readings show repeated deviation from your usual range, the pattern is being monitored more closely, the current pattern has strengthened and follow-up is recommended, recovery is occurring but has not yet stabilised.

This protects trust while still preserving seriousness.

9.5.5 Avoiding Alert Fatigue

Alert fatigue is one of the main UX risks in longitudinal monitoring.

If the system repeats the same alert without new meaning, the user will start ignoring it. Digital-health adherence literature repeatedly identifies burden and repetitive or poorly targeted notifications as a major cause of disengagement.

So the engine should suppress unnecessary repetition and escalate prominence only when one of the following occurs: the state worsens, persistence strengthens, convergence increases, recovery fails, a new action becomes necessary.

That means alerts should change when meaning changes, not simply when time passes.

9.5.6 The User Should Be Able to See Why the Alert Happened

The alert should not feel arbitrary.

The user should be able to open the alert and see: which layer or layers contributed most, whether the issue is new or recurring, whether the pattern is worsening or recovering, what the recommended next step is.

This is part of trust architecture. A system that cannot show why it changed state will lose credibility.

9.5.7 Alerts Should Support Continuity, Not Break It

A bad alert interrupts continuity. A good alert strengthens it.

The alert should draw the user back into the timeline: review recent drift, repeat the next sample, add any missing context, follow the next recommended step.

That keeps the person inside the monitoring relationship rather than pushing them away from it.

9.5.8 Section Conclusion

Alert presentation must be graded, interpretable, and tied to clear user action. The system should distinguish watch states from action states, avoid panic language, reduce duplicate alert burden, and show the user why the state changed. This is how alerting supports trust and continuity rather than undermining them.

9.6 Privacy, Control, and Data Permissioning

A prognostic system asks the user for repeated intimate data across time. That means privacy, control, and permissioning cannot be treated as secondary policy matters. They are part of the operating core of the system.

The governing principle is: the person should remain able to see, control, and selectively release their longitudinal health data, because repeated participation depends on trust in how that data are handled.

Current evidence supports this directly. NHS England's 2024 public attitudes survey reported that 83% of people trust the NHS to keep patient data secure, while cyberattacks were the most common concern; trust was lower for many non-NHS actors. Recent consent-management and public-attitudes work also emphasizes that user control and understandable consent processes increase willingness to share health data.

9.6.1 Privacy Is Not an External Add-On

In this system, privacy failure is system failure.

If the user believes that repeated biological, behavioural, and symptom data may leave their control unpredictably, they are less likely to participate consistently. Once participation weakens, the timeline weakens. Once the timeline weakens, the science weakens.

So privacy is not merely a legal wrapper around the engine. It is one of the conditions that allows the engine to exist at all.

9.6.2 Control Must Be Visible

The user should be able to see clearly: what data are stored, which data remain local, which data are shareable, what has been shared already, who has access now, how to revoke or change permissions.

Control that exists only in hidden settings is weak control. In health-data systems, trust rises when consent and data-sharing choices are explicit, understandable, and revisable. Recent work on secure self-determined health-data sharing and user-driven consent platforms makes this point directly.

9.6.3 Permissioning Should Be Layered

Not every data type needs the same permission rule.

A stronger architecture is layered permissioning. For example: private personal timeline visible only to the user, selected summary sharing for onward review, separate permission for research use, separate permission for clinician or specialist access, separate permission for integrated device feeds or external data imports.

This matters because repeated surveillance produces a rich longitudinal record. Users should not have to choose between total exposure and total isolation.

9.6.4 Sharing Should Be Purpose-Bound

Data sharing should also be purpose-bound.

The user should be able to distinguish between: sharing for immediate review of a concerning drift pattern, sharing for ongoing care coordination, sharing for research or model improvement, sharing for device integration or secondary analytics.

This is consistent with recent literature on consent for secondary use of health data in AI and broader health-data governance, which emphasizes that willingness to share depends heavily on understanding purpose, boundaries, and safeguards.

9.6.5 Permissioning Must Support Escalation Without Breaking Ownership

The system still needs a practical escalation path.

So the strongest model is not "never share." It is "share by explicit state transition."

That means when the engine moves into a stronger concern state, the user should be able to authorize export of: recent state trajectory, major contributing signal layers, persistence summary, recovery or non-recovery pattern, relevant history corridor.

In that structure, ownership is preserved because outward review happens through permissioned release, not through silent extraction.

9.6.6 Trust Depends on Who Receives the Data

Who receives data matters. Public-attitudes work shows much higher trust for NHS and local health services than for commercial technology companies, with markedly lower trust for commercial tech handling patient data.

That means the paper should be explicit: permissioning is not only about the fact of sharing, but about the destination of sharing.

The user should be able to see: the destination, the purpose, the scope, the duration.

That is a stronger privacy architecture than generic acceptance screens.

9.6.7 Security Is Part of Permissioning

Permissioning without security is weak.

NHS England's survey identifies cyberattack as a leading public concern around health-data security. Broader UK digital-health guidance likewise treats data protection, privacy, and cybersecurity as core compliance areas.

So the paper should treat security as part of control design, not as an external technical note.

9.6.8 Section Conclusion

Privacy, control, and permissioning are core parts of the user-ownership architecture. A prognostic engine that depends on repeated intimate data must allow the person to see what is held, decide what is shared, distinguish different sharing purposes, and authorize escalation exports without losing ownership of the timeline. That is how long-term participation remains compatible with trust.

9.7 Adherence, Accessibility, and Real-World Validity

A prognostic engine is only valid in the real world if people can and will keep using it. That makes adherence and accessibility more than implementation details. They are part of the validity condition of the system.

The governing principle is: if repeated participation fails, longitudinal inference weakens, and if the interface excludes people, the engine cannot claim to function as a general personal health architecture.

Current literature supports this directly. Reviews of digital-health adherence and home-monitoring usability repeatedly identify burden, usability, accessibility, device friction, motivation, and routine fit as major determinants of whether remote-monitoring systems actually work outside ideal research settings.

9.7.1 Adherence Is a Scientific Variable

In this model, adherence is not just a user-behaviour issue. It changes the quality of the model itself.

If the user samples regularly, the engine gains: stronger personal baseline, better persistence estimation, better recovery tracking, more reliable convergence logic, stronger escalation discipline.

If the user drops out, delays repeatedly, or only uses the system intermittently, then: the corridor weakens, recurrence patterns become harder to read, false reassurance may increase, false escalation may increase.

So adherence is part of the engine's evidential strength.

9.7.2 Why Accessibility Matters

Accessibility matters because a system cannot claim broad prognostic value if it only works for highly technical, highly motivated, or highly resourced users.

The interface and workflow should therefore support: clear visual presentation, simple language on action screens, low-effort logging, compatibility with different abilities and ages, support for passive capture where possible, usable operation even during fatigue, stress, or cognitive load.

Health-app usability research and DTAC/NICE-aligned review work continue to emphasize accessibility as a core requirement rather than an optional enhancement.

9.7.3 Real-World Use Is Not Clean Laboratory Use

The paper should also be precise about the difference between ideal and real use.

In real life, people will sometimes: forget to log, delay a sample, enter incomplete context, wear devices inconsistently, engage more during concern and less during stability, vary in motivation across time.

So the engine must tolerate imperfect use without collapsing.

This means the architecture should support: sparse intervals, partial missingness, confidence-aware interpretation, re-entry after gaps, reminder logic without punishment.

That is necessary for real-world validity.

9.7.4 Adherence Improves When Use Feels Worthwhile

People sustain systems that are visibly useful.

That means the user needs to feel that the app is doing something meaningful with their effort: preserving continuity, showing trend clearly, distinguishing watch from action, helping them act earlier, reducing uncertainty rather than amplifying confusion.

Digital-health adherence studies repeatedly show that perceived usefulness and manageable burden are central to sustained use.

9.7.5 Passive Capture Strengthens Validity

Where possible, passive or low-effort capture improves real-world validity because it reduces reliance on constant manual input.

This is especially useful for: wearable physiological streams, sleep and recovery patterns, movement and routine rhythm, background continuity between active samples.

Passive capture does not remove the need for active user input, but it stabilises the timeline when manual adherence fluctuates.

9.7.6 Accessibility Also Protects Data Equity

If the system is only easy for one class of user, then the resulting data will be biased toward that class.

That weakens both fairness and model generality.

So accessibility is not only about inclusion at the interface level. It also affects what kinds of real-world baseline corridors the engine will actually learn from and support.

9.7.7 Section Conclusion

Adherence and accessibility are part of the real-world validity of the prognostic engine. Repeated use strengthens baseline quality, state interpretation, and recovery tracking; inconsistent use weakens them. The system must therefore be designed for imperfect real life, not only for ideal protocol use, and it must remain accessible enough that continuity is possible across different users, abilities, and circumstances.

9.8 Limits of the UX Layer

The UX layer is structurally important, but it also has limits. A well-designed interface can preserve continuity, trust, and adherence, but it cannot compensate for weak signal quality, poor assay validity, or inadequate application-layer science.

The governing principle is: good interface design strengthens the engine, but it does not substitute for evidential weakness underneath.

9.8.1 Good UX Does Not Create Biological Signal

A simple, elegant interface does not make a weak biomarker strong. If the underlying signal is sparse, noisy, or poorly validated, the interface cannot solve that by presentation alone.

So the paper should keep the hierarchy clear: UX supports use, use supports continuity, continuity supports inference, but inference still depends on genuine signal quality.

9.8.2 High Trust Does Not Mean High Accuracy

A user may trust a system because it is clear and well designed. That trust must not be mistaken for proof that every inference is correct.

So the system should preserve humility in its outputs: watch remains watch, drift remains drift, escalation remains recommendation for further review, not proof of final diagnosis.

This protects the model from becoming persuasive beyond its evidence.

9.8.3 Ownership Does Not Remove Need for Validation

Even where the user owns the timeline and controls sharing, the application layers still require validation. Ownership improves continuity and trust, but it does not remove the need for: assay validation, threshold calibration, disease-specific evaluation, real-world performance testing.

That distinction should remain explicit.

9.8.4 UX Can Also Fail Through Overdesign

The chapter should acknowledge the opposite risk as well.

A system that tries too hard to be reassuring may hide meaningful concern. A system that tries too hard to be detailed may overwhelm the user. A system that tries too hard to engage may create notification fatigue.

So the UX layer must be disciplined, not merely attractive.

9.8.5 Section Conclusion

The UX layer is essential, but it is not sovereign. It can preserve trust, adherence, ownership, and continuity, but it cannot replace biological validity, assay maturity, or disease-specific evidence. Its proper role is to make the real strengths of the engine usable and visible without pretending to create strengths that are not there.

9.9 Chapter Conclusion

This chapter has shown that user ownership, trust, and interface design are structural elements of the prognostic engine. The person must remain the holder of the timeline because continuity exists in life before it exists in institutions. Trust is a technical requirement because repeated use depends on clarity, restraint, privacy control, and visible usefulness. Interface simplicity is necessary for longitudinal use, alert presentation must preserve action without panic, and privacy and permissioning must allow selective onward sharing without loss of ownership. Adherence and accessibility then become part of real-world validity itself.

The central conclusion is: the user layer is not outside the science of prognosis; it is one of the conditions that makes the science operable in lived reality.

Chapter 10. Validation Architecture

A prognostic engine is only conceptually interesting until it is validated. This chapter defines the evidential structure required to move the system from architectural proposal to credible application.

The central proposition is: the engine must be validated as a longitudinal prediction system, not merely as a static classifier.

That distinction matters because the paper is not describing a one-off diagnostic test. It is describing a continuity-based system that reads repeated personal signals across time, updates state, and escalates when patterns become actionably coherent. Validation must therefore assess not only whether the system can detect abnormalities, but whether it can do so earlier, more consistently, and more usefully than episodic alternatives. Current literature on digital biomarkers, predictive models, and clinical AI increasingly stresses analytical validity, clinical validity, prospective evaluation, and real-world implementation rather than retrospective accuracy alone.

10.1 Purpose

This chapter establishes the validation structure of the engine.

It defines: analytical validation of the signal layers, temporal validation of the prognostic model, retrospective and prospective evaluation, domain-specific validation across disease applications, user-side and real-world performance testing, governance and safety requirements for deployment.

10.1.1 Chapter Function

The function of this chapter is to answer: how do we prove that the engine works as a prognostic system rather than merely as an attractive conceptual model?

That question must be answered at several levels. Reviews of AI-based prediction models now explicitly emphasize reporting quality, validation discipline, and the need for prospective implementation evidence, while FDA guidance on digital health technologies for remote data acquisition sets expectations for the reliable use of remote tools in clinical investigations.

10.2 Why Longitudinal Validation Is Different

Longitudinal validation is different because this engine is not a one-time classifier. It is a repeated-measure system that updates over time, tracks trajectory, and changes state as new evidence arrives. That means it cannot be validated adequately by showing only that it separates cases from non-cases at one isolated moment. It must be validated as a temporal prediction architecture. TRIPOD+AI explicitly applies to prediction models used for prognostic, monitoring, and screening purposes, not only single-time diagnostic models, and recent digital-biomarker reviews likewise stress analytical validity, clinical validity, and real-world implementation rather than retrospective accuracy alone.

10.2.1 Static Accuracy Is Not Enough

A static model asks: given this snapshot, what class is most likely?

A longitudinal model must ask: given this sequence, is the person drifting, recovering, or approaching escalation earlier than a static snapshot would reveal?

That difference changes the validation target. The model must be assessed not only on discrimination at one point, but on whether it correctly interprets direction, persistence, convergence, and time-to-action. Recent work on performance measures for predictive AI in medicine argues that evaluation should move beyond narrow headline metrics and consider the full intended prediction task.

10.2.2 Time Is Part of the Outcome

In this engine, time is not incidental metadata. It is part of the outcome structure.

The relevant validation questions include: How early does the engine detect meaningful drift? How often does it alert too early? How often does it alert too late? Does it distinguish transient disturbance from persistent deterioration? Does it recover appropriately when the person returns to corridor?

Those questions do not arise fully in a one-off classifier. They arise because the model is meant to operate across repeated windows and state transitions. That is why longitudinal validation must include time-aware endpoints, not only cross-sectional labels.

10.2.3 The Unit of Validation Is the Trajectory

For a static diagnostic model, the unit of validation is often the isolated case.

For this engine, the stronger unit is the trajectory.

That means validation must examine whether the model reads sequences correctly, such as: stable sequence, soft deviation with recovery, persistent divergence, reinforced escalation path, relapse after apparent recovery.

This is especially important for a system that claims to outperform after-event recognition by detecting formation paths earlier. A model cannot justify that claim unless trajectories themselves are part of the validation design. Digital-biomarker reviews and analytical-validation work increasingly make this point by focusing on continuous measures, repeated capture, and implementation feasibility in real-world datasets.

10.2.4 Longitudinal Validation Needs More Than Discrimination

A static model is often judged heavily by discrimination.

A longitudinal model still needs discrimination, but it also needs: temporal calibration, stability of state transitions, alert burden, false-alarm persistence, missed-event timing, recovery and de-escalation behaviour, usefulness of earlier warning.

This matters because a model can discriminate reasonably well in hindsight and still behave badly over time. It may flicker, over-alert, de-escalate too slowly, or fail to preserve trajectory coherence. Reviews of predictive AI performance and reliability in clinical settings warn against relying on narrow metric summaries without examining whether the model is reliable and useful in the intended workflow.

10.2.5 Remote and Repeated Data Create Extra Validation Demands

Because the engine relies on home sampling, wearables, and other repeated digital inputs, validation must also address the quality of remote data acquisition itself. The FDA's current guidance on digital health technologies for remote data acquisition in clinical investigations emphasizes that remote digital tools used to collect data should be fit for purpose and that sponsors should consider factors such as selection, verification, usability, and data handling for the intended clinical investigation.

That means longitudinal validation for this engine has at least two layers: validation of the signals as remotely acquired measures, and validation of the engine as a temporal interpreter of those measures.

Both are necessary.

10.2.6 Reporting Discipline Also Changes

Because this is a prediction and monitoring architecture, reporting standards should reflect that. TRIPOD+AI was created to improve transparent reporting of clinical prediction models, including those used for prognostic and monitoring purposes, and recent assessments still report inconsistent adherence in applied AI prediction studies.

So longitudinal validation should be reported with enough detail to show: the timeline structure of the data, the updating logic, the prediction horizon, the state definitions, the alert and escalation rules, the real-world data gaps and limitations.

That is part of the evidential architecture, not an editorial extra.

10.2.7 Section Conclusion

Longitudinal validation is different because this engine is designed to interpret evolving personal trajectories rather than classify one-off snapshots. It must therefore be validated on timing, persistence, transition behaviour, calibration, and real-world remote data quality in addition to ordinary predictive discrimination. That is the only validation frame consistent with what the engine actually claims to do.

10.3 Analytical Validation of the Signal Layer

Before the engine can be validated as a prognostic system, its signal layers must be validated as usable inputs. This is analytical validation.

The central question is not yet whether the full model predicts well. The first question is whether the data entering the model are fit for purpose.

That matters especially here because the engine depends on mixed sources: low-burden blood sampling, wearables and passive physiological capture, symptom and self-report input, behavioural and exposure records, history-linked and inherited-risk fields.

Each of these layers brings different strengths and different failure modes. Reviews of digital biomarkers and remote monitoring emphasize exactly this point: analytical validation must address whether the signal can be measured reliably, repeatedly, and meaningfully in the intended context of use.

10.3.1 Why Analytical Validation Comes First

A strong state engine cannot rescue a weak signal layer.

If the incoming signal is unstable, unrepeatable, poorly calibrated, or highly sensitive to uncontrolled variation, then even elegant downstream modelling will inherit that weakness. FDA guidance on remote digital health technologies likewise emphasizes that such tools used in clinical investigations should be fit for purpose, with attention to verification, validation, usability, and data handling.

So analytical validation must come before, or at least alongside, model validation.

10.3.2 What Must Be Validated

For each signal type, analytical validation should examine at least: accuracy (how close the measurement is to the intended quantity), precision (whether repeated measurement under similar conditions is stable), repeatability (whether the same user can obtain similar results across short intervals), robustness (whether the signal survives ordinary real-world variation), completeness (whether enough usable data are actually captured), context sensitivity (whether the signal changes predictably under known conditions), missingness behaviour (how often the signal is absent or unusable).

These are not optional extras. They determine whether the signal can lawfully enter a longitudinal engine.

10.3.3 Blood-Layer Validation

For low-burden blood sampling, validation must address whether the analytes intended for use are reliable in the chosen collection format.

That includes issues such as: capillary versus venous comparability where relevant, sample stability, collection variability, transport or storage effects, assay sensitivity at low signal levels, user-collected sample quality.

Recent microsampling and patient-centric blood collection reviews repeatedly note that analyte-specific validation is essential and that not all biomarkers behave equally well in remote or capillary formats.

So the paper should be explicit: the blood layer is only as strong as the validated assay pathway behind it.

10.3.4 Wearable-Layer Validation

For wearables and passive physiological sensors, validation must examine: device measurement reliability, within-user consistency, sensitivity to motion artefact or non-wear, behaviour across real-world settings, alignment with clinically meaningful constructs, data dropout patterns.

Reviews of wearable monitoring in cardiovascular, inflammatory, and neurological settings consistently stress that signal quality, wear-time compliance, and fit-for-purpose assessment are essential before relying on these data for longitudinal inference.

So wearable validation is not just about whether a device records data. It is about whether the recorded data support the intended prognostic interpretation.

10.3.5 Self-Report and Behavioural Input Validation

Self-report and behavioural inputs cannot be validated in the same way as laboratory signals, but they still require structure.

Validation here should assess: completion rate, temporal consistency, internal coherence, response burden, recall bias sensitivity, whether structured entries correlate meaningfully with other layers over time.

This is important because a repeated symptom input can be highly valuable, but only if the system captures it consistently enough to become interpretable rather than anecdotal.

10.3.6 Fit-for-Purpose Validation

A key principle from current digital health guidance is fit for purpose. The right question is not whether a signal is perfect in the abstract, but whether it is good enough for the role it is supposed to play. FDA's remote-data-acquisition guidance uses this same logic.

In this engine, that means: a low-specificity signal may still be acceptable as a secondary reinforcement layer, a signal used to trigger escalation should usually face a higher validation bar, a contextual modifier may be useful even if it is not highly precise, provided its role remains limited.

This layered validation logic is especially important in a multi-signal architecture.

10.3.7 Signal Qualification by Role

The paper should therefore classify signals by operational role during validation: primary signals (require stronger analytical confidence), secondary reinforcement signals (require useful but not necessarily decisive analytical performance), contextual modifiers (require enough consistency to support interpretation, not to govern it alone).

That is a stronger validation structure than pretending every signal needs identical evidential status.

10.3.8 Analytical Validation Outputs

The output of analytical validation should be a signal qualification table for each application layer, showing: signal name, collection method, intended role, analytical strengths, known limitations, acceptable use conditions, confidence level in deployment.

This becomes the bridge between raw data acquisition and model evaluation.

10.3.9 Section Conclusion

Analytical validation establishes whether the engine's signal layers are fit for purpose before the full prognostic model is judged. Blood assays, wearables, symptom inputs, and contextual data all require role-appropriate validation for accuracy, repeatability, robustness, and usability in real-world collection settings. Without this step, longitudinal model validation rests on unstable ground.

10.4 Retrospective Model Validation

Once the signal layers are shown to be fit for purpose, the next stage is retrospective model validation. This asks whether the engine, when applied to existing longitudinal datasets, can reconstruct meaningful drift paths, distinguish stable from deteriorating trajectories, and identify concern earlier than late recognition alone. Retrospective validation remains a standard first step in predictive-model development, but current guidance is clear that it is not sufficient on its own; TRIPOD+AI and recent reviews both stress transparent reporting and the limits of retrospective-only evidence.

10.4.1 Purpose of Retrospective Validation

The purpose here is not to prove final clinical utility. It is to determine whether the engine behaves coherently on real historical sequences.

It should test whether the model can: track baseline formation and drift, separate transient disturbance from sustained divergence, identify escalation corridors before formal endpoint recognition, de-escalate appropriately when recovery is present, preserve usable state trajectories across time.

That is different from asking only whether the model classifies a final label correctly.

10.4.2 Why Retrospective Validation Still Matters

Retrospective validation is useful because it allows the engine to be tested across large existing datasets before prospective deployment. Datasets such as UK Biobank, All of Us, NHANES, MIMIC-IV, TCGA, and other domain-specific resources can support development and temporal back-testing of application layers, although access conditions and data structure differ by source. Official dataset resources confirm the availability of linked biomarker, record, genomic, and longitudinal data across these infrastructures.

This matters because retrospective testing can reveal: whether the chosen signals carry temporal value, whether thresholds are too loose or too strict, whether the state model flickers or behaves stably, whether the proposed escalation logic is too early, too late, or incoherent.

10.4.3 The Retrospective Unit Should Be the Trajectory

The correct unit of retrospective evaluation is not just the isolated record. It is the trajectory.

That means validation should examine sequences such as: stable baseline with no escalation, weak deviation followed by recovery, persistent divergence without escalation, reinforced divergence followed by escalation, relapse after apparent recovery, late endpoint preceded by earlier detectable drift.

This is consistent with the paper's central claim that the engine interprets formation paths rather than one-off snapshots.

10.4.4 Time-Split Design

Retrospective validation should preserve time order.

A strong design uses earlier portions of the record to estimate baseline and model behaviour, and later portions to test prediction and escalation performance. This avoids contaminating the past with information from the future and is consistent with good practice in temporal validation of predictive models.

So the design should avoid random record shuffling where that would destroy the timeline logic. Instead, it should use: baseline-estimation window, model-development window, holdout future window.

That is more faithful to the intended use of the engine.

10.4.5 Core Retrospective Endpoints

The retrospective stage should assess at least: time-to-alert relative to known endpoint or review trigger, false-alert burden, missed-event rate, state-transition stability, de-escalation correctness, trajectory discrimination, calibration of warning strength, lead-time gain versus later recognition baseline.

These are stronger than a single AUC-style summary because the engine is temporal and stateful.

Recent work on evaluating predictive AI in medicine also argues against relying only on narrow headline metrics without regard to the intended clinical use and the consequences of model behaviour over time.

10.4.6 Negative Controls and Hard Cases

Retrospective validation should also include difficult counterexamples.

These should include: trajectories with repeated benign fluctuation, trajectories with context-heavy but non-progressive disturbance, sparse-data users, incomplete recovery that never becomes a major event, overlapping multi-disease patterns, asymptomatic trajectories with weak signal.

This matters because a model that performs only on easy cases will not justify deployment.

10.4.7 Retrospective Validation by Application Layer

Because the engine supports multiple disease domains, retrospective validation should be stratified.

At minimum, separate retrospective evaluation should be reported for: cancer-oriented surveillance, metabolic dysregulation, inflammatory or autoimmune recurrence, cardiovascular recovery instability, neurodegenerative or functional-decline pathways.

This is necessary because signal density, event definition, and lead-time structure differ across domains.

10.4.8 What Retrospective Validation Cannot Prove

The paper should remain explicit here.

Retrospective validation can show whether the engine behaves coherently on historical data. It cannot, by itself, prove: that users will adhere in real life, that alerts improve outcomes, that escalation pathways work operationally, that prospective false-alert burden will remain acceptable.

That is why retrospective validation is necessary but not sufficient. Current reporting guidance and AI-evaluation reviews both make this point.

10.4.9 Section Conclusion

Retrospective model validation is the first full test of whether the engine can interpret real longitudinal trajectories as intended. It should preserve time order, evaluate trajectories rather than isolated records, and measure not only discrimination but timing, stability, alert burden, and lead-time gain. Done properly, it establishes whether the architecture has temporal coherence before prospective deployment is attempted.

10.5 Prospective and Real-World Validation

Retrospective validation shows whether the engine can behave coherently on historical sequences. Prospective and real-world validation ask the harder question: does the system still behave properly when data arrive live, imperfectly, and under ordinary conditions of use?

That step is essential. Current guidance and recent reviews of clinical AI and digital health repeatedly distinguish retrospective model performance from prospective usefulness, implementation quality, and real-world impact. Prospective evaluation remains necessary to test whether a model retains validity outside historical back-testing and whether remote data acquisition works as intended in practice.

10.5.1 Why Prospective Validation Is Different

Prospective validation differs because the model no longer reads a completed past. It must operate under uncertainty as events unfold.

That means the engine must cope with: missing or delayed data, variable adherence, changing context, real-time alert burden, live escalation decisions, imperfect remote measurements.

A system can appear strong retrospectively and still fail prospectively if it over-alerts, underperforms with sparse inputs, or behaves unpredictably when users interact with it in ordinary life. Reviews of predictive AI in medicine and digital biomarkers emphasize exactly this gap between retrospective promise and live deployment reality.

10.5.2 What Prospective Validation Should Test

Prospective studies should test not only whether the model identifies concern, but whether it does so in a way that remains usable and timely.

Core questions include: Does the engine detect drift early enough to matter? Does it preserve acceptable false-alert burden over time? Do users remain engaged with the loop? Does escalation occur at appropriate points? Does recovery and de-escalation behave sensibly in live use? Does the model remain calibrated when real-world inputs are incomplete or noisy?

These are the operational questions retrospective testing cannot fully answer.

10.5.3 Real-World Remote Data Acquisition Must Be Tested Directly

Because this engine depends on home sampling, wearables, symptom entries, and mixed-cadence personal data, prospective validation must include the remote acquisition pathway itself.

FDA guidance on digital health technologies for remote data acquisition in clinical investigations explicitly emphasizes fit-for-purpose use, participant usability, data handling, and the suitability of remote tools for the intended investigation.

So prospective validation should assess: whether users can collect the required data reliably, whether device and assay performance remain stable outside ideal settings, whether data gaps break the state engine or are handled safely, whether context capture is sufficient for interpretation.

This is not separate from the model. It is part of the model's real-world validity.

10.5.4 Prospective Design Stages

A strong validation path should be staged.

A practical sequence is:

Stage 1: feasibility study – Tests usability, adherence, data completeness, and basic signal flow.

Stage 2: observational prospective study – Runs the engine without forcing action to assess timing, alert burden, and state behaviour against observed outcomes.

Stage 3: implementation study – Assesses how the system behaves when its outputs are actually used in live workflows.

This staged progression is consistent with broader digital-health evaluation logic, where feasibility, analytical reliability, and implementation are separated rather than collapsed into one claim.

10.5.5 Real-World Endpoints

Prospective validation should include endpoints that reflect the actual purpose of the engine.

These include: adherence over time, usable data completeness, time-to-alert, false-alert rate, time-to-escalation, recovery-state correctness, user response quality, escalation appropriateness, lead-time gain over usual recognition pathway.

Where a downstream review pathway exists, the study can also examine whether the engine improves the organisation and timing of that pathway, not just whether it produces alerts.

10.5.6 Human Factors Are Part of Prospective Validation

Because this is a user-owned monitoring system, human factors must be measured prospectively, not assumed.

That includes: whether users understand outputs, whether alerts feel proportionate, whether privacy controls are trusted, whether the system fits daily routine, whether people continue using it outside initial enthusiasm.

Recent digital-health usability and patient-generated data reviews consistently identify these factors as central to real-world success.

10.5.7 Prospective Validation by Application Layer

As with retrospective testing, prospective validation should not assume that every disease layer matures at the same rate.

A reasonable path is to validate application layers progressively, for example: metabolic and recovery monitoring first, where repeated signals may be denser; inflammatory recurrence next, where flare prediction can be tracked prospectively; higher-complexity oncology applications later, where signal sparsity and downstream confirmation are more demanding.

This keeps the deployment pathway serious and proportionate to signal maturity.

10.5.8 What Prospective Validation Can Establish

Prospective validation can establish whether the system is: usable in live conditions, stable under imperfect data flow, credible as an early-warning engine, calibrated in real-world monitoring, capable of supporting earlier and better-organised escalation.

It still does not automatically prove broad clinical utility across every domain, but it is the necessary bridge from model coherence to operational credibility.

10.5.9 Section Conclusion

Prospective and real-world validation are necessary because this engine is meant to operate live, not only in hindsight. The system must therefore be tested under ordinary conditions of remote sampling, wearable capture, user interaction, incomplete data, and real-time alerting. Only then can the paper support the claim that the engine functions as a true prognostic architecture rather than a retrospective pattern-reading exercise.

10.6 Domain-Specific Validation Pathways

A shared prognostic architecture does not remove the need for domain-specific validation. Each application layer must be validated according to the biology, signal quality, event structure, and escalation pathway relevant to that domain.

The governing principle is: the engine may be common, but evidence must still be specific.

This is consistent with current health-AI and digital-biomarker guidance. Prediction models, remote-monitoring tools, and digital biomarkers are expected to be validated in relation to their intended context of use, not only as abstract general systems.

10.6.1 Why Domain-Specific Validation Is Necessary

Different disease applications do not pose the same validation problem.

They differ in: signal density, baseline stability, detectability of early change, specificity of biomarkers, timing of meaningful escalation, availability of downstream confirmation.

So the same engine cannot be considered equally validated across all domains merely because its core state logic is coherent.

A cancer layer, for example, faces different constraints from a glucose-regulation layer. A neurodegenerative layer may rely more on passive behavioural drift, while an inflammatory layer may rely more on recurrence-and-recovery dynamics.

That is why application-specific validation pathways are required.

10.6.2 Cancer Validation Pathway

The oncology-oriented layer should be validated around: inherited-risk stratification where available, repeated blood-signal behaviour, time-to-concern relative to later diagnostic recognition, false-alert burden in asymptomatic or low-symptom populations, appropriateness of escalation to targeted review.

Because early cancer-related signals may be sparse and localization may remain uncertain, validation must pay particular attention to: sensitivity in low-burden settings, specificity under repeated sampling, lead-time gain, downstream review yield.

This is exactly where current MCED and early-detection literature places its emphasis.

10.6.3 Metabolic Validation Pathway

The metabolic layer should be validated around: early regulatory drift detection, repeated post-load or longitudinal glycaemic deviation, recovery to baseline after ordinary perturbation, behaviour-linked destabilisation, time-to-progression relative to conventional threshold recognition.

This domain is particularly well suited to dense repeated validation because glucose-related and behavioural signals can often be observed at higher temporal resolution than many other domains. Current work on CGM in prediabetes and related early dysglycaemia supports this kind of validation focus.

10.6.4 Inflammatory and Autoimmune Validation Pathway

The inflammatory layer should be validated around: flare prediction, recurrence interval, failed recovery, symptom and wearable reinforcement, false alarms in noisy inflammatory environments.

Because these conditions often oscillate rather than follow a single monotonic decline, validation should examine: whether the engine distinguishes flare formation from transient disturbance, whether it detects shortening recurrence intervals, whether de-escalation behaves correctly after genuine resolution.

Recent work in IBD and rheumatology supports the relevance of wearable and longitudinal flare-related signal validation.

10.6.5 Cardiovascular Validation Pathway

The cardiovascular layer should be validated around: recovery-state monitoring, exertion-linked instability, autonomic pattern degradation, rhythm-related change, time-to-warning relative to later overt events or formal review.

Because cardiovascular remote monitoring already makes substantial use of wearables and physiological continuity, validation here should pay attention to: passive-data quality, adherence over longer intervals, recovery metrics, alert usefulness in avoiding noisy escalation.

Recent reviews of cardiovascular wearables and remote monitoring support this kind of evaluation focus.

10.6.6 Neurodegenerative and Functional-Decline Validation Pathway

The neurodegenerative layer should be validated around: passive functional drift, change in movement, sleep, speech, or behavioural routine, trajectory stability over longer timescales, time-to-recognition relative to later formal decline identification, burden and interpretability of continuous monitoring.

Because this domain may involve slower and subtler progression, validation must be especially careful about: long-window trajectory analysis, false positives from ordinary behavioural variation, whether passive data genuinely improve early recognition.

Recent digital-biomarker literature in neurodegeneration supports this focus on repeated functional and behavioural monitoring.

10.6.7 Same Metrics, Different Meaning

Some metrics will recur across domains: time-to-alert, false-alert rate, missed-event rate, calibration, adherence, lead-time gain.

But their meaning will differ by application.

For example: in oncology, a false escalation may trigger a costly workup; in metabolism, an early false watch state may be less burdensome; in inflammatory disease, recurrence timing may matter more than one-off severity; in neurodegeneration, long-term trajectory coherence may matter more than short-window sensitivity.

So the validation framework should retain shared metric categories while interpreting them within each domain's actual use context.

10.6.8 Validation Maturity Will Not Be Uniform

The paper should also state plainly that application layers will mature at different speeds.

A realistic path is that some domains validate earlier because: signals are denser, endpoints are clearer, remote measurements are better established.

Other domains may remain more experimental for longer because: early signals are sparse, signal origin is harder to interpret, downstream confirmation is more complex.

That does not weaken the shared engine. It simply defines a staged development path.

10.6.9 Section Conclusion

Domain-specific validation pathways are required because each application layer has different signals, constraints, event structures, and consequences of escalation. The engine can therefore be shared architecturally while still being validated separately for oncology, metabolism, inflammation, cardiovascular monitoring, and neurodegenerative decline. That is the only serious way to claim both generality and rigor.

10.7 Calibration, Error, and Safety Monitoring

A prognostic engine is not sufficient if it only produces plausible alerts. It must also remain calibrated, make errors in a controlled and visible way, and be monitored for safety over time. This is especially important in a longitudinal system, because the harm does not arise only from one wrong prediction. Harm can also arise from repeated false concern, delayed escalation, sticky overclassification, or unstable de-escalation. Current prediction-model guidance and digital-health evaluation work both stress that performance must be assessed beyond simple discrimination, including calibration, real-world reliability, and implementation safety.

10.7.1 Why Calibration Matters

Calibration asks whether the strength of the model's warning corresponds to the actual frequency or seriousness of the outcome it is meant to anticipate.

That matters because a system can discriminate moderately well and still be badly calibrated. In practice, that means: low-level concern may be overstated, high-level concern may be understated, escalation bands may not correspond to real-world risk, the user may lose trust because warning intensity feels unreliable.

For a longitudinal engine, calibration must apply not only to final outcomes, but also to state-transition behaviour and warning tiers. Recent work on evaluating predictive AI in medicine argues that models should be judged in relation to their intended use and not by narrow accuracy summaries alone.

10.7.2 Longitudinal Calibration Is More Than Probability Matching

In this system, calibration is not only about whether a probability estimate matches an endpoint rate. It also concerns whether the engine's internal levels behave proportionately across time.

The engine should therefore be calibrated on at least four fronts: state calibration (whether the assigned state matches the observed trajectory class), alert calibration (whether alert tiers correspond to meaningful later concern), time calibration (whether the model warns too early, too late, or proportionately), recovery calibration (whether de-escalation happens at the right stage of genuine resolution).

That is a broader and more appropriate calibration frame for a monitoring model than a one-time classifier metric.

10.7.3 Error Must Be Typed, Not Just Counted

The engine's errors should not be treated as one undifferentiated bucket. They need to be typed, because different errors have different implications.

At minimum, the paper should distinguish: false watch (soft concern that does not progress), false escalation (active review recommendation without sufficient underlying deterioration), missed progression (real worsening that was not elevated appropriately), late escalation (eventual escalation that occurred too late for the engine's stated purpose), sticky state error (failure to de-escalate after genuine recovery), flicker error (unstable back-and-forth state movement without real structural change).

This is stronger than just reporting aggregate false positives and false negatives, because it reflects what the engine actually does over time.

10.7.4 Safety Monitoring Must Be Ongoing

Because this is a live monitoring system, safety cannot be assessed once and forgotten. It needs ongoing monitoring after deployment or implementation testing.

Relevant safety signals include: rising false-alert burden, repeated user disengagement after alerts, worsening calibration drift across new populations, overrepresentation of errors in certain subgroups, systematic missed events in sparse-data users, unsafe persistence of elevated states without adequate review.

Recent FDA guidance on digital health technologies for remote data acquisition in clinical investigations emphasises fit-for-purpose use, usability, and appropriate handling of remotely acquired data, which supports the need for ongoing monitoring rather than one-time validation only.

10.7.5 Calibration Drift Must Be Expected

A person-centred engine that operates in changing real-world conditions should assume that calibration drift may occur.

Causes may include: different user populations, changing device characteristics, altered adherence patterns, signal-quality drift, new assay versions, changing context distributions over time.

So the system should be monitored for recalibration need, not treated as permanently stable after first validation. This is consistent with broader model-governance concerns addressed by PROBAST+AI and related prediction-model work, which emphasise quality, risk of bias, and applicability rather than assuming that one successful evaluation generalises indefinitely.

In practical deployment, calibration should therefore test not only overall warning strength but also interaction effects between signal layers and the behaviour of return-window tuning under live data conditions, so that super-additive convergence and recovery-lock failure modes can be identified early.

10.7.6 Safety Monitoring Should Include the User Layer

In this engine, safety is not only biomedical. It is also behavioural and interactional.

A user-facing safety framework should monitor whether: alerts create avoidable panic, repeated low-value notifications cause disengagement, interface burden leads to missing critical follow-up, users misunderstand watch states as diagnosis, privacy or control failures reduce participation.

This is consistent with digital-biomarker and patient-generated-data literature, which repeatedly links usability, trust, and implementation quality to real-world effectiveness and safety.

10.7.7 A Practical Safety Dashboard

For the paper, a useful safety-monitoring structure would track: calibration by state band, false-watch rate, false-escalation rate, missed-progression rate, late-escalation rate, de-escalation failure rate, user adherence after alert, subgroup performance drift, data-quality degradation rate.

This turns safety from a vague ethical statement into an operational part of the engine.

10.7.8 Section Conclusion

Calibration, error, and safety monitoring are core parts of the validation architecture. The engine must show not only that it can detect concern, but that its warning levels are proportionate, its errors are typed and monitored, and its live behaviour remains safe and stable under real-world use. That is the standard appropriate to a longitudinal prognostic system rather than a static classifier.

10.8 Governance and Reporting Discipline

A prognostic engine cannot be treated as valid simply because it performs well in one study or because its interface is persuasive. It requires governance and reporting discipline so that its claims, limits, update rules, and validation status remain visible and auditable across time.

The governing principle is: a longitudinal health engine must be governed as an evolving prediction system, with transparent reporting of what it is, what it has been validated for, how it changes, and where its limits remain.

This is consistent with current reporting and appraisal frameworks for clinical prediction models and AI. TRIPOD+AI exists to improve transparent reporting of prediction models, while PROBAST+AI addresses risk of bias and applicability in AI-based prediction studies.

10.8.1 Why Governance Matters Here

This system is not a static assay. It is a layered engine that: ingests repeated personal data, updates state across time, applies thresholds and escalation rules, may adapt through recalibration or application-layer refinement.

That means governance is not optional. Without it, users and evaluators cannot know: what version of the engine is operating, what signal layers are active, which application layers are validated, what thresholds are currently in use, whether the model has changed since prior evaluation.

For a longitudinal system, that opacity would be unacceptable.

10.8.2 Reporting Must Match Intended Use

The paper should state clearly that reporting must follow intended use.

The engine is not just a diagnostic classifier. It is a monitoring and prognostic system. Reporting therefore needs to include: intended context of use, target population, signal layers used, baseline and state definitions, alert and escalation rules, de-escalation and recovery logic, validation design, data gaps and limitations, current validation scope by domain.

That is the only way readers can judge whether the engine is being used within the bounds of its evidence. TRIPOD+AI is directly relevant here because it covers prediction models used for diagnostic and prognostic purposes, including AI-based systems.

10.8.3 Version Control and Change Control

Because the engine may evolve, governance must include explicit version control.

At minimum, each operational version should record: model version, signal configuration, weighting structure, threshold set, calibration date, validation status, approved application domains.

This matters because even small changes in weighting, thresholds, or data handling can materially alter behaviour.

A serious engine cannot behave as though updates are invisible.

10.8.4 Domain-Specific Claims Must Be Separated

The system may share one architecture across disease layers, but governance must prevent overgeneralised claims.

That means reports should separate clearly: what is architecturally proposed, what is analytically validated, what is retrospectively validated, what is prospectively validated, what is implemented in real-world use, which disease domains are covered at each level.

This prevents the common failure of presenting the strongest domain-specific evidence as though it applies equally everywhere.

10.8.5 User-Facing Transparency

Governance is not only for researchers or reviewers. The user-facing system should also preserve minimum transparency.

The user should be able to know: what kind of system they are using, what the current output means, whether it is a watch state or escalation state, whether the system is experimental, validated, or domain-limited, what happens to their data if they share it outward.

That is part of trustworthy operation, not a marketing extra.

10.8.6 Bias, Applicability, and Scope Control

Governance must also monitor whether the engine behaves differently across: data-rich and data-sparse users, different ages or health contexts, different device environments, different disease-application layers.

This is where risk-of-bias and applicability frameworks matter. PROBAST+AI was developed precisely because AI prediction models can look strong while still carrying major applicability or bias concerns.

So the governance layer should explicitly track: subgroup performance, missingness effects, calibration drift, application-scope breaches, out-of-domain use.

10.8.7 Incident and Override Logging

A serious prognostic engine should also record when its outputs are overridden, ignored, revised, or implicated in significant errors.

That includes: false-escalation review outcomes, missed-progression review outcomes, major recalibration triggers, user-reported harmful confusion, clinician or reviewer override patterns where applicable.

This creates a learning and accountability loop rather than a silent black box.

10.8.8 Minimum Reporting Package

For the paper, the cleanest governance proposal is a minimum reporting package attached to each application-layer release.

That package should include: engine version, intended use statement, signal list and weighting summary, threshold summary, validation stage, known limitations, safety-monitoring status, last calibration and review date.

This is simple enough to implement and strong enough to make the architecture auditable.

10.8.9 Section Conclusion

Governance and reporting discipline are necessary because this engine is an evolving longitudinal prediction system, not a one-time fixed test. Transparent reporting of version, scope, thresholds, validation stage, and limitations is required to keep the system auditable, trustworthy, and properly bounded to its evidence. That is the only serious basis on which a prognostic health engine can move from concept to credible deployment.

10.9 Chapter Conclusion

This chapter has established the validation architecture required for the prognostic engine. Because the system is longitudinal, it must be validated as a trajectory-reading model rather than a one-time classifier. That requires analytical validation of the signal layers, retrospective testing on real temporal sequences, prospective and real-world validation under live conditions, separate validation pathways for each disease application, and ongoing monitoring of calibration, error, and safety. Governance and reporting discipline then ensure that the engine's scope, changes, and limits remain visible over time.

The central conclusion is: a prognostic engine becomes credible only when its longitudinal logic, signal validity, real-world use, safety, and scope are all validated and reported in a disciplined way.

Chapter 11. Code and Deployment Specification

The previous chapters defined the scientific, user, and validation architecture of the system. This chapter defines how that architecture should be implemented in code and deployed in practice.

The central proposition is: the engine should be implemented as a transparent, modular, state-based system in which signal intake, baseline management, drift computation, state transition, alerting, and export are separated into auditable components.

That matters because the paper is not describing a vague concept. It is describing a buildable engine.

11.1 Purpose

This chapter establishes: the code architecture, the minimum data structures, the operational modules, the state engine interface, the deployment layers, the logging and governance hooks, the separation between core engine and application layers.

11.1.1 Chapter Function

The function of this chapter is to answer: how should the prognostic engine actually be written and deployed so that the theory remains auditable, extensible, and application-specific?

11.2 Design Principles

The codebase should follow six principles: modularity (each major function should live in a distinct component), traceability (every state change should be explainable from stored inputs), configurability (disease-layer weighting and thresholds should not be hard-coded into the core), time-awareness (sequence and window logic must be first-class objects), safety discipline (alerting and escalation should pass through explicit gates), auditability (outputs, overrides, and versioned changes must be recordable).

The core design rule is: the general engine remains stable, while application layers are attached through configuration rather than rewriting the engine itself.

11.3 Core Data Structures

A minimum internal object set is:

11.3.1 Input Event

An input event represents one incoming observation. Fields: event_id, user_id, timestamp, layer, signal_name, value, unit, quality_flag, context_tags, source_type.

A simple representation is: InputEvent = (u, t, L, n, v, q, c, s)

11.3.2 Baseline Record

A baseline record stores personal corridor information for one signal. Fields: user_id, signal_name, rolling_mean, rolling_variance, last_update_time, confidence_level, stable_window_count.

11.3.3 Window Summary

A window summary stores aggregated information for a defined time window. Fields: user_id, window_start, window_end, signal_summaries, drift_score, persistence_score, convergence_score, recovery_score, state_posterior.

11.3.4 State Record

A state record stores the operative state and surrounding evidence. Fields: user_id, timestamp, prior_state, current_state, state_distribution, drift_band, alert_status, escalation_status, rationale_summary.

11.3.5 Review Export

A review export stores the packet for onward sharing. Fields: user_id, export_time, recent_state_path, contributing_layers, persistence_summary, convergence_summary, recovery_summary, context_summary, version_stamp.

11.4 Engine Modules

The engine should be split into the following modules.

11.4.1 Intake Module

Function: receive incoming signals, validate structure, attach metadata, route to storage and normalisation.

11.4.2 Baseline Module

Function: maintain personal baselines, update corridor estimates during stable periods, preserve confidence in baseline quality, protect against pathological reset during active drift.

11.4.3 Normalisation Module

Function: convert raw input into baseline-relative form, preserve direction, attach confidence adjustments, support layer-specific normalisation logic.

11.4.4 Drift Module

Function: compute signal-level deviation, compute layer-level deviation, aggregate weighted drift score, preserve directional drift information.

11.4.5 Persistence Module

Function: compute recurrence across windows, track recovery failure, produce persistence bands, support signal-level and layer-level persistence.

11.4.6 Convergence Module

Function: detect compatible cross-layer abnormality, compute convergence score, preserve weighted reinforcement logic.

11.4.7 State Module

Function: update posterior state distribution, apply transition rules, preserve hysteresis, determine operative state.

11.4.8 Alert Module

Function: apply watch, alert, and escalation thresholds, suppress duplicate low-value alerts, bind user-facing action to state.

11.4.9 Export and Sharing Module

Function: produce user-readable summaries, produce structured escalation exports, apply permissioning rules.

11.4.10 Audit Module

Function: log state changes, log overrides, log recalibration events, preserve version and threshold history.

11.5 State Engine Logic in Code

The core update logic can be expressed as: \(X(t+1) = T[X(t), D(t), P(t), C(t), R(t), H(t), B(t)]\)

In implementation terms, the engine should perform this sequence:

1. receive new events
2. normalise against personal baseline
3. update window summary
4. compute drift
5. compute persistence
6. compute convergence
7. update state distribution
8. apply threshold gates
9. generate user-facing result
10. log the transition

11.6 Application-Layer Configuration

The engine should not embed cancer, metabolic, inflammatory, cardiovascular, and neurodegenerative logic directly into the core.

Instead, each application layer should provide a configuration object containing: active signal set, signal weights, layer weights, drift thresholds, persistence thresholds, convergence thresholds, escalation thresholds, review export template, validation status flag.

11.7 Deployment Architecture

The deployment stack should remain simple and separable.

11.7.1 Client Layer

The client layer handles: user interface, symptom and context entry, wearable sync, sample logging, results display, permission controls.

11.7.2 Engine Layer

The engine layer handles: intake, normalisation, scoring, state updates, alerting, export generation.

11.7.3 Storage Layer

The storage layer handles: raw events, baseline records, window summaries, state history, audit logs, permission records.

11.7.4 Governance Layer

The governance layer handles: model version control, threshold versioning, validation metadata, safety monitoring, override and incident records.

11.8 Logging, Audit, and Safety Hooks

Every meaningful internal transition should be loggable.

At minimum, the system should log: incoming event batches, baseline updates, state transitions, alert generation, escalation generation, de-escalation events, manual overrides, config changes, recalibration events.

Each log should include: timestamp, engine version, application layer, affected user, relevant threshold set, rationale summary.

This is necessary for: debugging, validation review, safety monitoring, governance compliance.

11.9 Chapter Conclusion

This chapter has defined the code and deployment structure of the prognostic engine. The system should be implemented as a modular state-based architecture with clear separation between intake, baseline management, signal processing, state updating, alerting, export, and governance. The core engine remains general, while disease-specific behaviour is introduced through configuration, not through rewriting the engine. That makes the system auditable, extensible, and deployable without collapsing the theory into a black box.

Chapter 12. Dataset Register

The engine requires a dataset register because a longitudinal prognostic system cannot be justified by architecture alone. It needs real data sources for baseline construction, retrospective back-testing, analytical signal work, and domain-specific validation.

The purpose of this chapter is to define the public and established data infrastructures that can support development and testing of the engine.

The governing principle is: datasets should be selected according to what part of the engine they validate—signal layer, temporal trajectory, disease application, or real-world monitoring workflow.

12.1 Purpose

This chapter establishes: which datasets support core model development, which datasets support domain-specific application layers, which datasets support remote, longitudinal, or multimodal evaluation, what each dataset can and cannot contribute to the engine.

12.1.1 Chapter Function

The function of this chapter is to answer: what real datasets can support development, back-testing, and validation of the engine, and what role should each one play?

12.2 Core Longitudinal and Multimodal Datasets

12.2.1 UK Biobank

UK Biobank is one of the strongest dataset anchors for this paper because it combines large-scale biomarker data with genetic data and linked healthcare records. Official UK Biobank pages state that its data types include biomarker data, genetic data, healthcare records, questionnaire data, physical measurements, demographic and lifestyle data, and environmental data, and that healthcare records include cancer data among other linked records.

For this engine, UK Biobank is particularly useful for: retrospective baseline and drift analysis, longitudinal health-record linkage, inherited-risk and genetic stratification, cancer, metabolic, cardiovascular, and broader population health applications.

It is especially strong where the paper needs a large-scale multimodal cohort rather than an acute-care or single-disease dataset.

12.2.2 All of Us Research Program

All of Us is another major core dataset because it explicitly combines surveys, electronic health records, physical measurements, genomic analyses, and wearables. Official All of Us sources state that researchers can access data from surveys, genomic analyses, electronic health records, physical measurements, and wearables, and that data from surveys, EHRs, blood and urine tests, and fitness devices may be collected over time.

For this engine, All of Us is especially useful for: multimodal longitudinal modelling, user-centred monitoring architecture, wearable-linked and survey-linked personal trajectories, application-layer prototyping where diverse real-world health data matter.

It is one of the best dataset matches for the paper's broader thesis that prognosis should be personal, continuous, and multimodal.

12.3 Population Biomonitoring and Exposure Datasets

12.3.1 NHANES / CDC Biomonitoring Data

For population-level baseline and exposure context, NHANES-linked CDC biomonitoring data are highly useful. CDC states that the National Exposure Report summarizes biomonitoring measurements made using NHANES samples and that blood, serum, and urine samples are analyzed for environmental chemicals. CDC also notes that these data are nationally representative for the U.S. population.

For the engine, NHANES and CDC biomonitoring resources are useful for: population reference context, exposure-related signal interpretation, environmental and behavioural burden calibration, secondary comparison against broad population distributions.

These datasets are not enough by themselves for the full drift engine, but they are strong for contextual and biomonitoring layers.

12.4 Clinical and High-Frequency Event Datasets

12.4.1 MIMIC-IV

MIMIC-IV is a strong resource for high-frequency clinical event trajectories and real-world hospital-based time series. PhysioNet states that MIMIC-IV is sourced from two in-hospital database systems: a hospital-wide EHR and an ICU-specific clinical information system. PhysioNet also provides related MIMIC-IV waveform and ECG resources that can extend high-frequency physiologic analysis.

For this engine, MIMIC-IV is useful for: acute deterioration modelling, recovery-state and event-trajectory analysis, high-frequency physiologic sequence testing, validating state-transition behaviour in clinically rich settings.

It is less suited to lifelong community monitoring than UK Biobank or All of Us, but it is very useful for testing temporal state logic under dense clinical observation.

12.5 Cancer-Specific Molecular and Proteomic Datasets

12.5.1 TCGA / NCI Cancer Genomics Resources

For the cancer application layer, TCGA remains a major molecular resource. NCI states that TCGA molecularly characterized more than 20,000 primary cancer and matched normal samples spanning 33 cancer types.

For this paper, TCGA is useful for: cancer-application signal design, subtype-aware oncology modelling, molecular pattern exploration, informing cancer-layer feature selection.

It is not, by itself, a full longitudinal personal-monitoring dataset, but it is highly relevant for cancer application-layer structure.

12.5.2 CPTAC / Related Proteomic Resources

Cancer proteomic resources, including CPTAC-linked infrastructure, are useful for extending the cancer layer beyond nucleic-acid patterns into protein-level interpretation. This supports the chapter's multi-analyte logic for the oncology application.

For this engine, proteomic resources are useful for: cancer-layer feature enrichment, multi-analyte surveillance design, reinforcement across molecular classes.

These resources are better treated as application-layer enrichment datasets rather than universal baseline datasets.

12.6 Recommended Use by Chapter and Validation Stage

A clean mapping for the paper is:

Chapters 1–3: Concept and User Loop – Use All of Us, UK Biobank. These best support the claim that longitudinal, multimodal, and personal data architectures already exist in large-scale form.

Chapters 4–6: State Logic and Engine Development – Use UK Biobank, All of Us, MIMIC-IV. These support both general longitudinal structure and higher-frequency sequence testing.

Chapter 7: Cancer Application Layer – Use UK Biobank, TCGA, CPTAC or related proteomic resources. These support inherited risk, healthcare-record linkage, and cancer-specific molecular layer design.

Chapter 8: Multi-Disease Expansion – Use All of Us, UK Biobank, NHANES / CDC Biomonitoring, MIMIC-IV. These support broader disease-domain coverage, population context, and real temporal trajectories.

Chapter 10: Validation Architecture – Use UK Biobank, All of Us, MIMIC-IV, application-specific datasets per domain. These are strongest for retrospective and staged prospective design planning.

12.8 Chapter Conclusion

This chapter has defined the dataset register for the prognostic engine. UK Biobank and All of Us are the strongest general anchors because they provide multimodal, longitudinal, and linked health data. NHANES and CDC biomonitoring resources support population and exposure context. MIMIC-IV supports dense temporal event modelling in clinical settings. Cancer-specific molecular and proteomic datasets support the oncology application layer. Together, these resources provide the empirical base for development, back-testing, and staged validation of the engine, while also making clear that no single dataset is sufficient for the full architecture.

Chapter 13. Limits, Error, and Disclosure Boundary

A system of this kind is only credible if it states its own boundary conditions clearly. The paper has built a longitudinal prognostic architecture for pre-event health monitoring, but that architecture does not justify unlimited claims. It must define what it does, what it does not do, how error is handled, and where the disclosure line sits between early warning and overstatement.

The central proposition of this chapter is: the engine should be presented as a disciplined pre-event surveillance and escalation system, not as an unrestricted diagnostic authority.

That boundary is consistent with current prediction-model and digital-health guidance. TRIPOD+AI explicitly covers prediction models used for diagnostic, prognostic, monitoring, and screening purposes, which reinforces the need to state intended use precisely rather than blur categories. FDA guidance on digital health technologies for remote data acquisition likewise emphasizes fit-for-purpose use and appropriate handling of remotely acquired data in the intended context.

13.1 Purpose

This chapter establishes: the outer limits of the engine's claims, the main error classes that must be disclosed, the distinction between surveillance, alerting, and diagnosis, the conditions under which outputs should and should not be interpreted strongly, the proper disclosure boundary for publication and deployment.

13.1.1 Chapter Function

The function of this chapter is to prevent the engine from being misunderstood.

It answers: what exactly is being claimed here, and where must the claim stop?

13.2 Scope of the Claim

The strongest legitimate claim of the paper is this: the engine can organise repeated personal data into a longitudinal early-warning system that detects structured drift earlier than episodic recognition alone in at least some domains, subject to signal quality, validation status, and application-layer maturity.

That is a serious claim. It is also bounded.

The paper should not claim that the engine universally detects all disease from first principles, that it eliminates uncertainty, or that it replaces all downstream diagnostic pathways.

13.2.1 What the Engine Is

The engine is: a personal baseline system, a repeated-measure drift interpreter, a state-transition monitor, an alert and escalation architecture, a user-owned continuity model.

It is strongest when used as a surveillance and organisation layer for earlier concern detection.

13.2.2 What the Engine Is Not

The engine is not: a universal one-step diagnostic oracle, a replacement for every external confirmatory pathway, proof that all diseases are equally visible early, proof that every alert corresponds to a named condition, proof that inherited risk equals present disease.

These exclusions are not weaknesses. They are the limits that keep the model serious.

13.2.3 Domain Limits Still Apply

The scope of the claim also depends on application layer.

For example: inherited cancer susceptibility may be identifiable in a subset of persons, but NCI states that only about 5% to 10% of all cancers are due to inherited harmful genetic changes, so universal birth-level direct cancer detection is not a serious claim.

Digital biomarkers and remote-monitoring layers may support long-term, user-friendly monitoring, but current literature still emphasizes challenges in moving many such biomarkers into reliable routine use.

So the scope of the claim must remain calibrated to the maturity of each signal layer and disease application.

13.3 Error Classes and Failure Modes

A serious disclosure boundary requires explicit error classes.

The engine can fail in several ways: false watch (the system monitors a pattern that does not progress), false escalation (the system recommends review without sufficient later support), missed progression (the system under-reads a real forming concern), late escalation (the system escalates, but too late for its stated purpose), sticky classification (the system remains elevated after true recovery), flicker instability (the system oscillates between neighbouring states without real structural change), domain misassignment (the engine detects drift but attributes the concern corridor too narrowly or too early).

These errors should not be hidden behind overall performance summaries. They are part of the model's real operating boundary.

13.3.1 Signal Failure Modes

The engine can also fail because its inputs fail.

Examples include: weak or unvalidated remote assays, noisy wearable data, incomplete self-report, missing context, sparse adherence, calibration drift over time.

FDA guidance on remote digital health technologies and recent analytical-validation literature both support the point that fit-for-purpose signal performance is a prerequisite for trustworthy downstream interpretation.

13.4 What the Engine Does Not Prove

This section should be explicit, because many misunderstandings arise when monitoring systems are overread as diagnostic finality.

The engine does not prove: that a named disease is present from one weak signal, that a high-risk inherited background means inevitable disease, that every domain can be monitored equally well from home, that remote digital biomarkers are mature to the same degree across all applications, that a positive surveillance pattern identifies exact tissue origin or exact pathology without downstream confirmation.

This is particularly important in the cancer application. Early-detection and inherited-risk science support surveillance and risk mapping, but they do not justify blanket claims of universal direct cancer detection from birth.

13.4.1 Surveillance Is Not the Same as Diagnosis

The paper should preserve this distinction all the way through: surveillance = repeated monitoring for structured concern; alert = the pattern has become meaningful enough to flag; escalation = the pattern has become strong enough to justify active review; diagnosis = downstream confirmation beyond the surveillance layer.

That boundary is part of the scientific honesty of the paper.

13.5 Disclosure Boundary in User-Facing Operation

The user-facing system must disclose enough to preserve trust without overstating certainty.

That means outputs should state: current state, current trend, main contributing layers, whether the state is watch, drift, escalation, or recovery, what action is appropriate now, whether the relevant application layer is fully validated or still limited in scope.

The app should not speak as though uncertainty has disappeared.

Suitable language remains: repeated deviation is being monitored, drift has strengthened, escalation is recommended, recovery is occurring but not yet stable.

This is stronger than either silence or alarmism.

13.5.1 Disclosure of Validation Status

Where application layers are at different maturity levels, the system should disclose that directly.

For example: validated for metabolic drift monitoring, under limited evaluation for oncology-oriented surveillance, not yet validated for autonomous disease naming.

That is important because shared architecture does not mean shared maturity.

13.6 Disclosure Boundary in Scientific Reporting

The scientific paper should also separate clearly: architectural proposal, analytical validation, retrospective validation, prospective validation, implemented use.

TRIPOD+AI exists precisely to improve transparent reporting of model development, validation, and updating across diagnostic, prognostic, monitoring, and screening applications.

So the paper should not collapse conceptual elegance into evidential completion.

13.6.1 Minimum Disclosure Package

For each application layer, the minimum disclosure package should include: intended use, active signal layers, validation stage, main thresholds or threshold logic, known failure modes, data-source limitations, whether outputs are surveillance-only or linked to a live escalation pathway.

This keeps the model publishable without becoming inflated.

13.7 Final Chapter Conclusion

This chapter has set the final boundary of the paper. The engine should be presented as a disciplined longitudinal surveillance and escalation architecture that detects structured drift from repeated personal data and supports earlier, better-organised concern recognition. It should not be presented as a universal diagnostic authority, nor as proof that all disease classes are equally detectable from the earliest stage. Its claims must remain bounded by signal validity, domain-specific validation, and the distinction between surveillance, alerting, escalation, and diagnosis.

Appendix A. Core Prognostic Engine Code

A.1 Purpose

This appendix provides the reference implementation for the general prognostic engine described in Chapters 4 to 6 and Chapter 11.

It includes: data structures, baseline handling, signal normalisation, drift scoring, persistence logic, convergence logic, recovery logic, state update logic, alert and escalation logic, audit logging, review export logic.

The implementation is written in Python for clarity and portability.

A.2 File Structure

appendix_a_core_engine/

├── schema.py
├── baseline.py
├── normalization.py
├── drift.py
├── persistence.py
├── convergence.py
├── recovery.py
├── state_engine.py
├── alerts.py
├── audit.py
├── exports.py
└── engine.py

A.3 schema.py

from __future__ import annotations
from dataclasses import dataclass, field
from enum import Enum
from typing import Dict, List, Optional, Any
from datetime import datetime

class SignalLayer(str, Enum):
    BLOOD = "blood"
    WEARABLE = "wearable"
    SYMPTOM = "symptom"
    EXPOSURE = "exposure"
    HISTORY = "history"
    FAMILY = "family"

class HealthState(str, Enum):
    STABLE = "Stable"
    SUSCEPTIBLE = "Susceptible"
    SOFT_DEVIATION = "SoftDeviation"
    PERSISTENT_DIVERGENCE = "PersistentDivergence"
    ESCALATION_CANDIDATE = "EscalationCandidate"
    CLINICAL_REVIEW = "ClinicalReview"

class UserFacingState(str, Enum):
    STABLE = "Stable"
    WATCH = "Watch"
    DRIFT_DETECTED = "DriftDetected"
    ESCALATION_RECOMMENDED = "EscalationRecommended"
    UNDER_REVIEW = "UnderReview"
    RECOVERING = "Recovering"

@dataclass
class InputEvent:
    event_id: str
    user_id: str
    timestamp: datetime
    layer: SignalLayer
    signal_name: str
    value: float
    unit: str = ""
    quality_flag: float = 1.0
    context_tags: Dict[str, Any] = field(default_factory=dict)
    source_type: str = "manual"

@dataclass
class BaselineRecord:
    user_id: str
    signal_name: str
    rolling_mean: float
    rolling_variance: float
    last_update_time: datetime
    confidence_level: float = 1.0
    stable_window_count: int = 0
    observations_seen: int = 0

@dataclass
class NormalizedSignal:
    user_id: str
    timestamp: datetime
    layer: SignalLayer
    signal_name: str
    raw_value: float
    z_score: float
    direction: int
    confidence: float
    context_tags: Dict[str, Any] = field(default_factory=dict)

@dataclass
class WindowSummary:
    user_id: str
    window_start: datetime
    window_end: datetime
    normalized_signals: List[NormalizedSignal]
    layer_scores: Dict[str, float] = field(default_factory=dict)
    drift_score: float = 0.0
    directional_drift_score: float = 0.0
    persistence_score: float = 0.0
    convergence_score: float = 0.0
    recovery_score: float = 0.0
    state_distribution: Dict[str, float] = field(default_factory=dict)

@dataclass
class StateRecord:
    user_id: str
    timestamp: datetime
    prior_state: HealthState
    current_state: HealthState
    state_distribution: Dict[str, float]
    drift_band: int
    alert_status: bool
    escalation_status: bool
    recovering: bool = False
    rationale_summary: str = ""

@dataclass
class ReviewExport:
    user_id: str
    export_time: datetime
    recent_state_path: List[str]
    contributing_layers: Dict[str, float]
    persistence_summary: float
    convergence_summary: float
    recovery_summary: float
    context_summary: Dict[str, Any]
    version_stamp: str

A.4 baseline.py

from __future__ import annotations
from dataclasses import replace
from typing import Dict, Iterable
from datetime import datetime
from schema import BaselineRecord

EPS = 1e-9

class BaselineManager:
    """
    Maintains rolling personal baselines.
    Baselines should only be updated aggressively during stable periods.
    """
    def __init__(self, alpha: float = 0.05):
        self.alpha = alpha
        self.records: Dict[tuple[str, str], BaselineRecord] = {}

    def get(self, user_id: str, signal_name: str) -> BaselineRecord | None:
        return self.records.get((user_id, signal_name))

    def bootstrap(
        self,
        user_id: str,
        signal_name: str,
        initial_mean: float,
        initial_variance: float,
        timestamp: datetime,
        confidence_level: float = 0.5,
    ) -> BaselineRecord:
        record = BaselineRecord(
            user_id=user_id,
            signal_name=signal_name,
            rolling_mean=initial_mean,
            rolling_variance=max(initial_variance, EPS),
            last_update_time=timestamp,
            confidence_level=confidence_level,
            stable_window_count=0,
            observations_seen=1,
        )
        self.records[(user_id, signal_name)] = record
        return record

    def update_stable(
        self,
        user_id: str,
        signal_name: str,
        value: float,
        timestamp: datetime,
    ) -> BaselineRecord:
        key = (user_id, signal_name)
        record = self.records.get(key)
        if record is None:
            return self.bootstrap(
                user_id=user_id,
                signal_name=signal_name,
                initial_mean=value,
                initial_variance=1.0,
                timestamp=timestamp,
                confidence_level=0.3,
            )
        alpha = self.alpha
        old_mean = record.rolling_mean
        new_mean = (1 - alpha) * old_mean + alpha * value
        squared_dev = (value - new_mean) ** 2
        new_var = max((1 - alpha) * record.rolling_variance + alpha * squared_dev, EPS)
        updated = replace(
            record,
            rolling_mean=new_mean,
            rolling_variance=new_var,
            last_update_time=timestamp,
            stable_window_count=record.stable_window_count + 1,
            observations_seen=record.observations_seen + 1,
            confidence_level=min(1.0, record.confidence_level + 0.01),
        )
        self.records[key] = updated
        return updated

    def freeze(self, user_id: str, signal_name: str) -> None:
        """
        Placeholder for future baseline freeze logic.
        Useful when strong drift should not contaminate the ordinary corridor.
        """
        _ = (user_id, signal_name)

A.5 normalization.py

from __future__ import annotations
from typing import Iterable, List
from math import sqrt
from schema import InputEvent, NormalizedSignal
from baseline import BaselineManager, EPS

class SignalNormalizer:
    def __init__(self, baseline_manager: BaselineManager):
        self.baseline_manager = baseline_manager

    def normalize_event(self, event: InputEvent) -> NormalizedSignal:
        record = self.baseline_manager.get(event.user_id, event.signal_name)
        if record is None:
            record = self.baseline_manager.bootstrap(
                user_id=event.user_id,
                signal_name=event.signal_name,
                initial_mean=event.value,
                initial_variance=1.0,
                timestamp=event.timestamp,
                confidence_level=0.2,
            )
        sigma = max(sqrt(record.rolling_variance), EPS)
        z_score = (event.value - record.rolling_mean) / sigma
        if z_score > 0:
            direction = 1
        elif z_score < 0:
            direction = -1
        else:
            direction = 0
        confidence = max(0.0, min(1.0, event.quality_flag * record.confidence_level))
        return NormalizedSignal(
            user_id=event.user_id,
            timestamp=event.timestamp,
            layer=event.layer,
            signal_name=event.signal_name,
            raw_value=event.value,
            z_score=z_score,
            direction=direction,
            confidence=confidence,
            context_tags=event.context_tags,
        )

    def normalize_events(self, events: Iterable[InputEvent]) -> List[NormalizedSignal]:
        return [self.normalize_event(e) for e in events]

A.6 drift.py

from __future__ import annotations
from collections import defaultdict
from typing import Dict, List
from schema import NormalizedSignal

EPS = 1e-9

class DriftCalculator:
    def __init__(self, signal_weights: Dict[str, float], layer_weights: Dict[str, float]):
        self.signal_weights = signal_weights
        self.layer_weights = layer_weights

    def compute_signal_contribution(self, signal: NormalizedSignal) -> float:
        weight = self.signal_weights.get(signal.signal_name, 1.0)
        return weight * signal.confidence * abs(signal.z_score)

    def compute_layer_scores(self, signals: List[NormalizedSignal]) -> Dict[str, float]:
        grouped_scores = defaultdict(list)
        grouped_weights = defaultdict(list)
        for s in signals:
            layer = s.layer.value
            signal_weight = self.signal_weights.get(s.signal_name, 1.0) * s.confidence
            grouped_scores[layer].append(abs(s.z_score) * signal_weight)
            grouped_weights[layer].append(signal_weight)
        result: Dict[str, float] = {}
        for layer, scores in grouped_scores.items():
            weight_sum = sum(grouped_weights[layer])
            result[layer] = sum(scores) / max(weight_sum, EPS)
        return result

    def compute_drift(self, signals: List[NormalizedSignal]) -> tuple[float, float, Dict[str, float]]:
        weighted_sum = 0.0
        directional_sum = 0.0
        total_weight = 0.0
        for s in signals:
            w = self.signal_weights.get(s.signal_name, 1.0) * s.confidence
            weighted_sum += abs(s.z_score) * w
            directional_sum += s.direction * abs(s.z_score) * w
            total_weight += w
        drift_score = weighted_sum / max(total_weight, EPS)
        directional_drift_score = directional_sum / max(total_weight, EPS)
        layer_scores = self.compute_layer_scores(signals)
        return drift_score, directional_drift_score, layer_scores

    @staticmethod
    def drift_band(drift_score: float) -> int:
        if drift_score < 1.0:
            return 0
        if drift_score < 2.0:
            return 1
        if drift_score < 3.0:
            return 2
        return 3

A.7 persistence.py

from __future__ import annotations
from typing import List
from schema import WindowSummary

class PersistenceCalculator:
    def __init__(self, drift_threshold: float = 1.8):
        self.drift_threshold = drift_threshold

    def compute(self, windows: List[WindowSummary], lookback: int = 5) -> float:
        if not windows:
            return 0.0
        recent = windows[-lookback:]
        hits = sum(1 for w in recent if w.drift_score > self.drift_threshold)
        return hits / len(recent)

    def compute_magnitude_aware(self, windows: List[WindowSummary], lookback: int = 5) -> float:
        if not windows:
            return 0.0
        recent = windows[-lookback:]
        values = [min(w.drift_score / self.drift_threshold, 1.0) for w in recent]
        return sum(values) / len(values)

A.8 convergence.py

from __future__ import annotations
from typing import Dict, List
from schema import WindowSummary

class ConvergenceCalculator:
    def __init__(self, layer_abnormal_threshold: float = 1.5, layer_weights: Dict[str, float] | None = None):
        self.layer_abnormal_threshold = layer_abnormal_threshold
        self.layer_weights = layer_weights or {}

    def compute(self, window: WindowSummary) -> float:
        if not window.layer_scores:
            return 0.0
        abnormal = 0
        for _, score in window.layer_scores.items():
            if score >= self.layer_abnormal_threshold:
                abnormal += 1
        return abnormal / max(len(window.layer_scores), 1)

    def compute_weighted(self, window: WindowSummary) -> float:
        if not window.layer_scores:
            return 0.0
        total_weight = 0.0
        abnormal_weight = 0.0
        for layer, score in window.layer_scores.items():
            w = self.layer_weights.get(layer, 1.0)
            total_weight += w
            if score >= self.layer_abnormal_threshold:
                abnormal_weight += w
        return abnormal_weight / max(total_weight, 1e-9)

    def temporal_convergence(self, windows: List[WindowSummary], lookback: int = 5) -> float:
        if not windows:
            return 0.0
        recent = windows[-lookback:]
        values = [self.compute(w) for w in recent]
        return sum(values) / len(values)

A.9 recovery.py

from __future__ import annotations
from typing import List
from schema import WindowSummary

class RecoveryCalculator:
    def compute(self, windows: List[WindowSummary], lookback: int = 4) -> float:
        if len(windows) < 2:
            return 0.0
        recent = windows[-(lookback + 1):]
        gains = []
        for i in range(1, len(recent)):
            previous = recent[i - 1].drift_score
            current = recent[i].drift_score
            gains.append(max(previous - current, 0.0))
        if not gains:
            return 0.0
        return sum(gains) / len(gains)

    def is_recovering(self, windows: List[WindowSummary], min_recovery_score: float = 0.2) -> bool:
        return self.compute(windows) >= min_recovery_score

A.10 state_engine.py

from __future__ import annotations
from typing import Dict, List
from schema import HealthState, WindowSummary, StateRecord
from drift import DriftCalculator

class StateEngine:
    def __init__(
        self,
        soft_threshold: float = 1.0,
        persistent_threshold: float = 1.8,
        escalation_threshold: float = 2.4,
        convergence_threshold: float = 0.30,
        persistence_threshold: float = 0.50,
        commit_threshold: float = 1.30,
    ):
        self.soft_threshold = soft_threshold
        self.persistent_threshold = persistent_threshold
        self.escalation_threshold = escalation_threshold
        self.convergence_threshold = convergence_threshold
        self.persistence_threshold = persistence_threshold
        self.commit_threshold = commit_threshold

    def update_distribution(
        self,
        drift: float,
        persistence: float,
        convergence: float,
        recovery: float,
        susceptibility: float,
    ) -> Dict[str, float]:
        dist = {
            HealthState.STABLE.value: 0.0,
            HealthState.SUSCEPTIBLE.value: 0.0,
            HealthState.SOFT_DEVIATION.value: 0.0,
            HealthState.PERSISTENT_DIVERGENCE.value: 0.0,
            HealthState.ESCALATION_CANDIDATE.value: 0.0,
            HealthState.CLINICAL_REVIEW.value: 0.0,
        }
        if drift < self.soft_threshold and persistence < 0.2:
            dist[HealthState.STABLE.value] = 0.7
            dist[HealthState.SUSCEPTIBLE.value] = 0.3 * susceptibility
        elif drift >= self.soft_threshold and persistence < self.persistence_threshold:
            dist[HealthState.SOFT_DEVIATION.value] = 0.6
            dist[HealthState.SUSCEPTIBLE.value] = 0.2 * susceptibility
            dist[HealthState.STABLE.value] = 0.2
        elif drift >= self.persistent_threshold and persistence >= self.persistence_threshold:
            dist[HealthState.PERSISTENT_DIVERGENCE.value] = 0.7
            dist[HealthState.SOFT_DEVIATION.value] = 0.2
            dist[HealthState.SUSCEPTIBLE.value] = 0.1 * susceptibility
        if (
            drift >= self.escalation_threshold
            and persistence >= self.persistence_threshold
            and convergence >= self.convergence_threshold
        ):
            dist[HealthState.ESCALATION_CANDIDATE.value] = max(
                dist[HealthState.ESCALATION_CANDIDATE.value], 0.8
            )
            dist[HealthState.PERSISTENT_DIVERGENCE.value] = max(
                dist[HealthState.PERSISTENT_DIVERGENCE.value], 0.15
            )
            dist[HealthState.SOFT_DEVIATION.value] = 0.05
        if recovery > 0.3:
            dist[HealthState.SOFT_DEVIATION.value] += 0.1
            dist[HealthState.STABLE.value] += 0.1
        total = sum(dist.values())
        if total > 0:
            dist = {k: v / total for k, v in dist.items()}
        return dist

    def commit_ratio(self, distribution: Dict[str, float]) -> float:
        ordered = sorted(distribution.values(), reverse=True)
        if len(ordered) < 2:
            return 999.0
        top1, top2 = ordered[0], ordered[1]
        if top2 == 0:
            return 999.0
        return top1 / top2

    def select_state(self, distribution: Dict[str, float]) -> HealthState:
        top_state = max(distribution, key=distribution.get)
        return HealthState(top_state)

    def should_escalate(self, drift: float, persistence: float, convergence: float, commit_ratio: float) -> bool:
        return (
            drift >= self.escalation_threshold
            and persistence >= self.persistence_threshold
            and convergence >= self.convergence_threshold
            and commit_ratio >= self.commit_threshold
        )

A.11 alerts.py

from __future__ import annotations
from schema import HealthState, UserFacingState

class AlertMapper:
    def map_state(self, state: HealthState, recovering: bool = False) -> UserFacingState:
        if recovering:
            return UserFacingState.RECOVERING
        if state == HealthState.STABLE:
            return UserFacingState.STABLE
        if state in {HealthState.SUSCEPTIBLE, HealthState.SOFT_DEVIATION}:
            return UserFacingState.WATCH
        if state == HealthState.PERSISTENT_DIVERGENCE:
            return UserFacingState.DRIFT_DETECTED
        if state == HealthState.ESCALATION_CANDIDATE:
            return UserFacingState.ESCALATION_RECOMMENDED
        return UserFacingState.UNDER_REVIEW

    def render_message(self, user_state: UserFacingState) -> str:
        if user_state == UserFacingState.STABLE:
            return "Your recent readings remain within your usual range."
        if user_state == UserFacingState.WATCH:
            return "A small deviation is being monitored."
        if user_state == UserFacingState.DRIFT_DETECTED:
            return "Repeated drift is being detected across your recent readings."
        if user_state == UserFacingState.ESCALATION_RECOMMENDED:
            return "The pattern has strengthened and follow-up is recommended."
        if user_state == UserFacingState.UNDER_REVIEW:
            return "Your recent pattern is under structured review."
        return "Recovery is occurring but is not yet fully stable."

A.12 audit.py

from __future__ import annotations
from dataclasses import asdict
from typing import Any, Dict, List
from datetime import datetime
from schema import StateRecord, WindowSummary

class AuditLogger:
    def __init__(self):
        self.logs: List[Dict[str, Any]] = []

    def log_transition(
        self,
        user_id: str,
        window: WindowSummary,
        state_distribution: Dict[str, float],
        operative_state: str,
        alert_status: bool,
        escalation_status: bool,
        version: str,
    ) -> None:
        self.logs.append(
            {
                "timestamp": datetime.utcnow().isoformat(),
                "user_id": user_id,
                "window_start": window.window_start.isoformat(),
                "window_end": window.window_end.isoformat(),
                "drift_score": window.drift_score,
                "persistence_score": window.persistence_score,
                "convergence_score": window.convergence_score,
                "recovery_score": window.recovery_score,
                "state_distribution": state_distribution,
                "operative_state": operative_state,
                "alert_status": alert_status,
                "escalation_status": escalation_status,
                "engine_version": version,
            }
        )

A.13 exports.py

from __future__ import annotations
from typing import List, Dict, Any
from datetime import datetime
from schema import ReviewExport, WindowSummary, StateRecord

class ExportBuilder:
    def build_review_export(
        self,
        user_id: str,
        windows: List[WindowSummary],
        states: List[StateRecord],
        version_stamp: str,
    ) -> ReviewExport:
        recent_states = [s.current_state.value for s in states[-5:]]
        latest = windows[-1]
        context_summary: Dict[str, Any] = {}
        for signal in latest.normalized_signals:
            for k, v in signal.context_tags.items():
                context_summary[k] = v
        return ReviewExport(
            user_id=user_id,
            export_time=datetime.utcnow(),
            recent_state_path=recent_states,
            contributing_layers=latest.layer_scores,
            persistence_summary=latest.persistence_score,
            convergence_summary=latest.convergence_score,
            recovery_summary=latest.recovery_score,
            context_summary=context_summary,
            version_stamp=version_stamp,
        )

A.14 engine.py

from __future__ import annotations
from collections import defaultdict
from datetime import datetime, timedelta
from typing import Dict, List
from schema import (
    InputEvent,
    WindowSummary,
    StateRecord,
    HealthState,
)
from baseline import BaselineManager
from normalization import SignalNormalizer
from drift import DriftCalculator
from persistence import PersistenceCalculator
from convergence import ConvergenceCalculator
from recovery import RecoveryCalculator
from state_engine import StateEngine
from alerts import AlertMapper
from audit import AuditLogger

class PrognosticEngine:
    def __init__(
        self,
        signal_weights: Dict[str, float],
        layer_weights: Dict[str, float],
        version: str = "0.1.0",
    ):
        self.version = version
        self.baselines = BaselineManager()
        self.normalizer = SignalNormalizer(self.baselines)
        self.drift_calc = DriftCalculator(signal_weights, layer_weights)
        self.persistence_calc = PersistenceCalculator()
        self.convergence_calc = ConvergenceCalculator(layer_weights=layer_weights)
        self.recovery_calc = RecoveryCalculator()
        self.state_engine = StateEngine()
        self.alert_mapper = AlertMapper()
        self.audit = AuditLogger()
        self.user_windows: Dict[str, List[WindowSummary]] = defaultdict(list)
        self.user_states: Dict[str, List[StateRecord]] = defaultdict(list)

    def build_window(self, user_id: str, signals, window_start: datetime, window_end: datetime) -> WindowSummary:
        drift_score, directional_drift_score, layer_scores = self.drift_calc.compute_drift(signals)
        window = WindowSummary(
            user_id=user_id,
            window_start=window_start,
            window_end=window_end,
            normalized_signals=signals,
            layer_scores=layer_scores,
            drift_score=drift_score,
            directional_drift_score=directional_drift_score,
        )
        prior_windows = self.user_windows[user_id] + [window]
        window.persistence_score = self.persistence_calc.compute(prior_windows)
        window.convergence_score = self.convergence_calc.temporal_convergence(prior_windows)
        window.recovery_score = self.recovery_calc.compute(prior_windows)
        return window

    def process_window(
        self,
        user_id: str,
        events: List[InputEvent],
        window_start: datetime,
        window_end: datetime,
        susceptibility: float = 0.0,
    ):
        normalized = self.normalizer.normalize_events(events)
        window = self.build_window(user_id, normalized, window_start, window_end)
        distribution = self.state_engine.update_distribution(
            drift=window.drift_score,
            persistence=window.persistence_score,
            convergence=window.convergence_score,
            recovery=window.recovery_score,
            susceptibility=susceptibility,
        )
        window.state_distribution = distribution
        commit_ratio = self.state_engine.commit_ratio(distribution)
        operative_state = self.state_engine.select_state(distribution)
        escalation = self.state_engine.should_escalate(
            drift=window.drift_score,
            persistence=window.persistence_score,
            convergence=window.convergence_score,
            commit_ratio=commit_ratio,
        )
        recovering = self.recovery_calc.is_recovering(self.user_windows[user_id] + [window])
        prior_state = (
            self.user_states[user_id][-1].current_state
            if self.user_states[user_id]
            else HealthState.STABLE
        )
        record = StateRecord(
            user_id=user_id,
            timestamp=window_end,
            prior_state=prior_state,
            current_state=operative_state,
            state_distribution=distribution,
            drift_band=self.drift_calc.drift_band(window.drift_score),
            alert_status=operative_state != HealthState.STABLE,
            escalation_status=escalation,
            recovering=recovering,
            rationale_summary=f"drift={window.drift_score:.2f}, persistence={window.persistence_score:.2f}, convergence={window.convergence_score:.2f}",
        )
        self.user_windows[user_id].append(window)
        self.user_states[user_id].append(record)
        self.audit.log_transition(
            user_id=user_id,
            window=window,
            state_distribution=distribution,
            operative_state=operative_state.value,
            alert_status=record.alert_status,
            escalation_status=record.escalation_status,
            version=self.version,
        )
        user_state = self.alert_mapper.map_state(operative_state, recovering=recovering)
        message = self.alert_mapper.render_message(user_state)
        return {
            "user_id": user_id,
            "window": window,
            "state_record": record,
            "user_state": user_state.value,
            "message": message,
        }

Appendix B. Trajectory Simulation Suite

B.1 Purpose

This appendix provides the simulation framework needed for Chapters 4, 6, 7, 8, 10, and 13.

It supports generation of: stable trajectories, soft deviation trajectories, persistent divergence trajectories, escalation trajectories, recovery trajectories, relapse trajectories, sparse-data and missingness scenarios, false-alarm corridors, multi-layer disease-specific synthetic cases.

B.2 File Structure

appendix_b_simulations/

├── synthetic_users.py
├── trajectory_generators.py
├── disease_profiles.py
├── missingness.py
├── run_simulations.py
└── plot_utils.py

B.3 synthetic_users.py

from __future__ import annotations
from dataclasses import dataclass, field
from typing import Dict, List

@dataclass
class SyntheticUserProfile:
    user_id: str
    baseline_means: Dict[str, float]
    baseline_stds: Dict[str, float]
    susceptibility: float = 0.0
    family_risk: float = 0.0
    disease_profile: str = "general"

B.4 disease_profiles.py

from __future__ import annotations

DISEASE_PROFILES = {
    "general": {
        "signals": ["crp", "hrv", "fatigue_score", "sleep_frag", "stress_load"],
        "signal_weights": {
            "crp": 1.4,
            "hrv": 1.2,
            "fatigue_score": 1.0,
            "sleep_frag": 1.0,
            "stress_load": 0.8,
        },
    },
    "oncology": {
        "signals": ["methylation_score", "fragment_score", "fatigue_score", "weight_change", "crp"],
        "signal_weights": {
            "methylation_score": 2.0,
            "fragment_score": 1.8,
            "fatigue_score": 0.9,
            "weight_change": 0.8,
            "crp": 0.7,
        },
    },
    "metabolic": {
        "signals": ["glucose_var", "glucose_peak", "sleep_frag", "stress_load", "activity_drop"],
        "signal_weights": {
            "glucose_var": 2.0,
            "glucose_peak": 1.8,
            "sleep_frag": 1.1,
            "stress_load": 1.0,
            "activity_drop": 0.9,
        },
    },
    "inflammatory": {
        "signals": ["crp", "fatigue_score", "hrv", "symptom_cluster", "sleep_frag"],
        "signal_weights": {
            "crp": 1.8,
            "fatigue_score": 1.2,
            "hrv": 1.0,
            "symptom_cluster": 1.2,
            "sleep_frag": 0.8,
        },
    },
}

B.5 trajectory_generators.py

from __future__ import annotations
import random
from datetime import datetime, timedelta
from typing import Dict, List
from schema import InputEvent, SignalLayer
from synthetic_users import SyntheticUserProfile

def sample_normal(mean: float, std: float) -> float:
    return random.gauss(mean, std)

def generate_stable_trajectory(
    user: SyntheticUserProfile,
    signal_to_layer: Dict[str, SignalLayer],
    start: datetime,
    windows: int,
    step_days: int = 7,
) -> List[List[InputEvent]]:
    result: List[List[InputEvent]] = []
    for w in range(windows):
        t = start + timedelta(days=w * step_days)
        batch = []
        for signal, mean in user.baseline_means.items():
            std = user.baseline_stds.get(signal, 1.0)
            value = sample_normal(mean, std)
            batch.append(
                InputEvent(
                    event_id=f"{user.user_id}_stable_{w}_{signal}",
                    user_id=user.user_id,
                    timestamp=t,
                    layer=signal_to_layer[signal],
                    signal_name=signal,
                    value=value,
                    quality_flag=1.0,
                )
            )
        result.append(batch)
    return result

def generate_soft_deviation_trajectory(
    user: SyntheticUserProfile,
    signal_to_layer: Dict[str, SignalLayer],
    start: datetime,
    windows: int,
    deviating_signals: List[str],
    deviation_strength: float = 1.5,
    step_days: int = 7,
) -> List[List[InputEvent]]:
    result = generate_stable_trajectory(user, signal_to_layer, start, windows, step_days)
    for idx in range(max(2, windows // 3), min(windows, (windows // 3) + 2)):
        for event in result[idx]:
            if event.signal_name in deviating_signals:
                event.value += deviation_strength * user.baseline_stds.get(event.signal_name, 1.0)
    return result

def generate_persistent_divergence_trajectory(
    user: SyntheticUserProfile,
    signal_to_layer: Dict[str, SignalLayer],
    start: datetime,
    windows: int,
    deviating_signals: List[str],
    step_growth: float = 0.7,
    step_days: int = 7,
) -> List[List[InputEvent]]:
    result = generate_stable_trajectory(user, signal_to_layer, start, windows, step_days)
    start_idx = windows // 3
    for idx in range(start_idx, windows):
        multiplier = (idx - start_idx + 1) * step_growth
        for event in result[idx]:
            if event.signal_name in deviating_signals:
                event.value += multiplier * user.baseline_stds.get(event.signal_name, 1.0)
    return result

def generate_recovery_trajectory(
    user: SyntheticUserProfile,
    signal_to_layer: Dict[str, SignalLayer],
    start: datetime,
    windows: int,
    deviating_signals: List[str],
    peak_strength: float = 3.0,
    step_days: int = 7,
) -> List[List[InputEvent]]:
    result = generate_stable_trajectory(user, signal_to_layer, start, windows, step_days)
    rise_start = windows // 4
    peak = windows // 2
    for idx in range(rise_start, peak):
        multiplier = (idx - rise_start + 1) * (peak_strength / max(peak - rise_start, 1))
        for event in result[idx]:
            if event.signal_name in deviating_signals:
                event.value += multiplier * user.baseline_stds.get(event.signal_name, 1.0)
    for idx in range(peak, windows):
        multiplier = max(0.0, peak_strength - (idx - peak + 1) * (peak_strength / max(windows - peak, 1)))
        for event in result[idx]:
            if event.signal_name in deviating_signals:
                event.value += multiplier * user.baseline_stds.get(event.signal_name, 1.0)
    return result

def generate_relapse_trajectory(
    user: SyntheticUserProfile,
    signal_to_layer: Dict[str, SignalLayer],
    start: datetime,
    windows: int,
    deviating_signals: List[str],
    step_days: int = 7,
) -> List[List[InputEvent]]:
    result = generate_recovery_trajectory(
        user=user,
        signal_to_layer=signal_to_layer,
        start=start,
        windows=windows,
        deviating_signals=deviating_signals,
        peak_strength=2.5,
        step_days=step_days,
    )
    relapse_start = int(windows * 0.75)
    for idx in range(relapse_start, windows):
        multiplier = (idx - relapse_start + 1) * 1.2
        for event in result[idx]:
            if event.signal_name in deviating_signals:
                event.value += multiplier * user.baseline_stds.get(event.signal_name, 1.0)
    return result

B.6 missingness.py

from __future__ import annotations
import random
from typing import List
from schema import InputEvent

def apply_random_missingness(batches: List[List[InputEvent]], missing_rate: float = 0.10) -> List[List[InputEvent]]:
    output = []
    for batch in batches:
        kept = [event for event in batch if random.random() > missing_rate]
        output.append(kept)
    return output

def apply_sparse_user_pattern(batches: List[List[InputEvent]], keep_every_n: int = 2) -> List[List[InputEvent]]:
    output = []
    for idx, batch in enumerate(batches):
        if idx % keep_every_n == 0:
            output.append(batch)
        else:
            output.append([])
    return output

B.7 run_simulations.py

from __future__ import annotations
from datetime import datetime, timedelta
from schema import SignalLayer
from engine import PrognosticEngine
from synthetic_users import SyntheticUserProfile
from trajectory_generators import (
    generate_stable_trajectory,
    generate_soft_deviation_trajectory,
    generate_persistent_divergence_trajectory,
    generate_recovery_trajectory,
    generate_relapse_trajectory,
)
from missingness import apply_random_missingness

def demo_simulation():
    signal_to_layer = {
        "crp": SignalLayer.BLOOD,
        "hrv": SignalLayer.WEARABLE,
        "fatigue_score": SignalLayer.SYMPTOM,
        "sleep_frag": SignalLayer.WEARABLE,
        "stress_load": SignalLayer.EXPOSURE,
    }
    signal_weights = {
        "crp": 1.4,
        "hrv": 1.2,
        "fatigue_score": 1.0,
        "sleep_frag": 1.0,
        "stress_load": 0.8,
    }
    layer_weights = {
        "blood": 1.5,
        "wearable": 1.2,
        "symptom": 1.0,
        "exposure": 0.8,
        "history": 0.7,
        "family": 0.6,
    }
    engine = PrognosticEngine(signal_weights=signal_weights, layer_weights=layer_weights)
    user = SyntheticUserProfile(
        user_id="user_001",
        baseline_means={
            "crp": 1.2,
            "hrv": 55.0,
            "fatigue_score": 1.0,
            "sleep_frag": 2.0,
            "stress_load": 2.5,
        },
        baseline_stds={
            "crp": 0.3,
            "hrv": 5.0,
            "fatigue_score": 0.4,
            "sleep_frag": 0.6,
            "stress_load": 0.7,
        },
        susceptibility=0.2,
    )
    start = datetime(2026, 1, 1)
    batches = generate_persistent_divergence_trajectory(
        user=user,
        signal_to_layer=signal_to_layer,
        start=start,
        windows=16,
        deviating_signals=["crp", "fatigue_score", "sleep_frag"],
    )
    batches = apply_random_missingness(batches, missing_rate=0.05)
    outputs = []
    for idx, batch in enumerate(batches):
        window_start = start + timedelta(days=idx * 7)
        window_end = window_start + timedelta(days=6)
        out = engine.process_window(
            user_id=user.user_id,
            events=batch,
            window_start=window_start,
            window_end=window_end,
            susceptibility=user.susceptibility,
        )
        outputs.append(out)
    return outputs, engine

if __name__ == "__main__":
    outputs, engine = demo_simulation()
    for out in outputs:
        state = out["state_record"].current_state.value
        print(out["window"].window_end.date(), state, out["message"])

B.8 Example Simulation Scenarios by Chapter

B.8.1 Chapter 4 Scenarios

Use: stable trajectory, soft deviation, persistent divergence, escalation candidate, recovery and relapse. Purpose: show state transitions, boundary logic, persistence and convergence effects.

B.8.2 Chapter 6 Scenarios

Use: random fluctuation versus directional drift, high-drift low-persistence, low-drift high-persistence, strong convergence with moderate drift, de-escalation after genuine recovery. Purpose: show engine behaviour under competing conditions.

B.8.3 Chapter 7 Oncology Scenarios

Use: inherited susceptibility with weak repeated blood drift, blood-only weak anomaly, reinforced oncology drift, sparse-signal false positive corridor, escalation after multi-layer reinforcement.

B.8.4 Chapter 8 Multi-Disease Scenarios

Use: metabolic regulatory drift, inflammatory flare recurrence, cardiovascular recovery failure, neurofunctional slow decline.

B.8.5 Chapter 13 Failure Scenarios

Use: false watch, false escalation, missed progression, sticky elevated state, flickering state path.

Appendix C. Disease-Layer Configuration Files

C.1 Purpose

These configuration files isolate disease-specific weighting and threshold logic from the general engine.

C.2 oncology_config.py

ONCOLOGY_CONFIG = {
    "name": "oncology",
    "signals": [
        "methylation_score",
        "fragment_score",
        "fatigue_score",
        "weight_change",
        "family_risk_score",
    ],
    "signal_weights": {
        "methylation_score": 2.2,
        "fragment_score": 2.0,
        "fatigue_score": 0.9,
        "weight_change": 0.8,
        "family_risk_score": 1.4,
    },
    "layer_weights": {
        "blood": 1.8,
        "wearable": 0.9,
        "symptom": 0.8,
        "exposure": 0.5,
        "history": 1.1,
        "family": 1.5,
    },
    "thresholds": {
        "soft": 1.0,
        "persistent": 1.7,
        "escalation": 2.3,
        "convergence": 0.25,
        "persistence": 0.45,
        "commit": 1.25,
    },
    "validation_stage": "conceptual_or_retrospective",
}

C.3 metabolic_config.py

METABOLIC_CONFIG = {
    "name": "metabolic",
    "signals": [
        "glucose_var",
        "glucose_peak",
        "sleep_frag",
        "stress_load",
        "activity_drop",
    ],
    "signal_weights": {
        "glucose_var": 2.0,
        "glucose_peak": 1.8,
        "sleep_frag": 1.1,
        "stress_load": 1.0,
        "activity_drop": 0.9,
    },
    "layer_weights": {
        "blood": 1.7,
        "wearable": 1.3,
        "symptom": 0.8,
        "exposure": 1.1,
        "history": 1.0,
        "family": 0.7,
    },
    "thresholds": {
        "soft": 0.9,
        "persistent": 1.5,
        "escalation": 2.1,
        "convergence": 0.30,
        "persistence": 0.40,
        "commit": 1.20,
    },
    "validation_stage": "strong_candidate_for_early_validation",
}

C.4 inflammatory_config.py

INFLAMMATORY_CONFIG = {
    "name": "inflammatory",
    "signals": [
        "crp",
        "fatigue_score",
        "symptom_cluster",
        "sleep_frag",
        "hrv",
    ],
    "signal_weights": {
        "crp": 1.9,
        "fatigue_score": 1.1,
        "symptom_cluster": 1.3,
        "sleep_frag": 0.8,
        "hrv": 1.0,
    },
    "layer_weights": {
        "blood": 1.5,
        "wearable": 1.1,
        "symptom": 1.2,
        "exposure": 0.9,
        "history": 1.2,
        "family": 0.6,
    },
    "thresholds": {
        "soft": 0.95,
        "persistent": 1.6,
        "escalation": 2.2,
        "convergence": 0.28,
        "persistence": 0.50,
        "commit": 1.22,
    },
    "validation_stage": "retrospective_and_prospective_candidate",
}

C.5 cardiovascular_config.py

CARDIOVASCULAR_CONFIG = {
    "name": "cardiovascular",
    "signals": [
        "hr_rest",
        "hrv",
        "recovery_time",
        "activity_drop",
        "sleep_frag",
    ],
    "signal_weights": {
        "hr_rest": 1.4,
        "hrv": 1.7,
        "recovery_time": 1.8,
        "activity_drop": 1.0,
        "sleep_frag": 0.9,
    },
    "layer_weights": {
        "blood": 0.9,
        "wearable": 1.8,
        "symptom": 0.8,
        "exposure": 0.9,
        "history": 1.1,
        "family": 0.7,
    },
    "thresholds": {
        "soft": 0.9,
        "persistent": 1.5,
        "escalation": 2.0,
        "convergence": 0.30,
        "persistence": 0.45,
        "commit": 1.20,
    },
    "validation_stage": "good_remote_monitoring_candidate",
}

C.6 neuro_config.py

NEURO_CONFIG = {
    "name": "neurodegenerative",
    "signals": [
        "gait_var",
        "sleep_frag",
        "speech_change",
        "interaction_slowing",
        "fatigue_score",
    ],
    "signal_weights": {
        "gait_var": 1.7,
        "sleep_frag": 1.0,
        "speech_change": 1.6,
        "interaction_slowing": 1.4,
        "fatigue_score": 0.8,
    },
    "layer_weights": {
        "blood": 0.7,
        "wearable": 1.5,
        "symptom": 1.0,
        "exposure": 0.7,
        "history": 1.2,
        "family": 0.8,
    },
    "thresholds": {
        "soft": 0.85,
        "persistent": 1.4,
        "escalation": 2.0,
        "convergence": 0.32,
        "persistence": 0.50,
        "commit": 1.25,
    },
    "validation_stage": "slower_longitudinal_validation_needed",
}

Appendix D. Dataset Interfaces and Preprocessing

D.1 Purpose

This appendix provides preprocessing scaffolds for real datasets used in Chapter 12 and Chapter 10.

D.2 preprocessing.py

from __future__ import annotations
import pandas as pd
from typing import List, Dict

def standardize_columns(df: pd.DataFrame, mapping: Dict[str, str]) -> pd.DataFrame:
    return df.rename(columns=mapping).copy()

def coerce_timestamps(df: pd.DataFrame, time_col: str) -> pd.DataFrame:
    df = df.copy()
    df[time_col] = pd.to_datetime(df[time_col], errors="coerce")
    return df.dropna(subset=[time_col])

def create_time_windows(
    df: pd.DataFrame,
    user_col: str,
    time_col: str,
    freq: str = "7D",
) -> pd.DataFrame:
    df = df.copy()
    df = df.sort_values([user_col, time_col])
    df["window_start"] = df.groupby(user_col)[time_col].transform(
        lambda s: s.dt.floor(freq)
    )
    return df

def compute_signal_summary(
    df: pd.DataFrame,
    user_col: str,
    signal_col: str,
    value_col: str,
    window_col: str,
) -> pd.DataFrame:
    grouped = (
        df.groupby([user_col, window_col, signal_col])[value_col]
        .agg(["mean", "std", "count", "min", "max"])
        .reset_index()
    )
    return grouped

D.3 cohort_split.py

from __future__ import annotations
import pandas as pd

def temporal_split(
    df: pd.DataFrame,
    time_col: str,
    train_end,
    valid_end,
):
    df = df.copy()
    train = df[df[time_col] <= train_end]
    valid = df[(df[time_col] > train_end) & (df[time_col] <= valid_end)]
    test = df[df[time_col] > valid_end]
    return train, valid, test

Appendix E. Validation and Calibration Scripts

E.1 trajectory_validation.py

from __future__ import annotations
from typing import List, Dict
from schema import StateRecord

def lead_time_gain(alert_times, event_times):
    values = []
    for a, e in zip(alert_times, event_times):
        if a is not None and e is not None:
            values.append((e - a).days)
    return sum(values) / len(values) if values else None

def state_stability(records: List[StateRecord]) -> float:
    if len(records) < 2:
        return 1.0
    transitions = 0
    for i in range(1, len(records)):
        if records[i].current_state != records[i - 1].current_state:
            transitions += 1
    return 1.0 - transitions / (len(records) - 1)

def false_escalation_rate(records: List[StateRecord], confirmed_flags: List[bool]) -> float:
    escalations = 0
    false_escalations = 0
    for r, ok in zip(records, confirmed_flags):
        if r.escalation_status:
            escalations += 1
            if not ok:
                false_escalations += 1
    return false_escalations / escalations if escalations else 0.0

E.2 calibration.py

from __future__ import annotations
import pandas as pd

def calibration_table(scores, outcomes, bins=10):
    df = pd.DataFrame({"score": scores, "outcome": outcomes})
    df["bin"] = pd.qcut(df["score"], q=bins, duplicates="drop")
    table = (
        df.groupby("bin")
        .agg(mean_score=("score", "mean"), observed_rate=("outcome", "mean"), n=("outcome", "size"))
        .reset_index()
    )
    return table

Appendix F. User Output and Export Logic

F.1 user_output.py

from __future__ import annotations
from typing import Dict, Any

def render_user_summary(result: Dict[str, Any]) -> Dict[str, Any]:
    window = result["window"]
    record = result["state_record"]
    top_layers = sorted(window.layer_scores.items(), key=lambda kv: kv[1], reverse=True)[:3]
    return {
        "current_state": result["user_state"],
        "message": result["message"],
        "drift_score": round(window.drift_score, 2),
        "persistence_score": round(window.persistence_score, 2),
        "convergence_score": round(window.convergence_score, 2),
        "recovering": record.recovering,
        "top_contributing_layers": top_layers,
        "next_action": next_action_from_state(result["user_state"]),
    }

def next_action_from_state(user_state: str) -> str:
    if user_state == "Stable":
        return "Continue ordinary monitoring."
    if user_state == "Watch":
        return "Repeat routine monitoring and review recent context."
    if user_state == "DriftDetected":
        return "Repeat sampling within the next scheduled interval."
    if user_state == "EscalationRecommended":
        return "Initiate targeted follow-up and prepare review export."
    if user_state == "UnderReview":
        return "Continue monitoring while external review proceeds."
    return "Continue monitoring until recovery stabilises."

Appendix G. Recommended Appendix-to-Chapter Mapping

G.1 Mapping

Chapters 4 to 6 – Use Appendix A and Appendix B.

Chapter 7 – Use Appendix B oncology scenarios and Appendix C oncology config.

Chapter 8 – Use Appendix B domain simulations and Appendix C disease configs.

Chapter 9 – Use Appendix F user-state rendering and export logic.

Chapter 10 – Use Appendix D and Appendix E.

Chapter 11 – Use Appendix A, C, D, E, and F.

Chapter 12 – Use Appendix D dataset interfaces.

Chapter 13 – Use Appendix B failure scenarios and Appendix E error analysis.

Appendix H. Minimum Simulation Experiments Required for the Paper

H.1 Core Experiments

Stable baseline experiment – Show that the engine remains in Stable under ordinary fluctuation.
Soft deviation experiment – Show that weak short-lived departure does not over-escalate.
Persistent divergence experiment – Show transition from Soft Deviation to Persistent Divergence.
Convergence experiment – Show that multi-layer reinforcement increases escalation probability.
Recovery experiment – Show downward state movement after genuine corridor re-entry.
Relapse experiment – Show that recovery followed by renewed drift is preserved in history.
Missingness experiment – Show engine performance under sparse-data and random-missingness conditions.
False alarm experiment – Show that context-heavy non-progressive noise does not become repeated high-escalation output.
Oncology-specific experiment – Inherited susceptibility plus weak repeated blood deviation plus reinforcement.
Metabolic-specific experiment – Repeated glucose drift plus sleep and stress amplification.
Inflammatory-specific experiment – Flare recurrence plus failed recovery.
Cardiovascular-specific experiment – Recovery-state deterioration plus wearable instability.
Neurofunctional-specific experiment – Slow functional drift through passive signals.

Appendix I. Reference Link Register

This appendix lists the full URLs for all abbreviated, shortened, or named web references used throughout the document.

Short form / label used in document	Full URL
National Institutes of Health	https://www.nih.gov/
NIH health information	https://www.nih.gov/health-information
NIH clinical research trials	https://www.nih.gov/health-information/nih-clinical-research-trials-you
National Cancer Institute	https://www.cancer.gov/
Cancer.gov	https://www.cancer.gov/
NCI research	https://www.cancer.gov/research
NCI genetics / hereditary cancer	https://www.cancer.gov/about-cancer/causes-prevention/genetics
NCI what is cancer	https://www.cancer.gov/about-cancer/understanding/what-is-cancer
NCI cancer types	https://www.cancer.gov/types
U.S. Food and Drug Administration	https://www.fda.gov/
FDA digital health technologies guidance	https://www.fda.gov/regulatory-information/search-fda-guidance-documents/digital-health-technologies-remote-data-acquisition-clinical-investigations
FDA medical devices	https://www.fda.gov/medical-devices
FDA drugs	https://www.fda.gov/drugs
National Library of Medicine	https://www.nlm.nih.gov/
PMC	https://pmc.ncbi.nlm.nih.gov/
PubMed	https://pubmed.ncbi.nlm.nih.gov/
All of Us Research Program	https://allofus.nih.gov/
All of Us program overview	https://allofus.nih.gov/article/program-overview
All of Us protocol	https://allofus.nih.gov/article/all-us-research-program-protocol
CDC	https://www.cdc.gov/
CDC biomonitoring	https://www.cdc.gov/biomonitoring/
CDC National Exposure Report	https://www.cdc.gov/biomonitoring/resources/national-exposure-report.html
NHANES	https://www.cdc.gov/nchs/nhanes/
UK Biobank	https://www.ukbiobank.ac.uk/
PhysioNet	https://physionet.org/
MIMIC-IV	https://physionet.org/content/mimiciv/3.1/
NHS England Digital	https://digital.nhs.uk/
NHS England public attitudes to data	https://digital.nhs.uk/data-and-information/keeping-data-safe-and-benefitting-the-public/public-attitudes-to-data-in-the-nhs-and-social-care
TRIPOD Statement	https://www.tripod-statement.org/
TRIPOD+AI	https://www.tripod-statement.org/ai
PROBAST+AI	https://www.probast.org/
BMJ	https://www.bmj.com/
Oxford Academic / OUP Academic	https://academic.oup.com/
ScienceDirect	https://www.sciencedirect.com/
Frontiers	https://www.frontiersin.org/
Nature	https://www.nature.com/
JMIR	https://www.jmir.org/
JMIR Medical Informatics	https://medinform.jmir.org/
Gastroenterology / Gastrojournal	https://www.gastrojournal.org/
MDPI	https://www.mdpi.com/
ASCO Publications / ASC Publications	https://ascopubs.org/
Alzheimer's & Dementia journals	https://alz-journals.onlinelibrary.wiley.com/journal/15525279

The NIH, NCI, FDA, All of Us, NLM, and PMC URLs above are official current pages.

Appendix J. Short Integrative Closing Note for the Paper

The appendices are not ancillary. They are the executable body of the thesis.

The main chapters establish: the timing failure of after-event medicine, the personal baseline principle, the user-owned monitoring loop, the state architecture, the multi-layer observation field, the prognostic engine, domain-specific application layers, validation, governance, and limits.

The appendices then provide: executable engine code, simulation environments, disease-layer configurations, preprocessing interfaces, validation scripts, user-output logic.

That combination gives the paper both theoretical coherence and technical implementability.

References

National Cancer Institute. The Genetics of Cancer. National Cancer Institute. Updated 8 August 2024. Available at: cancer.gov/about-cancer/causes-prevention/genetics.
National Cancer Institute. Genetic Testing Fact Sheet. National Cancer Institute. Updated 18 April 2024. Available at: cancer.gov/about-cancer/causes-prevention/genetics/genetic-testing-fact-sheet.
National Cancer Institute. BRCA Gene Changes: Cancer Risk and Genetic Testing. National Cancer Institute. Updated 19 July 2024. Available at: cancer.gov/about-cancer/causes-prevention/genetics/brca-fact-sheet.
Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ. 2024;385:e078378. Available at: bmj.com/content/385/bmj-2023-078378.
TRIPOD Statement Group. TRIPOD Statement. Available at: tripod-statement.org.
TRIPOD Statement Group. Scope of TRIPOD+AI. Available at: tripod-statement.org/scope.
Moons KGM, Damen JAA, Kaul T, Hooft L, Andaur Navarro CL, Dhiman P, et al. PROBAST+AI: an updated quality, risk of bias, and applicability assessment tool for prediction models using regression or artificial intelligence methods. BMJ. 2025;388:e082505. Available at: bmj.com/content/388/bmj-2024-082505.
U.S. Food and Drug Administration. Digital Health Technologies for Remote Data Acquisition in Clinical Investigations. Final guidance. 29 July 2024. Available at: fda.gov/regulatory-information/search-fda-guidance-documents/digital-health-technologies-remote-data-acquisition-clinical-investigations.
U.S. Food and Drug Administration. Digital Health Technologies (DHTs) for Drug Development. Available at: fda.gov/science-research/science-and-research-special-topics/digital-health-technologies-dhts-drug-development.
UK Biobank. About Our Data. Available at: ukbiobank.ac.uk/about-our-data.
UK Biobank. Types of Data. Available at: ukbiobank.ac.uk/about-our-data/types-of-data.
UK Biobank. Biomarker Data. Available at: ukbiobank.ac.uk/about-our-data/types-of-data/biomarker-data.
UK Biobank. Healthcare Records. Available at: ukbiobank.ac.uk/about-our-data/types-of-data/healthcare-records.
All of Us Research Program. All of Us Research Program. National Institutes of Health. Available at: allofus.nih.gov.
All of Us Research Program. All of Us Research Program Protocol. Available at: allofus.nih.gov/article/all-us-research-program-protocol.
All of Us Research Program. Opportunities for Researchers. Available at: allofus.nih.gov/article/opportunities-researchers.
Centers for Disease Control and Prevention. National Report on Human Exposure to Environmental Chemicals. Available at: cdc.gov/biomonitoring/resources/national-exposure-report.html.
Centers for Disease Control and Prevention. National Exposure Report Home Page. Available at: cdc.gov/environmental-exposure-report/index.html.
Centers for Disease Control and Prevention. About the Biomonitoring Data Tables. Available at: cdc.gov/environmental-exposure-report/data-tables/index.html.
Johnson A, Bulgarelli L, Pollard T, Horng S, Celi LA, Mark R, et al. MIMIC-IV. PhysioNet. Available at: physionet.org/content/mimiciv.
National Cancer Institute. The Cancer Genome Atlas Program (TCGA). Available at: cancer.gov/ccg/research/genome-sequencing/tcga.
National Cancer Institute. Clinical Proteomic Tumor Analysis Consortium (CPTAC). Available at: gdc.cancer.gov/about-gdc/contributed-genomic-data-cancer-research/clinical-proteomic-tumor-analysis-consortium-cptac.
National Cancer Institute. Clinical Proteomic Tumor Analysis Consortium (CPTAC). Available at: dctd.cancer.gov/research/networks/cptac.
National Cancer Institute. Clinical Proteomic Tumor Analysis Consortium (CPTAC) Data Resources. Available at: cancer.gov/research/resources/resource/175.
Foy BH, Carlson JCT, Reinertsen E, et al. Hematologic setpoints are a stable and patient-specific deep phenotype. Cell Systems. 2023/2024. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC10557837 and pmc.ncbi.nlm.nih.gov/articles/PMC12085991.
Røys EÅ, et al. A Parametric Empirical Bayes Approach to Personalized Reference Intervals. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12582661.
Westerman K, et al. Longitudinal analysis of biomarker data from a personalized nutrition platform. 2018. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC6168584.
Otten AT, et al. Clinical Validation of a Capillary Blood Home-Based Self-Sampling Technique. 2023. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC10906358.
Zarbl J, et al. Remote self-collection of capillary blood using upper arm devices. 2022. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC9476144.
Campbell AJ, et al. From blood drops to biomarkers: a scoping review of capillary microsampling and related biomarker workflows. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12085825.
Hügle T, et al. Blood self-sampling: a missing link for remote patient care. 2022. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC9594585.
Hebib L, et al. Digital tools and self-administered home blood tests. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12317230.
Demiris G, Iribarren SJ, Sward K, Lee S, Yang R. Patient Generated Health Data Use in Clinical Practice: A Systematic Review. 2019. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC6697140.
Keeling A, et al. The Impact of Patient-Generated Health Data From Mobile Health on Care and Clinical Decision-Making. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12716635.
Ramachandran M, et al. The impact of eHealth on relationships and trust in primary care. 2023. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC10623772.
Hughes AM, et al. Wearable devices and cardiovascular health. European Heart Journal. 2026. Available at: academic.oup.com/eurheartj/advance-article/doi/10.1093/eurheartj/ehag189/8540180.
Xie H, et al. State-of-the-art wearable sensors for cardiovascular health. Nature Reviews Cardiology / Nature cardiovascular review content. 2025. Available at: nature.com/articles/s44325-025-00090-6.
Jamieson A, et al. A guide to consumer-grade wearables in cardiovascular healthcare. 2025. Available at: nature.com/articles/s44325-025-00082-6.
Gill SK, et al. Artificial intelligence to enhance clinical value across the spectrum of cardiovascular healthcare. European Heart Journal. 2023;44(9):713 onwards. Available at: academic.oup.com/eurheartj/article/44/9/713/6982945.
Hirten RP, Danieletto M, Sanchez-Mayor M, et al. Physiological data collected from wearable devices identify and predict inflammatory bowel disease flares. Gastroenterology. 2025;168:939-951.e5. Available at: gastrojournal.org/article/S0016-5085(25)00013-7/fulltext.
Bar-Mashiah AS, et al. Forecasting inflammatory bowel disease activity with wearable devices. Gastroenterology. 2025. Available at: gastrojournal.org/article/S0016-5085(25)00359-2/fulltext.
De Deo D, et al. Digital biomarkers and artificial intelligence: a new frontier in immunology and inflammatory disease monitoring. Frontiers in Immunology. 2025. Available at: frontiersin.org/journals/immunology/articles/10.3389/fimmu.2025.1637159/full.
Ullah R, et al. Digital Biomarkers for Early Detection of Alzheimer’s Disease. 2026. Available at: mdpi.com/3042-4518/3/2/18.
Chunga N, et al. Digital Technologies for Symptom Monitoring in Parkinson Disease. 2026. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12960457.
Cai H, et al. Wearable devices in neurological disorders: a narrative review. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12432666.
Watson SL, Mofty HK, Donnelly M, Peto T, Hogg RE. Adherence to Usability and Accessibility Principles in Digital Health Applications for Patients With Diabetes: Systematic Review. Journal of Medical Internet Research. 2025;27:e71567. Available at: jmir.org/2025/1/e71567.
Wettstein R, et al. A Remote Patient Monitoring System With Feedback Mechanisms Using Apple Watch: User-Friendly, Reusable, Interoperable, and Secure Design. JMIR mHealth and uHealth. 2024;12:e58441. Available at: mhealth.jmir.org/2024/1/e58441.
Jansen AJS, et al. Device based monitoring in digital care and its impact on hospital service use: systematic review. npj Digital Medicine. 2025. Available at: nature.com/articles/s41746-024-01427-8.
NHS Digital. Public attitudes to data in the NHS and social care. Published 9 May 2024. Available at: digital.nhs.uk/data-and-information/keeping-data-safe-and-benefitting-the-public/public-attitudes-to-data-in-the-nhs-and-social-care.
NHS England. National engagement on data: cohort 2 report. 2025. Available at: transform.england.nhs.uk/key-tools-and-info/data-saves-lives/national-public-engagement-on-the-use-of-health-data/national-engagement-on-data-cohort-2-report.
National Data Guardian. National Data Guardian 2023–2024 report. Published 5 December 2024. Available at: gov.uk/government/publications/national-data-guardian-2023-2024-report/national-data-guardian-2023-2024-report.
Van Calster B, et al. Evaluation of performance measures in predictive artificial intelligence for medical applications. The Lancet Digital Health. 2025. Available at: thelancet.com/journals/landig/article/PIIS2589-7500(25)00098-6/fulltext.
van de Sande D, et al. A step-by-step approach to prevent an artificial intelligence model from doing more harm than good. BMJ Health & Care Informatics. 2022;29(1):e100495. Available at: informatics.bmj.com/content/29/1/e100495.
Custódio G, et al. Impact of Neonatal Screening and Surveillance for the TP53 R337H Mutation on Pediatric Adrenocortical Tumor Outcomes. Journal of Clinical Oncology. 2013. Available at: ascopubs.org/doi/abs/10.1200/JCO.2012.46.3711.
Genomic newborn screening for cancer risk. Journal of Clinical Oncology abstract / ASCO Publications. 2025;43(16 suppl):10014. Available at: ascopubs.org/doi/10.1200/JCO.2025.43.16_suppl.10014.
Rudnick PA, et al. A Description of the Clinical Proteomic Tumor Analysis Consortium (CPTAC) Common Data Analysis Pipeline. 2016. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC5117628.
Ellis MJ, et al. The NCI Clinical Proteomic Tumor Analysis Consortium. 2013. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC3800055.
Knight SR, et al. Mobile devices and wearable technology for measuring patient outcomes after surgery: a systematic review. npj Digital Medicine. 2021. Available at: nature.com/articles/s41746-021-00525-1.
Rashid Z, et al. Remote Monitoring of Patients Through RADAR-Base Platform. JMIR Mental Health. 2024;11:e51259. Available at: mental.jmir.org/2024/1/e51259.
Quality assessment umbrella reference: Zych MM, et al. Quality Assessment of Digital Health Apps: Umbrella Review. Journal of Medical Internet Research. 2024;26:e58616. Available at: jmir.org/2024/1/e58616.
Li H, et al. Evaluating the Utility of Wearable Sensors for the Early Detection of Parkinsonian Change. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12322615.
Bradley M, et al. Determining Falls Risk in People with Parkinson’s Disease Using Wearable Sensors. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC12251786.
Trigg A, et al. Interpretation of Change in Novel Digital Measures. 2025. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC11919315.
Zhou S, et al. Bayesian Learning of Personalized Longitudinal Biomarker Trajectories. 2023. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC11160561.
van Delft FA, et al. Modeling strategies to analyse longitudinal biomarker data. 2022. Available at: pmc.ncbi.nlm.nih.gov/articles/PMC9568827.

Personal Prognostic Drift Engine A User-Centred Pre-Event Health Framework Using Repeated Low-Burden Sampling and Longitudinal Personal Data

Core Thesis

Chapter 1. The Failure of After-Event Medicine

1.1 The Institutional Timing Problem

1.1.1 Structural Consequence

1.1.2 Chapter Function

1.2 Disease as Delayed Recognition

1.2.1 The Misreading of Onset

1.2.2 Why This Matters

1.2.3 Prognostic Implication

1.2.4 Section Conclusion

1.3 Why Post-Event Detection Is Structurally Weak

1.3.1 Late Recognition Distorts the Signal

1.3.2 The Cost of Episodic Measurement

1.3.3 Population Thresholds Arrive Too Coarsely

1.3.4 Post-Event Logic Reduces Prognosis to Reaction

1.3.5 Section Conclusion

1.4 The Need for Pre-Event Prognosis

1.4.1 Prognosis as Temporal Reversal

1.4.2 What a Prognostic System Must Observe

1.4.3 Why the Person Must Be the Continuity Point

1.4.4 Prognosis Before Proof

1.4.5 Section Conclusion

1.5 Chapter Conclusion

Chapter 2. Personal Baseline as the Primary Instrument

2.1 The Person as a Continuous Dataset

2.1.1 Why Institutional Fragments Are Not Enough

2.1.2 The Baseline as a Living Corridor

2.1.3 Section Conclusion

2.2 Baseline Versus Population Threshold

2.2.1 What a Population Threshold Actually Does

2.2.2 Why Personal Baselines Are Stronger for Early Drift

2.2.3 The Baseline-First Rule

2.2.4 Section Conclusion

2.3 Personal Equilibrium Corridor

2.3.1 Corridor, Not Point

2.3.2 Stability Has Structure

2.3.3 Early Deterioration Appears as Corridor Exit

2.3.4 Corridor Logic

2.3.5 Section Conclusion

2.4 Repeated Measurement as Structure, Not Noise

2.4.1 Why Repetition Changes Interpretation

2.4.2 Sequence Reveals What Snapshots Cannot

2.4.3 Noise Is What Fails to Persist

2.4.4 Repetition Supports Personal Setpoints

2.4.5 Section Conclusion

2.5 Chapter Conclusion

Chapter 3. Low-Burden Sampling and the User Loop

3.1 Why Low-Burden Sampling Matters

3.1.1 Continuity Requires Low Friction

3.1.2 The User Loop Is More Important Than the Single Test

3.1.3 Low-Burden Does Not Mean Low Value

3.1.4 Section Conclusion

3.2 The Pinprick Model

3.2.1 Why the Pinprick Matters

3.2.2 Small Sample, Large Temporal Value

3.2.3 From Event Testing to Continuous Capture

3.2.4 Multi-Disease Applicability

3.2.5 Limits of the Pinprick Model

3.2.6 Section Conclusion

3.3 App-Centred Data Continuity

3.3.1 The App as Temporal Memory

3.3.2 Why the User Loop Must Be App-Centred

3.3.3 The App as a Convergence Engine

3.3.4 User Experience as a Scientific Constraint

3.3.5 The App Does Not Need to Diagnose to Be Useful

3.3.6 Section Conclusion

3.4 Signal Ingestion and Personal Tracking

3.4.1 What the System Must Ingest

3.4.2 Ingestion Is Not Mere Storage

3.4.3 Time Is Part of the Signal

3.4.4 Personal Tracking Means Corridor Updating

3.4.5 Quality Control at the User Layer

3.4.6 Why Personal Tracking Is the Actual Product

3.4.7 Section Conclusion

3.5 From Repeated Readings to Prognosis

3.5.1 A Prognostic Reading Is Not a Single Reading

3.5.2 Direction Matters More Than Snapshot Position

3.5.3 Persistence Converts Suspicion into Structure

3.5.4 Convergence Creates Escalation Power