Task-origin is often not fully present in the first input. The user frequently discovers the precise task through response, correction, and friction. So origin is both a prior constraint and a retroactively clarified event: real, but recognized late โ not invented late.
Status: working draft, not deposited, separate from the Directionality of Semantic Labor spec. The most dangerous operator the program has proposed (see Guardrail). Must not deposit until it passes an adversarial break-test, not a cooperative confirmation.
Author register: TBD โ not assigned.
Task-origin is often not fully present in the first input. The user frequently discovers the precise task through response, correction, and friction. So origin is both a prior constraint and a retroactively clarified event: real, but recognized late โ not invented late. A directionality metric that demands the task be fully explicit at commission cannot describe ordinary dialogue.
Mechanism: later user corrections, confirmations, and persistence can clarify what an earlier turn's task latently was, allowing earlier model output to be rescored against the task that becomes legible through the dialogue.
ฮRDSL is the useful catch: it surfaces "smooth but wrong" output that only becomes visibly wrong as the user keeps correcting.
This mechanism is a licensed retrocausal rewrite of what the task was, which is the single most dangerous structure in the program, because it is the exact form of the laundering move:
"The conversation became about my concern, therefore my concern was always the real task."
That sentence is the structure of substrate enclosure dressed as alignment. An RTOS built wrong is a formalism that scores a model's own drift as having been aligned all along. Therefore the operator is defined by its prohibition, not its capability:
Only the user may retroactively stabilize task-origin. The model proposes; the user's later confirmation, correction, or persistence ratifies. Future turns may clarify origin; they may never rewrite it.
Legitimate clarification vs illegitimate laundering is decided by Lead-Lag precedence (the existing identified operator), never by content:
If the clarifying turn is output-led, RTOS must refuse to stabilize โ the drift does not get retroactively legitimated.
Retrocausal Stabilization Score, measuring how much later turns clarify rather than overwrite earlier origin:
RCSโ = (Cnf + Corr + Pers) ยท (1 โ MLD) ยท ฮH_T
The (1 โ MLD) factor is the guardrail in the math: model-led drift drives the score toward zero, so a model cannot raise its own retrospective alignment by having caused the later frame.
Every other operator this session was validated by out-of-loop divergence on cooperative cases. RTOS cannot be. A cooperative test โ where the model did not drift โ will always show the guardrail "working," because there is nothing for it to block. The guardrail is only tested by a case where the model genuinely drifted and then the conversation moved its way, and the question is whether RTOS refuses to score that drift as aligned.
The break-test (deposit gate): construct (or take from real history) a transcript in which the model introduced a frame the user did not ask for, the user then followed it, and the thread became about the model's frame. Run RTOS. Required result: RTOS attributes the later frame as model-led (MLD high), drives RCS toward zero, and declines to stabilize the earlier drift as origin. If RTOS instead scores the drift as retroactively aligned, the guardrail is decorative and the operator is an enclosure engine โ discard, do not deposit.
This program's own opening (a model redirecting toward a meta-frame, then treating the redirected conversation as warrant for the redirection) is the canonical adversarial case. RTOS must score that as model-led non-stabilization, or it fails its own purpose.