The keystone
The Modeled Wear Trajectory replaces remaining-life.
A clean scalar such as "14,000 hours left" plugs directly into depreciation and lease-residual
formulas. That convenience is the trap. A scalar reads as a warranty, the data does not support
it per unit, and a false point estimate is the easiest thing for an adversarial reviewer to
break with a single early failure. So the field name and the scalar are retired.
In its place is the Modeled Wear Trajectory: a projected path of wear as a function of
cumulative duty, measured in GPU-hours, thermal cycles, and calendar time rather than assumed.
At every horizon it carries a calibrated band, and the certificate foregrounds the lower
predictive bound, because a lender prices against the downside, not the median. It answers how
this unit's wear is trending and how much margin is plausibly left under its observed duty, and
it answers as a distribution, not a promise.
The per-unit engine
The wear signals that matter most are monotone: spare rows are consumed and never returned,
pages are retired and never un-retired, uncorrectable-ECC events accumulate. Monotone
degradation is exactly the regime the prognostics literature models with Gamma and
Inverse-Gaussian processes, both yielding, through their first-passage time to a fixed
threshold, a full distribution over time-to-threshold. The cleanest instance is spare-row
depletion on Hopper, where the threshold is the fixed and known 512-row cap and the wear signal
is a deterministic read. This is the reframing that turns the H100 data gap from a fatal
weakness into a methodological feature: for a Gold unit monitored from birth the process is fit
and updated online on that unit's own history and the band is tight; for a Silver unit seen once
the trajectory uses a population-prior drift with unit-level random effects and the band is
correspondingly wider. The loss of anchoring shows up honestly as a wider band, never as a
quietly less reliable point estimate.
The band: a coverage guarantee, not a hope
A band is only worth putting in a covenant if it covers reality at the rate it claims. We do
not rely on the parametric intervals a model emits. We calibrate the band with conformal
prediction, and for the censored, time-to-event setting GPU wear lives in we use conformalized
survival analysis, after Candes, Lei, and Ren, 2023, which wraps any survival predictor to
produce a calibrated, covariate-dependent lower predictive bound with finite-sample coverage
under Type-I right-censoring. It reports the number an underwriter actually wants, a lower bound
at a stated coverage such as 90%, the guarantee is distribution-free and does not depend on the
degradation model being correctly specified, and it is built for the censoring and survivorship
that sink naive survival estimates. If the wrapped model is poor the band widens to keep its
promise; it does not silently mis-cover.
Transfer error as a coverage-inflation term
Applying a model calibrated on one generation or cooling regime to an asset of another is, in
the conformal framework, a covariate shift. The residual under-coverage that remains when a
Hopper-calibrated model is validated on a discrete-H100 fleet is the transfer error, reported
as the band inflation needed to restore nominal coverage on the target generation. It stops
being a vague hedge and becomes a measured quantity with units. This also makes the moat
quantitative and self-liquidating: as Voltry's own fleet accumulates generation-matched
records, the covariate shift shrinks and the inflation falls, on precisely the generations the
public literature cannot yet support. Because it is printed on every certificate, a reader can
watch it shrink.
What the trajectory will and will not say
It will not say: "this H100 has 14,000 hours of useful life remaining." It will say: under
observed duty of n GPU-hours and k thermal cycles, modeled wear reaches the defined threshold
at a projected horizon with a 90% lower predictive bound of X; calibration is generation-matched
or transferred with a reported inflation of Y; provenance is born-on or reconstructed with chain
gaps; exposure is assessed or Not Assessed; methodology version and calibration snapshot are
stamped for replay. A buyer will forgive the uncertainty in that statement. A buyer will not
forgive a precise number that turns out to be wrong. That asymmetry is the entire argument for
the change.