Except for the fact that both functions increase, cumulative hazard estimate is nothing like the estimate\( \widehat{F} = 1- S\). In fact the cumulative hazard estimate can exceed 1.0.
Below I plot the estimated failure function and two different estimates of the cumulative hazard function. The first is the one Stata generates. It is the Nelson-Aalen estimate, shown on page 300 of the Manual Entry for sts. The N-A estimate the finite sample version of the definition:
\[
\Lambda_1(t) = \int_0^t \lambda(t)dt
\]
A second estimate can be based on the mathematical relationship of the cumulative hazard function to the Survival curve
\[
\Lambda_2(t)= -\textrm{log}(1-S(t))
\]
where the Kaplan-Meier estimate \(\widehat{S}\) is substituted for \(S\). The graph below shows that the two estimates are very close.
Code:
webuse catheter, clear stset time infect sts gen cumhaz1 = na km = s label var cumhaz1 "Cum Haz:Nelson-Aalen" gen cumhaz2 = -log(km) label var cumhaz2 "Cum Haz:-log(s)" gen cumfail = 1 - km plot cumhaz1 cumhaz2 cumfail _t sort _t label var cumfail "Cumulative Failure Probability" #delim; twoway connect cumhaz1 cumhaz2 cumfail _t, c(stairstep stairstep) title("KM Failure & Two Cum Hazard Estimates") saving(g01, replace); graph use g01 graph export graph.png
Comment