Except for the fact that both functions increase, cumulative hazard estimate is nothing like the estimateˆF=1−S. In fact the cumulative hazard estimate can exceed 1.0.
Below I plot the estimated failure function and two different estimates of the cumulative hazard function. The first is the one Stata generates. It is the Nelson-Aalen estimate, shown on page 300 of the Manual Entry for sts. The N-A estimate the finite sample version of the definition:
Λ1(t)=∫t0λ(t)dt
A second estimate can be based on the mathematical relationship of the cumulative hazard function to the Survival curve
Λ2(t)=−log(1−S(t))
where the Kaplan-Meier estimate ˆS is substituted for S. The graph below shows that the two estimates are very close.
Code:
webuse catheter, clear stset time infect sts gen cumhaz1 = na km = s label var cumhaz1 "Cum Haz:Nelson-Aalen" gen cumhaz2 = -log(km) label var cumhaz2 "Cum Haz:-log(s)" gen cumfail = 1 - km plot cumhaz1 cumhaz2 cumfail _t sort _t label var cumfail "Cumulative Failure Probability" #delim; twoway connect cumhaz1 cumhaz2 cumfail _t, c(stairstep stairstep) title("KM Failure & Two Cum Hazard Estimates") saving(g01, replace); graph use g01 graph export graph.png
Comment