Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • survival analysis_time span_missing values

    Dear all,
    I am an epidemiologist, and I will need some help to do survival models because of missing values. This is a retrospective approach based on past collected data from different governmental programs (inconsistency in data sampling). I have a dataset with several mercury exposure for participants at different times (value and month of sampling), their year of birth/death and some sociodemographic covariate that do not vary over time.

    I tried a time span approach. The value column time end it the year mercury was measured. (sampling started in 1970)
    Ex. Id 1 born in 1960, first data value in 1978
    id Time start Time end Value Age sampling_start Age sampling_end Death status
    1 1978 1978 5.6 18 18 0
    1 1978 1983 4.3 18 23 0
    1 1983 1990 3.2 23 30 0
    1 1990 1997 . 30 37 1
    2 1980 1980 7.6 30 30 0
    2 1980 1985 2.6 30 35 0
    Etc. 1985 1996 . 35 46 1
    None of the participants had a sampling of mercury the year of their death, then I put a missing value as value. I suppose it is not the good thing. The issues death=1 is always related to a missing value of mercury.
    My objective is to do a cox model to test whether Hg is related to death along other covariates non time varying (ex. Sex). Also I was thinking of using “Age sampling_end” instead of “time end” as I would like to do a survival curve with a x axis on age of survival..
    stset age_sampling_end, failure(Event==1) id(id_) time0(age_sampling_start). Thank you for helping me. Aline .
    Last edited by Aline Philibert; 16 Jun 2024, 08:33.
Working...
X