Hi all,
I am new to the field of biostatistics from MD neurology, trying to run analyses by myself without being mentored at all which is terribly hard to navigate. I really hope the community can provide some insights.
I am investigating the effect of air pollution (continuous variable) measured at baseline on cognitive performances over time (cognition is measured as a composite score at baseline and then annually). Visit is 0, 1, 2 or 3 for baseline and then year 1, year 2 and year 3. I was thinking I could do this:
mixed cognition i.visit##c.pollution age sex education || id: visit residuals(ar 1, t(visit))
When I look at raw mean cognitive scores in my population before running any models, there is a slightly increase at year 1 (learning effect I guess) then a decrease then a slightly increase again.
I don't know if I should consider visit as 0, 1, 2 and 3 here or c.visit instead of i.visit? Convert in months 0, 12, 24 and 36? I noticed that in some papers, instead of having visit as the 'time' variable and age (age at baseline) as a covariate and their potential interaction (as it's not the same thing an increase in a year at age 60 than age 80 years old), they used 'age' as the 'time' variable (I imagine by incrementing age every year)? How would you recommend to proceed? Should I add a quadratic term on visit or age? I will of course compare models with lr tests or AIC/BIC etc but I wanted to make sure I start in the right direction.
I know this is a very basic question but any help you might provide would be so incredibly valuable to me.
Thank you so much,
Cheers,
Pierre M.
I am new to the field of biostatistics from MD neurology, trying to run analyses by myself without being mentored at all which is terribly hard to navigate. I really hope the community can provide some insights.
I am investigating the effect of air pollution (continuous variable) measured at baseline on cognitive performances over time (cognition is measured as a composite score at baseline and then annually). Visit is 0, 1, 2 or 3 for baseline and then year 1, year 2 and year 3. I was thinking I could do this:
mixed cognition i.visit##c.pollution age sex education || id: visit residuals(ar 1, t(visit))
When I look at raw mean cognitive scores in my population before running any models, there is a slightly increase at year 1 (learning effect I guess) then a decrease then a slightly increase again.
I don't know if I should consider visit as 0, 1, 2 and 3 here or c.visit instead of i.visit? Convert in months 0, 12, 24 and 36? I noticed that in some papers, instead of having visit as the 'time' variable and age (age at baseline) as a covariate and their potential interaction (as it's not the same thing an increase in a year at age 60 than age 80 years old), they used 'age' as the 'time' variable (I imagine by incrementing age every year)? How would you recommend to proceed? Should I add a quadratic term on visit or age? I will of course compare models with lr tests or AIC/BIC etc but I wanted to make sure I start in the right direction.
I know this is a very basic question but any help you might provide would be so incredibly valuable to me.
Thank you so much,
Cheers,
Pierre M.
Comment