Hi everyone,
I am working on a paper looking at the mental health hospitalisations of mother of children placed in foster care 2 years before and 2 after placement, aiming to assess if mothers are more likely to increase mh hospitalisation after child removal, and if this differences are higher than other group. I am comparing three groups: children placed in foster care (care group N=1150), children never placed and never in contact with child protection services (no_contact group N=8500), and children who had a contact with child protection services but were not placed (contact group N=4150). Groups were matched and the age at placement of the care group children were used as dummy placement dates for the comparison groups. The outcome variable is binary (mhdiag) coded 1 if the the mother had a mh hospitalisation and 0 otherwise, and 'before_after' codes 0 if the hospitalisation was before placement and 1 if it was after placement.
I am using a GEE model and a modified passion approach to estimate RRs instead of ORs. I would like to include an offset to account for the population at risk in each group. However, given that my group variable is highly correlated with the log of the population variable (lpop) my RRs go from 7 in the model without offset to 55 when including the offset. I also tried to include lpop as a covariate but stata removes it from the model because of collinearity.
Model :
xtgee mhdiag ib3.group##i.before_after , family(poisson) link(log) corr(independent) offset(lpop) vce(robust) eform
My questions are:
a) Am I using correctly the offset?
b) Can I use the offset option with a binary outcome, or it is only for count?
c) Is there any other way to account for the differences in population between groups?
d) Is the exposure variable ‘group’ already accounting for the population difference so there is no need to include an offset?
Thanks for your help!
I am working on a paper looking at the mental health hospitalisations of mother of children placed in foster care 2 years before and 2 after placement, aiming to assess if mothers are more likely to increase mh hospitalisation after child removal, and if this differences are higher than other group. I am comparing three groups: children placed in foster care (care group N=1150), children never placed and never in contact with child protection services (no_contact group N=8500), and children who had a contact with child protection services but were not placed (contact group N=4150). Groups were matched and the age at placement of the care group children were used as dummy placement dates for the comparison groups. The outcome variable is binary (mhdiag) coded 1 if the the mother had a mh hospitalisation and 0 otherwise, and 'before_after' codes 0 if the hospitalisation was before placement and 1 if it was after placement.
I am using a GEE model and a modified passion approach to estimate RRs instead of ORs. I would like to include an offset to account for the population at risk in each group. However, given that my group variable is highly correlated with the log of the population variable (lpop) my RRs go from 7 in the model without offset to 55 when including the offset. I also tried to include lpop as a covariate but stata removes it from the model because of collinearity.
Model :
xtgee mhdiag ib3.group##i.before_after , family(poisson) link(log) corr(independent) offset(lpop) vce(robust) eform
My questions are:
a) Am I using correctly the offset?
b) Can I use the offset option with a binary outcome, or it is only for count?
c) Is there any other way to account for the differences in population between groups?
d) Is the exposure variable ‘group’ already accounting for the population difference so there is no need to include an offset?
Thanks for your help!
Comment