This post has to do with estimating relative risk using "glm" for common outcomes in cohort studies as mentioned by the UCLA Statistical Consulting Group. According to this post (found below) you can use Poisson regression with robust error variance to obtain the relative risk in survey datasets where the outcome of interest is NOT rare. Poisson regression can be done to directly estimate the prevalence ratios of interest, as the odds ratio can overestimate the risk ratio when the outcome of interest is common (Behrens et al., 2004).
I am currently working on a new project that uses data from the National Health Interview Survey to observe the relationship between diabetes and certain risk factors. In order to account for the complex survey sampling and sample weights I am using the "svyset" command and then the "svy" command followed by "subpop" to establish the variance estimation and sampling weights before running my commands. Since my outcome, diabetes, is not rare in my sample population, I would like to use Poisson regression with robust error variance to obtain the relative risk.
Unfortunately, some standard options are not allowed with the "svy" prefix, with "vce(robust)" being one of them. I cannot set "vce" to "(robust)" because "svy" is already using the variance estimation and sampling weights identified by "svyset". I was wondering if anyone knew of a way to obtain the relative risk when using national survey data and sample weights.
Should I use regular logistic regression to obtain the odds ratios? Should I continue with Poisson regression without setting the "vce" to "(robust)" and use this to obtain the relative risk? Or should I use another completely different approach? I have been looking through the literature and most studies simply use regular logistic regression. Any advice would be greatly appreciated.
I am currently working on a new project that uses data from the National Health Interview Survey to observe the relationship between diabetes and certain risk factors. In order to account for the complex survey sampling and sample weights I am using the "svyset" command and then the "svy" command followed by "subpop" to establish the variance estimation and sampling weights before running my commands. Since my outcome, diabetes, is not rare in my sample population, I would like to use Poisson regression with robust error variance to obtain the relative risk.
Unfortunately, some standard options are not allowed with the "svy" prefix, with "vce(robust)" being one of them. I cannot set "vce" to "(robust)" because "svy" is already using the variance estimation and sampling weights identified by "svyset". I was wondering if anyone knew of a way to obtain the relative risk when using national survey data and sample weights.
Should I use regular logistic regression to obtain the odds ratios? Should I continue with Poisson regression without setting the "vce" to "(robust)" and use this to obtain the relative risk? Or should I use another completely different approach? I have been looking through the literature and most studies simply use regular logistic regression. Any advice would be greatly appreciated.
Comment