Greetings.
I have 2 questions regarding calculation of kappa in Stata 13.
I am trying to calculate inter-rater reliability using cohen's Kappa statistic. Each of the two variables has a 'score' ranging from 1-5.
The underlying data is survey data. Specifically, it is from a Demographic and Health Survey, and includes sampling weights. For other analyses of this data, I am using svy commands, or aweight.
1. Is there a way to calculate kappa while acknowledging that the underlying data is weighted - in other words, each observation is not necessarily representative of 1 person. It seems to me that those observations with a higher sampling weight should influence the kappa statistic more than those with a lower sampling weight.
2. I want to calculate the confidence interval of the kappa statistic. Since the data is from a population based survey with clustered responses, I chose to bootstrap the calculation, drawing 20 lots of 200 observations with replacement. Which of these two commands is correct? Alternatively, what is the difference in the two commands?
bootstrap r(kappa), reps(20) size(200): kap a b
bootstrap r(kappa), reps(20) size(200) cluster(psu): kap a b
Thank you.
I have 2 questions regarding calculation of kappa in Stata 13.
I am trying to calculate inter-rater reliability using cohen's Kappa statistic. Each of the two variables has a 'score' ranging from 1-5.
The underlying data is survey data. Specifically, it is from a Demographic and Health Survey, and includes sampling weights. For other analyses of this data, I am using svy commands, or aweight.
1. Is there a way to calculate kappa while acknowledging that the underlying data is weighted - in other words, each observation is not necessarily representative of 1 person. It seems to me that those observations with a higher sampling weight should influence the kappa statistic more than those with a lower sampling weight.
2. I want to calculate the confidence interval of the kappa statistic. Since the data is from a population based survey with clustered responses, I chose to bootstrap the calculation, drawing 20 lots of 200 observations with replacement. Which of these two commands is correct? Alternatively, what is the difference in the two commands?
bootstrap r(kappa), reps(20) size(200): kap a b
bootstrap r(kappa), reps(20) size(200) cluster(psu): kap a b
Thank you.
Comment