Dear all,
I want to run regressions with clustered data. My problem is that I have only about 30 clusters. In addition, my independent variable is a ratio between 0 and 1 and 40% of the observations take a value equal to 0 or 1. Therefore, I use the fractional response model, which is nonlinear. When I use the following command, the significance of my coefficient estimate is pretty bad.
glm y x1 x2 x3 ... x8, fam(bin) link(logit) cluster (district)
Clearly, I should deal with the few-cluster problem. Cameron, Gelbach and Miller (2008) suggest that a wild cluster bootstrap procedure is best for the case of few clusters, or alternatively, use a T distribution rather than the standard normal for the Wald statistic. However, I am not sure how to implement it using Stata. I found that there used to be a user-written program cgmwildboot.ado, which has been mentioned in some papers. Is it written by Professor Judson Caskey? I could not open the website he provided, probably because I am located in China now and there is always some internet block by the government. I am wondering if anyone could kindly provide me with the cgmwildboot.ado file (and other related help files) or instruct me how to adjust the results to a T distribution in Stata? The default of the regression command is normal distribution. Since my case is not a linear model, can I directly apply these procedures? Thank you very much for your help!
Best,
April
I want to run regressions with clustered data. My problem is that I have only about 30 clusters. In addition, my independent variable is a ratio between 0 and 1 and 40% of the observations take a value equal to 0 or 1. Therefore, I use the fractional response model, which is nonlinear. When I use the following command, the significance of my coefficient estimate is pretty bad.
glm y x1 x2 x3 ... x8, fam(bin) link(logit) cluster (district)
Clearly, I should deal with the few-cluster problem. Cameron, Gelbach and Miller (2008) suggest that a wild cluster bootstrap procedure is best for the case of few clusters, or alternatively, use a T distribution rather than the standard normal for the Wald statistic. However, I am not sure how to implement it using Stata. I found that there used to be a user-written program cgmwildboot.ado, which has been mentioned in some papers. Is it written by Professor Judson Caskey? I could not open the website he provided, probably because I am located in China now and there is always some internet block by the government. I am wondering if anyone could kindly provide me with the cgmwildboot.ado file (and other related help files) or instruct me how to adjust the results to a T distribution in Stata? The default of the regression command is normal distribution. Since my case is not a linear model, can I directly apply these procedures? Thank you very much for your help!
Best,
April
Comment