Hello,
I am unsure of the appropriate counts model to use in the case of an overdispersed response variable, with an unbalanced, panel structure data set. The unit variable is electoral constituency, and the time variable is financial years. I considered using a negative binomial panel regression with -xtnbreg- but have read (in the Rabe-Hesketh and Skrondal text on multilevel modelling) that this is a poor option for panel data because the coefficients are difficult to interpret. As my main variable of interest is an indicator variable, and as the effects of the are clearer to me on examining fitted values for them (mainly for margin of victory), this does not seem that much of a concern for me. They recommend random-intercept poisson model with a sandwich estimator for the standard errors, which I don't think I can do because I use stata on university computers and can not use gllamm.
I am trying to see the effects of many electoral variables including turnout, margin of victory, legislators' membership of the ruling coalition (coded as an indicator variable), and the main variable of interest, partisan affiliation with a higher level of government (again coded as an indicator variable), on the number of infrastructure projects initiated by legislators in a state legislative assembly in India.
I am using Stata 14.
I have included a summary of the response variable below:
Projects
Mean 38.87229
Largest Std. Dev. 31.73026
Variance 1006.81
Skewness 5.646863
Kurtosis 92.75536
I would be very grateful for any suggestions on how to deal with this, potential alternative models and robustness checks. Thank You!
Edit: tags
I am unsure of the appropriate counts model to use in the case of an overdispersed response variable, with an unbalanced, panel structure data set. The unit variable is electoral constituency, and the time variable is financial years. I considered using a negative binomial panel regression with -xtnbreg- but have read (in the Rabe-Hesketh and Skrondal text on multilevel modelling) that this is a poor option for panel data because the coefficients are difficult to interpret. As my main variable of interest is an indicator variable, and as the effects of the are clearer to me on examining fitted values for them (mainly for margin of victory), this does not seem that much of a concern for me. They recommend random-intercept poisson model with a sandwich estimator for the standard errors, which I don't think I can do because I use stata on university computers and can not use gllamm.
I am trying to see the effects of many electoral variables including turnout, margin of victory, legislators' membership of the ruling coalition (coded as an indicator variable), and the main variable of interest, partisan affiliation with a higher level of government (again coded as an indicator variable), on the number of infrastructure projects initiated by legislators in a state legislative assembly in India.
I am using Stata 14.
I have included a summary of the response variable below:
Projects
Mean 38.87229
Largest Std. Dev. 31.73026
Variance 1006.81
Skewness 5.646863
Kurtosis 92.75536
I would be very grateful for any suggestions on how to deal with this, potential alternative models and robustness checks. Thank You!
Edit: tags
Comment