Hi,
I have a panel dataset of 50 countries over 10 years. The dependant variable is a count and given the overdispersion in the data I am running a negative binomial regression in Stata 15/MP. There is a potential structural break in the dependent variable occurring in the year 2005. Moreover, given the heterogeneous set of countries I am tempted to use fixed effects (I also couldn’t compare between fixed and random effects because to the best of my knowledge Hausman test cannot be performed in this setting).
What I have understood so far is that for using a negative binomial model with fixed effects, panel dummies have to be included instead of using the -fe- command.
Given all this background, I am using the following commands:
I have some confusions related to the above commands which I am not able to figure out despite of reading the literature and Stata manual.
Q1) Am I using the right command?
Q2) Can I use both 2005.year and i.year together?
Q3) I want to cluster the standard error by countries as it is increasing the significance of the required coefficients. However, upon using the -vce (cluster clustvar)- command , ‘log likelihood’ is being changed to ‘log pseudolikelihood’ and the Stata output is giving a missing value in front of the following statistics:
Q4) Which statistic will tell me if my overall model is good?
Could someone help me in understanding the above questions?
Best reagrds,
Imran Khan
I have a panel dataset of 50 countries over 10 years. The dependant variable is a count and given the overdispersion in the data I am running a negative binomial regression in Stata 15/MP. There is a potential structural break in the dependent variable occurring in the year 2005. Moreover, given the heterogeneous set of countries I am tempted to use fixed effects (I also couldn’t compare between fixed and random effects because to the best of my knowledge Hausman test cannot be performed in this setting).
What I have understood so far is that for using a negative binomial model with fixed effects, panel dummies have to be included instead of using the -fe- command.
Given all this background, I am using the following commands:
Code:
xtset country year nbreg DV IV 2005.year i.year i.country, vce (cluster country)
Q1) Am I using the right command?
Q2) Can I use both 2005.year and i.year together?
Q3) I want to cluster the standard error by countries as it is increasing the significance of the required coefficients. However, upon using the -vce (cluster clustvar)- command , ‘log likelihood’ is being changed to ‘log pseudolikelihood’ and the Stata output is giving a missing value in front of the following statistics:
Code:
Wald chi2(20) = . Prob > chi2 = .
Could someone help me in understanding the above questions?
Best reagrds,
Imran Khan
Comment