Hi everyone,
i've been away for a while getting settled into new position. I sincerely apologize. I have an issue with some estimations and would like to share it with you and see if I can understand what is driving it. To start I am using Stata version 13.1 in a MacBook Pro running Mac OS 10.10.5, although I believe that has very little to do with the issue. I now describe the nature of the data: unbalanced panel data of household mortgage information and demographics, with a total of 7,183 observations, 994 groups (clusters) and from 2 to 8 observations per group (this is where I'm thinking the problem lies but I want to see if you agree). I have tried running xtprobit and xtlogit with random effects and robust errors, and I get the following error:
My understanding is that the vce(robust) option is the same as calculating a variance covariance matrix that if you're using vce(cluster panelvar), where panelvar is the variable that identifies the groups (clusters) for the panel dataset. This is from the [XT] part of the reference manual in the mehtods and formulas section for xtprobit:
A similar explanation can be found for xtlogit estimation of the robust standard errors when estimating random effects in the manual. My understanding is then that the robust standard error estimator tries to be robust to heteroskedasticity across panels and within-panel serial correlation. Since there aren't so many data points within the panels (min 2, max 8) I can see how it can be very difficult to account for any serial correlation within the panels. Is this the reason why the esitmator is having a problem to come up with the sandwich estimator in this case? Could there be another problem?
Furthermore, cmp (user-written command available on SSC) provides robust and clustered standard errors when doing a random effects estimation. My doubt here is that cmp is not really setup to take into account the time correlation component of a panel, so I wonder if those estimates are still quite not robust to serial correlation within the panels. Which then brings the question of whether should I actually worry about standard errors that are robust to serial correlation when I don't have that many observations in some panels (minimum 2), and hence still use cmp's estimates to do inference better than an xtprobit random effects estimation without robust errors.
Thanks for any light you can throw on this, I appreciate it.
i've been away for a while getting settled into new position. I sincerely apologize. I have an issue with some estimations and would like to share it with you and see if I can understand what is driving it. To start I am using Stata version 13.1 in a MacBook Pro running Mac OS 10.10.5, although I believe that has very little to do with the issue. I now describe the nature of the data: unbalanced panel data of household mortgage information and demographics, with a total of 7,183 observations, 994 groups (clusters) and from 2 to 8 observations per group (this is where I'm thinking the problem lies but I want to see if you agree). I have tried running xtprobit and xtlogit with random effects and robust errors, and I get the following error:
Code:
calculation of robust standard errors failed r(198);
Specifying vce(robust) or vce(cluster clustvar) causes the Huber/White/sandwich VCE estimator to be calculated for the coefficients estimated in this regression....
Clustering on the panel variable produces a consistent VCE estimator when the disturbances are not identically distributed over the panels or there is serial correlation in it.
The cluster–robust VCE estimator requires that there are many clusters and the disturbances are uncorrelated across the clusters. The panel variable must be nested within the cluster variable because of the within-panel correlation that is generally induced by the random-effects transform when there is heteroskedasticity or within-panel serial correlation in the idiosyncratic errors.
Clustering on the panel variable produces a consistent VCE estimator when the disturbances are not identically distributed over the panels or there is serial correlation in it.
The cluster–robust VCE estimator requires that there are many clusters and the disturbances are uncorrelated across the clusters. The panel variable must be nested within the cluster variable because of the within-panel correlation that is generally induced by the random-effects transform when there is heteroskedasticity or within-panel serial correlation in the idiosyncratic errors.
Furthermore, cmp (user-written command available on SSC) provides robust and clustered standard errors when doing a random effects estimation. My doubt here is that cmp is not really setup to take into account the time correlation component of a panel, so I wonder if those estimates are still quite not robust to serial correlation within the panels. Which then brings the question of whether should I actually worry about standard errors that are robust to serial correlation when I don't have that many observations in some panels (minimum 2), and hence still use cmp's estimates to do inference better than an xtprobit random effects estimation without robust errors.
Thanks for any light you can throw on this, I appreciate it.
Comment