Hi everyone,
I am using reghdfe from SSC in Stata 18 to run an IV regression on a data set with around 10M records. I included about 800 fixed effects and used two-way clustering for clustered standard errors. Below are the output for my first stage regression. It looks like I failed to reject the H_0 that my instrument is underidentified, but I can reject the H_0 for weak instrument. These results seem to be inconsistent, and I hope if anyone could help me to interpret them.
Thank you in advance for your help!
Summary results for first-stage regressions
-------------------------------------------
(Underid) (Weak id)
Variable | F( 1, 43) P-val | SW Chi-sq( 1) P-val | SW F( 1, 43)
1.endg | 65.52 0.0000 | 67.04 0.0000 | 65.52
NB: first-stage test statistics cluster-robust
Stock-Yogo weak ID F test critical values for single endogenous regressor:
10% maximal IV size 16.38
15% maximal IV size 8.96
20% maximal IV size 6.66
25% maximal IV size 5.53
Source: Stock-Yogo (2005). Reproduced by permission.
NB: Critical values are for i.i.d. errors only.
Underidentification test
Ho: matrix of reduced form coefficients has rank=K1-1 (underidentified)
Ha: matrix has rank=K1 (identified)
Kleibergen-Paap rk LM statistic Chi-sq(1)=1.44 P-val=0.2302
Weak identification test
Ho: equation is weakly identified
Cragg-Donald Wald F statistic 1322.26
Kleibergen-Paap Wald rk F statistic 65.52
Stock-Yogo weak ID test critical values for K1=1 and L1=1:
10% maximal IV size 16.38
15% maximal IV size 8.96
20% maximal IV size 6.66
25% maximal IV size 5.53
Source: Stock-Yogo (2005). Reproduced by permission.
NB: Critical values are for Cragg-Donald F statistic and i.i.d. errors.
Weak-instrument-robust inference
Tests of joint significance of endogenous regressors B1 in main equation
Ho: B1=0 and orthogonality conditions are valid
Anderson-Rubin Wald test F(1,43)= 11.21 P-val=0.0017
Anderson-Rubin Wald test Chi-sq(1)= 11.47 P-val=0.0007
Stock-Wright LM S statistic Chi-sq(1)= . P-val= .
NB: Underidentification, weak identification and weak-identification-robust
test statistics cluster-robust
Number of clusters (1) N_clust1 = 44
Number of clusters (2) N_clust2 = 418
Number of observations N = 11001968
Number of regressors K = 110
Number of endogenous regressors K1 = 1
Number of instruments L = 110
Number of excluded instruments L1 = 1
I am using reghdfe from SSC in Stata 18 to run an IV regression on a data set with around 10M records. I included about 800 fixed effects and used two-way clustering for clustered standard errors. Below are the output for my first stage regression. It looks like I failed to reject the H_0 that my instrument is underidentified, but I can reject the H_0 for weak instrument. These results seem to be inconsistent, and I hope if anyone could help me to interpret them.
Thank you in advance for your help!
Summary results for first-stage regressions
-------------------------------------------
(Underid) (Weak id)
Variable | F( 1, 43) P-val | SW Chi-sq( 1) P-val | SW F( 1, 43)
1.endg | 65.52 0.0000 | 67.04 0.0000 | 65.52
NB: first-stage test statistics cluster-robust
Stock-Yogo weak ID F test critical values for single endogenous regressor:
10% maximal IV size 16.38
15% maximal IV size 8.96
20% maximal IV size 6.66
25% maximal IV size 5.53
Source: Stock-Yogo (2005). Reproduced by permission.
NB: Critical values are for i.i.d. errors only.
Underidentification test
Ho: matrix of reduced form coefficients has rank=K1-1 (underidentified)
Ha: matrix has rank=K1 (identified)
Kleibergen-Paap rk LM statistic Chi-sq(1)=1.44 P-val=0.2302
Weak identification test
Ho: equation is weakly identified
Cragg-Donald Wald F statistic 1322.26
Kleibergen-Paap Wald rk F statistic 65.52
Stock-Yogo weak ID test critical values for K1=1 and L1=1:
10% maximal IV size 16.38
15% maximal IV size 8.96
20% maximal IV size 6.66
25% maximal IV size 5.53
Source: Stock-Yogo (2005). Reproduced by permission.
NB: Critical values are for Cragg-Donald F statistic and i.i.d. errors.
Weak-instrument-robust inference
Tests of joint significance of endogenous regressors B1 in main equation
Ho: B1=0 and orthogonality conditions are valid
Anderson-Rubin Wald test F(1,43)= 11.21 P-val=0.0017
Anderson-Rubin Wald test Chi-sq(1)= 11.47 P-val=0.0007
Stock-Wright LM S statistic Chi-sq(1)= . P-val= .
NB: Underidentification, weak identification and weak-identification-robust
test statistics cluster-robust
Number of clusters (1) N_clust1 = 44
Number of clusters (2) N_clust2 = 418
Number of observations N = 11001968
Number of regressors K = 110
Number of endogenous regressors K1 = 1
Number of instruments L = 110
Number of excluded instruments L1 = 1
Comment