Dear Statalisters,
After reading through many posts, I would like to run a 2sls regression with the first stage being Poisson and the second a regular linear regression. This is because my endogenous variable is bounded between 0 and 1 (a percentage) which looks like a Poisson distribution (mass close to zero, no observations get to 1 but a few extremes get close). I have unbalanced firm panel data with t= 10 and a few hundred thousand firms. The variable of interest, y2, is at the industry level, as is the instrumental variable, z, while the outcome in the second stage is firm-level. Would the following be a correct way to implement the strategy and also get correct errors estimated? If so, would the predicted coefficient be interpreted as normally for linear regression?
I appreciate any help!
After reading through many posts, I would like to run a 2sls regression with the first stage being Poisson and the second a regular linear regression. This is because my endogenous variable is bounded between 0 and 1 (a percentage) which looks like a Poisson distribution (mass close to zero, no observations get to 1 but a few extremes get close). I have unbalanced firm panel data with t= 10 and a few hundred thousand firms. The variable of interest, y2, is at the industry level, as is the instrumental variable, z, while the outcome in the second stage is firm-level. Would the following be a correct way to implement the strategy and also get correct errors estimated? If so, would the predicted coefficient be interpreted as normally for linear regression?
Code:
ppmlhdfe y2 x1 x2 z, absorb(firm year) cluster(industry) predict y2_hat, xb reghdfe ln_y y2_hat x1 x2, absorb(firm year) cluster(industry)
Comment