Hi!
I look at a binary DV for 139 companies over 8 years. Thus, I have panel data. I have three IV, of which one is also a binary variable, and several control variables. As it is the first time that I look at a binary DV I have a few questions regarding pre-tests and the general approach and I am just insecure about the correct procedure. My approach so far:
Next, I want to include industry, country and year fixed effects. Is there a certain test that I have to do to decide whether these effects need to be included or do I just include them as common sense to reduce the risk for an omitted variable bias and reducing endogeneity concerns? If there is a test, which one and how do I apply it?
Once that is all clarfied and I have my final regression equation I would use Mc Fadden to look at the model fit and the "margins, dydx(IV) atmeans" command for interpreation correct?
Can anyone help me wioth my open questions? Any feedback on my procedure? Am I missing out an important step?
Thanks in advance!
Franka
I look at a binary DV for 139 companies over 8 years. Thus, I have panel data. I have three IV, of which one is also a binary variable, and several control variables. As it is the first time that I look at a binary DV I have a few questions regarding pre-tests and the general approach and I am just insecure about the correct procedure. My approach so far:
- Classical data cleaning / preperation including transformation to panel data via xtset id year
- Descriptive statistics including correlation and multicollinarity check (VIF). VIF below 10 -->no multicollinarity. Two of the IV are medium correlated and I may sepearte them into two equations.
- Heteroscedasticity check confirming heteroscedasticity in my sample. Thus, I thought I have to include robust standard errors via adding vce (robust) at the end of my regression equation.
Next, I want to include industry, country and year fixed effects. Is there a certain test that I have to do to decide whether these effects need to be included or do I just include them as common sense to reduce the risk for an omitted variable bias and reducing endogeneity concerns? If there is a test, which one and how do I apply it?
Once that is all clarfied and I have my final regression equation I would use Mc Fadden to look at the model fit and the "margins, dydx(IV) atmeans" command for interpreation correct?
Can anyone help me wioth my open questions? Any feedback on my procedure? Am I missing out an important step?
Thanks in advance!
Franka
Comment