Dear All,
I am running a regression in which I have reasons to believe that the dependent variable could be potentially affected by the end of cold war i.e., 1990. I have a panel data of 70 countries across different regions from 1980-2015.
Given this background, I have two questions:
Q1) Which of the following strategy is the right one to introduce the dummy variable for Cold War.
Strategy I
Strategy II
Strategy III
DV stands for dependent variable and IV represent a set of independent variables.
I am struggling to differentiate between the logic of above three regressions. I am getting very different results from each of these. In short, which one is most appropriate to evaluate the influence of the end of Cold War (Year 1990) on the dependent variable?
Q2) As I have 70 countries in my data across 4 geographical regions. I want to control the heterogeneity in terms of countries. The idea behind introducing geographical dummies is three folds: First, I want to examine if the key or main IV is affecting the DV differently in each region. Second, if geographical location of the countries get some favourable treatment in terms of DV. Third, if geographical location of the countries get some favourable treatment in terms of IV.
Strategy I
Strategy II
Strategy III
I dropped one dummy to avoid dummy variable trap from Strategy I and II. Now, my confusion is again to choose the right strategy as well as its interpretation. The above 3 regressions produced very different results. How to choose between them?
Moreover, what does the sign and coefficient on the dummy tell us?
Looking forward to your response.
Best regards,
Imran Khan
I am running a regression in which I have reasons to believe that the dependent variable could be potentially affected by the end of cold war i.e., 1990. I have a panel data of 70 countries across different regions from 1980-2015.
Given this background, I have two questions:
Q1) Which of the following strategy is the right one to introduce the dummy variable for Cold War.
Strategy I
Code:
reg DV IV 1990.year
Code:
generate CWDummy = 0 replace CWDummy =1 if (Year>1990) reg DV IV CWDummy
Code:
generate CWDummy = 0 replace CWDummy =1 if (Year<1990) reg DV IV CWDummy
I am struggling to differentiate between the logic of above three regressions. I am getting very different results from each of these. In short, which one is most appropriate to evaluate the influence of the end of Cold War (Year 1990) on the dependent variable?
Q2) As I have 70 countries in my data across 4 geographical regions. I want to control the heterogeneity in terms of countries. The idea behind introducing geographical dummies is three folds: First, I want to examine if the key or main IV is affecting the DV differently in each region. Second, if geographical location of the countries get some favourable treatment in terms of DV. Third, if geographical location of the countries get some favourable treatment in terms of IV.
Strategy I
Code:
generate d1= 0 generate d2= 0 generate d3= 0 generate d4= 0 replace d1 =1 if (region ==1) replace d2 =2 if (region ==2) replace d3 =3 if (region ==3) replace d4 =4 if (region ==4) reg DV IV d1 d2 d3 reg DV IV d2 d3 d4
Code:
reg DV IV i.region
Code:
reg DV IV if region==1 reg DV IV if region==2 reg DV IV if region==3 reg DV IV if region==4
Moreover, what does the sign and coefficient on the dummy tell us?
Looking forward to your response.
Best regards,
Imran Khan
Comment