Dear all,
I have a simple First-Difference model with two time periods. I have two independent variables that capture average temperature and precipitation and a dependent variable on how many migrants a household has.
I create variables that capture the differences between the two periods. I use a Binary dependent variable (M): 1 if household increased its number of migrants, 0 else. And then I run a simple OLS, i.e. limited dependent variable model: reg M c.T c.P if URBAN==0, noconstant vce(cluster DISTRICT)
I would like to predict the total number of households in the population that increased their number of migrants as a result first of a) temperature variation b) precipitation variation c) both.
I know I need to use sampling weights.
But otherwise I am not sure how to do that.
My idea was the following:
Is this a correct approach? How can I do it separately for the two variables?
I have a simple First-Difference model with two time periods. I have two independent variables that capture average temperature and precipitation and a dependent variable on how many migrants a household has.
I create variables that capture the differences between the two periods. I use a Binary dependent variable (M): 1 if household increased its number of migrants, 0 else. And then I run a simple OLS, i.e. limited dependent variable model: reg M c.T c.P if URBAN==0, noconstant vce(cluster DISTRICT)
I would like to predict the total number of households in the population that increased their number of migrants as a result first of a) temperature variation b) precipitation variation c) both.
I know I need to use sampling weights.
But otherwise I am not sure how to do that.
My idea was the following:
- run the model
- predict the probabilities
- generate a variable by multiplying the probability with the total number of migrants
- total of the variable should be the nr. of displaced people in response to P and T
Is this a correct approach? How can I do it separately for the two variables?
Comment