Hello.
I am working with a panel dataset of 3 years (2012, 2015, 2018). I am trying to see the effects of health shock on household consumption. So, the main regression includes consumption as the dependent variable. However, other regression models that I consider would include different dependent variables such as loans, assets, savings etc. I want to conduct the analysis in the overall sample but also want to do it for different quintiles.
Concern: How to divide the panel data into quintiles based on the consumption expenditure. The panel is balanced.
1) I ran this code:
I think the quintiles are then assigned based on 2012 consumption. Is this the right approach?
2) The reason I am not using xtqreg is because I believe this creates quintiles based on the dependent variable. Please correct me if I am wrong. So, in the regression of savings on health, quintiles will be based on savings, not consumption (which I want).
3) Whether estimating the poverty line based on the CBN method to divide households into poor and nonpoor using the baseline data 2012 and then constructing the panel for each subsample would be wise? Advice appreciated.
I am working with a panel dataset of 3 years (2012, 2015, 2018). I am trying to see the effects of health shock on household consumption. So, the main regression includes consumption as the dependent variable. However, other regression models that I consider would include different dependent variables such as loans, assets, savings etc. I want to conduct the analysis in the overall sample but also want to do it for different quintiles.
Concern: How to divide the panel data into quintiles based on the consumption expenditure. The panel is balanced.
1) I ran this code:
Code:
xtset id year egen tag=tag(id) xtile group=total_consumption if tag, nq(3) bysort id (tag) : replace group = group[_N]
2) The reason I am not using xtqreg is because I believe this creates quintiles based on the dependent variable. Please correct me if I am wrong. So, in the regression of savings on health, quintiles will be based on savings, not consumption (which I want).
3) Whether estimating the poverty line based on the CBN method to divide households into poor and nonpoor using the baseline data 2012 and then constructing the panel for each subsample would be wise? Advice appreciated.
Comment