Dear all,
I am working on a data set of social safety net benefits for elderly. Here, observation type (obs_type) '1' refers to 'elderly getting benefit (beneficiary)' and '0' refers to 'elderly not getting benefit (non-beneficiary)'.
I am trying to find all the possible pairs whether better off elderly are getting benefit even though there are non-beneficiaries who are worse off in the respective district. For eg. the test in District 101 is whether comparison pairs are found wherein non-beneficiaries are worse off than beneficiaries. A non-beneficiary is considered 'worse off' than the beneficiary when non-beneficiary is worse off in at least three criteria than the beneficiary. Also, I have more observations of non-beneficiaries than the beneficiaries.
Criteria 1: age – an older person will get priority
Criteria 2: income – an elderly with lower income will get priority
Criteria 3: land_size – an elderly with lower land size will get priority
Criteria 4: family_status - an elderly living alone will get priority (can be converted into a binary variable)
Criteria 5: health_status – an elderly who cannot work will get priority (can be converted into a binary variable)
I'm thinking of creating two new variables: 1) 'paired_id' referring to the id of the observation which is found to be 'worse off' than the compared non-beneficiaries (all possible pairs) and 2) rank - rank all possible pairs according to the no of criteria they are worse off? If no worse off non-beneficiaries are found, the cell will remain empty. How do I create these new variables?
Please find a sample dataset attached. I am using Stata 15.
Thank you,
Kumar
I am working on a data set of social safety net benefits for elderly. Here, observation type (obs_type) '1' refers to 'elderly getting benefit (beneficiary)' and '0' refers to 'elderly not getting benefit (non-beneficiary)'.
I am trying to find all the possible pairs whether better off elderly are getting benefit even though there are non-beneficiaries who are worse off in the respective district. For eg. the test in District 101 is whether comparison pairs are found wherein non-beneficiaries are worse off than beneficiaries. A non-beneficiary is considered 'worse off' than the beneficiary when non-beneficiary is worse off in at least three criteria than the beneficiary. Also, I have more observations of non-beneficiaries than the beneficiaries.
Criteria 1: age – an older person will get priority
Criteria 2: income – an elderly with lower income will get priority
Criteria 3: land_size – an elderly with lower land size will get priority
Criteria 4: family_status - an elderly living alone will get priority (can be converted into a binary variable)
Criteria 5: health_status – an elderly who cannot work will get priority (can be converted into a binary variable)
I'm thinking of creating two new variables: 1) 'paired_id' referring to the id of the observation which is found to be 'worse off' than the compared non-beneficiaries (all possible pairs) and 2) rank - rank all possible pairs according to the no of criteria they are worse off? If no worse off non-beneficiaries are found, the cell will remain empty. How do I create these new variables?
Please find a sample dataset attached. I am using Stata 15.
Thank you,
Kumar
|
Comment