Dear forum members,
First of all, thank you for taking the time and maybe being able to help me.
I am facing the following challenge: I am trying to replicate and extend the paper "Estimating the effect of smoking on birth outcomes using a matched panel data set" by J.Abrevaya (2006). Data on births is available for the US through NCHS (I am using '99 - '04) for a single year. Since there is no unique identifier for each mother (e.g. Soc. security number) in the data, Abrevaya implements a "matching algorithm" in order to identify mothers with several births throughout the years.
Frankly speaking, I do not know how to write the algorithm. I have prepared the data for every single year (seperate cross-sectional data sets) and would like to write an algorithm such as that the data from the single years is merged into a new panel data set given some specific matching criteria.
In a first algorithm by Abrevaya, the matching criteria are "Mother's state of birth", "Mother's race", "Child's state of birth", "Child's county of birth", and "Child's city of birth".
Is it possible to merge the data from the single years into a new panel data set, given the matching criteria? Or should I merge all the data into a (very large) new dataset and apply the matching criteria in order to sort out the data I don't want to use (and then again, how could I do that?) ?
Thank you very much in advance for your help and comments, I am very much looking forward to them.
Best,
Max
First of all, thank you for taking the time and maybe being able to help me.
I am facing the following challenge: I am trying to replicate and extend the paper "Estimating the effect of smoking on birth outcomes using a matched panel data set" by J.Abrevaya (2006). Data on births is available for the US through NCHS (I am using '99 - '04) for a single year. Since there is no unique identifier for each mother (e.g. Soc. security number) in the data, Abrevaya implements a "matching algorithm" in order to identify mothers with several births throughout the years.
Frankly speaking, I do not know how to write the algorithm. I have prepared the data for every single year (seperate cross-sectional data sets) and would like to write an algorithm such as that the data from the single years is merged into a new panel data set given some specific matching criteria.
In a first algorithm by Abrevaya, the matching criteria are "Mother's state of birth", "Mother's race", "Child's state of birth", "Child's county of birth", and "Child's city of birth".
Is it possible to merge the data from the single years into a new panel data set, given the matching criteria? Or should I merge all the data into a (very large) new dataset and apply the matching criteria in order to sort out the data I don't want to use (and then again, how could I do that?) ?
Thank you very much in advance for your help and comments, I am very much looking forward to them.
Best,
Max
Comment