Hi!
I am doing a difference-in-difference analysis on how academic achievement (measured by a standardised test score) is affected by getting a diagnosis. My dataset is very big and I have information on background characteristics measured at birth, mental health and test scores for different school years. Individuals can be measured from 1 to 7 times depending on when they leave school.
To do my DiD I am comparing children who have a diagnosis before the 2nd grade to children who receive a diagnosis between the 2nd and 4th grade. I have test scores on their reading tests in the 2nd and 4th grade. I have reorganised my data to only include children who have both test scores. My model is:
π¦ππ‘=πΌ0+πΌ1Latediagπ+πΌ2Grade4π‘+πΏ(Latediag*Grade4)ππ‘+ π₯β²ππ‘π½+πππ‘,
where y is a standardised test score, Latediag is 1 if the child receives a diagnosis between the 2nd and 4th grade (=0 otherwise), Grade4 is equal to one if the child is in grade 4 (0= otherwise) and X is a set of background characteristics and year dummies.
Right now I am estimating the model with OLS, but I am wondering if I should use the fixed effects instead. The reason for my wondering is that one child could have his measures of the 2nd and 4th grade tests in 2010 and 2012 and another child in 2013 and 2015. If I had measures from just two time periods e.g. 2010 and 2012 then I would use OLS and I know it would give me the same result as a fixed effects estimation. Is this also the case when I have two time periods with different years?
Best regards,
Elinor
I am doing a difference-in-difference analysis on how academic achievement (measured by a standardised test score) is affected by getting a diagnosis. My dataset is very big and I have information on background characteristics measured at birth, mental health and test scores for different school years. Individuals can be measured from 1 to 7 times depending on when they leave school.
To do my DiD I am comparing children who have a diagnosis before the 2nd grade to children who receive a diagnosis between the 2nd and 4th grade. I have test scores on their reading tests in the 2nd and 4th grade. I have reorganised my data to only include children who have both test scores. My model is:
π¦ππ‘=πΌ0+πΌ1Latediagπ+πΌ2Grade4π‘+πΏ(Latediag*Grade4)ππ‘+ π₯β²ππ‘π½+πππ‘,
where y is a standardised test score, Latediag is 1 if the child receives a diagnosis between the 2nd and 4th grade (=0 otherwise), Grade4 is equal to one if the child is in grade 4 (0= otherwise) and X is a set of background characteristics and year dummies.
Right now I am estimating the model with OLS, but I am wondering if I should use the fixed effects instead. The reason for my wondering is that one child could have his measures of the 2nd and 4th grade tests in 2010 and 2012 and another child in 2013 and 2015. If I had measures from just two time periods e.g. 2010 and 2012 then I would use OLS and I know it would give me the same result as a fixed effects estimation. Is this also the case when I have two time periods with different years?
Best regards,
Elinor
Comment