Dear Statalisters and Sebastian Kripfganz ,
I have a panel data which consist of weekly observations of income and monthly observations of depression score for 800 individuals. I would like to estimate a dynamic GMM using depression_score as dependent variable and lag of dependent variable and income as independent variables. However, I observe these variables at different frequency (irregular time interval, time spacing) and I have missing values. For example, individual 1 has two mental health interviews at week 5 (wave 1) and week 10 (wave 2) while individual 2 has the same interview at week 3 (wave 1) and week 7 (wave 2). Thus, I could not decide how to define time in xtset for xtdpdgmm (week or wave?). I can keep only non missing depression_score and use week as a time variable. Then, would xtdpdgmm take the difference between time elapsed between two interview for different individuals? Individual 1 has 5 weeks gap between Y and L.Y while individual 2 has 4 weeks gap. Would it be a problem for the estimation?
Or should I use wave instead? This keeps time lag between Y and L.Y same for all individuals (week 3 and 5 in wave 1, week 7 and 10 in wave 2). Do you have any suggestions?
Thanks in advance!
Best regards,
I have a panel data which consist of weekly observations of income and monthly observations of depression score for 800 individuals. I would like to estimate a dynamic GMM using depression_score as dependent variable and lag of dependent variable and income as independent variables. However, I observe these variables at different frequency (irregular time interval, time spacing) and I have missing values. For example, individual 1 has two mental health interviews at week 5 (wave 1) and week 10 (wave 2) while individual 2 has the same interview at week 3 (wave 1) and week 7 (wave 2). Thus, I could not decide how to define time in xtset for xtdpdgmm (week or wave?). I can keep only non missing depression_score and use week as a time variable. Then, would xtdpdgmm take the difference between time elapsed between two interview for different individuals? Individual 1 has 5 weeks gap between Y and L.Y while individual 2 has 4 weeks gap. Would it be a problem for the estimation?
Or should I use wave instead? This keeps time lag between Y and L.Y same for all individuals (week 3 and 5 in wave 1, week 7 and 10 in wave 2). Do you have any suggestions?
Code:
input float(id week wave) double income double float depression_score 1 1 1 100 . 1 2 1 . . 1 3 1 50 . 1 4 1 . . 1 5 1 60 12 1 6 2 . . 1 7 2 . . 1 8 2 80 . 1 9 2 . . 1 10 2 100 10 2 1 1 . . 2 2 1 50 . 2 3 1 90 8 2 4 1 . . 2 5 1 60 . 2 6 2 . . 2 7 2 100 12 2 8 2 . . 2 9 2 . . 2 10 2 . . end
Best regards,
Comment