Hi:
I am new to this forum so I apologize in advance for any naivety or lack of understanding of rules and regulations. I am also new to survival/hazard rate modeling though not to econometrics in general.
My problem is as follows. I am analyzing a survival model that examines the issue of how many months newly hired employees remain before they quit. I am using a logistic regression model. There are approximately 14,000 employees hired at different times during the study period (about 13 years). I have some non-time-varying covariates (NTVC) that are fixed; e.g., age at hire, gender, etc. and some that vary TVC); e.g., wages. The data are arranged such that there is a single observation for each time period (month) that the employee is “At risk” so we have a total of approximately 700,000 observations (employee-months). Each employee hired appears in the data once. The model looks like this:
Logit(Y(it))=A*TVC(it) + B*NTVC(it) + C*month1_monthT,
Where Y is 1 for quitting and zero elsewhere; A, B and C are parameter vectors of appropriate dimension; month1_monthT is a set of dummy variables for each possible value of t; i and t denote the individual and the number of months the individual has been at risk so far, respectively, so if he/she is in their 4th month of employment since being hired, t=4, and so on. The Stata code is simply something like this:
logit Y m1-m144 TVC NTVC.
My question is whether their needs to be an adjustment of the standard errors produced by the standard logit command. According to Paul Allison’s book “Survival Analysis Using SAS” pp. 246-7 and Allison (1982), "Discrete-Time Methods for the Analysis of Event Histories", Sociological Methodology, Vol. 13 (1982) pp. 61-98, see:
http://statisticalhorizons.com/wp-co...lison.SM82.pdf,
there is no need to adjust the standard errors.
However, according to Tyler Shumway, “Forecasting Bankruptcy More Accurately: A Simple Hazard Model”, The Journal of Business, Vol. 74, No. 1 (January 2001), pp. 101-124:
http://www.rcg.ch/resources/Forecast...ard_Model1.pdf,
it is necessary to adjust the degrees of freedom to approximately equal the number employees in the model (14,000) rather than the number of employee months (700,000). This is to be done by dividing the 700,000 by the average number of employee months (in my case that is about 51).
To adjust or not-adjust the standard errors output from the logit command? That is the question. Do I adjust the Stata output’s standard errors (essentially multiplying them by the square root of 51, in my example) or just leave them as they are?
Thanks,
Michael
I am new to this forum so I apologize in advance for any naivety or lack of understanding of rules and regulations. I am also new to survival/hazard rate modeling though not to econometrics in general.
My problem is as follows. I am analyzing a survival model that examines the issue of how many months newly hired employees remain before they quit. I am using a logistic regression model. There are approximately 14,000 employees hired at different times during the study period (about 13 years). I have some non-time-varying covariates (NTVC) that are fixed; e.g., age at hire, gender, etc. and some that vary TVC); e.g., wages. The data are arranged such that there is a single observation for each time period (month) that the employee is “At risk” so we have a total of approximately 700,000 observations (employee-months). Each employee hired appears in the data once. The model looks like this:
Logit(Y(it))=A*TVC(it) + B*NTVC(it) + C*month1_monthT,
Where Y is 1 for quitting and zero elsewhere; A, B and C are parameter vectors of appropriate dimension; month1_monthT is a set of dummy variables for each possible value of t; i and t denote the individual and the number of months the individual has been at risk so far, respectively, so if he/she is in their 4th month of employment since being hired, t=4, and so on. The Stata code is simply something like this:
logit Y m1-m144 TVC NTVC.
My question is whether their needs to be an adjustment of the standard errors produced by the standard logit command. According to Paul Allison’s book “Survival Analysis Using SAS” pp. 246-7 and Allison (1982), "Discrete-Time Methods for the Analysis of Event Histories", Sociological Methodology, Vol. 13 (1982) pp. 61-98, see:
http://statisticalhorizons.com/wp-co...lison.SM82.pdf,
there is no need to adjust the standard errors.
However, according to Tyler Shumway, “Forecasting Bankruptcy More Accurately: A Simple Hazard Model”, The Journal of Business, Vol. 74, No. 1 (January 2001), pp. 101-124:
http://www.rcg.ch/resources/Forecast...ard_Model1.pdf,
it is necessary to adjust the degrees of freedom to approximately equal the number employees in the model (14,000) rather than the number of employee months (700,000). This is to be done by dividing the 700,000 by the average number of employee months (in my case that is about 51).
To adjust or not-adjust the standard errors output from the logit command? That is the question. Do I adjust the Stata output’s standard errors (essentially multiplying them by the square root of 51, in my example) or just leave them as they are?
Thanks,
Michael
Comment