Hi All,
When performing a simple regression of a binary variable on a continuous variable, I checked the distribution of my residuals using the following two commands (as suggested by https://www.statology.org/qq-plots-stata/):
predict resid_varname_w, residuals
qnorm resid_varname_w
My variable is winsorized, so I think that explains the results at the tail ends. I tried various transformations (f.e. taking the square root), however this did not improve the result much. I have this problem in four regressions, as shown below:

All four dependent variables have means between -0.5 and 3.5 and std dev of between 0.2 and 3. It would be really helpful to get some suggestions on how to improve the results.
Cheers,
Aron
When performing a simple regression of a binary variable on a continuous variable, I checked the distribution of my residuals using the following two commands (as suggested by https://www.statology.org/qq-plots-stata/):
predict resid_varname_w, residuals
qnorm resid_varname_w
My variable is winsorized, so I think that explains the results at the tail ends. I tried various transformations (f.e. taking the square root), however this did not improve the result much. I have this problem in four regressions, as shown below:
All four dependent variables have means between -0.5 and 3.5 and std dev of between 0.2 and 3. It would be really helpful to get some suggestions on how to improve the results.
Cheers,
Aron
Comment