Hi all,
I'm currently trying to appraise my model and would like to:
1. Test the accuracy of its predictions
2. Test the goodness of fit of the model
The dependent variable in my model has just 63 positive observations and over 5000 negative observations and I have adopted a rare events regression.
When using estat class I find that none of observations are positively classified, so none of the positive observations are predicted correctly.
data:image/s3,"s3://crabby-images/9c4f3/9c4f32c6d11c460bd39bcc25e7df45bc470028ea" alt=""
Is this just a feature of the imbalanced data set where the majority class bias skews the models ability to predict correctly? Or is there an argument for changing the Pr(D) >= .5 cut-off.
I also tried an roc curve using lroc and get an area under ROC curve of 0.8057 but I am unsure how this is interpreted.
Is there a goodness of fit test that is most suited to imbalanced data sets?
Would really appreciate any help, thanks !
I'm currently trying to appraise my model and would like to:
1. Test the accuracy of its predictions
2. Test the goodness of fit of the model
The dependent variable in my model has just 63 positive observations and over 5000 negative observations and I have adopted a rare events regression.
When using estat class I find that none of observations are positively classified, so none of the positive observations are predicted correctly.
Is this just a feature of the imbalanced data set where the majority class bias skews the models ability to predict correctly? Or is there an argument for changing the Pr(D) >= .5 cut-off.
I also tried an roc curve using lroc and get an area under ROC curve of 0.8057 but I am unsure how this is interpreted.
Is there a goodness of fit test that is most suited to imbalanced data sets?
Would really appreciate any help, thanks !
Comment