Backward elimination begins with the largest model and eliminates variables one-by-one until we are satisfied that all remaining variables are important to the model. How to choose p- values for variable with more than two level? For example with binary variable and continuous variable we get only one p-value. But variable with more than 2 levels we get 2 or more p- values.
1) which p-value should be used to remove for categorical variable with levels - highest or lowest?
2) Should one remove the whole categorical variable with kevels based on one p-value or just remove level with highest/lowest p- value. But then problem arises with the number of observations- on removing levels regression analysis removes all the observations related that level?
1) which p-value should be used to remove for categorical variable with levels - highest or lowest?
2) Should one remove the whole categorical variable with kevels based on one p-value or just remove level with highest/lowest p- value. But then problem arises with the number of observations- on removing levels regression analysis removes all the observations related that level?
Comment