I am running a logistic regression on whether gender influences whether an undergraduate invests or not.
Gender takes the form of dummy variable: 0 = female; male = 1
Invest => yes = 1/ no = 0
I have included other explanatory variables: year group; finance module taken; income; whether someone claimed freeschool meals (freeschool)
I then want to run a conditional logistic regression, for male and females:
here I input the command to find female investors:
logit invest1 i.gen1 i.finance1 i.year1 i.income1 i.freeschool1 if (gen1==0), robust
But this flags up the message:
"note: 5.year1 != 0 predicts success perfectly;
5.year1 omitted and 2 obs not used.
note: 5.income1 != 0 predicts failure perfectly;
5.income1 omitted and 1 obs not used.
note: 0.gen1 omitted because of collinearity."
Running the next conditional logistic regression for male investors:
logit invest1 i.gen1 i.finance1 i.year1 i.income1 i.freeschool1 if (gen1==1), robust
and this comes up with the message:
note: 2.year1 != 0 predicts success perfectly;
2.year1 omitted and 2 obs not used.
I read in other forums that the "firthlogit" can overcome issues of perfect predictions, however when I input this, stata states this command is unrecognized?
1. How do I overcome the collinearity issues in the 1st conditional regression?
2. How do I overcome the issue of the perfect prediction?
Note: my sample size is very small of 105 obs and so splitting the conditional regression makes the sample even smaller.
Gender takes the form of dummy variable: 0 = female; male = 1
Invest => yes = 1/ no = 0
I have included other explanatory variables: year group; finance module taken; income; whether someone claimed freeschool meals (freeschool)
I then want to run a conditional logistic regression, for male and females:
here I input the command to find female investors:
logit invest1 i.gen1 i.finance1 i.year1 i.income1 i.freeschool1 if (gen1==0), robust
But this flags up the message:
"note: 5.year1 != 0 predicts success perfectly;
5.year1 omitted and 2 obs not used.
note: 5.income1 != 0 predicts failure perfectly;
5.income1 omitted and 1 obs not used.
note: 0.gen1 omitted because of collinearity."
Running the next conditional logistic regression for male investors:
logit invest1 i.gen1 i.finance1 i.year1 i.income1 i.freeschool1 if (gen1==1), robust
and this comes up with the message:
note: 2.year1 != 0 predicts success perfectly;
2.year1 omitted and 2 obs not used.
I read in other forums that the "firthlogit" can overcome issues of perfect predictions, however when I input this, stata states this command is unrecognized?
1. How do I overcome the collinearity issues in the 1st conditional regression?
2. How do I overcome the issue of the perfect prediction?
Note: my sample size is very small of 105 obs and so splitting the conditional regression makes the sample even smaller.
Comment