Dear statalist users
My aim:
I´m running a logistic regression with an interaction term (EducationLevel##EthnicGroup).
Dependent variable is participation in training. I want to compare marginal effects of participation between the two ethnic groups by educational level (low, middle, high).
.
My Sample:
My problem is that one of the ethnic groups (EthnicGroup1) is rather large (n=6212) so that all educational levels show an even amount of events (see table group1) whereas the second group (EthnicGroup2) is small (n=315) and the lower educational level shows only 13 events (see table group2).
My Question:
Should I consider using estimation for rare data (i.e. Firth Method)?
Thank you for your help
Sara
My aim:
I´m running a logistic regression with an interaction term (EducationLevel##EthnicGroup).
Dependent variable is participation in training. I want to compare marginal effects of participation between the two ethnic groups by educational level (low, middle, high).
Code:
logistic i.EducationLevel##i.EthnicGroup .margins, dydx(EthnicGroup)at(EducationLevel=(1(1)3))
My Sample:
My problem is that one of the ethnic groups (EthnicGroup1) is rather large (n=6212) so that all educational levels show an even amount of events (see table group1) whereas the second group (EthnicGroup2) is small (n=315) and the lower educational level shows only 13 events (see table group2).
Code:
. tab Education Participation Participation in training Education Level (group1) 0 1 Total ------------------+----------------------+---------- low 836 304 1140 middle 2041 1278 3319 high 895 865 1760 ------------------+----------------------+---------- Total 3772 2447 6212 . tab Education Participation Participation in training Education Level (group2) 0 1 Total ------------------+----------------------+---------- low 91 13 104 middle 112 25 137 high 47 27 74 ------------------+----------------------+---------- Total 250 65 315
My Question:
Should I consider using estimation for rare data (i.e. Firth Method)?
Thank you for your help
Sara
Comment