Dear forum,
I have two questions that I am struggling with. One is about what model I should use in Stata and two is about outlier problems with each model.
I'm a bit of a beginner but I will try to explain it to the best of my ability.
I compiled a dataset of around a 130 CEO successions of about 120 different companies. I'm wanting to test the relationship between post-succession ROA and CEO type in moderation of board composition. So, in short a moderation relationship.
DV: post-succession ROA (continuous)
IV1: CEO type (nominal with 3 types)
IV2: Board composition (nominal with 2 types)
I have several other control variables,
- Year (nominal)
- Board size (continuous)
- pre-succession ROA (continuous)
- Industry SIC 2-digit (nominal)
My first question is whether I should be doing a two-way ANOVA or a hierarchical multiple linear regression, or perhaps another model?
I tried to do the two-way ANOVA but got stuck on the outliers assumption. I could not decide if I need to remove my outliers or not, and do not know how transforming my data will affect my research. I installed the extremes tool to identify my outliers. I also used it in combination with the scatter command to assess the outliers better. I do not know if my outliers are significant and struggle on removing them or not, or let alone dealing with them.
For the hierarchical regression, I think this is simply a multiple linear regression in which I test three different equations. The first is the DV, IV1 and IV2, the second includes the interaction between IV1 and IV2, the third includes the controls. If I am wrong, please correct me.
The dwstat command brings me 2.1 so i believe to fulfill the independence assumption.
Linearity is tested for using a twoway scatter with lfit.
Here I am also wondering about outliers as some IVs or controls show widely deviating results, making me wonder if I can tell that linearity exists. Also, linearity is automatically met for categorical dummies right? I read this somewhere and wonder if it's true.
Thank you for your time and hope you can help me
I have two questions that I am struggling with. One is about what model I should use in Stata and two is about outlier problems with each model.
I'm a bit of a beginner but I will try to explain it to the best of my ability.
I compiled a dataset of around a 130 CEO successions of about 120 different companies. I'm wanting to test the relationship between post-succession ROA and CEO type in moderation of board composition. So, in short a moderation relationship.
DV: post-succession ROA (continuous)
IV1: CEO type (nominal with 3 types)
IV2: Board composition (nominal with 2 types)
I have several other control variables,
- Year (nominal)
- Board size (continuous)
- pre-succession ROA (continuous)
- Industry SIC 2-digit (nominal)
My first question is whether I should be doing a two-way ANOVA or a hierarchical multiple linear regression, or perhaps another model?
I tried to do the two-way ANOVA but got stuck on the outliers assumption. I could not decide if I need to remove my outliers or not, and do not know how transforming my data will affect my research. I installed the extremes tool to identify my outliers. I also used it in combination with the scatter command to assess the outliers better. I do not know if my outliers are significant and struggle on removing them or not, or let alone dealing with them.
For the hierarchical regression, I think this is simply a multiple linear regression in which I test three different equations. The first is the DV, IV1 and IV2, the second includes the interaction between IV1 and IV2, the third includes the controls. If I am wrong, please correct me.
The dwstat command brings me 2.1 so i believe to fulfill the independence assumption.
Linearity is tested for using a twoway scatter with lfit.
Here I am also wondering about outliers as some IVs or controls show widely deviating results, making me wonder if I can tell that linearity exists. Also, linearity is automatically met for categorical dummies right? I read this somewhere and wonder if it's true.
Thank you for your time and hope you can help me

Comment