Hello Statalist community,
I hope all of you are fine.
I am running a panel data analysis to study the effect of some bank-specific variables on NPL (non-performing loans listed on banks' balance sheets). I am carrying out Fixed effect and random effects estimation by using the command xtreg with 109 cross-sectional units adopting the vce (cluster id) specification to deal with the estimation biases that may result.
However, I would like to carry out some post-estimation analysis and better "shape" my dataset, by which I mean get rid of some outliers in my cross-sectional units.
I have read papers and previously published discussions here on the forum and it emerges that there is no clear consensus about how to detect outliers and crucial is the type of data are you working with in order to select once you discovered an outlier how to deal with it. The "sensibility" of the researcher is of great importance as well.
My request is the following:
What Stata commands are considered the most appropriate to detect outliers?
Scatterplots between two variables (independent and dependent)? Summary statistics showing quartile values?
I know maybe considering the nature of my request you may be of limited help, but still, I would like to hear different opinions.
I must say that I took notes of some cross-sectional units that for a given variable present a diverging trend compared to "that" is prevalent and dominant among the other cross-section units. I did this in excel.
I don't think scatterplots can be very helpful to allow me to detect "different" or "suspicious" trends (it does not matter how we name it as long as we both agree that present a trait of divergence compared to the main observed path).
For this reason, what Stata commands and graphs would you suggest I should use? The purpose is to get some hints and pieces of advice as I am not expecting any "right" answers.
Thanks to everybody,
Greetings,
I hope all of you are fine.
I am running a panel data analysis to study the effect of some bank-specific variables on NPL (non-performing loans listed on banks' balance sheets). I am carrying out Fixed effect and random effects estimation by using the command xtreg with 109 cross-sectional units adopting the vce (cluster id) specification to deal with the estimation biases that may result.
However, I would like to carry out some post-estimation analysis and better "shape" my dataset, by which I mean get rid of some outliers in my cross-sectional units.
I have read papers and previously published discussions here on the forum and it emerges that there is no clear consensus about how to detect outliers and crucial is the type of data are you working with in order to select once you discovered an outlier how to deal with it. The "sensibility" of the researcher is of great importance as well.
My request is the following:
What Stata commands are considered the most appropriate to detect outliers?
Scatterplots between two variables (independent and dependent)? Summary statistics showing quartile values?
I know maybe considering the nature of my request you may be of limited help, but still, I would like to hear different opinions.
I must say that I took notes of some cross-sectional units that for a given variable present a diverging trend compared to "that" is prevalent and dominant among the other cross-section units. I did this in excel.
I don't think scatterplots can be very helpful to allow me to detect "different" or "suspicious" trends (it does not matter how we name it as long as we both agree that present a trait of divergence compared to the main observed path).
For this reason, what Stata commands and graphs would you suggest I should use? The purpose is to get some hints and pieces of advice as I am not expecting any "right" answers.
Thanks to everybody,
Greetings,
Comment