Hello,
I have an unbalanced panel data set where my variables are not normally distributed. At the same time I have a variable with outliers. These outliers have values that definitely do not make sense and from my point of view represent input errors in the data that I cannot correct afterwards. I want to identify the outliers and then exclude them from my calculation. Due to the fact that my varaibles are not normally distributed, I cannot use many common methods to identify and handle outliers. Thus, I have tried the methods "median of absolute differences (mad) and "double mad". However, from my point of view, this excluded too many cases that are not outliers. Are there other methods that I can use here?
I use Stata 14.2 and here is also some information about the variable called "sales":
tabstat sales

qnorm sales

graph box sales

Thanks for the support.
I have an unbalanced panel data set where my variables are not normally distributed. At the same time I have a variable with outliers. These outliers have values that definitely do not make sense and from my point of view represent input errors in the data that I cannot correct afterwards. I want to identify the outliers and then exclude them from my calculation. Due to the fact that my varaibles are not normally distributed, I cannot use many common methods to identify and handle outliers. Thus, I have tried the methods "median of absolute differences (mad) and "double mad". However, from my point of view, this excluded too many cases that are not outliers. Are there other methods that I can use here?
I use Stata 14.2 and here is also some information about the variable called "sales":
tabstat sales
qnorm sales
graph box sales
Thanks for the support.
Comment