I want to perform a panel data analysis. I have a dependent variable with values between 0 and 0.61. The 2,360 observations are for ~130 countries and take place between 2003-2020.
The Skewness of the data stands at 3.075139. Meaning the data is not normally distributed. What is the best way, in your opinion, to deal with this violation of assumption for a regression analysis?
I removed observations from the 99 percentile, and the Skewness decreased to 1.3. The results of the regression stayed roughly the same.
I am, however, not sure if observation deletion is justified. Should I prefer some other way to normalize the data (log+1transformation, root transformation, etc.)?
If yes, what method would you recommend?
The Skewness of the data stands at 3.075139. Meaning the data is not normally distributed. What is the best way, in your opinion, to deal with this violation of assumption for a regression analysis?
I removed observations from the 99 percentile, and the Skewness decreased to 1.3. The results of the regression stayed roughly the same.
I am, however, not sure if observation deletion is justified. Should I prefer some other way to normalize the data (log+1transformation, root transformation, etc.)?
If yes, what method would you recommend?
Comment