Hi everybody,
I'm trying to clean up my dataset so I can actually start analyzing the data. I started off by using [tab var1] to create an overview and then [replace var1 =. if var1 .....] to remove the 'wrong' observations from my summary statistics. However, for variables with many values I cant use the [tab] command to create an overview and see the 'wrong' outliers. Could anyone please show me a method to clean those variables with many values.
For example, (sum output)
Variable | Obs Mean Std. Dev. Min Max
-------------+------------------------------------------------------------------------
curcd | 0
at | 436197 15494.29 98727.65 0 3771200
bkvlps | 381156 9665.382 569540.6 -1.38e+07 9.47e+07
ceq | 435607 2661.754 10210.6 -136332 284434
csho | 413093 233.7273 40434.87 0 2.60e+07
-------------+------------------------------------------------------------------------
(at = asset total, bkvlps = book value per share, ceq = common equity total, csho = common shares outstanding)
I made the min / max observations that seem quite off to me bold. So, my question is how I can overview those (many value) variables and how can I clean them up.
Thanks a lot for helping me!
I'm trying to clean up my dataset so I can actually start analyzing the data. I started off by using [tab var1] to create an overview and then [replace var1 =. if var1 .....] to remove the 'wrong' observations from my summary statistics. However, for variables with many values I cant use the [tab] command to create an overview and see the 'wrong' outliers. Could anyone please show me a method to clean those variables with many values.
For example, (sum output)
Variable | Obs Mean Std. Dev. Min Max
-------------+------------------------------------------------------------------------
curcd | 0
at | 436197 15494.29 98727.65 0 3771200
bkvlps | 381156 9665.382 569540.6 -1.38e+07 9.47e+07
ceq | 435607 2661.754 10210.6 -136332 284434
csho | 413093 233.7273 40434.87 0 2.60e+07
-------------+------------------------------------------------------------------------
(at = asset total, bkvlps = book value per share, ceq = common equity total, csho = common shares outstanding)
I made the min / max observations that seem quite off to me bold. So, my question is how I can overview those (many value) variables and how can I clean them up.
Thanks a lot for helping me!
Comment