There are a bunch of datasets, e.g. world values survey, alcohol usage report etc which present good amounts of data over long periods of time.
However, these aren't really panel datasets (In some cases, they can't be, e.g. many ageing reports) - i.e. the entities surveyed aren't the same over the various "waves". These are pooled cross sections.
So typical longitudinal models such as fixed effects don't make sense.
In such cases, is Pooled OLS the best way to analyze the data in terms of relationship between variables, and if so, what sort of causality can be claimed, assuming each wave grabbed random, representative samples....
Also, if Pooled OLS is the best tool for the job, how does one run a pooled OLS in Stata? Run regressions in a loop for every year and report average coefficients? (What about SE etc?) Is there a specific command for this? Or simply something like
would serve the purpose?
However, these aren't really panel datasets (In some cases, they can't be, e.g. many ageing reports) - i.e. the entities surveyed aren't the same over the various "waves". These are pooled cross sections.
So typical longitudinal models such as fixed effects don't make sense.
In such cases, is Pooled OLS the best way to analyze the data in terms of relationship between variables, and if so, what sort of causality can be claimed, assuming each wave grabbed random, representative samples....
Also, if Pooled OLS is the best tool for the job, how does one run a pooled OLS in Stata? Run regressions in a loop for every year and report average coefficients? (What about SE etc?) Is there a specific command for this? Or simply something like
Code:
reg y x i.year, vce(cluster country)
Comment