Correct for multiple testing

Marry Lee

Join Date: Nov 2020

Posts: 184
#1

Correct for multiple testing

06 Aug 2024, 12:41

Dear statalisters,

I learned about multiple testing only recently, and it seems like a serious problem.

I am studying the effect of exposure to an event on different health outcomes.
I have many dependent variables (different health outcomes) regressed on a vector of 1 key variable and other covariates, using different specifications (different fixed effects)

I have a model such as:

Code:

Y1 = betaX fixed_effects_11 Y1 = betaX fixed_effects_22 Y1 = betaX fixed_effects_33 Y2 = betaX fixed_effects_11 Y2 = betaX fixed_effects_22 Y2 = betaX fixed_effects_33

If statistical significance is the critiria for deciding on conclusions,
Can we ignore that we have many covariates in the same regression, if we only focus on one key independent variable, thus within each model we are testing only one hypothesis? Is this logic right?

If we assume that each health outcome is of interest on its own, can we ignore correction for multiple testing?

Any applied economics papers who corrected for having different models please?

Other than the Stata command

Code:

wyoung

Is there a Stata command that corrects for having different models with different dependent variables?

Thank you

References:
Jones, D., D. Molitor, and J. Reif. "What Do Workplace Wellness Programs Do? Evidence from the Illinois Workplace Wellness Study." Quarterly Journal of Economics, November 2019, 134(4): 1747-1791.
Tags: None
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17444
#2

07 Aug 2024, 02:47

Marry:
welcome to this forum.
Have you taken a look at -mvreg- and -mvreg postestimation- entries in Stata .pdf manual?

Kind regards,
Carlo
(StataNow 18.5)
1 like
Comment
Maxence Morlet

Join Date: Mar 2021

Posts: 609
#3

07 Aug 2024, 02:48

Dear Marry, in addition to Carlo's helpful response, multiple hypothesis testing adjustment might be of interest. I suggest taking a look at rwolf2 from ssc install.
2 likes
Comment
Marry Lee

Join Date: Nov 2020

Posts: 184
#4

07 Aug 2024, 08:45

Carlo Lazzaro
Thank you so much for your answer.
You are saying that since dependent variables are health outcomes that may be correlated, they may be modeled in a multivariate model.
If I understand well, since the health outcomes are correlated then I am almost testing the same hypothesis: the effect of exposure to the event is associated with better health / worse health.
Multivariate analysis will maximize power while holding the type I error rate at alpha level.
Am I getting this right?
Comment
Marry Lee

Join Date: Nov 2020

Posts: 184
#5

07 Aug 2024, 09:40

Thank you Maxence Morlet For this very interesting command.

So rwolf2 allows to correct for having different dependent variables in my analysis.
I just need to include all regressions from main analysis and heterogeneity and probably mechanisms analysis at the same time and have the adjusted p values. Did I understand this right?

Can you please explain why this command may be better and not another for correction? as I want to justify my choice of one or the other commands.
Comment
Julian Reif

Join Date: Dec 2018

Posts: 39
#6

07 Aug 2024, 10:56

The Github respository for `wyoung` provides several examples for how to use that command in a regression setting. David McKenzie provides a nice overview of different multiple hypothesis testing adjustments in his blog post. I have not used `rwolf2`, but am not aware of a reason to prefer `wyoung` or `rwolf2` over the other.

Associate Professor of Finance and Economics
University of Illinois
www.julianreif.com
2 likes
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17444
#7

07 Aug 2024, 11:10

Marry:
I would go -mvreg- as it allows you to estimate the between-equation covariances.
Unfortunately, I'm not familiar with the community-contributed modules suggested by Maxence and Julian.

Kind regards,
Carlo
(StataNow 18.5)
1 like
Comment
Marry Lee

Join Date: Nov 2020

Posts: 184
#8

07 Aug 2024, 12:27

Julian Reif Thank you so much for your confirmation and for the link to the blog. It was really helpful.
Best
Comment
George Ford

Join Date: Aug 2014

Posts: 2782
#9

07 Aug 2024, 12:58

I'd read up on it thoroughly to make sure you need the correction in your specific case.

I believe it to be the case when you estimate the coefficients simultaneously, no adjustment is required. If you estimate the models separately, then you would apply it.

mvreg is a simultaneous estimation approach. But, all the X's (including FE) must be the same in mvreg. sureg allows different Xs and could estimate 6 models.

Not clear why you have different fixed effects, unless you are looking for differences in the X coef across different levels of aggregation of the FE. If the FE are at different levels of aggregation, there are other tests to determine whether higher aggregation is legitimate (Wooldridge/Papke). I written some code to run that test and posted it on Statalist before.
2 likes
Comment
Marry Lee

Join Date: Nov 2020

Posts: 184
#10

08 Aug 2024, 04:12

Hi George Ford
Thank you for your answer
So you say that if I use mvreg (with the same Xs for all models), there is no need for correction? right?

Last edited by Marry Lee; 08 Aug 2024, 04:17.
Comment
George Ford

Join Date: Aug 2014

Posts: 2782
#11

08 Aug 2024, 08:15

I believe that is correct.
1 like
Comment
paulvonhippel

Join Date: Apr 2014

Posts: 488
#12

04 Nov 2024, 11:33

I don't think this is correct. -mvreg- does not correct for multiple tests. For example the commands

Code:

sysuse auto, clear mvreg headroom trunk turn = price mpg displ gear_ratio length weight

return exactly the same p values as if you ran three separate regressions:

Code:

reg headroom price mpg displ gear_ratio length weight reg trunk price mpg displ gear_ratio length weight reg turn price mpg displ gear_ratio length weight

If you would correct the p values of the 3 separate regressions, then you should also correct the p values returned by mvreg....
1 like
Comment

Announcement

Correct for multiple testing

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment