Interact dummy of dependent variable with independent variable

高佳

Join Date: Jan 2016

Posts: 74
#1

Interact dummy of dependent variable with independent variable

08 Jan 2025, 20:08

Hi all,

I have a resarch setting in which we have strong theoretical support that the effect of X (variable of interest) on Y (the dependent variable) is conditional on threshhold of Y. For instance, we speculate that the effect of X on Y depends on whether Y is positive or negative according to the prospect theory. Per my own knowledge, the most intuitive research design is the following:

Code:

Y ~ X + Z + X* Z + u

In which

Code:

Z = 1 if Y > 0, Z = 0 otherwise

.

The model will be estimated by OLS. I can mathmatically prove that when E[u|X] = 0 holds, then E[u|X*Z] = 0 holds (check the link here). I have two questions regarding this approach:

1, Can I use this research design judging from the causal inference perspective or more generally, the econometrics perspective?
2, Is there any research papers that use such a design?

Last edited by 高佳; 08 Jan 2025, 20:38.
Tags: None
George Ford

Join Date: Aug 2014

Posts: 2932
#2

09 Jan 2025, 16:07

Y = XB + e, so you are conditioning on e. There will be endogeneity.

Do you have anything exogenous to work with? Maybe the lag of Y would work.
Comment
高佳

Join Date: Jan 2016

Posts: 74
#3

09 Jan 2025, 19:10

Originally posted by George Ford View Post

Y = XB + e, so you are conditioning on e. There will be endogeneity.

Do you have anything exogenous to work with? Maybe the lag of Y would work.

Hi George, thanks for your comments. I'm not quite following what you are saying. Are you suggesting that even if X itself is exogenous, X*Z would still be endegenous?

It is hard to find something exogenous in this case becaus Z is a demterministic function of Y.
Comment
George Ford

Join Date: Aug 2014

Posts: 2932
#4

10 Jan 2025, 09:00

You need Y to be independent of e, which it is not.
Comment
高佳

Join Date: Jan 2016

Posts: 74
#5

11 Jan 2025, 19:05

Originally posted by George Ford View Post

You need Y to be independent of e, which it is not.

I doubt that wil ever be satisfied.
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2078
#6

13 Jan 2025, 13:41

What you're doing is the same as estimating two different linear models -- assuming you're including an overall intercept. One model is for Y > 0 and the other for Y < 0. It's unusual to do that because what sense does it make to hold Z fixed and then change X when changing X might make Y go from negative to positive? So I think you need to rethink what you want to learn.
Comment
高佳

Join Date: Jan 2016

Posts: 74
#7

13 Jan 2025, 20:50

Originally posted by Jeff Wooldridge View Post

What you're doing is the same as estimating two different linear models -- assuming you're including an overall intercept. One model is for Y > 0 and the other for Y < 0. It's unusual to do that because what sense does it make to hold Z fixed and then change X when changing X might make Y go from negative to positive? So I think you need to rethink what you want to learn.

Dear Dr. Wooldridge,

Thank you so much for your insights. This is surely a problem if X itself could explain the variance of Y or could predict the direction of Y. But in our specific research setting, we have strong theoretical argument that X should not affect the direction of Y. Instead, X should only explian Y depends on the direction of Y. That's why we are interested in this model specification in the first place. Shall there be any other model specification that could help us to investiagte this question, we are more than happy to look into it.

Nonetheless, we are aware of this issue you mentioned and test these possibilities by running the following two models:

Code:

Y= X + u

Code:

Pr(Y=1) = X + e

In either model, we find no statistically significant effect of X on Y.

Given the theoretical argument and empirical evidence that X could not predict the direction of Y, could we use the model specification in the original post to investigate the research question we have?
Comment
George Ford

Join Date: Aug 2014

Posts: 2932
#8

14 Jan 2025, 11:51

If you have other Xs, I wonder if semipar would work.
Comment
高佳

Join Date: Jan 2016

Posts: 74
#9

15 Jan 2025, 05:19

Originally posted by George Ford View Post

If you have other Xs, I wonder if semipar would work.

Thanks, but I I'm not quite following, what is semipar?
Comment
George Ford

Join Date: Aug 2014

Posts: 2932
#10

15 Jan 2025, 08:49

Code:

sysuse auto, clear semipar mpg foreign, nonpar(weight)

at a minimum, this will give some idea about whether there's a difference across the range of YX.

Last edited by George Ford; 15 Jan 2025, 08:52.
Comment
Rich Goldstein

Join Date: Mar 2014

Posts: 4397
#11

15 Jan 2025, 09:37

note that -semipar- is user-written and needs to be installed; use -search- to find and install
Comment
George Ford

Join Date: Aug 2014

Posts: 2932
#12

15 Jan 2025, 09:51

A finite mixture model (fmm) might be a solution.
Comment

Announcement

Interact dummy of dependent variable with independent variable

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment