How to obtain "generalised residuals" in control function approach?

Kushneel Prakash

Join Date: Nov 2018

Posts: 13
#1

How to obtain "generalised residuals" in control function approach?

09 Jan 2022, 21:02

I am in the midst of running "control function approach" where in the first stage I am using xtlogit command (endogenous variable is binary) and xtreg command (outcome variable is continuous) in the second stage to deal with endogeneity. I need to obtain predicted "generalised residuals" after running first stage regression and use that as an additional variable in the second stage. I have not yet been able to figure out a way to do it in Stata. Looking for directions from this group on how I can achieve this using Stata code. Thanks.
Tags: control function approach, endogeneity, panel data
George Ford

Join Date: Aug 2014

Posts: 3035
#2

11 Jan 2022, 18:46

I think the "score" option on predict gets you the GR.
Comment
Clemence Kieny

Join Date: Dec 2021

Posts: 6
#3

13 Jan 2022, 07:02

Hi,

I am doing something similar albeit without panel data and using a probit first stage.

I have computed the GR using 3 different codes that reassuringly appear to give the exact same results.

Z is my instrument.

In the second stage I just include the GRs along wit my control variables.

************************************************** ************************************************** ************
** Method 1 **

Code:

probit EEV Z $controls predict gr1, score

** Method 2 **
Based on : https://www.statalist.org/forums/for...ith-panel-data

Code:

probit EEV Z $controls predict probitxb, xb gen pdfprobit = normalden(probitxb) gen cdfprobit = normal(probitxb) gen lamda = pdfprobit/cdfprobit gen pdfprobit_n = normalden(-probitxb) gen cdfprobit_n = normprob(-probitxb) gen lamda_n = pdfprobit_n/cdfprobit_n gen gr2 = EEV*lamda - (1 - EEV)*lamda_n

** Method 3 **
Based on https://www.stata.com/statalist/arch.../msg00650.html

Code:

probit EEV Z $controls predict xb, xb gen gr3 = cond(EEV == 1, normalden(xb)/normal(xb), -normalden(xb)/(1-normal(xb)))

************************************************** ************************************************** ************
I don't understand why Method 1 gives the same results as 2 and 3 (which are obviously the same), since it's supposed to report the "first derivative of the log likelihood with respect to xb", but it apparently works.

george, do you happen to have an explanation as to why the predict, score command actually computes the GR?
Comment
George Ford

Join Date: Aug 2014

Posts: 3035
#4

14 Jan 2022, 16:10

Because it is useful, I guess.
Comment
Kushneel Prakash

Join Date: Nov 2018

Posts: 13
#5

17 Jan 2022, 01:20

Thanks George Ford and Clemence Kieny. I note with a panel dataset, method 2 and method 3 to compute generalised residual also does give exactly same numbers. Thank you for sharing this.

But method 1 does not work as predict score is only available after xtlogit..., fixed effects. And in my case, running xtlogit with fixed effects drops lot of observations, so I am going with random effects. Could there be a way to obtain generalised residuals after xtlogit fixed effects while keeping all observations?

What I have: xtlogit a b c $control, fe

where a is my endogenous variable that takes a value of 0/1. b and c are my IV's which also takes a value of 0/1.
I get an error that states:
note: multiple positive outcomes within groups encountered.
note: 14,250 groups (125,319 obs) omitted because of all positive or all negative outcomes.

Any more insights into my problem would help.
Comment
George Ford

Join Date: Aug 2014

Posts: 3035
#6

17 Jan 2022, 13:00

Sounds like the outcomes=1 (or 0) for all or nearly all observations within your fixed effect. Is the FE a natural interpretation? Could you use a higher level of fixed effect?
Comment
Kushneel Prakash

Join Date: Nov 2018

Posts: 13
#7

17 Jan 2022, 18:10

Hi George Ford. Thanks for your reply. My outcome variable is at individual level and so is my fixed effect. I do not think I can use a higher level of fixed effect.
Comment
George Ford

Join Date: Aug 2014

Posts: 3035
#8

19 Jan 2022, 10:41

Look at this, at p. 13-15. It explains how to handle dichotomous first stage for CF.
I'm curious if you exclude the fixed effects from the first stage and still produce a valid CF (still consistent?). Sounds like a question for Wooldridge (among others) who frequents Statalist.

HTML Code:

https://www.irp.wisc.edu/newsevents/workshops/appliedmicroeconometrics/participants/slides/Slides_14.pdf
Comment
Kushneel Prakash

Join Date: Nov 2018

Posts: 13
#9

20 Jan 2022, 23:15

Thanks George Ford. Indeed. Hoping Jeff Wooldridge could shed some light on this; where we want to run 1st stage panel FE model with endogenous binary outcome and then in 2nd stage, we have a outcome that is continuous. How to implement control function approach for this and generate the associated generalised residual that is to be used in this method.
Comment
Eva Boonaert

Join Date: Oct 2019

Posts: 26
#10

05 Dec 2024, 06:30

Dear Kushneel Prakash, have you found a solution to your problem? I have a similar problem with generating many missing values after generating the generalized residuals via:

Code:

predict u2h_fe, score
Comment

Announcement

How to obtain "generalised residuals" in control function approach?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment