extremely high f statistic

Daisy Lee

Join Date: Dec 2023

Posts: 5
#1

extremely high f statistic

07 Jul 2024, 00:09

Hello.

I'm currently using ivreghdfe command.

Below is the equation that I set for the regression.

Code:

y = a + b₁x1 + b₂c1 + b₃x1*c1 + u

Code:

y = a + b₁x2 + b₂c2 + b₃x2*c2 + u

(Here, x1 & x2 or c1 & c2 are variables with similar concepts but applied to different subjects.
For instance, x1 refers to the income equality index among men in a region. x2 refers to the income inequality index among women in a region.)

Since x1, x2 are endogenous, I use z1, z2 as the instrument variable for each x1 and x2.

So I used the below command.

Code:

ivreghdfe y c.c1 (c.x1 c.x1#c.c1 = c.z1 c.z1#c.c1), absorb(region year) cluster(region) first savefirst

Code:

ivreghdfe y c.c2 (c.x2 c.x2#c.c2 = c.z2 c.z2#c.c2), absorb(region year) cluster(region) first savefirst

I got the first-stage results as this.

(results for x1)

(results for x2)

F statistic in the first regression (15392.21) is too high and weird.
When I drop the c1#x1 term, the f statistic of excluded instruments in the first regression is 50.67.
(And when I drop the c2#x2 term, the f statistic in the second regression is 8.71.)

So what is the problem in this extremely high f statistic?
If this problem happens due to the c1#z1 interaction term, how can I fix it?
(Both x1 and x1#c1 interaction term are necessary in my equation.. so neither of them can be excluded.)

Last edited by Daisy Lee; 07 Jul 2024, 00:26. Reason: f-stat
Tags: None

Announcement

extremely high f statistic