Hi Stata,
I am using STATA MP 8 core version on Mac (RAM 16G) and running a regression with FEs. The regression for using reghdfe is like:
reghdfe IHS ib7.group1event ib7.group2event ib7.group3event ib7.group4event ib7.group5event ib7.group6event ib7.group7event ib7.group8event ib7.group9event [aw=population], absorb(i.week i.group i.state) cluster(state)
The data I am using has 20 million obs. I have managed to compress it to the size of about 300M. However, reghdfe causes out of memory issue and my system asks me to quit Stata (I have tried options: poolside(1) compact, still out of memory). Even when I run reghdfe on 1/10 of the data (about 2 million), it takes about 10 mins to finish. But when we I do the same regression using reg command with the full sample, it returned results in 30 seconds. Any intuition of what is going on here? I thought reghdfe is designed to make regression with many FEs faster, but it doesn't look like that here. I appreciate any thoughts.
I am using STATA MP 8 core version on Mac (RAM 16G) and running a regression with FEs. The regression for using reghdfe is like:
reghdfe IHS ib7.group1event ib7.group2event ib7.group3event ib7.group4event ib7.group5event ib7.group6event ib7.group7event ib7.group8event ib7.group9event [aw=population], absorb(i.week i.group i.state) cluster(state)
The data I am using has 20 million obs. I have managed to compress it to the size of about 300M. However, reghdfe causes out of memory issue and my system asks me to quit Stata (I have tried options: poolside(1) compact, still out of memory). Even when I run reghdfe on 1/10 of the data (about 2 million), it takes about 10 mins to finish. But when we I do the same regression using reg command with the full sample, it returned results in 30 seconds. Any intuition of what is going on here? I thought reghdfe is designed to make regression with many FEs faster, but it doesn't look like that here. I appreciate any thoughts.
Comment