Dear Statalist Community,
I am currently working with the Heckman selection model in Stata and facing a challenging situation. My model includes a substantial number of two-way fixed effects, leading to a total of over 4,000 variables. Additionally, the dataset I am working with contains more than 85,000 observations. Due to this high dimensionality, I am unable to obtain results from Stata.
Could anyone kindly provide guidance or suggestions on how to handle such a large-scale model effectively? Specifically, I am looking for advice on:
1. Strategies to manage a large number of variables in the context of the Heckman model.
2. Approaches to dealing with large datasets while ensuring the robustness and validity of the model.
Any insights, tips, or references to relevant resources would be greatly appreciated. Thank you in advance for your time and assistance.
Sincerely,
Leona
I am currently working with the Heckman selection model in Stata and facing a challenging situation. My model includes a substantial number of two-way fixed effects, leading to a total of over 4,000 variables. Additionally, the dataset I am working with contains more than 85,000 observations. Due to this high dimensionality, I am unable to obtain results from Stata.
Could anyone kindly provide guidance or suggestions on how to handle such a large-scale model effectively? Specifically, I am looking for advice on:
1. Strategies to manage a large number of variables in the context of the Heckman model.
2. Approaches to dealing with large datasets while ensuring the robustness and validity of the model.
Any insights, tips, or references to relevant resources would be greatly appreciated. Thank you in advance for your time and assistance.
Sincerely,
Leona
Comment