Dear Statalist community,
I am currently working on a research project that involves estimating a Poisson Pseudo-Maximum Likelihood (PPML) model in Stata. My dataset is structured in a panel dyadic format, representing origin-to-destination firm migration.
Considering the origin municipality as "i" and the destination as "j," to apply the most recommended level of fixed effects in a three-way framework (e.g., Yotov et al., 2016), namely pair fixed effects, one needs to have explanatory variables that vary at the "ij,t" level. Contrary to what is common in trade literature, I do not have such variables. Instead, I aim to estimate a variable in differences, i.e., the difference between the time required to access motorways at the destination and that at the origin. For example:
delta_X_itj = x_jt - x_it
However, upon estimating the model using
, the result yields zeros (omitted). My intuition suggests that this outcome makes sense as my variable is after all a linear combination of both "jt" and "it" variables, potentially leading it to be absorbed by the fixed effects.
So, the command is:
Where tt_mtw_ar_diff is the variable of interest; log_distance_rn_rdist is the distance by road (time-variant) between origin I and destination j
However, upon running the
command, which incorporates the correction proposed by Weidner and Zylkin (2021), I obtain a negative coefficient (different from zero). In other words, it seems peculiar to me that with this correction, the variable of interest can be estimated, whereas without it, it cannot. Is this an expected outcome? Am I doing something wrong?

Thank you for your help and insights.
Kind Regards,
Mauricio.
I am currently working on a research project that involves estimating a Poisson Pseudo-Maximum Likelihood (PPML) model in Stata. My dataset is structured in a panel dyadic format, representing origin-to-destination firm migration.
Considering the origin municipality as "i" and the destination as "j," to apply the most recommended level of fixed effects in a three-way framework (e.g., Yotov et al., 2016), namely pair fixed effects, one needs to have explanatory variables that vary at the "ij,t" level. Contrary to what is common in trade literature, I do not have such variables. Instead, I aim to estimate a variable in differences, i.e., the difference between the time required to access motorways at the destination and that at the origin. For example:
delta_X_itj = x_jt - x_it
However, upon estimating the model using
Code:
ppmlhdfe
So, the command is:
Code:
ppmlhdfe y tt_mtw_ar_diff /// log_distance_rn_rdist control1_diff control2_diff control3_diff control4_diff , absorb(origin#year destination#ano origin#destination) vce(cluster orgin#destination) d predict lambda matrix beta = e(b) ppml_fe_bias y tt_mtw_ar_diff, i(ldest) j(ldest_destino) t(ano) lambda(lambda) beta(beta)
However, upon running the
Code:
ppml_fe_bias
Thank you for your help and insights.
Kind Regards,
Mauricio.
Comment