Dear Statalists,
I am currently struggling for validating the parallel assumption for the diff-in-diff analysis.
I am trying to investigate the effect of PTAs on the initiation of trade disputes. The dependent variable is the number of trade disputes between countries i and j for a given year. The dependent variable contains 84% of zeros since there isn't any trade dispute between many country pairs.
Enlighted by this thread, I am therefore using ppmlhdfe. My command is ppmlhdfe no_ad_cases pta gdp_growth3_o gdp_growth3_d lag1_lnrer lag1_lnimp, a(year id) cluster(id) irr nolog
where pta is the variable of interest and pta=1 if a country-pair enters into a PTA since year t, otherwise it is 0. In other words, the treatment group is the country-pair has a PTA and the controls are these country-pairs without any PTA during my sample period. I have included control variables such as bilateral trade, real exchange rate, GDP growth rate, country-pair fixed effect, and year fixed effects. The data is unbalanced, N=30,515 and T=40. Due to countries entering into PTA at a different time, e.g. US-Canada formed NAFTA in 1994 but Korea-Chile implemented their PTA in 2004, I am in the staggered diff-in-diff world.
I would like to do a placebo test, i.e., use the data the came before the treatment went into effect. The procedure is as follows:
I am aware that the dynamic diff-in-diff including lags and leads and also several new commands such as "eventstudyinteract" developed by Sun and Abraham (2020) can verify the parallel assumption. But I am NOT sure whether eventstudyinteract could accommodate Poisson model.I also tried include both leads and lags. Below is what the graph looks like, which does not look like the parallel assumption holds. Because ideally, we would like to have the Incident rate ratio (IRR) above 1 but insignificant before the shock and IRR smaller than 1 and significant after the shock.

I would greatly appreciate it if everyone can guide me on how to verify the parallel assumption in the count model setup as I am really clueless.
Regards,
Bingzi
I am currently struggling for validating the parallel assumption for the diff-in-diff analysis.
I am trying to investigate the effect of PTAs on the initiation of trade disputes. The dependent variable is the number of trade disputes between countries i and j for a given year. The dependent variable contains 84% of zeros since there isn't any trade dispute between many country pairs.
Enlighted by this thread, I am therefore using ppmlhdfe. My command is ppmlhdfe no_ad_cases pta gdp_growth3_o gdp_growth3_d lag1_lnrer lag1_lnimp, a(year id) cluster(id) irr nolog
where pta is the variable of interest and pta=1 if a country-pair enters into a PTA since year t, otherwise it is 0. In other words, the treatment group is the country-pair has a PTA and the controls are these country-pairs without any PTA during my sample period. I have included control variables such as bilateral trade, real exchange rate, GDP growth rate, country-pair fixed effect, and year fixed effects. The data is unbalanced, N=30,515 and T=40. Due to countries entering into PTA at a different time, e.g. US-Canada formed NAFTA in 1994 but Korea-Chile implemented their PTA in 2004, I am in the staggered diff-in-diff world.
I would like to do a placebo test, i.e., use the data the came before the treatment went into effect. The procedure is as follows:
- Use only the data that came before the treatment went into effect.
- Pick a fake treatment period.
- Estimate the same difference-in-differences model you were planning to use (for example Y=αt+αg+β1Treated+ε), but create the Treated variable as equal to 1 if you’re in the treated group and after the fake treatment date you picked.
- If you find an “effect” for that treatment date where there really shouldn’t be one, that’s evidence that there’s something wrong with your design, which may imply a violation of parallel trends.
I am aware that the dynamic diff-in-diff including lags and leads and also several new commands such as "eventstudyinteract" developed by Sun and Abraham (2020) can verify the parallel assumption. But I am NOT sure whether eventstudyinteract could accommodate Poisson model.I also tried include both leads and lags. Below is what the graph looks like, which does not look like the parallel assumption holds. Because ideally, we would like to have the Incident rate ratio (IRR) above 1 but insignificant before the shock and IRR smaller than 1 and significant after the shock.
I would greatly appreciate it if everyone can guide me on how to verify the parallel assumption in the count model setup as I am really clueless.
Regards,
Bingzi
Comment