You are not logged in. You can browse but not post. Login or Register by clicking 'Login or Register' at the top-right of this page. For more information on Statalist, see the FAQ.
I apologize in advance if this question comes off as naive, as I am only a current high school student getting into econometrics. Using your PPMLHDFE function with year fixed effects, I am investigating how recipient country factors affects zero inflated absoloute continuous positive amounts of foriegn aid given by donor countries. From reading your previous responses to this thread, I have come to the folowing conclusions:
1. Log-log: Interpret the coefficient dirrectly as a percentage change when the regressor is logged
2. Log-linear: calculate (e^b-1)*100% when the regressor is a index or dummy variable
However, I do not understand exactly why interpreting log-log and log-linear requires different sets of calculations. Aren't they both inherent in coefficient values, given the status of the PPML log-link function and regressors? To illustrate, these are my results:
My interpretation is as follows:
1. For oresMetalsReal which is logged, a 1% increase in the regressor is associated with a 1.7% increase in the dependent variable for each 10% increase in the independent variable
2. For Taiwan which is a dummy binary indicator of 1 or 0, I calculate (e^-1.8-1)*100%=-84% decrease in the dependent variable
3. For polStability which is a discrete index between -2.5 and 2.5, I calculate (e^.934)*100% = 154% increase in the dependent variable for each unit increase in the independent variable
Are these interpretations correct as they differ drastically in magnitude. Could you also elaborate on why the calculations/interpretations of coefficients for semi-elasticity and elasticity is different please Further, I have read your previous comments here: https://www.statalist.org/forums/for...ative-binomial. regarding the disutility of calculating partial effects for poisson regressions with fixed effects. To confirm, is it due to the nonlinearity of GLM models and Incidental Paramter Problems with year fixed effects?
Could you also comment on reporting dirrectly the incident rate ratios (exp(b)) for regressors in the context of a multiplicative model (positive when greater than 1, negative when less than 1). Is this applicable for both log and non-logged covariates?
I start by noting that I am not the author of the command ppmlhdfe; as detailed in the help file, the authors are Sergio Correia, Paulo Gimaraes and Tom Zylkin.
Anyway, your interpretation of the coefficients appears to be correct. Just to be clear, in 1, a 1% increase in oresMetalsReal is associated with a 0.176% increase in the expectation of the dependent variable. The reason why the interpretation is different for variables that are logged is exactly because of that: if the variable enters the model in a different way, its effect takes a different form.
Finally, indeed the computation of partial effects can be affected by the IPP and therefore it is better to avoid it (and it is generally not needed). I am afraid I do not work with IRRs so I will not comment on that.
Thank you for that explanation. I also have an additional question. Referring to your comments here (https://www.statalist.org/forums/for...uated-at-means), I estimate a secondary probit model with year dummies to consider initial selection.
1. I have 542 total observations with 48 entities (countries) in each time period (year). There are 22 years. Is this problematic?
2. How would I interpret the coefficients of said probit model? Is it with marginal effects? I've included my results below. If you could provide the calculations/explanations/code for interpreting continuous logged variables and index or dummy variables, that would be wonderful.
3. Could you recommend some statistical diagnostics/code I could perform on the PPML and probit model to check for model robustness and fit? My understanding is that traditional diagnostics don't work on non-linear models
It is a secondary model that estimates the dichotomous yes/no outcome of if a country is eligible for aid allocation or not. Given that, how would I proceed with my interpretation/diagnostics?
Dear Professor Joao Santos Silva
I am attempting to run a ppml regression with variables exports gdp commonofficial language distance common trade agreement
I am employing this data using observations from 2022 for 167 countries
I have used the ffg command ppml exports gdp commonofficial language distance common trade agreement
Considering that this model has fixed effects is it enough to only use the ppml command?
May you also assist me with commands for the Park test and the Ramsey RESET test.
I suggest you use the ppmlhdfe command instead of ppml. The results should be exactly the same, but ppmlhdfe is much faster.
I would not perform the Park test as it is not needed. For the RESET, please see the example here; it is for the poisson command, but for ppmlhdfe is not very different.
Hello! In relation to the interpretation, where I have exports (in levels), as dependent variable, and log of tariffs as independent variable. How do we interpret the coefficient of tariffs, when PPML estimator is used?
1% increase in tariffs lead to -xx% decrease in exports OR
1 percentage point increase in tariffs lead to -xx% decrease in exports
The coefficients on logged regressors are elasticities and there is no need to transform those. For regressors not in logs, the semi-elasticity is given by 100*(exp(beta) - 1)%. This is negative for negative beta and positive for positive beta; is is also approximately equal to 100*(beta)% for beta close to zero.
About #2, note that it should be (e^(-0.4)-1)*100 = -0.33%.
I was just reading through this thread and I'm a bit confused. I have learned in my econometrics classes that in a regression with the dependent variable in levels, the dummy coefficient doesn't have to be transformed with (exp(beta)-1)*100%. The coefficient just indicates the difference in the dependent variable between the two groups that are separated with it in the unit of the dependent variable. The transformation would only be necessary if our dependent variable was in log (which we don't do with ppml). Also, the coefficients of logged regressors don't represent elasticities (as would be the case with a logged DV), but only semi-elasticities.
Maybe I'm just confusing things but that's how I learned it in class - could you shed some more light on this?
What you learned applies to linear models, whereas PPML estimates an exponential model. Therefore, the interpretation of PPML estimates is exactly as if the dependent variable was in logs.
If I am not mistaken, (e^-0,4-1)*100 = -0.33%, so the effect is negative. More generally, the coefficient and the effect always have the same sign because e^0-1=0.
Comment