One of my friends has run a system GMM model and the result looks as follows. I have a confusion about the result. He applied unit root test and then found all the variables to be I(1). So he applied the system GMM on the first differenced data.
QS1:Since system GMM instruments endogenous variables by their lagged level and first differences, I am just wondering whether it is right to apply the system GMM on first differenced variables (rather than on raw data) just because we found that all variables are I(1). As per as my limited knowledge, we are to run system GMM on raw data, not on first differenced data even though we find all the variables to be I(1). Please correct me if I am wrong.
QS2: Do you think the models below are correct? In the result, ΔCOMPL means first differenced of an independent variable COMPL, ΔCOMPL(-1) means first differenced variable of lagged value of that independent variable (used as instrument). But shouldn't we have only one independent variable here out of COMPL? It should be ΔCOMPL(-1), right? Should we have both ΔCOMPL and ΔCOMPL(-1) ? Is there anything wrong in the command he applied? I applied system GMM myself once, but my knowledge is very limited but I saw some papers on world development (including this one Prichard, W., Salardi, P., & Segal, P. (2018). Taxation, non-tax revenue and democracy: New evidence using new cross-country data. World Development, 109, 295-312) and they have only one variable (either the variable is instrumented by the lagged value or if the variable is not endogenous, then the level value is given). So, should we have both ΔCOMPL and ΔCOMPL(-1) here? (ignoring the fact that he applied the model on first differenced data, although my gut says that it should be applied on raw data)
Table: System GMM result (dependent variable: GINI)
Note: Here, Δ does not come from the system GMM command, it is written because all the variables were found to be I(1) and then he converted all the raw data to first differenced before running the system GMM command.
QS1:Since system GMM instruments endogenous variables by their lagged level and first differences, I am just wondering whether it is right to apply the system GMM on first differenced variables (rather than on raw data) just because we found that all variables are I(1). As per as my limited knowledge, we are to run system GMM on raw data, not on first differenced data even though we find all the variables to be I(1). Please correct me if I am wrong.
QS2: Do you think the models below are correct? In the result, ΔCOMPL means first differenced of an independent variable COMPL, ΔCOMPL(-1) means first differenced variable of lagged value of that independent variable (used as instrument). But shouldn't we have only one independent variable here out of COMPL? It should be ΔCOMPL(-1), right? Should we have both ΔCOMPL and ΔCOMPL(-1) ? Is there anything wrong in the command he applied? I applied system GMM myself once, but my knowledge is very limited but I saw some papers on world development (including this one Prichard, W., Salardi, P., & Segal, P. (2018). Taxation, non-tax revenue and democracy: New evidence using new cross-country data. World Development, 109, 295-312) and they have only one variable (either the variable is instrumented by the lagged value or if the variable is not endogenous, then the level value is given). So, should we have both ΔCOMPL and ΔCOMPL(-1) here? (ignoring the fact that he applied the model on first differenced data, although my gut says that it should be applied on raw data)
Table: System GMM result (dependent variable: GINI)
ΔGini(-1) | -0.294*** |
ΔCOMPL | -0.063*** |
ΔCOMPL(-1) | -0.043** |
Comment