Dear members,
I have a question regarding interpreting fixed effect OLS (FEOLS) estimates for a log-modulus (neglog) transformed dependent variable.
For my thesis, I employ a large panel data set and my general estimation strategy follows usual FEOLS as in:
Yijt = Xijt + aij + bt + eijt
whereby Y is my dependent variable, X refers to a vector of independent variables, a are the fixed effects for ij (basically a territory dyad), and b the fixed effects for years, and e the error term.
Now, since my dependent variable has many extreme values both below and above 0, and because (in the best case) I would like to interpret percentage (or at least relative) changes, I have transformed it via a log-modulus (or also called neglog) transformation in this way:
Yb = sign(Y) * ln(|Y|)
There are no 0s in Y and most values are way above |10000|. I have attached a histogram of Yb.
I have learned that if I had a ln-transformed independent variable and a ln-transformed dependent variable, I can interpret the estimate as approximate percentage change.
There is not much out there on log-modulus transformed variables in regression analyses but as far as I understood, this interpretation is not possible anymore, since:
Regarding the second point: If I were to select only dyads (ij groups) from my data set (while being aware of selection bias here) which are polarity-fixed (meaning not changing signs throughout time), could I then interpret an estimate of a ln-transformed X on my Yb again as approximate percentage change? Because what I would be left with in Yb is just kind of a mean of both ln-transformed values and ln-transformed values times -1 (adequately representing an oppositely directed effect)?
I wondered if year fixed effects would create difficulties (since they include both polarities) but then I am thinking I could build the same model by including year dummies, and clearly the estimate would not be bothered by it and just reflect the value under the condition of netting out year effects?
Can anyone help me out, if there is a substantial misinterpretation from my side or if you would agree with this line of arguing? Can I interpret the estimate as approximate percentage change for the case described?
I feel at least slightly adrift with this issue...
Would appreciate any help I can get! (:
I have a question regarding interpreting fixed effect OLS (FEOLS) estimates for a log-modulus (neglog) transformed dependent variable.
For my thesis, I employ a large panel data set and my general estimation strategy follows usual FEOLS as in:
Yijt = Xijt + aij + bt + eijt
whereby Y is my dependent variable, X refers to a vector of independent variables, a are the fixed effects for ij (basically a territory dyad), and b the fixed effects for years, and e the error term.
Now, since my dependent variable has many extreme values both below and above 0, and because (in the best case) I would like to interpret percentage (or at least relative) changes, I have transformed it via a log-modulus (or also called neglog) transformation in this way:
Yb = sign(Y) * ln(|Y|)
There are no 0s in Y and most values are way above |10000|. I have attached a histogram of Yb.
I have learned that if I had a ln-transformed independent variable and a ln-transformed dependent variable, I can interpret the estimate as approximate percentage change.
There is not much out there on log-modulus transformed variables in regression analyses but as far as I understood, this interpretation is not possible anymore, since:
- a negative value could come from an original positive value (between 0 and 1) or an original negative value, thus it is arbritrary,
- a change in shift of polarity (from below 0 to above 0) may not be interpreted anymore as percentage change (and I believe even regular percentage changes of shifts from negative to positive numbers are rather difficult).
Regarding the second point: If I were to select only dyads (ij groups) from my data set (while being aware of selection bias here) which are polarity-fixed (meaning not changing signs throughout time), could I then interpret an estimate of a ln-transformed X on my Yb again as approximate percentage change? Because what I would be left with in Yb is just kind of a mean of both ln-transformed values and ln-transformed values times -1 (adequately representing an oppositely directed effect)?
I wondered if year fixed effects would create difficulties (since they include both polarities) but then I am thinking I could build the same model by including year dummies, and clearly the estimate would not be bothered by it and just reflect the value under the condition of netting out year effects?
Can anyone help me out, if there is a substantial misinterpretation from my side or if you would agree with this line of arguing? Can I interpret the estimate as approximate percentage change for the case described?
I feel at least slightly adrift with this issue...
Would appreciate any help I can get! (:
Comment