Panel data regression

Francesco Del Monte

Join Date: May 2022

Posts: 1
#1

Panel data regression

02 May 2022, 06:54

Dear All,

I am a beginner working on Stata for academic purposes and I need help in carrying out a panel data regression.

1) I have a sample of S&P500 companies on which I need to investigate the direct effect of a variable x1 over y, and this is quite trivial as a simple regression is also already providing clear results of this relationship. But, I would like to use fe or re function instead. In this first case, the Hausman test indicates that fe should be used, yet I am wondering if it is possible to finalize a model with a fe and re (I need re in the 2nd part)

2) Problems arise when I introduce the second hypothesis I am investigating, namely whether the baseline effect is different in the three industries I am considering. Thus, I generated a variable which takes a value of either 1 or 2 or 3 depending on the industry in which the company operates. Of course, the industry is recurring as my panel goes from 1999 to 2013 for each company and the industry is clearly a time-invariant variable. For this reason, I guess a random-effects regression would more suitable here, as opposed to hypothesis 1. This is why I am running the regression using i.varname to indicate it is categorical. But when I run the regression I have trouble in interpreting the table's results (e.g. the p>z value should be higher or lower than 5% to have a statistically significant result for the considered variable? Or is there another way to check the overall validity and statistically significance of the model per se?). Also, do I need to use vce(cluster varname) in the command or not?

3) I am writing commands as follows:
- First model dependent variable + control variables --> xtreg y c1 c2 c3, fe/re
- Second model dependent variable + independent variable + control variables --> xtreg y x1 c1 c2 c3, fe/re
- Third model dependent variable + independent variable + moderator + control variables --> xtreg y x1 i.x2 c1 c2 c3, re

Do you think the structure of these commands is right for generating these models or should I write them in some other way?

Thank you in advance and hope someone can help me quite urgently.

Kind Regards,
Francesco
Tags: None
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17673
#2

02 May 2022, 07:14

Francesco:
welcome to this forum.
1) your first (trivial) model is surely misspecified (one predictor only is obviously not enough to address the data geberating process you're investigating). In addition, I do not follow you as far as the regression model with regressand + control variables is concerned (why do predictors lack here?)
2) while -regress- is not the first choice when you're dealing with a pane dataset (BTW: how many panels is your dataset composed of'), you're right in reporting that -xtreg,fe- wipes out (as expected) time-invariant predictors.
3) -hausman- tells you that -fe- is the way to go. If that were the case, and you want a coefficient for the time-invariant predictor(s), you may want to consider the Mundlak correction (just type .-search mundlak- from within Stata to spot and install it);
4) Please also note that, if -fe- is the way to go, -re- estimates are unrealiable (because -re- is not consistent in this scenario).
5) you do not tell interested listers whether you've already checked your regression model for heteroskedasticity, serial correlation, endogeneity and appropriateness of the functional form of the regressand.
As an aside, blunt as it may seem, please note that urgency is poster's matter only (especially when you should have a supervisor/teacher/professor to follow up with you during the reserach your dissertation will be based upon).

Kind regards,
Carlo
(Stata 19.0)
Comment

Announcement

Panel data regression

Comment