Hi All,
I am struggling with STATA as a newbie undergrad trying to finish my dissertation looking at the 'Financial impact of ESG on 81 clubs in the English Football League.' As ESG in football is quite recent, I only have a panel of two years (I know not ideal but was hoping for a third year's worth of data which never came). I am starting with a pooled OLS and then move on to looking at FE to control for heterogeneity. I have two dependent variables - revenue and profit and loss, and independent variables Governance, Financial Sustainability, Environment, and Fan engagement with an aggregated ESG score based on those four components. I am first looking at the aggregated ESG score on its own as these variables quite intuitively have high levels of colinearity so I don't include multiple independent variables in my baseline model. I have My initial code looks something like this, where there is an interaction dummy term to determine league specific impacts and attendence and leagueposition are control variables.
// Step 1: OLS regression with interaction term
reg pretaxpnl fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore
// Step 2: Fixed Effects (FE) regression
xtreg pretaxpnl fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, fe
// Step 3: Random Effects (RE) regression
xtreg pretaxpnl fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, re
// Step 1: OLS regression with interaction term
reg revenue fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore
// Step 2: Fixed Effects (FE) regression
xtreg revenue fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, fe
// Step 3: Random Effects (RE) regression
xtreg revenue fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, re
HAUSMAN
My questions are as follows:
- Do I set up tests for heteroscedasticity/autocorrelation before I run my findings (Breusch Pagan, F test, etc.) to know whether to use cluster or robust SE?
- When I run this code, I get R2 values of around 0.5 but super high p values so basically no conclusions can be made from my model - don't know how to get around this.
- As a newbie, just looking for a general bit of help with standard approaches to FE and pooled OLS and what tests to use.
Thank you.
I am struggling with STATA as a newbie undergrad trying to finish my dissertation looking at the 'Financial impact of ESG on 81 clubs in the English Football League.' As ESG in football is quite recent, I only have a panel of two years (I know not ideal but was hoping for a third year's worth of data which never came). I am starting with a pooled OLS and then move on to looking at FE to control for heterogeneity. I have two dependent variables - revenue and profit and loss, and independent variables Governance, Financial Sustainability, Environment, and Fan engagement with an aggregated ESG score based on those four components. I am first looking at the aggregated ESG score on its own as these variables quite intuitively have high levels of colinearity so I don't include multiple independent variables in my baseline model. I have My initial code looks something like this, where there is an interaction dummy term to determine league specific impacts and attendence and leagueposition are control variables.
// Step 1: OLS regression with interaction term
reg pretaxpnl fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore
// Step 2: Fixed Effects (FE) regression
xtreg pretaxpnl fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, fe
// Step 3: Random Effects (RE) regression
xtreg pretaxpnl fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, re
// Step 1: OLS regression with interaction term
reg revenue fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore
// Step 2: Fixed Effects (FE) regression
xtreg revenue fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, fe
// Step 3: Random Effects (RE) regression
xtreg revenue fairgameesgscore averageattendence leagueposition c.leaguedummy#c.fairgameesgscore, re
HAUSMAN
My questions are as follows:
- Do I set up tests for heteroscedasticity/autocorrelation before I run my findings (Breusch Pagan, F test, etc.) to know whether to use cluster or robust SE?
- When I run this code, I get R2 values of around 0.5 but super high p values so basically no conclusions can be made from my model - don't know how to get around this.
- As a newbie, just looking for a general bit of help with standard approaches to FE and pooled OLS and what tests to use.
Thank you.
Comment