Hi everyone,
I'm investigating the relationship of the gender diversity in partnerships of audit firms (GDR) on the audit quality (abs_ModDACC). When I do a two-way scatterplot this is what I get (see attachement). Audit quality is measured by absolute discretionary accruals and the lower the accruals are the higher the audit quality. GDR is measured by proportion female/total, but I still need to create a better variable that when this ratio is 0.5 (50%), the new variable will be 1 (100%, because 50/50 is most diverse). However, this is not perse the problem for now so I won't go deeper into this. The code I used for the scatter:
I cannot really find a similar scatter on the internet, so how can I interpret this scatterplot? I've also tried to run a regression, but the gender diversity ratio is always insignificant. I actually don't know whether I do this correctly. First I did this, because I have panel data:
I'm not sure that when you want to cluster at industry level, you should group based on the CIK code (firm code) or SIC2 (industry code). Below are some codes I've tried to run the regression. I tried to cluster standard errors and tried to add industry fixed effects (SIC2) and year fixed effects (Year). All the other variables are my control variables.
Prior literature shows that most of the time industry and year fixed effects are used, and clustered standard errors at client or audit firm level. I think I have to run a fixed effects model because I have panel data.
If I need to add more information so that someone can help me to elaborate on this, please let me know! I'm really struggling with it...
Thanks in advance and kind regards,
Josephine
I'm investigating the relationship of the gender diversity in partnerships of audit firms (GDR) on the audit quality (abs_ModDACC). When I do a two-way scatterplot this is what I get (see attachement). Audit quality is measured by absolute discretionary accruals and the lower the accruals are the higher the audit quality. GDR is measured by proportion female/total, but I still need to create a better variable that when this ratio is 0.5 (50%), the new variable will be 1 (100%, because 50/50 is most diverse). However, this is not perse the problem for now so I won't go deeper into this. The code I used for the scatter:
Code:
twoway(scatter abs_ModDACC GDR)
Code:
egen id = group(CIK) xtset id Year, yearly order id
Code:
areg abs_ModDACC i.Year GDR Leverage Restatement TotalAccruals ICW GEOSEG BUSSEG Busyness Loss SalesGrowth MTB lnCFO specialist Big4 lnAuditFees lnNonAuditFees lnClientOfficeSize, absorb(SIC2) vce(cluster SIC2) areg abs_ModDACC GDR Leverage Restatement TotalAccruals ICW GEOSEG BUSSEG Busyness Loss SalesGrowth MTB lnCFO specialist Big4 lnAuditFees lnNonAuditFees lnClientOfficeSize, absorb(SIC2) cluster (SIC2) regress abs_ModDACC i.Year i.SIC2 GDR Leverage Restatement TotalAccruals ICW GEOSEG BUSSEG Busyness Loss SalesGrowth MTB lnCFO specialist Big4 lnAuditFees lnNonAuditFees lnClientOfficeSize, robust regress abs_ModDACC i.Year i.SIC2 GDR Leverage Restatement TotalAccruals ICW GEOSEG BUSSEG Busyness Loss SalesGrowth MTB lnCFO specialist Big4 lnAuditFees lnNonAuditFees lnClientOfficeSize regress abs_ModDACC i.Year GDR Leverage Restatement TotalAccruals ICW GEOSEG BUSSEG Busyness Loss SalesGrowth MTB lnCFO specialist Big4 lnAuditFees lnNonAuditFees lnClientOfficeSize xtreg abs_ModDACC GDR Leverage Restatement TotalAccruals ICW GEOSEG BUSSEG Busyness Loss SalesGrowth MTB lnCFO specialist Big4 lnAuditFees lnNonAuditFees lnClientOfficeSize, fe vce(cluster CIK)
If I need to add more information so that someone can help me to elaborate on this, please let me know! I'm really struggling with it...
Thanks in advance and kind regards,
Josephine
Comment