Dear all,
I am working on a dataset consisting of 214 thousand firms in Switzerland. My dependent variable, bOptingOut, is a binary variable which equals 1 if a firm chooses not to have their financial statements audited and 0 otherwise (meaning they have their financial statements audited). Basically, I am trying to assess the impact of information asymmetries between owners and managers and among owners.
The dataset consist only of firms that (by assumption) do have a choice between having their financial statements audited or not (for example, I exclude all large companies which are legally required to have their financial statements audited).
The independent variables consist of firm capital, firm age, the size of management and various binary variables describing the ownership structure (single owner, family owned, corporate owners, etc.) as well as fixed effects for industry and canton (i.e. state).
This is my regression model:
The Stata output is as follows (for brevity I omit the two categorical variables for industry and canton):
To start with I have two basic questions:
1) Is it okay to use a logit model if the subject (here, the firms) can actually choose the outcome (having their financial statements audited or not)? If it is a problem, how can I mitigate it (e.g. by using a different model)?
In the traditional examples I encountered in literature, the subjects did usually not have a direct influence on the outcome (e.g. do you get mortgage or not, are you admitted to university or not).
2) The key independent variables are all highly significant (except for bOnlySwissOwners and some of the fixed effects). Intuitively this makes sense, especially due to the large sample size. Or am I missing something?
I would appreciate your thoughts as many of you probably have more experience with statistics than I do. If you need additional information, please let me know, I am happy to expand.
Kind regards,
Daniel
I am working on a dataset consisting of 214 thousand firms in Switzerland. My dependent variable, bOptingOut, is a binary variable which equals 1 if a firm chooses not to have their financial statements audited and 0 otherwise (meaning they have their financial statements audited). Basically, I am trying to assess the impact of information asymmetries between owners and managers and among owners.
The dataset consist only of firms that (by assumption) do have a choice between having their financial statements audited or not (for example, I exclude all large companies which are legally required to have their financial statements audited).
The independent variables consist of firm capital, firm age, the size of management and various binary variables describing the ownership structure (single owner, family owned, corporate owners, etc.) as well as fixed effects for industry and canton (i.e. state).
This is my regression model:
Code:
logit bOptingOut lncapital lnAge cntTotalManagement i.bSingleOwner i.bFamilyOwnedStrict i.bOnlyCorporateOwners i.bFullyOwnerManaged i.bOnlySwissOwners i.industry i.firmCanton, nolog vce(robust)
Code:
Logistic regression Number of obs = 214,343 Wald chi2(50) = 11096.22 Prob > chi2 = 0.0000 Log pseudolikelihood = -20750.008 Pseudo R2 = 0.2415 ---------------------------------------------------------------------------------------- | Robust bOptingOut | Coef. Std. Err. z P>|z| [95% Conf. Interval] -----------------------+---------------------------------------------------------------- lnCapital | -.635477 .0207321 -30.65 0.000 -.6761112 -.5948427 lnAge | -.5767789 .020981 -27.49 0.000 -.617901 -.5356568 cntTotalManagement | -.5195666 .016879 -30.78 0.000 -.5526489 -.4864844 1.bSingleOwner | .336793 .04052 8.31 0.000 .2573753 .4162107 1.bFamilyOwnedStrict | .1779222 .049 3.63 0.000 .081884 .2739605 1.bOnlyCorporateOwners | -1.796396 .0488854 -36.75 0.000 -1.89221 -1.700583 1.bFullyOwnerManaged | .4281087 .047943 8.93 0.000 .3341422 .5220753 1.bOnlySwissOwners | .0567386 .0324741 1.75 0.081 -.0069094 .1203866
To start with I have two basic questions:
1) Is it okay to use a logit model if the subject (here, the firms) can actually choose the outcome (having their financial statements audited or not)? If it is a problem, how can I mitigate it (e.g. by using a different model)?
In the traditional examples I encountered in literature, the subjects did usually not have a direct influence on the outcome (e.g. do you get mortgage or not, are you admitted to university or not).
2) The key independent variables are all highly significant (except for bOnlySwissOwners and some of the fixed effects). Intuitively this makes sense, especially due to the large sample size. Or am I missing something?
I would appreciate your thoughts as many of you probably have more experience with statistics than I do. If you need additional information, please let me know, I am happy to expand.
Kind regards,
Daniel
Comment