Dear community
I intend to do one-to-one matching in STATA based on two variables and a condition. There are firms (each classified as family and non-family using a dummy variable), however family firms occupy 70% of the entire dataset. Therefore, I am trying to create a matched sample using non-family firms as the baseline, findind one-to-one match on certain parameters for subsequent analysis.
The two parameters include - Year and Industry Code. Further, I am including a conditional logic, which says that the net sales differential between the matched pair should not exceed 5 percent.
Using the existing joinby command, I am generating many-to-many matches and that too not unqiue for a given family/non-family firm. Therefore, I wanted someone to advise about how should I proceed with my analysis?
Below is the sample data for you to visualize the problem at hand:
I intend to do one-to-one matching in STATA based on two variables and a condition. There are firms (each classified as family and non-family using a dummy variable), however family firms occupy 70% of the entire dataset. Therefore, I am trying to create a matched sample using non-family firms as the baseline, findind one-to-one match on certain parameters for subsequent analysis.
The two parameters include - Year and Industry Code. Further, I am including a conditional logic, which says that the net sales differential between the matched pair should not exceed 5 percent.
Using the existing joinby command, I am generating many-to-many matches and that too not unqiue for a given family/non-family firm. Therefore, I wanted someone to advise about how should I proceed with my analysis?
Below is the sample data for you to visualize the problem at hand:
Company Code | Firm | Year | Industry Code | Dummy (Family/Non-Family) | Net Sales |
11 | ABC | 1997 | 8 | 1 | 219 |
11 | PQR | 1998 | 8 | 1 | 300.2999878 |
11 | XYZ | 1997 | 8 | 0 | 225.8999939 |
Comment