Hii Stata users,
I have two different datasets for two different groups with mostly the same information
This is the relevant data for both groups.
Group 1 (Companies that have been takenover)
Group 2 (Benchmark, companies that have not been takenover)
I would like to know whether the variable OIBDP (firm performance) is significant different between the two groups. And pair the observations of both groups based on Year, SIC and a range of 20% of AT.
I was thinking of doing the Wilcoxon Signed-Rank test. However, searching on the internet on how to do this I got the feeling I have to put the two datasets in one dataset.
Question1: Is there a way to compare two different datasets of should I really combine the two datasets using "append" or "merge".
Question 2: If I have to combine both datasets with append. I was thinking to make a dummy which sets 1 if it belongs to group 1 and 0 if it belongs to group 2. But should I name the variables exactly the same in both datasets and order the variables in the same order in both datasets in order to append? How can I change the names and order of the variables in dataset 2 while working in the same do-file as dataset 1 and afterwards going back to dataset 1 to append?
Question 3: If I have to combine both datasets with merge, I should name the bariable OIBDP different for both groups, right?
Question 4: How do I match the observations of both groups based on Year, SIC and range of 20% AT, while doing the wilcoxon Signed-Rank test?
Thank you for helping in advance!
I have two different datasets for two different groups with mostly the same information
This is the relevant data for both groups.
Group 1 (Companies that have been takenover)
Code:
input int YearMatch str30 CompanyName str4 SIC double AT str8 Attitude double(CSHO DT OIBDP SALES PRCC_F) 2000 "Ultramar Diamond Shamrock Corp" "2911" 5988.4 "Friendly" 83.352 1661.5 1066.4 14396.6 30.875 2003 "META Group Inc" "8732" 86.712 "Friendly" 13.657 .389 1.348 122.485 6.3 1997 "Hein-Werner Corp" "3559" 37.348 "Friendly" 2.771 . 2.646 39.037 7.312 2001 "Budget Group Inc" "7514" 4469.505 "Friendly" 37.255 3292.08 712.693 2160.73 .89 2002 "SangStat Medical Corp" "2834" 192.437 "Friendly" 26.443 20.269 11.114 120.057 11.3 1995 "Meta Software Inc" "7372" 35.095 "Friendly" 9.8 . 6.514 25.281 16.75 1996 "Studio Plus Hotels Inc" "7011" 146.24 "Friendly" 12.529 . 9.612 23.065 15.75 1995 "Research Medical Inc" "3841" 46.545 "Friendly" 9.408 . 12.061 34.024 23 2003 "Commonwealth Industries Inc" "3353" 379.326 "Friendly" 16.011 125.205 28.25 918.396 10.07 1997 "Sofamor Danek Group Inc" "3842" 385.657 "Friendly" 25.182 . 106.302 312.902 65.25 2002 "Corvas International Inc" "2834" 96.593 "Friendly" 27.591 12.558 -20.794 .142 1.55 1997 "Mecklermedia Corp" "2721" 51.781 "Friendly" 8.517 . 3.468 55.193 23.187 1997 "Learning Co Inc" "7372" 416.791 "Friendly" 48.869 . -330.466 392.438 16.062 2000 "C-bridge Internet Solutions" "7372" 98.46 "Friendly" 21.175 .265 -4.451 82.97 3.9063 2004 "Micro Therapeutics Inc" "3841" 61.238 "Friendly" 48.425 0 -17.601 35.725 3.75 1997 "Whitehall Corp" "3721" 48.599 "Friendly" 5.53 . -8.654 65.791 18 2006 "C-COR Inc" "3663" 313.129 "Friendly" 48.008 36.265 -8.251 262.526 7.72 2003 "TippingPoint Technologies Inc" "3577" 43.373 "Friendly" 7.336 1.003 -14.921 5.771 24.02 2005 "Manugistics Group Inc" "7372" 413.311 "Friendly" 84.161 178.076 14.199 176.192 1.86 2000 "Louis Dreyfus Natural Gas" "1311" 1501.965 "Friendly" 43.689 606.909 352.99 489.703 45.8125 1995 "Scanforms Inc" "2761" 16.693 "Friendly" 3.547 . 4.068 24.519 2.5 2003 "Lannett Co Inc" "2834" 29.063 "Friendly" 20.026 3.098 20.261 42.487
Code:
input int DataYearFiscal str30 CompanyName int SIC double(AT CSHO DT OIBDP SALES PRCC_F) 1992 "AAR CORP" 5080 365.151 15.901 . 23.251 382.78 13.5 1993 "AAR CORP" 5080 417.626 15.906 . 29.106 407.754 14.375 1994 "AAR CORP" 5080 425.814 15.961 . 34.766 451.395 15.25 1995 "AAR CORP" 5080 437.846 15.998 . 42.557 504.99 22.125 1996 "AAR CORP" 5080 529.584 18.204 . 55.177 589.328 31 1999 "AAR CORP" 5080 740.998 26.865 180.876 89.031 1024.333 13.875 2002 "AAR CORP" 5080 686.621 31.851 232.914 30.745 606.337 4.5 2003 "AAR CORP" 5080 709.292 32.245 250.162 47.491 651.958 9.58 2004 "AAR CORP" 5080 732.23 32.586 229.494 61.774 747.848 16.04 2005 "AAR CORP" 5080 978.819 36.654 320.704 91.877 897.284 24.08 1992 "ABS INDUSTRIES INC" 3460 41.976 2.526 . 7.907 51.407 14.25 1993 "ABS INDUSTRIES INC" 3460 63.997 5.052 . 10.064 72.36 12.75 1994 "ABS INDUSTRIES INC" 3460 93.811 5.052 . 13.001 92.122 13.75 1992 "ACF INDUSTRIES HOLDING CORP" 3743 1706.454 .015 . 173.367 320.148 . 1993 "ACF INDUSTRIES HOLDING CORP" 3743 1665.921 .015 . 150.525 348.429 . 1994 "ACF INDUSTRIES HOLDING CORP" 3743 1815.1 .015 . 137 380.5 . 1995 "ACF INDUSTRIES HOLDING CORP" 3743 2015.8 .015 . 157 407.2 . 1996 "ACF INDUSTRIES HOLDING CORP" 3743 2218.6 .015 . 183.4 486.8 . 1997 "ACF INDUSTRIES HOLDING CORP" 3743 3181.3 .015 . 176.4 501.9 . 1998 "ACF INDUSTRIES HOLDING CORP" 3743 3257.3 .015 . 188.6 536.5 . 1999 "ACF INDUSTRIES HOLDING CORP" 3743 3563.4 .015 1977.1 186 472.4 . 2000 "ACF INDUSTRIES HOLDING CORP" 3743 3794.5 .015 2042.9 178.7 443.8 .
I was thinking of doing the Wilcoxon Signed-Rank test. However, searching on the internet on how to do this I got the feeling I have to put the two datasets in one dataset.
Question1: Is there a way to compare two different datasets of should I really combine the two datasets using "append" or "merge".
Question 2: If I have to combine both datasets with append. I was thinking to make a dummy which sets 1 if it belongs to group 1 and 0 if it belongs to group 2. But should I name the variables exactly the same in both datasets and order the variables in the same order in both datasets in order to append? How can I change the names and order of the variables in dataset 2 while working in the same do-file as dataset 1 and afterwards going back to dataset 1 to append?
Question 3: If I have to combine both datasets with merge, I should name the bariable OIBDP different for both groups, right?
Question 4: How do I match the observations of both groups based on Year, SIC and range of 20% AT, while doing the wilcoxon Signed-Rank test?
Thank you for helping in advance!
Comment