Dear all,
I'm having some issues on importing data from the database Boardex, and I was wondering if you could help me out to prepare the data.
Boardex is a database with some corporate info for a set of companies and is provided in excel files. Depending on the specific file, there are different types of information being provided but the way it is organized in terms of identifying the firm and director/manager/etc is always the same. And here lies my problem. Here goes a small extract of the data just to show how it is organized:
So, for each year (Annual Report Date), we have the companyID (Boardex specific id), Company ISIN (identifier that can be used to merge with other databases), Director ID/DirectorName, and few Variables of interest that I do not include for copyright issues.
Given that I need to merge the data with a different database, ISIN is the identifier to be used. But, as you can see, for this company, we have two ISINs in the Company ISIN variable. Other companies have more, others one or even none (these last ones are not important as they will be deleted). Basically i would need to parse the Company ISIN field identifying the (variable) ISINs, and put them in one variable (say ISIN). This would require to repeat the information of all the other variables for each of the new identified and parsed ISINs. Is this possible to be done in Stata? I know that I would need to reshape the data, but the problem is the Company ISIN variable with all those ISINs
The data would follow this scheme:
Best wishes,
Nuno
I'm having some issues on importing data from the database Boardex, and I was wondering if you could help me out to prepare the data.
Boardex is a database with some corporate info for a set of companies and is provided in excel files. Depending on the specific file, there are different types of information being provided but the way it is organized in terms of identifying the firm and director/manager/etc is always the same. And here lies my problem. Here goes a small extract of the data just to show how it is organized:
Annual Report Date | Company Name | Company ID | Company ISIN | Company Ticker | Company Index | Sector | Country | DirectorName | Director ID | Variables of interest 1 | Variables of interest 2 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Martin Michael Gagen | 1466510920 | 1 | 11 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Peter Brian Williams | 1466610928 | 2 | 12 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Brian Paul Larcombe | 400911884 | 3 | 13 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Doctor Richard Douglas Summers | 60765967 | 4 | 14 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Michael James Queen | 663810061 | 5 | 15 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | John William Melbourn | 103147209 | 6 | 16 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Baroness (Sarah Mary) Hogg | 130078114 | 7 | 17 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Doctor John Richard Forrest | 143148447 | 8 | 18 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | The Rt. Hon. Lord Ralph Thomas Camoys | 7662746 | 9 | 19 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61, GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Sir George Russell | 819911086 | 10 | 20 |
Given that I need to merge the data with a different database, ISIN is the identifier to be used. But, as you can see, for this company, we have two ISINs in the Company ISIN variable. Other companies have more, others one or even none (these last ones are not important as they will be deleted). Basically i would need to parse the Company ISIN field identifying the (variable) ISINs, and put them in one variable (say ISIN). This would require to repeat the information of all the other variables for each of the new identified and parsed ISINs. Is this possible to be done in Stata? I know that I would need to reshape the data, but the problem is the Company ISIN variable with all those ISINs
The data would follow this scheme:
Annual Report Date | Company Name | Company ID | Company ISIN | Company Ticker | Company Index | Sector | Country | DirectorName | Director ID | Variables of interest 1 | Variables of interest 2 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Martin Michael Gagen | 1.47E+09 | 1 | 11 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Martin Michael Gagen | 1.47E+09 | 1 | 11 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Peter Brian Williams | 1.47E+09 | 2 | 12 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Peter Brian Williams | 1.47E+09 | 2 | 12 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B16PRC61 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Brian Paul Larcombe | 4.01E+08 | 3 | 13 |
Mar-99 | 3I GROUP PLC | 11523753 | GB00B1YW4409 | III, IIIP | FTSE 250(GBP) | Private Equity | United Kingdom - England | Brian Paul Larcombe | 4.01E+08 | 3 | 13 |
Best wishes,
Nuno
Comment