Colleagues,
I'm forced to merge two data sets with potentially duplicate variables. The situation looks like in the table below:
What would be the smart way to manage this problem. Ideally I would like to keep the dataset with indicators and variable containing original values. I have no way of deciphering which data is already in the dataset using variable names. The only way to do it is to look at the variable content, then it becomes apparent to some indicators were entered into the data on multiple occasions unnecessarily.
I'm forced to merge two data sets with potentially duplicate variables. The situation looks like in the table below:
Observation | var1 | var2 | ... | var59 | IndicatorABC (same as var2) | IndicatorDEF | IndicatorXYZ (same as var59) |
AAA1 | 1 | 23 | ... | 63 | 23 | 54 | 63 |
AAA2 | 5 | 23 | ... | 69 | 23 | 45 | 69 |
AAA3 | 63 | 5 | ... | 98 | 5 | 654 | 98 |
AAA4 | 21 | 8 | ... | 45 | 8 | 45 | 45 |
AAA5 | 54 | 2 | ... | 4 | 2 | 24123 | 4 |
AAA6 | 2 | 1 | ... | 1 | 1 | 54 | 1 |
Comment