I am trying to merge 3 census datasets. Personal, family and household. The first one has 3 identifiers HID, PID and FID. The second one has two HID and PID while the last one HID as the only identifier.
HID is a string and unique only in the 2nd dataset (household) but not in the others. I can merge successfully the second file with either using HID and i get a complete 100% merge.
But whenever i try to merge this new merged dataset with the remaining datafile having dropped my _merge variable, i get this error
"variable HID does not uniquely identify observations in the master data"
Can anyone help with ideas on how to get around this one.
Thanks
HID is a string and unique only in the 2nd dataset (household) but not in the others. I can merge successfully the second file with either using HID and i get a complete 100% merge.
But whenever i try to merge this new merged dataset with the remaining datafile having dropped my _merge variable, i get this error
"variable HID does not uniquely identify observations in the master data"
Can anyone help with ideas on how to get around this one.
Thanks
Comment