Hi everyone.
I have some datasets that contain first names and birthdays. In order to anonymize the data I would like to create numeric IDs and remove the first names. The trick is that these IDs have to be determined in such a way that they would allow me to match them across datasets (multiple surveys). I know how to create IDs using group, or encode but the resulting IDs would not be the same across datasets.
Assuming that the combination of first name, birth date, and one other variable -a string would create a unique ID, is there any quick way of doing this in Stata? Or Excel if easier.
Many thanks.
Laura
I have some datasets that contain first names and birthdays. In order to anonymize the data I would like to create numeric IDs and remove the first names. The trick is that these IDs have to be determined in such a way that they would allow me to match them across datasets (multiple surveys). I know how to create IDs using group, or encode but the resulting IDs would not be the same across datasets.
Assuming that the combination of first name, birth date, and one other variable -a string would create a unique ID, is there any quick way of doing this in Stata? Or Excel if easier.
Many thanks.
Laura
Comment