Hallo Statalisters!
I need to match two datasets using as a key a string variable (surname). Since surnames can be misspelled I'd like to implement a fuzzy matching automated routine. I am experimenting with matchit and jarowinkler. The text similarity score changes across methods. Would anybody be so kind to explain how the two scores (similscore in matchit and jarowinkler) are computed and how they differ? Many thanks in advance. Giorgia
I need to match two datasets using as a key a string variable (surname). Since surnames can be misspelled I'd like to implement a fuzzy matching automated routine. I am experimenting with matchit and jarowinkler. The text similarity score changes across methods. Would anybody be so kind to explain how the two scores (similscore in matchit and jarowinkler) are computed and how they differ? Many thanks in advance. Giorgia
Comment