Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Levenshtein Distance (fuzzy matching) with a loop

    Hello guys,

    I am currently trying to do fuzzy matching of two "string" variables (var1 and var2) in my dataset using Levenshtein Distance (-strdist package), which seems to fit my needs.

    The only problem that I am having is that I need to calculate the levenshtein distance of each observation in variable 1 with each observation of variable 2, and I am not sure how. As of now, when running strdist var1 var2, i get a pairwise calculation of levenshtein distance between observations in var1 and var2 from the same row. I was wondering if anyone might know how to best implement it?

    Best,
    Fredrick
Working...
X