Hi all,
Can anyone help me with the error message I get when I try to link two datasets using reclink? The two variables I try to link by are:
airlinemaster (which is the airline name, for example in the master dataset it could show up as "Air Canada" and in the using dataset it could show up as "Air Canada LTD.")
and
codemaster (which is a three or two digit numeric abbreviation in the master dataset, for example "CA" for "Air Canada", and "CCA" for "Air Canada LTD." in the using dataset.
my code is this:
reclink airlinemaster codemaster using `wikipedia', gen(myscore) idm(id_master) idu(id_wiki) _merge(merged) minbigram(.80)
when I run it I get this:
reclink airlinemaster codemaster using `wikipedia', gen(myscore) idm(id_master) idu(id_wiki) _merge(merged)
> minbigram(.80)
9 perfect matches found
Going through 1682 observation to assess fuzzy matches, each .=5% complete
...........) required
r(100);
Thanks,
Marilyn.
Can anyone help me with the error message I get when I try to link two datasets using reclink? The two variables I try to link by are:
airlinemaster (which is the airline name, for example in the master dataset it could show up as "Air Canada" and in the using dataset it could show up as "Air Canada LTD.")
and
codemaster (which is a three or two digit numeric abbreviation in the master dataset, for example "CA" for "Air Canada", and "CCA" for "Air Canada LTD." in the using dataset.
my code is this:
reclink airlinemaster codemaster using `wikipedia', gen(myscore) idm(id_master) idu(id_wiki) _merge(merged) minbigram(.80)
when I run it I get this:
reclink airlinemaster codemaster using `wikipedia', gen(myscore) idm(id_master) idu(id_wiki) _merge(merged)
> minbigram(.80)
9 perfect matches found
Going through 1682 observation to assess fuzzy matches, each .=5% complete
...........) required
r(100);
Thanks,
Marilyn.
Comment