Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Merging two datasets using 1:1

    Dear Statalist Users,

    I am trying to merge these two datasets using 1:1 (merge 1:1 ctr_n cst_n GADM_1 yr using "Using File")
    but it says "variables ctr_n cst_n GADM_1 yr do not uniquely identify obs in the using data". Indeed, these variables uniquely identify.. I cannot figure out the reason. I would be truly happy if you can help me. Thanks a lot.



    Master Dataset:
    Code:
    * Example generated by -dataex-. To install: ssc install dataex
    clear
    input str30 ctr_n str136 cst_n str26 GADM_1 float yr
    "Angola" "BENGO"    "Bengo"    2000
    "Angola" "BENGO"    "Bengo"    2001
    "Angola" "BENGO"    "Bengo"    2002
    "Angola" "BENGO"    "Bengo"    2003
    "Angola" "BENGO"    "Bengo"    2004
    "Angola" "BENGO"    "Bengo"    2005
    "Angola" "BENGO"    "Bengo"    2006
    "Angola" "BENGO"    "Bengo"    2007
    "Angola" "BENGO"    "Bengo"    2008
    "Angola" "BENGO"    "Bengo"    2009
    "Angola" "BENGO"    "Bengo"    2010
    "Angola" "BENGO"    "Bengo"    2011
    "Angola" "BENGO"    "Bengo"    2012
    "Angola" "BENGO"    "Bengo"    2013
    "Angola" "BENGO"    "Bengo"    2014
    "Angola" "BENGO"    "Bengo"    2015
    "Angola" "BENGO"    "Bengo"    2016
    "Angola" "BENGUELA" "Benguela" 2000
    "Angola" "BENGUELA" "Benguela" 2001
    "Angola" "BENGUELA" "Benguela" 2002
    "Angola" "BENGUELA" "Benguela" 2003
    "Angola" "BENGUELA" "Benguela" 2004
    "Angola" "BENGUELA" "Benguela" 2005
    "Angola" "BENGUELA" "Benguela" 2006
    "Angola" "BENGUELA" "Benguela" 2007
    "Angola" "BENGUELA" "Benguela" 2008
    "Angola" "BENGUELA" "Benguela" 2009
    "Angola" "BENGUELA" "Benguela" 2010
    "Angola" "BENGUELA" "Benguela" 2011
    "Angola" "BENGUELA" "Benguela" 2012
    "Angola" "BENGUELA" "Benguela" 2013
    "Angola" "BENGUELA" "Benguela" 2014
    "Angola" "BENGUELA" "Benguela" 2015
    "Angola" "BENGUELA" "Benguela" 2016
    "Angola" "BIÉ"     "Bie"      2000
    "Angola" "BIÉ"     "Bie"      2001
    "Angola" "BIÉ"     "Bie"      2002
    "Angola" "BIÉ"     "Bie"      2003
    "Angola" "BIÉ"     "Bie"      2004
    "Angola" "BIÉ"     "Bie"      2005
    "Angola" "BIÉ"     "Bie"      2006
    "Angola" "BIÉ"     "Bie"      2007
    "Angola" "BIÉ"     "Bie"      2008
    "Angola" "BIÉ"     "Bie"      2009
    "Angola" "BIÉ"     "Bie"      2010
    "Angola" "BIÉ"     "Bie"      2011
    "Angola" "BIÉ"     "Bie"      2012
    "Angola" "BIÉ"     "Bie"      2013
    "Angola" "BIÉ"     "Bie"      2014
    "Angola" "BIÉ"     "Bie"      2015
    "Angola" "BIÉ"     "Bie"      2016
    "Angola" "Bengo"    "Bengo"    2000
    "Angola" "Bengo"    "Bengo"    2001
    "Angola" "Bengo"    "Bengo"    2002
    "Angola" "Bengo"    "Bengo"    2003
    "Angola" "Bengo"    "Bengo"    2004
    "Angola" "Bengo"    "Bengo"    2005
    "Angola" "Bengo"    "Bengo"    2006
    "Angola" "Bengo"    "Bengo"    2007
    "Angola" "Bengo"    "Bengo"    2008
    "Angola" "Bengo"    "Bengo"    2009
    "Angola" "Bengo"    "Bengo"    2010
    "Angola" "Bengo"    "Bengo"    2011
    "Angola" "Bengo"    "Bengo"    2012
    "Angola" "Bengo"    "Bengo"    2013
    "Angola" "Bengo"    "Bengo"    2014
    "Angola" "Bengo"    "Bengo"    2015
    "Angola" "Bengo"    "Bengo"    2016
    "Angola" "Benguela" "Benguela" 2000
    "Angola" "Benguela" "Benguela" 2001
    "Angola" "Benguela" "Benguela" 2002
    "Angola" "Benguela" "Benguela" 2003
    "Angola" "Benguela" "Benguela" 2004
    "Angola" "Benguela" "Benguela" 2005
    "Angola" "Benguela" "Benguela" 2006
    "Angola" "Benguela" "Benguela" 2007
    "Angola" "Benguela" "Benguela" 2008
    "Angola" "Benguela" "Benguela" 2009
    "Angola" "Benguela" "Benguela" 2010
    "Angola" "Benguela" "Benguela" 2011
    "Angola" "Benguela" "Benguela" 2012
    "Angola" "Benguela" "Benguela" 2013
    "Angola" "Benguela" "Benguela" 2014
    "Angola" "Benguela" "Benguela" 2015
    "Angola" "Benguela" "Benguela" 2016
    "Angola" "Bié"     "Bie"      2000
    "Angola" "Bié"     "Bie"      2001
    "Angola" "Bié"     "Bie"      2002
    "Angola" "Bié"     "Bie"      2003
    "Angola" "Bié"     "Bie"      2004
    "Angola" "Bié"     "Bie"      2005
    "Angola" "Bié"     "Bie"      2006
    "Angola" "Bié"     "Bie"      2007
    "Angola" "Bié"     "Bie"      2008
    "Angola" "Bié"     "Bie"      2009
    "Angola" "Bié"     "Bie"      2010
    "Angola" "Bié"     "Bie"      2011
    "Angola" "Bié"     "Bie"      2012
    "Angola" "Bié"     "Bie"      2013
    "Angola" "Bié"     "Bie"      2014
    end

    Using File:

    Code:
    * Example generated by -dataex-. To install: ssc install dataex
    clear
    input str30 ctr_n double yr str136 cst_n str26 GADM_1 double(pv1 pvs1)
    "Angola" 2008 "BENGO"    "Bengo"       124  .001312947366386652
    "Angola" 2008 "BENGO"    "Bengo"       141 .0014929481549188495
    "Angola" 2012 "BENGO"    "Bengo"      9912   .13892081379890442
    "Angola" 2012 "BENGO"    "Bengo"        91 .0012754029594361782
    "Angola" 2012 "BENGO"    "Bengo"        40 .0005606166669167578
    "Angola" 2008 "BENGO"    "Bengo"       195 .0020647156052291393
    "Angola" 2008 "BENGO"    "Bengo"     84967    .8996548056602478
    "Angola" 2012 "BENGO"    "Bengo"        62  .000868955859914422
    "Angola" 2008 "BENGO"    "Bengo"      4109   .04350726306438446
    "Angola" 2008 "BENGO"    "Bengo"       771  .008163567632436752
    "Angola" 2008 "BENGO"    "Bengo"       163 .0017258904408663511
    "Angola" 2008 "BENGO"    "Bengo"       129 .0013658887473866344
    "Angola" 2012 "BENGO"    "Bengo"     56650    .7939733862876892
    "Angola" 2008 "BENGO"    "Bengo"       115 .0012176528107374907
    "Angola" 2008 "BENGO"    "Bengo"        96 .0010164753766730428
    "Angola" 2012 "BENGO"    "Bengo"      1350  .018920812755823135
    "Angola" 2008 "BENGO"    "Bengo"      2608  .027614247053861618
    "Angola" 2012 "BENGO"    "Bengo"       435  .006096706259995699
    "Angola" 2012 "BENGO"    "Bengo"      2606  .036524176597595215
    "Angola" 2012 "BENGO"    "Bengo"       204 .0028591451700776815
    "Angola" 2008 "BENGO"    "Bengo"        78 .0008258862653747201
    "Angola" 2008 "BENGO"    "Bengo"       144 .0015247130068019032
    "Angola" 2008 "BENGO"    "Bengo"       804  .008512981235980988
    "Angola" 2012 "BENGUELA" "Benguela"   4654  .007903420366346836
    "Angola" 2008 "BENGUELA" "Benguela"   1047 .0016984015237540007
    "Angola" 2008 "BENGUELA" "Benguela"   1429  .002318066544830799
    "Angola" 2008 "BENGUELA" "Benguela"   1863 .0030220840126276016
    "Angola" 2008 "BENGUELA" "Benguela" 508654    .8251181840896606
    "Angola" 2008 "BENGUELA" "Benguela"   8295  .013455817475914955
    "Angola" 2012 "BENGUELA" "Benguela" 430449    .7309882044792175
    "Angola" 2008 "BENGUELA" "Benguela"   1352  .002193160355091095
    "Angola" 2008 "BENGUELA" "Benguela"    862 .0013983019161969423
    "Angola" 2012 "BENGUELA" "Benguela"    432 .0007336221169680357
    "Angola" 2012 "BENGUELA" "Benguela" 120757   .20506946742534637
    "Angola" 2012 "BENGUELA" "Benguela"   3255  .005527638830244541
    "Angola" 2008 "BENGUELA" "Benguela"    864 .0014015462948009372
    "Angola" 2012 "BENGUELA" "Benguela"    561 .0009526898502372205
    "Angola" 2012 "BENGUELA" "Benguela"  27424   .04657141864299774
    "Angola" 2008 "BENGUELA" "Benguela"   1184 .0019206374417990446
    "Angola" 2008 "BENGUELA" "Benguela"   1412  .002290489850565791
    "Angola" 2008 "BENGUELA" "Benguela"   8103  .013144362717866898
    "Angola" 2008 "BENGUELA" "Benguela"  78403    .1271822154521942
    "Angola" 2012 "BENGUELA" "Benguela"    523 .0008881582762114704
    "Angola" 2008 "BENGUELA" "Benguela"   1926  .003124280134215951
    "Angola" 2012 "BENGUELA" "Benguela"    804 .0013653523055836558
    "Angola" 2008 "BENGUELA" "Benguela"   1068 .0017324668588116765
    "Angola" 2008 "BIÉ"     "Bie"        1591  .004355276469141245
    "Angola" 2008 "BIÉ"     "Bie"        7094  .019419442862272263
    "Angola" 2008 "BIÉ"     "Bie"      273728     .749315619468689
    "Angola" 2012 "BIÉ"     "Bie"        3245  .009792235679924488
    "Angola" 2012 "BIÉ"     "Bie"      120015    .3621618449687958
    "Angola" 2008 "BIÉ"     "Bie"         738 .0020202351734042168
    "Angola" 2012 "BIÉ"     "Bie"         561 .0016928949626162648
    "Angola" 2012 "BIÉ"     "Bie"         243 .0007332860841415823
    "Angola" 2012 "BIÉ"     "Bie"         455  .001373025355860591
    "Angola" 2008 "BIÉ"     "Bie"         949  .002597836311906576
    "Angola" 2012 "BIÉ"     "Bie"        1856  .005600736476480961
    "Angola" 2008 "BIÉ"     "Bie"        1408  .003854324109852314
    "Angola" 2008 "BIÉ"     "Bie"        1157 .0031672250479459763
    "Angola" 2008 "BIÉ"     "Bie"        6110  .016725795343518257
    "Angola" 2012 "BIÉ"     "Bie"      201421     .607815682888031
    "Angola" 2008 "BIÉ"     "Bie"        1651  .004519523587077856
    "Angola" 2008 "BIÉ"     "Bie"        1950 .0053380196914076805
    "Angola" 2008 "BIÉ"     "Bie"       66651   .18245351314544678
    "Angola" 2012 "BIÉ"     "Bie"         379 .0011436848435550928
    "Angola" 2008 "BIÉ"     "Bie"         520  .001423471956513822
    "Angola" 2012 "BIÉ"     "Bie"        3210  .009686618112027645
    "Angola" 2008 "BIÉ"     "Bie"         745  .002039397368207574
    "Angola" 2008 "BIÉ"     "Bie"        1012 .0027702953666448593
    "Angola" 2017 "Bengo"    "Bengo"     68614     .651555437383675
    "Angola" 2017 "Bengo"    "Bengo"       963   .00914460439852623
    "Angola" 2017 "Bengo"    "Bengo"       639   .00606791506818095
    "Angola" 2017 "Bengo"    "Bengo"     25317     .240409085729479
    "Angola" 2017 "Bengo"    "Bengo"      2752    .0261328673984882
    "Angola" 2017 "Bengo"    "Bengo"      1271    .0120693584532989
    "Angola" 2017 "Bengo"    "Bengo"      5752    .0546207315683519
    "Angola" 2017 "Benguela" "Benguela"  15039    .0232563174327277
    "Angola" 2017 "Benguela" "Benguela" 388319      .60049670384729
    "Angola" 2017 "Benguela" "Benguela"   4026    .0062258084968523
    "Angola" 2017 "Benguela" "Benguela" 174530     .269893282900058
    "Angola" 2017 "Benguela" "Benguela"  56297    .0870577101210368
    "Angola" 2017 "Benguela" "Benguela"   2712   .00419383821248471
    "Angola" 2017 "Benguela" "Benguela"   5740   .00887633898955097
    "Angola" 2017 "Bié"     "Bie"      224071     .551696544864077
    "Angola" 2017 "Bié"     "Bie"      149507     .368108748267262
    "Angola" 2017 "Bié"     "Bie"        3567   .00878249115472401
    "Angola" 2017 "Bié"     "Bie"        4340     .010685733560836
    "Angola" 2017 "Bié"     "Bie"       14840    .0365383147564071
    "Angola" 2017 "Bié"     "Bie"        2543   .00626124895050831
    "Angola" 2017 "Bié"     "Bie"        7281     .017926918446186
    "Angola" 2012 "CABINDA"  "Cabinda"     856  .007345621939748526
    "Angola" 2008 "CABINDA"  "Cabinda"     397 .0028998844791203737
    "Angola" 2008 "CABINDA"  "Cabinda"     916  .006690917536616325
    "Angola" 2008 "CABINDA"  "Cabinda"     563  .004112430848181248
    "Angola" 2012 "CABINDA"  "Cabinda"   16222    .1392063945531845
    "Angola" 2008 "CABINDA"  "Cabinda"     678  .004952447954565287
    "Angola" 2008 "CABINDA"  "Cabinda"    2262  .016522768884897232
    "Angola" 2008 "CABINDA"  "Cabinda"     366 .0026734452694654465
    "Angola" 2008 "CABINDA"  "Cabinda"   42940    .3136550188064575
    "Angola" 2008 "CABINDA"  "Cabinda"     322 .0023520474787801504
    end
    label values pv1 pv1
    label values pvs1 pvs1



    Last edited by Cansu Oymak; 31 Dec 2023, 03:33.

  • #2
    I used your "using" data example (thank you for supplying) and did the following:
    Code:
    . duplicates report ctr_n cst_n GADM_1 yr
    
    Duplicates in terms of ctr_n cst_n GADM_1 yr
    
    --------------------------------------
       Copies | Observations       Surplus
    ----------+---------------------------
            2 |            2             1
            7 |           21            18
            8 |            8             7
            9 |           27            24
           14 |           42            39
    --------------------------------------
    so, it is clear that Stata is correct that these do not uniquely id the observations in the using data set - so, your claim that "Indeed, these variables uniquely identify." is not correct; why do you think it is correct - did you confuse the master and using data sets?

    added in edit: maybe you want a "1:m" merge?
    Last edited by Rich Goldstein; 31 Dec 2023, 04:45.

    Comment

    Working...
    X