Hello,
I have my master panel dataset with 14 variables and want to merge Dataset B (10 Variables) and Dataset C (1 Variable).
I have tried to merge with:
Merge 1:1 id fyear using DatasetB
Drop _merge
Merge 1:1 id fyear using DatasetC
I have prepared all datasets, so that each observation is uniquely identified by id and fyear ( deleted duplicates) and declared panel datasets and sorted id fyear.
However, I get a large number of unmatched observations. Am I doing anything wrong?
Dataset A:
obs: 31,640
vars: 14
size: 3,417,120
------------------------------------------------------------------------------------------------------------------------------
storage display value
variable name type format label variable label
------------------------------------------------------------------------------------------------------------------------------
fyear double %6.0g Data Year - Fiscal
at double %18.0g Assets - Total
csho double %18.0g Common Shares Outstanding
dlc double %18.0g Debt in Current Liabilities - Total
dltt double %18.0g Long-Term Debt - Total
dvc double %18.0g Dividends Common/Ordinary
ib double %18.0g Income Before Extraordinary Items
ni double %18.0g Net Income (Loss)
oiadp double %18.0g Operating Income After Depreciation
prstkc double %18.0g Purchase of Common and Preferred Stock
sale double %18.0g Sales/Turnover (Net)
xsga double %18.0g Selling, General and Administrative Expense
prcc_f double %18.0g Price Close - Annual - Fiscal
id long %8.0g id Standard and Poor's Identifier
------------------------------------------------------------------------------------------------------------------------------
Sorted by: id fyear
Dataset B:
Contains data from /Users/Guest/Desktop/Stata/Agencypart2prepared.dta
obs: 34,842
vars: 10 20 Apr 2017 15:43
size: 2,369,256
----------------------------------------------------------------------------------------
storage display value
variable name type format label variable label
----------------------------------------------------------------------------------------
datadate long %d Data Date
fyear double %6.0g Data Year - Fiscal
dvt double %18.0g Dividends - Total
oibdp double %18.0g Operating Income Before Depreciation
txt double %18.0g Income Taxes - Total
xad double %18.0g Advertising Expense
xint double %18.0g Interest and Related Expense - Total
xrd double %18.0g Research and Development Expense
id long %8.0g id Standard a
----------+--------------------------------------------------------
datadate | 34842 19628.96 692.1245 17531 20819
fyear | 34840 2012.78 1.893907 2010 2016
dvt | 29645 150.1909 803.536 -255.458 67643.8
oibdp | 29616 890.8212 3714.066 -21913 81730
txt | 30924 132.2633 871.8798 -34831 31051
-------------+--------------------------------------------------------
xad | 12427 107.2393 460.2595 0 9729
xint | 26051 112.075 498.2739 -.664 18562.28
xrd | 15855 178.3253 820.7956 -.202 16085
id | 34842 15752.67 7018.648 5 24304
DatasetC
obs: 35,981
vars: 5 21 Apr 2017 19:56
size: 1,007,468
----------------------------------------------------------------------------------------
storage display value
variable name type format label variable label
----------------------------------------------------------------------------------------
datadate long %d Data Date
fyear double %6.0g Data Year - Fiscal
re double %18.0g Retained Earnings
id long %8.0g id Standard and Poor's Identifier
dup float %9.0g
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
datadate | 35981 19666.62 712.2195 17531 20819
fyear | 35979 2012.882 1.947104 2010 2016
re | 31033 1613.649 10959.86 -117956.9 389427
id | 35981 15782.92 7018.259 5 24361
dup | 35981 .0094494 .0967492 0
My results :
. merge 1:1 id fyear using SetB.dta
(label id already defined)
Result # of obs.
-----------------------------------------
not matched 59,138
from master 27,968 (_merge==1)
from using 31,170 (_merge==2)
matched 3,672 (_merge==3)
-----------------------------------------
. drop _merge
. merge 1:1 id fyear using SetC.dta
(label id already defined)
Result # of obs.
-----------------------------------------
not matched 60,315
from master 43,572 (_merge==1)
from using 16,743 (_merge==2)
matched 19,238 (_merge==3)
-----------------------------------------
Thank you for any help.
I have my master panel dataset with 14 variables and want to merge Dataset B (10 Variables) and Dataset C (1 Variable).
I have tried to merge with:
Merge 1:1 id fyear using DatasetB
Drop _merge
Merge 1:1 id fyear using DatasetC
I have prepared all datasets, so that each observation is uniquely identified by id and fyear ( deleted duplicates) and declared panel datasets and sorted id fyear.
However, I get a large number of unmatched observations. Am I doing anything wrong?
Dataset A:
obs: 31,640
vars: 14
size: 3,417,120
------------------------------------------------------------------------------------------------------------------------------
storage display value
variable name type format label variable label
------------------------------------------------------------------------------------------------------------------------------
fyear double %6.0g Data Year - Fiscal
at double %18.0g Assets - Total
csho double %18.0g Common Shares Outstanding
dlc double %18.0g Debt in Current Liabilities - Total
dltt double %18.0g Long-Term Debt - Total
dvc double %18.0g Dividends Common/Ordinary
ib double %18.0g Income Before Extraordinary Items
ni double %18.0g Net Income (Loss)
oiadp double %18.0g Operating Income After Depreciation
prstkc double %18.0g Purchase of Common and Preferred Stock
sale double %18.0g Sales/Turnover (Net)
xsga double %18.0g Selling, General and Administrative Expense
prcc_f double %18.0g Price Close - Annual - Fiscal
id long %8.0g id Standard and Poor's Identifier
------------------------------------------------------------------------------------------------------------------------------
Sorted by: id fyear
Dataset B:
Contains data from /Users/Guest/Desktop/Stata/Agencypart2prepared.dta
obs: 34,842
vars: 10 20 Apr 2017 15:43
size: 2,369,256
----------------------------------------------------------------------------------------
storage display value
variable name type format label variable label
----------------------------------------------------------------------------------------
datadate long %d Data Date
fyear double %6.0g Data Year - Fiscal
dvt double %18.0g Dividends - Total
oibdp double %18.0g Operating Income Before Depreciation
txt double %18.0g Income Taxes - Total
xad double %18.0g Advertising Expense
xint double %18.0g Interest and Related Expense - Total
xrd double %18.0g Research and Development Expense
id long %8.0g id Standard a
----------+--------------------------------------------------------
datadate | 34842 19628.96 692.1245 17531 20819
fyear | 34840 2012.78 1.893907 2010 2016
dvt | 29645 150.1909 803.536 -255.458 67643.8
oibdp | 29616 890.8212 3714.066 -21913 81730
txt | 30924 132.2633 871.8798 -34831 31051
-------------+--------------------------------------------------------
xad | 12427 107.2393 460.2595 0 9729
xint | 26051 112.075 498.2739 -.664 18562.28
xrd | 15855 178.3253 820.7956 -.202 16085
id | 34842 15752.67 7018.648 5 24304
DatasetC
obs: 35,981
vars: 5 21 Apr 2017 19:56
size: 1,007,468
----------------------------------------------------------------------------------------
storage display value
variable name type format label variable label
----------------------------------------------------------------------------------------
datadate long %d Data Date
fyear double %6.0g Data Year - Fiscal
re double %18.0g Retained Earnings
id long %8.0g id Standard and Poor's Identifier
dup float %9.0g
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
datadate | 35981 19666.62 712.2195 17531 20819
fyear | 35979 2012.882 1.947104 2010 2016
re | 31033 1613.649 10959.86 -117956.9 389427
id | 35981 15782.92 7018.259 5 24361
dup | 35981 .0094494 .0967492 0
My results :
. merge 1:1 id fyear using SetB.dta
(label id already defined)
Result # of obs.
-----------------------------------------
not matched 59,138
from master 27,968 (_merge==1)
from using 31,170 (_merge==2)
matched 3,672 (_merge==3)
-----------------------------------------
. drop _merge
. merge 1:1 id fyear using SetC.dta
(label id already defined)
Result # of obs.
-----------------------------------------
not matched 60,315
from master 43,572 (_merge==1)
from using 16,743 (_merge==2)
matched 19,238 (_merge==3)
-----------------------------------------
Thank you for any help.
Comment