Hi All
I'm relatively new to STATA and could really use some advice.
My objective is to look for/ model the relationship between port disruptions and an exporting firm's revenue. I have one dataset from an exporting firm with its transactions for a period of 5 years, and another with port disruptions over the same period.
The datasets are structured as follows (I've omitted some of the columns from the firm dataset, there are actually about 30):
I need to put them into a master set, so obviously I need to merge them. The problem is that there is no consistent, unique key variable/identifier other than the dates. But the dates may or may not overlap depending on whether a disruption occurred during the journey or not. I thought I would try rangejoin, with the port terminal dataset as using and the firm dataset as the master,and then merge where the port disruption fell within the range between date_time_transaction start and datetime_transaction_end... but I see that rangejoin can only accomodate one range.
Does anyone have any advice on how to proceed from here?
Thanks so much in advance!
I'm relatively new to STATA and could really use some advice.
My objective is to look for/ model the relationship between port disruptions and an exporting firm's revenue. I have one dataset from an exporting firm with its transactions for a period of 5 years, and another with port disruptions over the same period.
The datasets are structured as follows (I've omitted some of the columns from the firm dataset, there are actually about 30):
Port Terminal Dataset | ||||||||||
datetime_disruptionstart | datetimedisruptionend | reasonforevent | durationofevent | |||||||
|
||||||||||
datetime_transaction_start | datetime_transaction_end | transaction_category | invoice_amount | number of cartons | eta | etd | load date |
Does anyone have any advice on how to proceed from here?
Thanks so much in advance!
Comment