I am producing a thesis on the impact of immigration on wages. To address simultaneity I must use an instrumental variable approach. My intended IV is to use historical settlement patterns which is the regional share of immigrants in 1991 (with the rest of the data from 2005-2018). However, the time invariant nature of the IV is clearly problematic. I can think of two ways possibly to deal with this:
1. Take 1991 regional shares and scale for each year by multiplying this by national growth in immigration from 1991 to each year t. Then I will have a time varying IV. However, I do not know if this is valid? The idea is that immigrants are drawn to areas where immigrants have already settled. However, this historical settlement does not directly impact wages today. Further, regions with higher shares of immigrants will be appropriately more proportionally impacted by this scaling up to period t (i.e. each year 2005 to 2018).
2. The second approach has perplexed me. It comes from Dustmann et al. (2013), who states: "compute the 1991 ratio of immigrants to natives for each of these regions, from the Census of Population, interacted with year dummies". I do not understand the intuition of using year dummies. Would it still be time invariant as those interacted dummies would all be 1? Please could anyone clarify what I am missing?
Otherwise, I cannot come up with any other solutions currently. Due to data constraints I am unable to run just a simple lag of an immigrant-native ratio which would be an ideal IV for simplicity.
Thank you very much and I hope you may assist me as this is rather urgent.
1. Take 1991 regional shares and scale for each year by multiplying this by national growth in immigration from 1991 to each year t. Then I will have a time varying IV. However, I do not know if this is valid? The idea is that immigrants are drawn to areas where immigrants have already settled. However, this historical settlement does not directly impact wages today. Further, regions with higher shares of immigrants will be appropriately more proportionally impacted by this scaling up to period t (i.e. each year 2005 to 2018).
2. The second approach has perplexed me. It comes from Dustmann et al. (2013), who states: "compute the 1991 ratio of immigrants to natives for each of these regions, from the Census of Population, interacted with year dummies". I do not understand the intuition of using year dummies. Would it still be time invariant as those interacted dummies would all be 1? Please could anyone clarify what I am missing?
Otherwise, I cannot come up with any other solutions currently. Due to data constraints I am unable to run just a simple lag of an immigrant-native ratio which would be an ideal IV for simplicity.
Thank you very much and I hope you may assist me as this is rather urgent.
Comment