Hi! I´m and undergraduate student and I´m currently in the process of my final thesis. The thesis consists on a gravity model for migration and tries to explain if there are differences in the pull and push factorts whether the destination country is a developed or developing economy being the origin country only developing economies. For the origin country there are 120 countries and for destination only 2, which are Qatar and United States as a representative sample of developed and developing as they receive the largest quantity of migration.
When I estimate the model with ppml, it told me that the variables where indeed to big, so I did use logs as my advisor also told me. The variables I´am currently using are TotalStockMigration as dependent variable and indepedent variables origin gdp, destination gdp, common religion, rta agreement, distance between the capitals and entry cost to start a business in the destination country. On the other hand, I created fixed pair effect and individuals to take into consideration with ppmlhdfe.
The main problem is that when i run the following:ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d,absorb(countpair year) cluster(countpair), it ommits distance and common religion, drops a lot of observations and most of them are not significant.
But if ir run: ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d,absorb(iso3_o year), is when is giving me the most coherent results but still both gdp´s are not significant at 95%.
data:image/s3,"s3://crabby-images/1afbf/1afbf8789a825102e2b97b4c918d2849d29373d0" alt="Click image for larger version
Name: 2022-04-21 (2).png
Views: 1
Size: 26.3 KB
ID: 1660831"
And if I add the cluster, then the gdp destination is significant but the origin not:
data:image/s3,"s3://crabby-images/ef188/ef188a0b31f1a1bd4ccf8c0f9ee7771922f49de9" alt="Click image for larger version
Name: 2022-04-21 (3).png
Views: 2
Size: 30.7 KB
ID: 1660832"
Also, as the thesis tries to answer the question if the pull and push factors are different from when the migrate to a developing country o to a developed, I estimate the regressions by sorting the destination (Qatar or United States). So for that purpose, i try to estimate the following but the results does not seem to be really good:
For QATAR: ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d if iso3num_d==634,absorb(iso3_o year) cluster(iso3_o)
data:image/s3,"s3://crabby-images/16ffd/16ffd2db17fbafae2a374f83629b29250b66b7f1" alt="Click image for larger version
Name: 2022-04-21 (6).png
Views: 1
Size: 26.6 KB
ID: 1660833"
It only gets 52 observations and most of them are omitted (it should get the half which are aprox 229) , I really do not know how to solve this.
For UNITED STATES: ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d if iso3num_d==840,absorb(iso3_o year) cluster(iso3_o)
data:image/s3,"s3://crabby-images/b8d96/b8d96855fc030ae02dcd0257b9b2cebefb4df15f" alt="Click image for larger version
Name: 2022-04-21 (7).png
Views: 1
Size: 25.7 KB
ID: 1660834"
For United States, it gets all the obsrvations but again a lot of them have been omitted.
If you could take a look and help me it would be great! Thankyou for your time and attention!
When I estimate the model with ppml, it told me that the variables where indeed to big, so I did use logs as my advisor also told me. The variables I´am currently using are TotalStockMigration as dependent variable and indepedent variables origin gdp, destination gdp, common religion, rta agreement, distance between the capitals and entry cost to start a business in the destination country. On the other hand, I created fixed pair effect and individuals to take into consideration with ppmlhdfe.
The main problem is that when i run the following:ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d,absorb(countpair year) cluster(countpair), it ommits distance and common religion, drops a lot of observations and most of them are not significant.
But if ir run: ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d,absorb(iso3_o year), is when is giving me the most coherent results but still both gdp´s are not significant at 95%.
And if I add the cluster, then the gdp destination is significant but the origin not:
Also, as the thesis tries to answer the question if the pull and push factors are different from when the migrate to a developing country o to a developed, I estimate the regressions by sorting the destination (Qatar or United States). So for that purpose, i try to estimate the following but the results does not seem to be really good:
For QATAR: ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d if iso3num_d==634,absorb(iso3_o year) cluster(iso3_o)
It only gets 52 observations and most of them are omitted (it should get the half which are aprox 229) , I really do not know how to solve this.
For UNITED STATES: ppmlhdfe lTotalStock ldistcap lgdpcap_o lgdpcap_d rta comrelig entry_cost_d if iso3num_d==840,absorb(iso3_o year) cluster(iso3_o)
For United States, it gets all the obsrvations but again a lot of them have been omitted.
If you could take a look and help me it would be great! Thankyou for your time and attention!
Comment