I have a dataset with annual observations (1990-2016) on neonatal mortality (dependent variable) for countries 1, 2, 3, 4, 5, and 6. The independent variables are indicators 1, 2, 3, 4, and 5. You can download the dataset here.
Given that observations are not independent, the data follow a hierarchical structure, and since I have longitudinal data/repeated observations, I decided to carry out a mixed-effects linear regression to explore the impact of these national indicators on neonatal mortality over the 27 year period in Stata.
mixed neomortality indicator1 indicator2 indicator3 indicator4 indicator5 year || country_id:
As you can see, I added random effects for the variable country_id. The intraclass correlation coefficient is 0.49.
data:image/s3,"s3://crabby-images/f3784/f378408267e610d46d340c1d2e07b6324f261fd7" alt="Click image for larger version
Name: output.jpg
Views: 1
Size: 108.2 KB
ID: 1526011"
My questions are related to whether this analysis is correct from a statistical point of view or not:
1) Is it correct to add random effects to country_id, or should I have added random effects to other variables?
2) Since this is longitudinal data, I included the variable year in the model, otherwise there would be no way to account for time in the model. Is this correct?
3) When writing a scientific paper, how should the results from this type of model be reported? I reported the regression coefficients, the 95% CI and the p-value for both the crude and adjusted model (all 5 indicators included, no further variables were included). I also reported the results from the LR test, the ICC, and the Akaike and Bayesian information criteria. Please comment on whether I missed something or if, for example, AIC and BIC are not usually reported for this type of models:
data:image/s3,"s3://crabby-images/638dd/638dd80a292b209e5bb948c96fafb0d3e1c855ae" alt="Click image for larger version
Name: table.jpg
Views: 1
Size: 80.7 KB
ID: 1526012"
Thank you so much in advance for your time and help!!!
Given that observations are not independent, the data follow a hierarchical structure, and since I have longitudinal data/repeated observations, I decided to carry out a mixed-effects linear regression to explore the impact of these national indicators on neonatal mortality over the 27 year period in Stata.
mixed neomortality indicator1 indicator2 indicator3 indicator4 indicator5 year || country_id:
As you can see, I added random effects for the variable country_id. The intraclass correlation coefficient is 0.49.
My questions are related to whether this analysis is correct from a statistical point of view or not:
1) Is it correct to add random effects to country_id, or should I have added random effects to other variables?
2) Since this is longitudinal data, I included the variable year in the model, otherwise there would be no way to account for time in the model. Is this correct?
3) When writing a scientific paper, how should the results from this type of model be reported? I reported the regression coefficients, the 95% CI and the p-value for both the crude and adjusted model (all 5 indicators included, no further variables were included). I also reported the results from the LR test, the ICC, and the Akaike and Bayesian information criteria. Please comment on whether I missed something or if, for example, AIC and BIC are not usually reported for this type of models:
Thank you so much in advance for your time and help!!!
Comment