Hello all,
Although familiar with the concepts associated with principal component analysis, I am using it for the first time myself.
I have 17 variables that I am hoping to use to PCA on for the purpose of data reduction. Essentially, these measures represent three different dimensions of "social capital," as identified in the literature (e.g., 1. Solidarity, trust, and tolerance, 2. A strong associational life, and 3. Political and civic engagement). My data is cross-sectional-times-series, and the observations are at the country level. I have 16 countries and five periods, for a total of 80 observations. My Stata version is 14.2.
After running the PCA analysis, there are 5 components with eigenvalues over 1 which cumulatively account for .74 of the variance. The K-M-O measuring sampling adequacy gives an overall measure of .6375 (so, PCA is justified). So, I used predict to create component measures (i.e., pc1, pc2, pc3, pc4, pc5).
My question is about how to go about the next step--creating a single variable by combining these five component variables to serve as the dependent variable for my analysis. Basically, social capital is my dependent variable. Rather than run 5 regressions with each of the five components variables serving as its own separate DV, I would like to combine these five component variables into a single composite (i.e., index) variable to use as my primary dependent variable.
My first question is whether there are any issues with doing this, which I should be aware of? My second question is whether there is a preferred approach for going about creating such a composite/idex varaible? I am also open to any other related tips or advice anyone might have to conducting such an analysis.
Thank you in advance for your time and help!
Best,
Catie
Although familiar with the concepts associated with principal component analysis, I am using it for the first time myself.
I have 17 variables that I am hoping to use to PCA on for the purpose of data reduction. Essentially, these measures represent three different dimensions of "social capital," as identified in the literature (e.g., 1. Solidarity, trust, and tolerance, 2. A strong associational life, and 3. Political and civic engagement). My data is cross-sectional-times-series, and the observations are at the country level. I have 16 countries and five periods, for a total of 80 observations. My Stata version is 14.2.
After running the PCA analysis, there are 5 components with eigenvalues over 1 which cumulatively account for .74 of the variance. The K-M-O measuring sampling adequacy gives an overall measure of .6375 (so, PCA is justified). So, I used predict to create component measures (i.e., pc1, pc2, pc3, pc4, pc5).
My question is about how to go about the next step--creating a single variable by combining these five component variables to serve as the dependent variable for my analysis. Basically, social capital is my dependent variable. Rather than run 5 regressions with each of the five components variables serving as its own separate DV, I would like to combine these five component variables into a single composite (i.e., index) variable to use as my primary dependent variable.
My first question is whether there are any issues with doing this, which I should be aware of? My second question is whether there is a preferred approach for going about creating such a composite/idex varaible? I am also open to any other related tips or advice anyone might have to conducting such an analysis.
Thank you in advance for your time and help!
Best,
Catie
Comment