This is my first post. I am an undergraduate student and I am carrying out my thesis.
I am working on the construction of an index, based on three variables which take values between 0 and 1. I should say that these variables are mean of another variables which can take values between 0 and 1, in turn. I think that my variables are highly correlated, so I use Principal Component Analysis to have a specification for my index. My first question is: Am I right? Can I use PCA for index construction when the variables can take values between 0 and 1 and they are correlated? (Q1). Secondly, searching for bibliography I found that different papers use this methodology for index construction -but not necessary with variables 0-1-, and what I understand for them is that the components are used to relate the variables in a formula, which is the index; this is: if the components are 0.3, 0.4, and 0,7, my index would be:
I=0.3var1 + 0.4var2 + 0.7var3
Is my understanding correct? Is it the way of using PCA?(Q2)
But other researchers use the variability explicated to relate the variables, instead of the components - whose sum is 1.. What strategy is more common used? Does it depend on my data/specification/goal/etc?(Q3)
Thank You.
I am working on the construction of an index, based on three variables which take values between 0 and 1. I should say that these variables are mean of another variables which can take values between 0 and 1, in turn. I think that my variables are highly correlated, so I use Principal Component Analysis to have a specification for my index. My first question is: Am I right? Can I use PCA for index construction when the variables can take values between 0 and 1 and they are correlated? (Q1). Secondly, searching for bibliography I found that different papers use this methodology for index construction -but not necessary with variables 0-1-, and what I understand for them is that the components are used to relate the variables in a formula, which is the index; this is: if the components are 0.3, 0.4, and 0,7, my index would be:
I=0.3var1 + 0.4var2 + 0.7var3
Is my understanding correct? Is it the way of using PCA?(Q2)
But other researchers use the variability explicated to relate the variables, instead of the components - whose sum is 1.. What strategy is more common used? Does it depend on my data/specification/goal/etc?(Q3)
Thank You.
Comment