Hello world,
I have a dataset of 500 records. Each record contains 20 scale variables and 10 dichotomous variables.
I want to divide these 500 records into 3 groups and plot them on a 2-dimensional X-Y plane.
I chose 2 dimension because this is more intuitive than 3 or more dimensions.
Q1. Then would it make sense to generate X and Y coordinate by using pca, and use these coordinates in the subsequent cluster analysis?
Q2. If so, should I better normalise the scale variables, say, by using boxcox transformation?
Q3. Can I input the dichotomous variables into pca as they are?
Cheers
Yoshi Nagao
I have a dataset of 500 records. Each record contains 20 scale variables and 10 dichotomous variables.
I want to divide these 500 records into 3 groups and plot them on a 2-dimensional X-Y plane.
I chose 2 dimension because this is more intuitive than 3 or more dimensions.
Q1. Then would it make sense to generate X and Y coordinate by using pca, and use these coordinates in the subsequent cluster analysis?
Q2. If so, should I better normalise the scale variables, say, by using boxcox transformation?
Q3. Can I input the dichotomous variables into pca as they are?
Cheers
Yoshi Nagao
Comment