Hello everyone,
I'm trying do define a population with specific characteristics (eg: eating well).
I have many dummy variables that i wish to use, positively or negatively (eg: having everyday -breakfast -fruits -vegetable -sweets)
I generated a composite variable "eatingwell= breakfast + fruits + veget +1-sweets"
Now, i want to make sure that this variable is really valid, that is to say my dummy variables are correlated together.
Now the questions:
1) I tried to use Cronbach's alpha, but i'm having a hard time to interpret it. I know that 0.7 is a good level, but i get results ranging from 0.4 to 0.6. Is that enough to use a composite variable?
2) I also tried the command "factor", but i dont understand the meaning of the results, and i dont know if i should use principal factor or principal component factor method.
Is the screeplot a good method to visualize the results of this factor analysis?
3) Finally, i tried to use "candisc list_of_dummy_var , group(composite_var)" , but i get an error message if i put the exact variables that were used to generate my composite variable (pooled within-group SSCP matrix has rank 4 instead of rank 5). If i choose to drop one of the dummy variable, i can use scandisc and the screeplot, but i'm not sure of the meaning of the screeplot. I initially thought it would give me the validity of the dummy variables, but now i wonder if it is not rather the validity of the functions used by the discriminant analysis.
4)Last, is manually generating my composite variable a good way of doing it, or should i use another command such as ".egen compvar "? And then how can i make stata understand that some variable have to be used positively and other negatively?
To sum up: What is the best way to generate a composite variable, and what is the best way to assess it's validity.
Thank you for your help.
Gabriel FERNANDEZ
I'm trying do define a population with specific characteristics (eg: eating well).
I have many dummy variables that i wish to use, positively or negatively (eg: having everyday -breakfast -fruits -vegetable -sweets)
I generated a composite variable "eatingwell= breakfast + fruits + veget +1-sweets"
Now, i want to make sure that this variable is really valid, that is to say my dummy variables are correlated together.
Now the questions:
1) I tried to use Cronbach's alpha, but i'm having a hard time to interpret it. I know that 0.7 is a good level, but i get results ranging from 0.4 to 0.6. Is that enough to use a composite variable?
2) I also tried the command "factor", but i dont understand the meaning of the results, and i dont know if i should use principal factor or principal component factor method.
Is the screeplot a good method to visualize the results of this factor analysis?
3) Finally, i tried to use "candisc list_of_dummy_var , group(composite_var)" , but i get an error message if i put the exact variables that were used to generate my composite variable (pooled within-group SSCP matrix has rank 4 instead of rank 5). If i choose to drop one of the dummy variable, i can use scandisc and the screeplot, but i'm not sure of the meaning of the screeplot. I initially thought it would give me the validity of the dummy variables, but now i wonder if it is not rather the validity of the functions used by the discriminant analysis.
4)Last, is manually generating my composite variable a good way of doing it, or should i use another command such as ".egen compvar "? And then how can i make stata understand that some variable have to be used positively and other negatively?
To sum up: What is the best way to generate a composite variable, and what is the best way to assess it's validity.
Thank you for your help.
Gabriel FERNANDEZ
Comment