Im writing a thesis about prosociality and social status. The data Im using consists of responses from USA, Poland, Germany and Sweden. Before I ran my regression analysis I standardized the variable for objective status(consisting of variables for job prestige, education and income). However, the objective status variable should be standardized at country level, not across all countries together. I standardize each indicator at country level by coding this:
bysort country: egen z2utdanning = std(education)
bysort country: egen z2jobbprestisje = std(jobbprestisje)
bysort country: egen z2inntektsdesil = std(incomedecile)
Then I use rowmean when I merge the variables to a composite:
egen row_objektivstatus= rowmean (z2inntektsdesil z2jobbprestisje z2utdanning)
Is it necessary to standardize the composite variable(row_objektivstatus) by country again, or is it already standardized because each indicator was standardized before I constructed the composite variable?
I also normalize the variable:
bysort country: su row_objektivstatus
bysort country: ge z1objektivstatus = (row_objektivstatus-r(min))/(r(max)-r(min))
bysort country: su z1objektivstatus
The minimum and maximun variable of objective status in USA are 0 and 1 without decimals, while for the other countries the values are equal to 0 and 1, but with decimals. Any suggestions why USA got perfect numbers without decimals? Im just asking because Im not sure I did the normalization and standardization correctly. When I normalize the objective status variables per country, the objective status composite does not have 1 in standard deviation anymore, but it seems like having 0 and 1 as values by normalizing is preferable to having 1 in st deviation for all countries. It is also a bit strange that for every country the mean value is about 0,57 and standard deviation is about 0.23(very small differences between countries). Would it not make sense to have bigger differences in those values?
Thanks in advance!
bysort country: egen z2utdanning = std(education)
bysort country: egen z2jobbprestisje = std(jobbprestisje)
bysort country: egen z2inntektsdesil = std(incomedecile)
Then I use rowmean when I merge the variables to a composite:
egen row_objektivstatus= rowmean (z2inntektsdesil z2jobbprestisje z2utdanning)
Is it necessary to standardize the composite variable(row_objektivstatus) by country again, or is it already standardized because each indicator was standardized before I constructed the composite variable?
I also normalize the variable:
bysort country: su row_objektivstatus
bysort country: ge z1objektivstatus = (row_objektivstatus-r(min))/(r(max)-r(min))
bysort country: su z1objektivstatus
The minimum and maximun variable of objective status in USA are 0 and 1 without decimals, while for the other countries the values are equal to 0 and 1, but with decimals. Any suggestions why USA got perfect numbers without decimals? Im just asking because Im not sure I did the normalization and standardization correctly. When I normalize the objective status variables per country, the objective status composite does not have 1 in standard deviation anymore, but it seems like having 0 and 1 as values by normalizing is preferable to having 1 in st deviation for all countries. It is also a bit strange that for every country the mean value is about 0,57 and standard deviation is about 0.23(very small differences between countries). Would it not make sense to have bigger differences in those values?
Thanks in advance!
Comment