Dear Sirs,
I'm working with Stata 18 for Mac (Intel 64-bit).
I'm using a dataset with 100 variables and 481,000 records, extracted from a relational database. The subjects in the database have been dichotomized according to an integer cut-off, which divides them in 2 groups, representing 88.87% and 11,13% of total sample, respectively. Is there any means to use a variable with a lot of values (such as ICD-9 diagnoses) to select those variables in which the expected distribution for each category are greater than the expected value for the less represented group and to have it reported in a table?
Thank you for your support and Happy New Year
Mattia
I'm working with Stata 18 for Mac (Intel 64-bit).
I'm using a dataset with 100 variables and 481,000 records, extracted from a relational database. The subjects in the database have been dichotomized according to an integer cut-off, which divides them in 2 groups, representing 88.87% and 11,13% of total sample, respectively. Is there any means to use a variable with a lot of values (such as ICD-9 diagnoses) to select those variables in which the expected distribution for each category are greater than the expected value for the less represented group and to have it reported in a table?
Thank you for your support and Happy New Year
Mattia
Comment