So - I am trying to decide which resident to promote. Using their "Performance" as one metric (about 50% of their entire score) My idea was to normalize their survey score and then use that score in their overall total rank.
I realized I might have a problem.
The scoring is 1-10 from 14 reviewers. My plan was to use the median to decrease some potentially biased outlier results.
I really am trying to be fair to all these hard-working people, so getting this right is important to me. Some reviewers gave no score (because they didnt work with that person) and thus missing data exists. That might give me trouble with interobserver reliability. There might be issues with right censoring too?
Slight differences might mean the difference between job and no job.
Dashes mean missing data. Each column is the same reviewer
Resident 1: 9 7 6 7 8 - 6 7 4 6 2 5 6 7 Resident 2: 10 7 8 10 8 8 10 8 10 9 10 6 8 9 Resident 3 9 7 6 - 8 - 8 6 7 6 4 7 8 7 Resident 4 - 8 8 - 7 7 4 7 3 8 - 10 Resident 5 8 9 7 - 9 9 10 8 10 9 8 4 5 8
Am I overthinking this?
I realized I might have a problem.
The scoring is 1-10 from 14 reviewers. My plan was to use the median to decrease some potentially biased outlier results.
I really am trying to be fair to all these hard-working people, so getting this right is important to me. Some reviewers gave no score (because they didnt work with that person) and thus missing data exists. That might give me trouble with interobserver reliability. There might be issues with right censoring too?
Slight differences might mean the difference between job and no job.
Dashes mean missing data. Each column is the same reviewer
Resident 1: 9 7 6 7 8 - 6 7 4 6 2 5 6 7 Resident 2: 10 7 8 10 8 8 10 8 10 9 10 6 8 9 Resident 3 9 7 6 - 8 - 8 6 7 6 4 7 8 7 Resident 4 - 8 8 - 7 7 4 7 3 8 - 10 Resident 5 8 9 7 - 9 9 10 8 10 9 8 4 5 8
Am I overthinking this?
Comment