There's no benefit to normalizing--it won't change the results of your svy: tab command, for example.
The definition of a sampling, or probability, weight, is: \(w_i = \frac{1}{f_i}\), where \(f_i\) is the probability that sample member \(i\) was selected.
"Normalized" weights are probability weights that are scaled (divided by a constant) to sum to some constant C. I've seen C = 1 and C = sample size \(n\). This last was popular before the days of survey software. As I've said, this makes no difference to most analyses.
However there are good reasons not to normalize weights. If you do normalize
1. You lose the possibility of estimating totals, including population counts. This is a serious loss in many studies
2. If you use a finite population correction (fpc option to svyset), then estimates of the design effect DEFF be incorrect.
3. A subject's weight has the interpretation as the number of people in the population "represented" by the subject. This is easy to understand and often important to look at. Normalizing destroys this interpretability
Note that the "probability" weights supplied with many data sets are not sampling weights, but sampling weights adjusted for non-response and post-stratified so that estimated sample totals match population totals known from other sources (like a Census). The reasons against normalizing apply to these weights as well.
The definition of a sampling, or probability, weight, is: \(w_i = \frac{1}{f_i}\), where \(f_i\) is the probability that sample member \(i\) was selected.
"Normalized" weights are probability weights that are scaled (divided by a constant) to sum to some constant C. I've seen C = 1 and C = sample size \(n\). This last was popular before the days of survey software. As I've said, this makes no difference to most analyses.
However there are good reasons not to normalize weights. If you do normalize
1. You lose the possibility of estimating totals, including population counts. This is a serious loss in many studies
2. If you use a finite population correction (fpc option to svyset), then estimates of the design effect DEFF be incorrect.
3. A subject's weight has the interpretation as the number of people in the population "represented" by the subject. This is easy to understand and often important to look at. Normalizing destroys this interpretability
Note that the "probability" weights supplied with many data sets are not sampling weights, but sampling weights adjusted for non-response and post-stratified so that estimated sample totals match population totals known from other sources (like a Census). The reasons against normalizing apply to these weights as well.
Comment