I have a question about the correct method of combining years of BRFSS data when the purpose is to increase sample size. One approach I've found is to multiply the final survey weight for each year by the proportion of that year's sample size to the total sample size for all years of data. For example, combining 2011 to 2014 data, I would calculate the new weights as follows. Of note, these are lines from the do file and not the entire file:
1. Read-in 2011 data do file: new_weight = (2011_old_weight)/4
2. Read-in 2012 data do file: new_weight = (2012_old_weight)/4
3. Read-in 2013 data do file: new_weight = (2013_old_weight)/4
4. Read-in 2014 data do file: new_weight = (2014_old_weight)/4
5. Append 2011-2014 data do file: svyset _psu [pweight= new_weight], strata(_ststr) singleunit(certainty)
- Read-in 2011 data do file: new_weight = weight_2011*(sample_size_2011/total_sample_size),
- Read-in 2012 data do file: new_weight = weight_2012*(sample_size_2012/total_sample_size),
- Read-in 2013 data do file: new_weight = weight_2013*(sample_size_2013/total_sample_size),
- Read-in 2014 data do file: new_weight = weight_2014*(sample_size_2014/total_sample_size)
- Append 2011-2014 data do file: svyset _psu [pweight= new_weight], strata(_ststr) singleunit(certainty)
1. Read-in 2011 data do file: new_weight = (2011_old_weight)/4
2. Read-in 2012 data do file: new_weight = (2012_old_weight)/4
3. Read-in 2013 data do file: new_weight = (2013_old_weight)/4
4. Read-in 2014 data do file: new_weight = (2014_old_weight)/4
5. Append 2011-2014 data do file: svyset _psu [pweight= new_weight], strata(_ststr) singleunit(certainty)
Comment