SAS vs Stata Jackknife

Bruce Bradbury

Join Date: May 2015
Posts: 20

SAS vs Stata Jackknife

16 Jun 2015, 19:38

I'm getting different results when using SAS (9.4) and Stata (v11) for jackknife regression. Anyone know why these are different?
Below are the key parts of my SAS and Stata code. If I run SAS surveyreg using the default Taylor series linearisation, and also run the corresponding Stata code on the same data, I get (near enough to) identical results. When I use the jackknife approach, the std errors are essentially the same as the Taylor series approach in Stata, but about 10% larger in SAS.

SAS CODE

Code:

  title "Regression: Taylor series linearisation";
  proc surveyreg data=temp1 total=lsac.stratum74with73   ;
     strata stratum74with73;
     cluster pcodes;
     weight defwt;
     model ready5_as = edl edm edh /noint;
  run;
  title "Regression: Jackknife method (10% larger SEs)";
  proc surveyreg data=temp1 varmethod=jk ;
     strata stratum74with73;
     cluster pcodes;
     weight defwt;
     model ready5_as = edl edm edh /noint;
  run;

STATA CODE

Code:

  svyset pcodes [pweight=defwt], strata(stratum74with73) fpc(totalstratums)
  * Taylor series linearisation
  svy: regress ready5_as edl edm edh, noconstant
  * Jackknife calculation
  svy jackknife: regress ready5_as edl edm edh, noconstant

Tags: None

Bruce Bradbury

Join Date: May 2015

Posts: 20
#2

16 Jun 2015, 19:47

I think it has something to do with the FPC. If I remove the FPC information from the code, all four approaches give essentially the same results (the larger std errors). But presumably the optimal estimates should take account of this? The SAS documentation implies that the jackknife estimate doesn't, and shouldn't, take account of the FPC, but it would seem the Stata code does in some way. Which is correct?
Comment
Steve Samuels

Join Date: Mar 2014

Posts: 1786
#3

18 Jun 2015, 14:01

The fpc is certainly permitted with the jackknife, at least in some circumstances.. See, for example, Wolter, 2007, p. 174; Lumley, 2004, p. 8

References:

Lumley, T. (2004). Analysis of complex survey samples. Journal of Statistical Software, 9(1), 1-19.
http://www.jstatsoft.org/v09/i08/paper

Wolter, Kirk M. 2007. Introduction to variance estimation. New York: Springer.

Last edited by Steve Samuels; 18 Jun 2015, 14:32.

Steve Samuels
Statistical Consulting
[email protected]

Stata 14.2
1 like
Comment
Bruce Bradbury

Join Date: May 2015

Posts: 20
#4

21 Jun 2015, 06:53

Steve, Thanks for these helpful comments and references. I haven't read the Wolter reference yet, but have found some useful comments on the Stata support site at http://www.stata.com/support/faqs/st...stage-designs/ There, it is suggested that a FPC for the first stage PSU selection is not appropriate if there is sampling within the PSUs (which there certainly is in my case).

So, even though the survey providers do provide information to make an FPC, I think I should not be doing so - under either the linearisation or jackknife approaches.

(The design is a regional stratification, followed by a first stage selection of postal areas, followed by a clustered sampling of people within these postal areas. Only information on the first stage stratification and psu selection is provided.)

Last edited by Bruce Bradbury; 21 Jun 2015, 06:57.
Comment

Announcement

SAS vs Stata Jackknife

Comment

Comment

Comment