SVY linear regression

Katarina Vlcakova

Join Date: Jan 2015

Posts: 6
#1

SVY linear regression

10 Mar 2015, 05:40

Hello everybody, I have a quite well working regression model (used STATA) with using sample weights (prefix svy). Could anyone help me with the postestimation, verification of the model (E.g. heteroskedasticity test, autocorrelation test, normality test). I am new to stata and survey analysis, so I would be glad for any help with this. Please respect, that I use stata only a ew months... thank you
Tags: None
Steve Samuels

Join Date: Mar 2014

Posts: 1786
#2

10 Mar 2015, 16:10

You haven't used "STATA", Katarina, but you have used "Stata". See the FAQ, Section 18.

The assumptions of homoskedasticity, normality, and independence (e.g. no auto-correlation) are all assumptions needed for ordinary least squares. They are not needed for svy: regress, which bases its inference on the sample design (stratification and variation between primary sampling units) and is robust to violations of those assumptions. In other words, there's no need to test for those violations.

What you do want to test is that your model is correct. The basic test of fit is linktest, but you can also augment your model with non-linear ( (e.g. square) terms and interactions, then test that the added terms are all zero.

Last edited by Steve Samuels; 10 Mar 2015, 17:01.

Steve Samuels
Statistical Consulting
[email protected]

Stata 14.2
1 like
Comment
Marcos Almeida

Join Date: Apr 2014

Posts: 4047
#3

10 Mar 2015, 18:31

Hello Katarina,

I wonder if you already know this, but maybe not, since you said you are "new to Stata".

So, apart from the useful tips Steve Samuels gave you, I suggest you take a look on some alternatives of postestimation after - svy:regress -

http://www.stata.com/manuals13/svysv...postestimation

Also, you can get this information directly from your Stata software just by typing - help svy postestimation -

Best,

Marcos

Best regards,

Marcos
Comment
Katarina Vlcakova

Join Date: Jan 2015

Posts: 6
#4

13 Mar 2015, 04:21

Thank you Steve Samuels. I found in forums, something similar, that you wrote... Could you please reccoment some materials or papers to backup my research? Thank you
Comment
Richard Williams

Join Date: Apr 2014

Posts: 4945
#5

13 Mar 2015, 08:51

UCLA (lower third of page) has some FAQs on SVY analysis that may be helpful:

http://www.ats.ucla.edu/stat/stata/faq/

Stata's own FAQ is at

http://www.stata.com/support/faqs/statistics/#survey

I like Steve's points. I guess I more or less already knew that but he states it very well. I wish the manual or a FAQ would elaborate a bit more on issues like this, e.g. diagnostic tests. I may just quote Steve in my next set of notes on this.

With some diagnostics, I also think it is not so bad just to run non-svy commands, e.g. do so to identify outliers. You are often just trying to identify problems and figure out how to solve them -- it isn't like your statistics have to be accurate to 3 decimal places.

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://www3.nd.edu/~rwilliam
Comment
Steve Samuels

Join Date: Mar 2014

Posts: 1786
#6

13 Mar 2015, 17:24

See Rich Valliant's talk: “Regression Diagnostics for Complex Survey Data”, Stata Users’ Conference. Washington DC, 2009.
http://www.stata.com/meeting/dcconf09/dc09_valliant.pdf.

For identifying outliers, it is important to start with a method that is resistant to outliers (slide 24). Unfortunately least squares is the method least resistant. In Stata, qreg will do median regression and will accept pweights in Stata 13.

I

Steve Samuels
Statistical Consulting
[email protected]

Stata 14.2
Comment
Katarina Vlcakova

Join Date: Jan 2015

Posts: 6
#7

17 Mar 2015, 04:59

Thank you so much for your help
Comment
Katarina Vlcakova

Join Date: Jan 2015

Posts: 6
#8

17 Mar 2015, 05:38

Could I have just one quick question? In the linktest, _hatsq was not significant (P-value 0,474), but is a p-value 0,810 in case of _hat OK, Is there no misspecification? Ovtest shows no omitted vars (0.4578). Thank you
Comment
Steve Samuels

Join Date: Mar 2014

Posts: 1786
#9

17 Mar 2015, 15:06

Katarina, the p-value for _hat is not important in judging the fit of the model.

Steve Samuels
Statistical Consulting
[email protected]

Stata 14.2
Comment
Dean Smith

Join Date: Jul 2019

Posts: 1
#10

10 Jul 2019, 09:52

"The assumptions of homoskedasticity, normality, and independence (e.g. no auto-correlation) are all assumptions needed for ordinary least squares. They are notneeded for svy: regress, which bases its inference on the sample design (stratification and variation between primary sampling units) and is robust to violations of those assumptions. In other words, there's no need to test for those violations."

Could you provide the reference for this: That svy: regress (or svy: glm which is what I'm using) is robust to homoskedasticity, normality, and autocorrelation ? Thanks!
Comment

Announcement

SVY linear regression

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment