New Stata command for lasso, ridge regression and elastic net regression

Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#1

New Stata command for lasso, ridge regression and elastic net regression

13 Sep 2017, 16:40

Hello everyone. I've written a Stata implementation of the Friedman, Hastie and Tibshirani (2010, JStatSoft) coordinate descent algorithm for elastic net regression and its famous special cases: lasso and ridge regression. The resultant command, elasticregress, is now available on ssc -- thanks to Kit Baum for the upload.

The command extends existing Stata lasso implementations, such as lars, by allowing the regularisation parameter to be given or found by K-fold cross-validation. As such it tends to have better out-of-sample fit. The below plot compares the performance of elasticregress, lars and OLS as the number of covariates increases. As is well known, OLS performs poorly on dense data. lars has roughly constant performance as the number of covariates increases while elasticregress becomes more accurate.

(The estimators are fitted on 1000 observations. The true relationships between the standard-normal covariates and the dependent variable are drawn from a spike and slab distribution with p=0.2 chance of being non-zero. Each dot is a mean over 30 replications. Both elasticregress and lars are calculated with their respective lasso options.)

elasticregress tends to be a little faster than lars when estimating the lasso. elasticregress can also estimate the more general elastic-net regression, which regularises with both the L1 and L2 norms and is thus more robust to colinearity in the regressors -- when it does so it can cross-validate both the regularisation parameter and the mixing parameter.

Hopefully the help files are self-contained -- do let me know if they're not. If you find a bug, please post an issue in the Github.
Tags: command, elastic net, lasso, regularized regression, ridge regression

6 likes
Michael Droste

Join Date: Sep 2017

Posts: 24
#2

22 Sep 2017, 13:49

Hi Wilbur --

Excellent work! Thanks for placing this on SSC.
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#3

22 Sep 2017, 19:25

Hi Michael -- thanks.
Comment
Robert Fluegge

Join Date: Sep 2017

Posts: 1
#4

25 Sep 2017, 13:27

Wilbur - is there any way that I can use this to determine if my data is generated from a Poisson distribution?
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#5

25 Sep 2017, 14:55

Robert -- This currently only fits linear models. I might extend it to logistic models later, but probably not Poisson (is anything really generated by a Poisson distribution???)
Comment
Sergio Correia

Join Date: Apr 2014

Posts: 420
#6

25 Sep 2017, 16:02

Originally posted by Wilbur Townsend View Post

Robert -- This currently only fits linear models. I might extend it to logistic models later, but probably not Poisson (is anything really generated by a Poisson distribution???)

If you model positive quantities (exports, sales, wages, etc.) poisson might be a better fit; see: http://personal.lse.ac.uk/tenreyro/LGW.html

I vaguely recall reading about poisson models with elasticnetsearch on R and it shouldn't be that tricky if you implement it through IRLS. That said, it might not be worth it for your use cases.
2 likes
Comment
Dimitriy V. Masterov

Join Date: Mar 2014

Posts: 609
#7

12 Oct 2017, 12:20

Does this command handle the covariate rescaling for you? How does it treat factor variables (as a group or individually) and handle their rescaling?
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#8

24 Oct 2017, 19:09

Dimitriy -- Yes, it standardises the covariates. It expands factor variables into dummies and standardises them (unless they're invariant).
Comment
Catherine Bloom

Join Date: Sep 2015

Posts: 6
#9

28 Dec 2017, 12:09

Dear Professor Townsend,

Many thanks for your great Stata programme. It's highly appreciated.

I am wondering why there is no t-statistic for each coefficient after estimating the Elastic Net regression?

Furthermore, is there any prerequisite for applying the LASSO or Elastic Net regression? In conventional time-series modelling, we normally require statistically stationary variables. Can we run Elastic Net regression on non-stationary variables?

Best wishes,
Catherine
Comment
Travis Box

Join Date: Jan 2018

Posts: 2
#10

04 Jan 2018, 16:18

Dear Wilbur,

My concern is similar to Catherine's. The program is not creating a e(V) matrix, so outreg and outreg2 will not work. Is there anyway to have this functionality added?

Best regards,
Travis
1 like
Comment
Thiago Scot

Join Date: Jan 2018

Posts: 1
#11

08 Jan 2018, 14:45

Dear Professor Townsend,

Thank you very much for the program.

I was trying to use the -lassoregress- command, and it seems that it generates different results when I run it several times. I copy below the code using the auto database and the output. I believe this should not happen, but apologies in advance if I am misunderstanding the estimation procedure.

Best,

Thiago

--

sysuse auto, clear

lassoregress mpg weight foreign

ereturn clear

return clear

lassoregress mpg weight foreign

LASSO regression Number of observations = 74
R-squared = 0.6075
alpha = 1.0000
lambda = 1.2064
Cross-validation MSE = 11.2183
Number of folds = 10
Number of lambda tested = 100
------------------------------------------------------------------------------
mpg | Coef.
-------------+----------------------------------------------------------------
weight | -.0044458
foreign | 0
_cons | 34.72133
------------------------------------------------------------------------------

LASSO regression Number of observations = 74
R-squared = 0.6371
alpha = 1.0000
lambda = 0.6903
Cross-validation MSE = 14.1879
Number of folds = 10
Number of lambda tested = 100
------------------------------------------------------------------------------
mpg | Coef.
-------------+----------------------------------------------------------------
weight | -.0051144
foreign | 0
_cons | 36.73993
------------------------------------------------------------------------------
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#12

11 Jan 2018, 11:13

Catherine -- for others' sake I'm copying the email in which I answered your questions below:

Hi there Catherine,

Re t-statistics -- developing inference for LASSO is an ongoing research program, and existing methods are generally quite restrictive about the nature of the data which the model is being estimated on. As such I decided to avoid them when implementing LASSO for Stata at this stage.

I'm not familiar with the appropriateness of LASSO for non-stationary data. A brief Google returns this paper, which suggests that LASSO-type estimators might perform well but LASSO itself does not.

All the best,
Wilbur
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#13

15 Jan 2018, 13:30

Travis -- as above, I've quite intentionally avoided implementing inference, given that that is an ongoing research program. I agree it'd be nice if it works with outreg2 -- I've added this as an issue to be resolved in the next version (that won't be soon). In the meantime it should work with estout.
1 like
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#14

15 Jan 2018, 13:33

Thiago -- you're getting difference results because the random sample used by the K-fold cross-validation differs each time. Try setting the seed before each use.
Comment
Wilbur Townsend

Join Date: Jun 2017

Posts: 17
#15

01 Feb 2018, 12:00

Hi all: I've updated the version on SSC to fix two bugs. Thanks to Kit for the upload.

The first was bug an unambiguous mistake, relating to the formula for calculating the maximal lambda.

The second 'bug' was a bit more ambiguous -- a friend found that when the variance of the dependent variable was small (e.g. with a binary dependent variable), elasticregress was producing non-optimal results because the default tolerance on the Euclidean norm of beta was too large. I've thus changed the default tolerance to be the lesser of (a) the old tolerance (0.001) and (b) abs(0.0001*var(depvar)).

Neither fix will change the output of Elasticregress if the old code was producing correct estimates.
Comment

Announcement

New Stata command for lasso, ridge regression and elastic net regression

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment