Hello everyone. I have been reading up on LASSO recently, and anticipate that when I have a go at it, I will likely have some categorical variables with 3 or more levels to contend with. I gather that "group lasso" is the correct way to deal with that.* Has anyone has implemented "group lasso" for Stata yet? To this point, my searches have not turned up anything.
Also, to those of you who are knowledgeable about LASSO, what do you think about Simon & Tibshirani's (2012) standardized group lasso?
Simon N, Tibshirani R. STANDARDIZATION AND THE GROUP LASSO PENALTY. Stat Sin. 2012 Jul;22(3):983-1001. doi: 10.5705/ss.2011.075. PMID: 26257503; PMCID: PMC4527185.
Cheers,
Bruce
* For example, here is an excerpt from a 2008 article by Meier et al.
Also, to those of you who are knowledgeable about LASSO, what do you think about Simon & Tibshirani's (2012) standardized group lasso?
Simon N, Tibshirani R. STANDARDIZATION AND THE GROUP LASSO PENALTY. Stat Sin. 2012 Jul;22(3):983-1001. doi: 10.5705/ss.2011.075. PMID: 26257503; PMCID: PMC4527185.
Cheers,
Bruce
* For example, here is an excerpt from a 2008 article by Meier et al.
Already for the special case in linear regression when not only continuous but also categorical
predictors (factors) are present, the lasso solution is not satisfactory as it only selects individ-
ual dummy variables instead of whole factors. Moreover, the lasso solution depends on how
the dummy variables are encoded. Choosing different contrasts for a categorical predictor will
produce different solutions in general. The group lasso (Yuan and Lin, 2006; Bakin, 1999; Cai,
2001; Antoniadis and Fan, 2001) overcomes these problems by introducing a suitable extension
of the lasso penalty.
predictors (factors) are present, the lasso solution is not satisfactory as it only selects individ-
ual dummy variables instead of whole factors. Moreover, the lasso solution depends on how
the dummy variables are encoded. Choosing different contrasts for a categorical predictor will
produce different solutions in general. The group lasso (Yuan and Lin, 2006; Bakin, 1999; Cai,
2001; Antoniadis and Fan, 2001) overcomes these problems by introducing a suitable extension
of the lasso penalty.
Comment