Wish List for Stata 20

Richard Williams

Join Date: Apr 2014

Posts: 4932
#1

Wish List for Stata 20

08 Apr 2025, 11:58

I can’t believe that Stata 19 has been out for over an hour and nobody has started this thread yet!

My ongoing wish is for more powerful factor variable notation that lets you include other functions of variables, like logs and square roots.

More generally, I wish that programs like sem ran faster. I personally don’t want that many new features, I mostly wish that existing features ran better.

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://www3.nd.edu/~rwilliam
Tags: None

Stuck

6 likes
Bert Lloyd

Join Date: Apr 2014

Posts: 105
#2

08 Apr 2025, 12:05

To quote, well, myself:

Originally posted by Bert Lloyd View Post

I wonder, does anyone have the programming / LLM coaxing chops to see which items on the Wishlist are provided?

It might be fun to go back a few editions too, see what share of requests happen within 1 version, 2 versions, ..., never
1 like
Comment
Bert Lloyd

Join Date: Apr 2014

Posts: 105
#3

08 Apr 2025, 12:45

From Wish list for Stata 14:

Originally posted by Bert Lloyd View Post

Developing nice plugins for popular text editors (e.g. Notepad++, WinEdt) -- the user-written ones are nice but official versions (or user-written but with active help from StataCorp) would be a huge improvement.

variable names > 32 characters

and

Originally posted by Bert Lloyd View Post

Allow inlist() to take more than 10 string arguments.

An option for "tab" that will display both value labels and unlabeled values, similar to user-written -tabnl- and -fre-.

From Wish list for Stata 18:

Originally posted by Bert Lloyd View Post

The ability to add custom metadata to .gph files and retrieve using graph describe, similar to graph describe's current ability to retrieve r(dtafile), r(command), etc.

(Some long-ago discussion here: https://www.statalist.org/forums/for...generated-pdfs)

From Wish list for Stata 19:

Originally posted by Bert Lloyd View Post

[*]Keep focus on the do-file editor while a do-file is running, e.g., do not have graphs pop up in front of the do-file editor.

and

Originally posted by Bert Lloyd View Post

Improvements to graph text, e.g.

Aligning subscripts and superscripts in graphs' text elements as requested by John Mullahy

allowing "hats" and other decorations over characters as requested by Michael Anbar

Ideally there would be a simple way to enter "latex mode", e.g.
title("{latex: $\hat{\beta}_{0}^{i}$}")
or
title("{latex: $\hat{\beta}_{0}^{i}$}")

and

Originally posted by Bert Lloyd View Post

While in mata interactively:

1. allow selecting mata code from the do-file editor and Control-D to run the selected mata code

2. if I enter do dofile.do interactively or hit Control-D from the do-file editor, quietly end mata interactive mode and run the do-file without an error.

(I realize these might be a bit contradictory, even just one of them would be helpful if it's not feasible to add both. I don't really have a preference for one over the other.)
1 like
Comment
Rich Goldstein

Join Date: Mar 2014

Posts: 4436
#4

08 Apr 2025, 13:39

I would like -hsearch- to be resurrected, even if the algorithm is not changed - I still use this at least once a month to find programs that I've forgotten the name of

I would like to repeat my previous requests for more design help; in particular, randomization schedules of various kinds (I know there are a number of user-written commands but I would like something built-in as a guard against the user retiring - actually this has already happened for at least one author) and matching (again, I know there are a number of user-written routines; I use and like -ultimatch- and have heard good things about -kmatch-)
2 likes
Comment
James Peluso

Join Date: Apr 2025

Posts: 1
#5

08 Apr 2025, 14:47

Originally posted by Richard Williams View Post

I can’t believe that Stata 19 has been out for over an hour and nobody has started this thread yet!

My ongoing wish is for more powerful factor variable notation that lets you include other functions of variables, like logs and square roots.

More generally, I wish that programs like sem ran faster. I personally don’t want that many new features, I mostly wish that existing features ran better.

Dr. Williams:

You have Stata MP2? What model CPU(s)? Total Cores? How large are your sem models? Observations? Variables?

I have Stata MP8, running on an Intel i9. 8 cores. This system is fast, but current-day CPU’s leave it in the dust.

Cordially,

James Peluso, M.S.
Data Scientist

25+ year Stata / SAS / JMP user
Comment
Chen Samulsion

Join Date: Jan 2018

Posts: 862
#6

08 Apr 2025, 18:33

Here is a retrospect of wishlist(s):
Wish list for Stata 14
Wish list for Stata 15
Wish list for Stata 16
Wish list for Stata 17
Wish list for Stata 18
Wish list for Stata 19

Dear Bert Lloyd, I second your advice:

Originally posted by Bert Lloyd View Post

I wonder, does anyone have the programming / LLM coaxing chops to see which items on the Wishlist are provided?

It might be fun to go back a few editions too, see what share of requests happen within 1 version, 2 versions, ..., never
1 like
Comment
Joseph Coveney

Join Date: Apr 2014

Posts: 4373
#7

08 Apr 2025, 21:05

A tall order, maybe, but go for it: Hamiltonian Monte Carlo.
3 likes
Comment
daniel klein

Join Date: Mar 2014

Posts: 3818
#8

09 Apr 2025, 01:00

A small order by comparison: in addition to the various c(sysdir_*) and c(adopath), please provide c() values that return the current setting of net set from, net set ado, and net set other; in other words, please provide easy access to the results reported by net query.

Last edited by daniel klein; 09 Apr 2025, 01:04.
1 like
Comment
Erik Ruzek

Join Date: Oct 2017

Posts: 408
#9

09 Apr 2025, 12:05

Originally posted by Joseph Coveney View Post

A tall order, maybe, but go for it: Hamiltonian Monte Carlo.

Similarly as they've done with H2o, perhaps they could just use the Stan engine for this. Robert Grant worked on a solution and developed the stan command. As of 2020, it was his view that one would be better just calling Stan via Python (via Stata). It seems that Stata developers themselves could make the Stata to Stan integration much more seamless.
1 like
Comment
Erik Reinbergs

Join Date: Oct 2022

Posts: 31
#10

10 Apr 2025, 09:22

Wishlist for sem/gsem:
WLSMV estimation similar to lavaan / mplus (with scaled/robust model fit statistics)

FIML for categorical data with MML estimation similar to mplus

I agree with the above ideas that having a built in bayes command with the HMC/NUTS sampler (via Stan backend) would be great. Sure calling Stan from Python from Stata is possible - but that is so complicated compared to using the same user friendly built in bayes stata commands with a different sampler on the backend.
1 like
Comment
Richard Williams

Join Date: Apr 2014

Posts: 4932
#11

10 Apr 2025, 11:04

I don’t trust AI because AI just makes things up sometimes, but maybe Stata could do better. Ask things like “Hey Chuck, regress y on x1, x2, and x3 with robust standard errors.” Or, perhaps more likely, “Hey Chuck, what are factor variables and how do I use them?” Chuck being the legendary Chuck Huber, who I estimate has been the star of approximately 90% of the world’s statistics videos.

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://www3.nd.edu/~rwilliam
1 like
Comment
Richard Williams

Join Date: Apr 2014

Posts: 4932
#12

10 Apr 2025, 13:07

It turns out Chuck Huber wrote a blog post a few years ago about how to use chatgpt and Stata together. As is, it seems more complicated than I would like, but I bet Stata could come up with something better and simpler if it wanted to.

https://blog.stata.com/2023/07/25/a-...o-run-chatgpt/

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://www3.nd.edu/~rwilliam
Comment
Jonathan Afilalo

Join Date: Nov 2016

Posts: 36
#13

11 Apr 2025, 07:51

The integration of h2o commands in STATA 19 has been GREATLY appreciated, however, the omission of h2o's famous autoML functionalities is disappointing. As it stands, we are relegated to using the "old school" grid search approach for hyperparameter tuning, which is time consuming and sometimes fails to identify the optimal set of hyperparameters for optimal accuracy. The same applies for model selection, we are relegated to training a bunch of models and manually comparing them.

h2o pioneered the autoML approach for hyperparameter tuning and model selection, automatically comparing different families of models with their optimal hyperparameters, and variable selection. Long story short, my wishlist for STATA 20 would be to integrate h2o's autoML functionalities.
Comment

Announcement

Wish List for Stata 20

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment