Thanks to Kit Baum, the dataframe package is now available on SSC.
Description
dataframe is intended to bypass some of the limitations of only having one dataset in memory at a time. dataframe is similar to preserve, with the key differences being:
I've included this in the mata forum, since the command is just a Stata wrapper on a mata function. I'd be interested to hear if anyone has any comments on coding efficiency, etc (see the ado file for source code). In particular, one challenge I had was storing value labels. I defined a struct, vallabelstruct with members pointer(string vector) vector text and pointer(real vector) vector vals. That is, if vallabelstruct were a vallabelstruct scalar, vallabelstruct.text would have length equal to the number of value labels in the dataset and the length of each vector being pointed to would be the number of entries for that particular value label. I want to use st_vlload() to populate vallabelstruct. I was hoping to be able to use syntax like st_vlload("mylabelname",*vallabelstruct.vals,*vall abelstruct.text), however this syntax is won't work. Does anyone have any ideas?
Thank you,
Andrew Maurer
Description
dataframe is intended to bypass some of the limitations of only having one dataset in memory at a time. dataframe is similar to preserve, with the key differences being:
- dataframe stores the dataset in memory, rather than to disk
- multiple dataframes may exist in memory at the same time
- subsets of data may be stored, rather than the whole dataset
Code:
ssc install dataframe
Thank you,
Andrew Maurer
Comment