Dear Statalisters, I am eager to hear your insights/thoughts and experiences on this matter:
I have been analyzing a considerable number of datasets, one by one. Each dataset is relatively small, around 70kB, with 10,000 to 20,000 observations and tab-delimited format (.txt). While the time Stata takes to read each dataset into memory is not significant individually, collectively, with billions of datasets, these tiny processes take a lot of time, sometimes taking a few days.
Are you aware of any potentially faster alternatives to the import delimited command? I have tried infile but it is slower than import delimited.
All the best,
Tiago
I have been analyzing a considerable number of datasets, one by one. Each dataset is relatively small, around 70kB, with 10,000 to 20,000 observations and tab-delimited format (.txt). While the time Stata takes to read each dataset into memory is not significant individually, collectively, with billions of datasets, these tiny processes take a lot of time, sometimes taking a few days.
Are you aware of any potentially faster alternatives to the import delimited command? I have tried infile but it is slower than import delimited.
All the best,
Tiago
Comment