Hello,
I've run some benchmarks comparing the speed of R and Stata for common data manipulation, on datasets ranging from 100MB to 5GB. Below is the result for 500MB (1e7 rows), which may interest some of you: http://www.princeton.edu/~mattg/pictures/1e7.png

You can find more information on the github repository https://github.com/matthieugomez/benchmark-stata-r , which contains a quick summary of the result, the R and Stata scripts I runned, and the results as a .csv file. Feel free to edit the scripts if you spot mistakes.
I've also written a guide on data manipulations in R for Stata Users : http://www.princeton.edu/~mattg/statar/
The guide targets topics I could not find elsewhere : equivalent to egen by commands, panel data commands, macros, and inplace transformations of large datasets. The guide is centered on the packages data.table and dplyr, which bring R syntax closer to Stata, while being generally an order of magnitude faster than Stata for common data manipulations. I hope this guide will be useful to some Stata users.
Matthieu
I've run some benchmarks comparing the speed of R and Stata for common data manipulation, on datasets ranging from 100MB to 5GB. Below is the result for 500MB (1e7 rows), which may interest some of you: http://www.princeton.edu/~mattg/pictures/1e7.png
You can find more information on the github repository https://github.com/matthieugomez/benchmark-stata-r , which contains a quick summary of the result, the R and Stata scripts I runned, and the results as a .csv file. Feel free to edit the scripts if you spot mistakes.
I've also written a guide on data manipulations in R for Stata Users : http://www.princeton.edu/~mattg/statar/
The guide targets topics I could not find elsewhere : equivalent to egen by commands, panel data commands, macros, and inplace transformations of large datasets. The guide is centered on the packages data.table and dplyr, which bring R syntax closer to Stata, while being generally an order of magnitude faster than Stata for common data manipulations. I hope this guide will be useful to some Stata users.
Matthieu
Comment