u/nc_bound

How to access more computing power?

Hi all, I'm in the midst of learning random forest in r/Rstudio, and I'm using hstats to test to look for interactions. It is taking forever, even using the default method of using only a subsample of the data. This is making it extremely difficult to learn, and over the long run, I'm gonna need to do this a lot.

Currently I'm running it on my MacBook Pro which is massively overheating, smallest runs are taking six hours, and I need to do many of them, for many different studies, over the next year or two.

Any suggestions for accessing more computing power?

I'm very new to all of this, having "grown up" with SPSS, the linear model, and good old regression. So, it help if any approach to boosting computing power can be figured out by a regular non-computer saavy guy like me. Eg, it sounds like Rstudio Server could be easy to get running on a cloud?

I can think of: 1. get a dedicated heavy duty computer. This would be a big commitment, especially at my resource-scarce institution, and although I'm optimistic these methods will prove valuable for my work, dumping a couple of grand into a machine is still risky. 2. rent time on a cloud computing site. Much lower up front investment, and if, down the road, the methods prove valuable for me, then I could later commit to a dedicated computer. 3. I'm a prof at a university...maybe there are resources in my university system.

Thank you for any ideas, advice, warnings, etc.

reddit.com
u/nc_bound — 1 day ago