What is your machine spec(CPU and GPU) for NUMERAI?

ryo_matsuzaka · October 10, 2022, 3:41am

I thought someone use threadripper. No one uses it?

nyuton · October 10, 2022, 6:37am

With pytorch and cuml you can push all your ML load to GPUs. They are faster and cheaper then a threadripper.

ryo_matsuzaka · October 10, 2022, 7:13am

Thank you very much. I misunderstood that for GBDT many core CPU is faster than GPU. I will tri it.

jrb · October 10, 2022, 8:46am

I have NVLINK on both my machines. I haven’t used it for Numerai data, although I have for used it for some large computer vision models in the past. I don’t know of any automatic way to use multiple GPUs as one, for this use-case (model parallelism). When people say multi-GPU training, they usually mean training large batches on multiple GPUs (data parallelism), which is the easy case and trivially automated by all frameworks.

It’s fairly straightforward to place some layers (i.e weights for those layers) on different GPUs. I’ve done this with tensorflow and JAX. Still a bit slower than using a single GPU, but noticeably faster than when doing the same without NVLINK, because device to device copies are much faster with it.

joakim · October 10, 2022, 9:40am

My CPU is a first Gen threadripper. 12 cores 24 threads, and fairly slow compared to latest Gen CPUs (I’ve overlooked it though so all cores are running at 4ghz all the time.). I use it mostly for pre and post processing. Been contemplating if I should upgrade to a 2990wx second Gen (32 cores 64 threads), which is the biggest one the motherboard supports. I don’t really need it though.

joakim · October 10, 2022, 9:49am

Thanks for a very thorough answer!

objectscience · October 17, 2022, 4:00pm

3090 x 2… Do the house lights dim just a little when you fire that up? Nice setup!

jxtrbtk · October 27, 2022, 12:46pm

An Asus Eee PC 901, shipped with an Intel N270 Atom CPU clocked at 1.6GHz and 1GB of RAM.
It runs on Debian 9 and I use it for inference only !!!
I have models using XGBoost and that’s fine. I had to struggle a bit more with my PyTorch models as it is 32bit device and not PyTorch is not working on 32bit systems. So I have broken my models down to play them with the simple linear algebra using numpy.

Of course for training I have other devices : a MSI laptop (i7, 16GB, GTX 1060) and a retired open air mining rig (AMD CPU, 12GB, GTX 1080) but no data science war machine.

autratec · October 28, 2022, 12:06am

All models are running on cloud. Start fromAzure ML studio. Moved to Colab and Python as others suggested. Moved to Kaggle notebook later and now using Deepnote to conduct daily submission.

Topic		Replies	Views
Numerai Tournament Example code using Pytorch NN and Optuna Tournament	13	2706	April 25, 2022
Tournament NN baseline with the new massive data Tournament	3	1391	October 19, 2021
[Community Release] Azure Compute for Numerai predictions Tournament	11	1232	May 23, 2022
16GB Intermediate solution: XGB Era Boosting Tournament	54	5406	April 1, 2022
Bias in the TC? - low CORR, high TC Tournament	2	589	October 28, 2022

What is your machine spec(CPU and GPU) for NUMERAI?

Related topics