Using spark api to get the prediction data

hi all,

Has anyone used spark to get the prediction data? i used spark scala api to do most of the work, from beginning to generating the csv file. It is much slower than python scripts since its distributed processing and computing. But it may be useful if having more than one local machines.

Thanks

I have put it into github, GitHub - liang3zy22/numeraiexampleinspark. I will continue to update the codes.