Challenges shared by Richard

nyuton · February 28, 2022, 5:07pm

I guess it’s worth sharing here. Continue reading on Twitter.

https://twitter.com/richardcraib/status/1498167957263839233?s=20&t=fOGaXayexv6TG476YqwE3g

luee · February 28, 2022, 6:08pm

In my experience, simulated data is kinda useless for training, especially when the data is so noisy. Fun blue sky project but I strongly doubt we can build a convincing generative model for the kind of data where a spearman of 3-4% is considered good, clearly our data is barely understood by our models

lcrmorin · March 3, 2022, 3:43pm

Depending on how you see it adding noise is a form of augmentation. I think custom noise (swapping / masking / etc.) layers can practically add robustness and performance. Custom layers are way more practical than metric learning or topological data analysis.

Topic		Replies	Views
Numerai Self-Supervised Learning & Data Augmentation Projects Data Science	114	11076	March 22, 2023
Synthetic data generation using GANs Data Science	8	2485	September 6, 2021
Feature reversing input noise Data Science	21	6294	May 18, 2021
Are the new data more noisy? Tournament	5	983	October 4, 2021
Generate Forged Signature From Real one for data augmentation Data Science	1	1806	March 26, 2022

Challenges shared by Richard

Related topics