GFlowNets for Signal Miner: A New Way to Find Diverse, High-Performing Models

jefferythewind · March 10, 2025, 6:47pm

Hey everyone,

I wanted to share something exciting I’ve been working on that could significantly improve the search for diverse, high-performing models in the Numerai ecosystem. If you’ve been using Signal Miner (GitHub), you already know the key challenge: predictive edge is small and fleeting. Strong models might hit 5% correlation, maybe 10% at best, and poor performance (even negative correlation) is inevitable in certain rounds.

But one thing we do know: multiple diverse competing solutions can exist and score well. Having many well-scoring models is better than having just the single best one.

GFlowNets: A New Paradigm for Model Search

Last week, I attended a talk by Alex Hernandez-Garcia introducing GFlowNets, a fascinating idea championed by Yoshua and Emmanuel Bengio. If you haven’t heard about them, check out:

Quote from Dr. Bengio:

“I have rarely been as enthusiastic about a new research direction. We call them GFlowNets, for Generative Flow Networks. They live somewhere at the intersection of reinforcement learning, deep generative models, and energy-based probabilistic modeling. They are also related to variational models and inference, and I believe they open new doors for non-parametric Bayesian modeling, generative active learning, and unsupervised or self-supervised learning of abstract representations to disentangle both the explanatory causal factors and the mechanisms that relate them.”

What makes GFlowNets special?

Unlike traditional deterministic optimization, GFlowNets are generative models trained to produce diverse outputs that all perform well on a given task. Instead of just finding the best single solution, they learn a probability distribution over good solutions. This is incredibly useful for problems like drug discovery, where you need to explore many promising molecules rather than just one.

Applying GFlowNets to Signal Miner

This approach is exactly what we need for Signal Miner. Right now, Signal Miner samples hyperparameters uniformly from LightGBM. But what if, instead of uniform sampling, we trained a GFlowNet to generate hyperparameter sets that consistently produce high correlation with the Numerai target?

From the NeurIPS paper:

“This paper is about the problem of learning a stochastic policy for generating an object (like a molecular graph) from a sequence of actions, such that the probability of generating an object is proportional to a given positive reward for that object.”

GFlowNets allow us to:

Find a diverse set of benchmark-beating models (instead of just one optimal set of hyperparameters).
Adapt dynamically to what works best, rather than relying on uniform random sampling.
Speed up hyperparameter search by focusing on promising regions instead of exhaustive grid search.

Implementation & First Results

I built a Signal Miner environment within the GFlowNet framework, available here:
GitHub: GFlowNet Signal Miner

To train, just run:

python main.py env=signalminer proxy=signalminer logger.do.online=True

This streams training results to Weights & Biases.

Early Findings

I ran a lightweight experiment on a subset of Numerai Classic data and features. Each training batch evaluates and updates 10 sets of parameters. Over 12K rounds have processed so far. (Clarification: “reward” and “proxy” mean the same thing.)

The trend is clear: over time, the GFlowNet generates better hyperparameters that achieve higher mean reward. Interestingly, the max score has not increased significantly yet, which suggests further tuning is needed. But for an initial test, this is very promising!

Example of GFlowNet-Generated Hyperparameters

In GFlowNet terminology, model output is called a state, representing a sequence of actions that led to a final configuration. Here’s an example of what GFlowNets produce when searching for optimal hyperparameters:

Why This Matters

This experiment is just the beginning. GFlowNets are generative AI beyond language models—they use probabilistic reasoning to explore solution spaces efficiently. In our case, we want to generate a field of diverse, high-performing models, not just one single best model.

This aligns with principles of ensemble learning and diversification—critical to Numerai’s long-term success. It also aligns with the vision of Signal Miner: mine for your own unique alpha. Everyone is a winner!

Next Steps

The main challenge now is speed—we need to evaluate many models to properly train the GFlowNet. However, new research is showing how we can use cheaper (faster) low-fidelity proxies to help train GFlowNets more efficiently.

Check out Multi-Fidelity Active Learning with GFlowNets for more on this. Incorporating this approach could make training significantly faster and more cost-effective, allowing us to explore even more parameter spaces efficiently.

But I’m excited about the potential. This could be a powerful new tool for Numerai tournament participants.

Would love to hear your thoughts! Have you tried GFlowNets? Do you think this approach could be applied elsewhere in the tournament? Let’s discuss!

jefferythewind · March 20, 2025, 3:52pm

Update

I got some feedback from Alex, the creator of the gflownet package that using an exponential function of the raw proxy may help the gflownet drive the random distribution towards higher max reward models, which is exactly what we want.

In each iteration, the GFlowNet helps us generate 10 random hyper parameter configurations for LightGBM training. The proxy is another name for the reward function. In this case, the raw proxy I used here is average era-wise correlation with the target. What Numerai calls CorrV2. However correlation can be negative, while the proxy shouldn’t be. So the exponential function helps with this as well. The proxy I implemented is:

proxy = exp( mean_corr * 100 )

Since correlation is like a percent, the 100 multiplier puts our correlation is range usually around += 5 percent. This is a decent range for the exponential function to work. This new proxy rewards the GFlowNet much more for producing higher correlation models, and penalizes less for low correlation. This promotes more experimental expansion in potentially high-corr areas of hyper-parameter space.

I ran a small version of the numerai dataset. There were 10 models evaluated per iteration, and 50K iterations, for a total training of 500K models. It took about a week to complete, and I’ve produced what I think is a pretty cool Graphic.

It captures the main idea from the Signal Miner, that the more models you evaluate, you eventually uncover better and better performing models. The GFlowNet potentially accelerates the search! I think that is evident by the increasing mean proxy. I also highlight each time we break a new high-water mark. I put the training time on the log scale for better separation of the data.

GitHub Repo: GitHub - jefferythewind/gflownet-signal-miner: GFlowNet Signal Miner

Run it with the python command in the previous post.

This seems like a pretty cool way to push your hyper-parameter search toward better performing models. One could also use a different proxy altogether to drive the search in other directions. The problem is, for the wide and deep tree models that we are really interested in, with 30K + trees, training our GFlowNet will take too long. I’ve been waiting for month already and only evaluated this size model about 100 times. I’m currently thinking about how to speed up the GFlowNet training, and how to get more information back into it, so it learns from every model I evaluate.

Topic		Replies	Views
Signal Miner: Find Unique Alpha & Beat the Benchmark Tournament	18	1926	April 16, 2025
Hyperparameters optimization for "small" LGBM models Data Science	7	2310	October 9, 2023
The Signals Meta Model Has Been Released: Here Are The Feature Exposures Signals	9	2028	August 23, 2023
What is good models for numerai signals? Signals	15	3219	October 5, 2022
Learning Two Uncorrelated Models Data Science	16	6514	September 9, 2020