Objetive Function

javiermoral · March 8, 2021, 12:36pm

Just writing this to share which target functions you use the most when training your models. I was thinking of customizing an Objective Function for boosted models in order to beat the common methods already developed. I know Spearman’s correlation is non-differentiable due to sort and rank steps, but I found some references to try to deal with these problems:

I’ve tried to use SoDeep loss functions when training my MLPs and it was a complete disaster. So it would be nice to hear some tips from you all. Do you keep going with RMSE, MSE, MAE. MAPE, LOGLOSS…?

silentj · March 8, 2021, 6:14pm

I’ve tried using KL Divergence for learning to rank (see here: https://theiconic.tech/learning-to-rank-is-good-for-your-ml-career-part-2-lets-implement-listnet-11af69d1704). Ended up getting slightly worse results than just regular MSE so I didn’t explore it too much. I might come back to it eventually, seemed cool at the time

greenprophet · March 9, 2021, 2:13am

I just used pearsonr since it seemed close enough without the sort. With NN and pytorch era batches i got better validation with pearsonr + mseloss than just mseloss. Only have a couple rounds started on live though.

Would like to eventually get some ranking and feature neutralization directly in the loss.

javiermoral · March 10, 2021, 10:41am

Did you code yourself the person correlation loss function? Or is it implemented elsewhere?

greenprophet · March 10, 2021, 6:31pm

I used this code

gist.github.com

https://gist.github.com/ncullen93/58e71c4303b89e420bd8e0b0aa54bf48

pytorch_correlations.py

def pearsonr(x, y):
    """
    Mimics `scipy.stats.pearsonr`

    Arguments
    ---------
    x : 1D torch.Tensor
    y : 1D torch.Tensor

    Returns

This file has been truncated. show original

define the function variables

    criterion = nn.MSELoss()
    corr_loss_fn = pearsonr

then in pytorch loop with loss functions I called like this. but depending on your modelling results might be different indexing. Also not sure if constants on losses are relevant. I get confused about this.

preds = model(x)                                
                                    loss = criterion(preds[0], y)
                                    corr_loss = 1 - corr_loss_fn(preds[0].squeeze(), y.squeeze())
                                    if USE_CORR_LOSS:
                                        loss += corr_loss * 0.05
                                    if phase=='train':
                                        loss.backward()
                                        optimizer.step()

lucky_chicken · March 14, 2021, 12:25am

I’m using fast-soft-sort for my neural nets, is better than MSE for me, but still worse than simple xgboost. I must be doing something wrong.

javiermoral · March 28, 2021, 3:44pm

Same for me, it does not work as good as expected.

javiermoral · April 13, 2021, 5:41pm

I can’t see how the gradient and the hessian are computed in your code

Topic		Replies	Views
Differentiable Spearman Correlation Data Science	2	4298	March 30, 2021
Differentiable Spearman in PyTorch (Optimize for CORR directly) Data Science	30	24073	November 7, 2023
Learning to Rank Data Science	29	19258	March 21, 2023
Custom loss functions for XGBoost using PyTorch Data Science	23	19686	July 16, 2022
A more clear understanding of the Ranked Correlation Data Science	5	2850	April 1, 2021

Objetive Function

Related topics