How do you handle your losses different scales

Hi,

Thanks you for the very interesting library!  Looking at the code I was wondering how you were balancing the different losses scales. The MSE loss is probably on a completely different scale than the cross-entropy loss. How do you make sure that the MSE loss does not dominate compared to your other losses?

I never actually implemented this myself, but I found two papers that implement custom loss weights. This enables you to have any number of losses each with different scales, but then each gets its own weight for the calculation of the total loss.

The paper in question are section 3 of [Multi-Task Learning Using Uncertainty to Weigh Losses](https://arxiv.org/pdf/1705.07115.pdf) and then section 2 of [Auxiliary Tasks in Multi-task Learning](https://arxiv.org/pdf/1805.06334.pdf) where some other authors refine the formula a bit more from the first paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How do you handle your losses different scales #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How do you handle your losses different scales #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions