Scalarizer Tracker

This issue tracks the candidate methods that could maybe be implemented as `Scalarizer` in `torchjd.scalarization` (see #666).

| Name | Ref | Stateful | Existing implementations | Special Remarks |
|-|-|-|-|-|
| Sum | - | No | (trivial) | |
| Mean | - | No | (trivial) | Sometimes called Equal Weights (EW) in research papers. |
| Linear | - | No | (trivial) | Sometimes called Linear Scalarization (LS) in research papers. Name it `Linear` or `Constant`? |
| Random | [Reasonable Effectiveness of Random Weighting: A Litmus Test for Multi-Task Learning](https://arxiv.org/pdf/2111.10603) (TMLR 2022, 200 citations) | No | [LibMTL (official)](https://github.com/median-research-group/LibMTL/blob/main/LibMTL/weighting/RLW.py) | Sometimes called Random Loss Weighting (RLW) in research papers. Name it `RLW` or `Random`? |
| STCH (Smooth TCHebyshev) | [Smooth Tchebycheff Scalarization for Multi-Objective Optimization](https://openreview.net/pdf?id=m4dO5L6eCp) (ICML 2024, 87 citations) | No | [official](https://github.com/Xi-L/STCH), [LibMTL](https://github.com/median-research-group/LibMTL/blob/main/LibMTL/weighting/STCH.py) | |
| GLS (Geometric Loss Strategy) | [MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning](https://openaccess.thecvf.com/content_CVPRW_2019/papers/WAD/Chennupati_MultiNet_Multi-Stream_Feature_Aggregation_and_Geometric_Loss_Strategy_for_Multi-Task_CVPRW_2019_paper.pdf) (CVPR workshop 2019, 149 citations) | No | [LibMTL](https://github.com/median-research-group/LibMTL/blob/main/LibMTL/weighting/GLS.py)
| IMTL-L | [Towards Impartial Multi-Task Learning](https://openreview.net/pdf?id=IMPnRXEWpvr) (ICLR 2021, 279 citations) | ? | [official](https://github.com/JohnLaMaster/Impartial-Multi-Task-Learning/blob/main/imtl.py), [LibMTL](https://github.com/median-research-group/LibMTL/blob/main/LibMTL/weighting/IMTL.py) (maybe this is IMTL-G) | Need more investigation |
| FAMO | [FAMO: Fast Adaptive Multitask Optimization](https://proceedings.neurips.cc/paper_files/paper/2023/file/b2fe1ee8d936ac08dd26f2ff58986c8f-Paper-Conference.pdf) (NeurIPS 2023, 124 citations) | ? | [official](https://github.com/Cranial-XIX/FAMO), [LibMTL](https://github.com/median-research-group/LibMTL/blob/main/LibMTL/weighting/FAMO.py) | Not sure this is a Scalarizer, need more investigation.
| GradNorm | [GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks](https://proceedings.mlr.press/v80/chen18a/chen18a.pdf) (ICML 2018, 2334 citations) | Yes, trainable state | [unofficial](https://github.com/LucasBoTang/GradNorm), [LibMTL](https://github.com/median-research-group/LibMTL/blob/main/LibMTL/weighting/GradNorm.py) | Not sure this is a Scalarizer, need more investigation.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalarizer Tracker #667

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Name	Ref	Stateful	Existing implementations	Special Remarks
Sum	-	No	(trivial)
Mean	-	No	(trivial)	Sometimes called Equal Weights (EW) in research papers.
Linear	-	No	(trivial)	Sometimes called Linear Scalarization (LS) in research papers. Name it `Linear` or `Constant`?
Random	Reasonable Effectiveness of Random Weighting: A Litmus Test for Multi-Task Learning (TMLR 2022, 200 citations)	No	LibMTL (official)	Sometimes called Random Loss Weighting (RLW) in research papers. Name it `RLW` or `Random`?
STCH (Smooth TCHebyshev)	Smooth Tchebycheff Scalarization for Multi-Objective Optimization (ICML 2024, 87 citations)	No	official, LibMTL
GLS (Geometric Loss Strategy)	MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning (CVPR workshop 2019, 149 citations)	No	LibMTL
IMTL-L	Towards Impartial Multi-Task Learning (ICLR 2021, 279 citations)	?	official, LibMTL (maybe this is IMTL-G)	Need more investigation
FAMO	FAMO: Fast Adaptive Multitask Optimization (NeurIPS 2023, 124 citations)	?	official, LibMTL	Not sure this is a Scalarizer, need more investigation.
GradNorm	GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks (ICML 2018, 2334 citations)	Yes, trainable state	unofficial, LibMTL	Not sure this is a Scalarizer, need more investigation.

Scalarizer Tracker #667

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions