EPDMS score is 0.0 for some tokens, even when all subscores are 1.0.

<img width="558" height="29" alt="Image" src="https://github.com/user-attachments/assets/e2b18358-be4f-4684-bdc1-cdb7bd480254" />

I am using navsim==2.2 to evaluate the EPDMS metric on the navtest split. (This evaluation setting, while not the two-stage pseudo-simulation, has been used in several recent papers).

### 1. Description
I've encountered a consistent issue where certain specific tokens receive a final EPDMS score of 0.0, even though every individual subscore (NC, DAC, DDC, TLC, EP, TTC, etc.) for that token is 1.0.

According to the EPDMS formula (a product of penalty terms and a weighted average of other terms), if all subscores are 1.0, the final EPDMS should also be 1.0. This 0.0 result seems to be an error. Is there anything I might have missed?

### 2. Other info
Hardware: The issue is reproducible on both NVIDIA H100 and Ascend 910B hardware.
Consistency: The exact same tokens fail (score 0.0) consistently across different models and multiple test runs.
Dependencies : I am aware of potential issues with older numpy versions (like 1.23.4). I have already upgraded numpy to 1.26.4, deleted the entire metric_cache then regenerated it, but the problem persists.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EPDMS score is 0.0 for some tokens, even when all subscores are 1.0. #172

1. Description

2. Other info

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

EPDMS score is 0.0 for some tokens, even when all subscores are 1.0. #172

Description

1. Description

2. Other info

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions