Q1 mods: Accommodate multiple-choice and numeric #5

hickman-santini · 2025-04-29T16:56:44Z

Especially check functions with @BEN (functions.py). I added one measly test (sorry 😞). Especially the head-to-head (recalculated peer scores for pros vs bot team) are pretty dang complicated.

There's a bug in calibration curves now but I'm tapped out - I bet it's simple, I just went about filtering to binaries in the worst way possible.

Have not touched the CP vs bots analysis.

* accommodate mc, numeric and binary * something wrong still

* ... but multiple-choice are biting me again * going to refactor a bit in next commit

@ben

* to accommodate multiple-choice and numerics * big chore * could be buggy * wrote one (1) test (sorry @ben)

hickman-santini · 2025-04-30T13:13:03Z

functions.py

+    if q_type == 'numeric':
+        forecasts = [f for f in forecasts if isinstance(f, list)]
+
+        if not forecasts:
+            return np.nan
+
+        cdfs_array = np.array(forecasts, dtype=float)
+        mean_cdf = np.mean(cdfs_array, axis=0)
+
+        return mean_cdf


@CodexVeritas this is what we're talking about here: https://metaculus.slack.com/archives/C02JGTBC7DJ/p1745932666973469 (may be wrong)

Molly Hickman added 7 commits April 15, 2025 11:50

don't exclude binaries; wip

5efddef

ingest, process forecasts from heroku

b9a355c

compute head-to-head scores for bots vs pros

5726553

* accommodate mc, numeric and binary * something wrong still

head-to-head score works...

2413830

* ... but multiple-choice are biting me again * going to refactor a bit in next commit

WIP: adapting get median forecasts

da0404c

ugh

1544025

rewrote head-to-head calc, get_median_forecast, weighted scores calc

a3f2fae

* to accommodate multiple-choice and numerics * big chore * could be buggy * wrote one (1) test (sorry @ben)

hickman-santini requested a review from CodexVeritas April 29, 2025 16:56

discarded unused first attempts at head-to-head scores

7d7459f

hickman-santini commented Apr 30, 2025

View reviewed changes

reinstating the calculate all peer scores

37b7d15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Q1 mods: Accommodate multiple-choice and numeric #5

Q1 mods: Accommodate multiple-choice and numeric #5

Uh oh!

hickman-santini commented Apr 29, 2025

Uh oh!

hickman-santini Apr 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Q1 mods: Accommodate multiple-choice and numeric #5

Are you sure you want to change the base?

Q1 mods: Accommodate multiple-choice and numeric #5

Uh oh!

Conversation

hickman-santini commented Apr 29, 2025

Uh oh!

hickman-santini Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants