Skip to content

Conversation

@hickman-santini
Copy link
Contributor

Especially check functions with @BEN (functions.py). I added one measly test (sorry 😞). Especially the head-to-head (recalculated peer scores for pros vs bot team) are pretty dang complicated.

There's a bug in calibration curves now but I'm tapped out - I bet it's simple, I just went about filtering to binaries in the worst way possible.

Have not touched the CP vs bots analysis.

image

Molly Hickman added 7 commits April 15, 2025 11:50
* accommodate mc, numeric and binary
* something wrong still
* ... but multiple-choice are biting me again
* going to refactor a bit in next commit
* to accommodate multiple-choice and numerics
* big chore
* could be buggy
* wrote one (1) test (sorry @ben)
Comment on lines +348 to +357
if q_type == 'numeric':
forecasts = [f for f in forecasts if isinstance(f, list)]

if not forecasts:
return np.nan

cdfs_array = np.array(forecasts, dtype=float)
mean_cdf = np.mean(cdfs_array, axis=0)

return mean_cdf
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@CodexVeritas this is what we're talking about here: https://metaculus.slack.com/archives/C02JGTBC7DJ/p1745932666973469 (may be wrong)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants