Skip to content

Conversation

@NickCEBM
Copy link
Contributor

@NickCEBM NickCEBM commented Apr 9, 2025

The function cleanup_dataset, which works post-scrape during data handling, is breaking due to bad date data. A date (not sure which trial yet) is showing up as 202-04-30 which pd.to_datetime doesn't like.

If we can find the trial we might be able to make a good guess at what that date should be, but that is annoying and finnicky anyway. I'm comfortable, therefore, with using the built-in error handling in Pandas to coerce a broken date to a NaT. A broken date might as well not exist for us so lets treat it as if that is the case.

@NickCEBM NickCEBM requested a review from inglesp April 9, 2025 13:11
@NickCEBM NickCEBM merged commit 3f12cd4 into master Apr 9, 2025
1 check passed
@NickCEBM NickCEBM deleted the Broken-Date-Fix branch April 9, 2025 13:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants