Skip to content

Conversation

@f90
Copy link

@f90 f90 commented Mar 27, 2025

The basic pitch model uses min-max normalization of the CQT, meaning that each 2s input audio chunk is normalized so that relative volume differences between audio chunks don't get reflected in the output. This PR adds a fixed normalization scheme and code to replace the min-max normalization layer of the trained model with the global normalization

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant