Skip to content

Prevent FLCE backward when reduction="none"#1126

Draft
nick-knight wants to merge 1 commit intolinkedin:mainfrom
nick-knight:add_flce_assert
Draft

Prevent FLCE backward when reduction="none"#1126
nick-knight wants to merge 1 commit intolinkedin:mainfrom
nick-knight:add_flce_assert

Conversation

@nick-knight
Copy link

Summary

This PR adds an assertion to prevent the backward pass of FLCE (fused linear cross-entropy loss) from being invoked when reduction="none". This unsupported functionality was previously discussed in #968. This PR also trims some out-of-date text from an existing comment about this functionality gap, and relocates the rest of the comment from the forward pass to the site of the assertion.

Testing Done

I was unable to test this PR according to the contributor guidelines. I saw numerous failures with make test and make test-convergence, and didn't try to debug them. Thus I am leaving this as a Draft PR.

  • Hardware Type: H100-80GB-HBM3
  • run make test to ensure correctness
  • run make checkstyle to ensure code style
  • run make test-convergence to ensure convergence

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant