Skip to content

Conversation

@jlamypoirier
Copy link
Collaborator

@jlamypoirier jlamypoirier commented Jan 29, 2026

✨ Description

Content extracted from #428

Fix and improve LM losses

  • Tweak entropy losses to improve readability
  • Extract reusable fused_predicted_logits_from_labels from _fused_cross_entropy_base_from_labels
  • Support arbitrary tensor dimensions in entropy losses.
  • Add TP support for Z loss
  • Fix broken entropy loss tests for loss masking.
  • Generalize and simplify entropy loss tests with support for z loss, improve distributed coverage, add fp16 and bf16 tests.

Misc including:

  • Fix slow tokenization in tests
  • Fix some c10d imports
  • Automate cpu support for safe_barrier, etc.

@jlamypoirier jlamypoirier marked this pull request as ready for review January 30, 2026 04:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants