Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add checkpoint deletion options to configuration and Orbax checkpoint manager
#3488 opened Mar 23, 2026 by awonak Loading…
3 of 4 tasks
Fix: Upload docker images on pypi release
#3487 opened Mar 23, 2026 by SurbhiJainUSC Loading…
4 tasks done
[Test] Rope change
#3486 opened Mar 23, 2026 by RissyRan Draft
4 tasks
Add option to replicate attn weights on FSDP or EP
#3480 opened Mar 21, 2026 by gobbleturk Loading…
[Distillation] Fix eod masking + strategy refactoring pull ready
#3478 opened Mar 20, 2026 by vlad-karp Loading…
4 tasks done
Split logical names in moe module pull ready
#3473 opened Mar 20, 2026 by NuojCheng Loading…
4 tasks done
Fix DiLoCo training compatibility issues
#3471 opened Mar 20, 2026 by khatwanimohit Loading…
4 tasks done
Fix param mapping for Qwen2 gemini-review
#3466 opened Mar 20, 2026 by ChingTsai Loading…
4 tasks done
Fix src/MaxText references in GPU/runner Dockerfiles pull ready
#3462 opened Mar 20, 2026 by bvandermoon Loading…
4 tasks done
maxtext/tunix lora integration [WIP]
#3453 opened Mar 18, 2026 by andytwigg Draft
4 tasks
Add simplified APIs for model obtaining maxtext models
#3450 opened Mar 18, 2026 by A9isha Loading…
4 tasks done
modify process_data to generate separate user/system parts in prompts
#3445 opened Mar 18, 2026 by andytwigg Loading…
4 tasks done
let tunix see lr hyperparams for logging pull ready
#3444 opened Mar 18, 2026 by andytwigg Loading…
4 tasks done
NNX train
#3442 opened Mar 18, 2026 by charlesli640 Draft
4 tasks done
Add abort_on_nan_loss and abort_on_inf_loss options
#3440 opened Mar 18, 2026 by Steboss Loading…
4 tasks done
Added SFT Pre-Processing for Grain Input Pipeline
#3437 opened Mar 18, 2026 by ajkv-google Loading…
4 tasks done
Use non-tokamax group sizes for expert parallelism
#3436 opened Mar 18, 2026 by xuefgu Draft
4 tasks done
Use Tokamax's representative group sizes.
#3434 opened Mar 17, 2026 by copybara-service bot Loading…
ProTip! What’s not been updated in a month: updated:<2026-02-24.