Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add zero1 aot support in train compile
#3872 opened May 11, 2026 by NuojCheng Collaborator Draft
4 tasks done
Implement custom MoE HashRouter, TopKRouter, and sqrtsoftplus
#3871 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
Conditionally branch tokamax.ragged_dot calls based on use_manual_quantization
#3869 opened May 11, 2026 by zxhe-sean Collaborator Loading…
4 tasks done
Fix ckpt conversion for qwen3-moe models (transformers==5.8.0)
#3868 opened May 11, 2026 by hengtaoguo Collaborator Loading…
4 tasks done
DeepSeek V4 Integration
#3867 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
Implement DeepSeek-V4 Compressed Attention Layers
#3866 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
DeepSeek-V4 Core Primitives
#3865 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
[DeepSeek v3] Add grad mask and update MLA init gemini-review
#3864 opened May 10, 2026 by gagika Collaborator Loading…
4 tasks done
Enable Qwen3-Omni SFT on ChartQA
#3863 opened May 10, 2026 by hengtaoguo Collaborator Draft
4 tasks
Refactor: extract shared post-training hooks and update SFT implementation
#3862 opened May 10, 2026 by igorts-git Collaborator Loading…
4 tasks done
Implement automated CI failure investigator gemini-review
#3861 opened May 9, 2026 by shralex Collaborator Loading…
4 tasks done
Optimize MaxText unit and integration test suite runtime
#3860 opened May 9, 2026 by shralex Collaborator Loading…
4 tasks done
Update optimization docs and add TPU v7x guide
#3857 opened May 8, 2026 by jacoguzo Collaborator Loading…
4 tasks done
Improve error message when tokenize_*_data config doesn't match dataset gemini-review
#3856 opened May 8, 2026 by aireenmei Collaborator Loading…
4 tasks done
Update docker image guide
#3855 opened May 8, 2026 by melissawm Collaborator Loading…
1 task done
Update JAX to 0.10.0 for pre-training
#3854 opened May 8, 2026 by SurbhiJainUSC Collaborator Draft
4 tasks done
Update First Run tutorial
#3853 opened May 8, 2026 by melissawm Collaborator Loading…
1 task done
Trigger tests using PR comments
#3850 opened May 8, 2026 by shralex Collaborator Loading…
4 tasks done
Simplify maybe_initialize_jax_distributed_system()
#3847 opened May 7, 2026 by SurbhiJainUSC Collaborator Loading…
4 tasks done
[NNX] NNX migration prep (4.5/N): Linen<->NNX checkpoint converter
#3843 opened May 7, 2026 by ecnal-cienet Collaborator Draft
4 tasks done
Improve eval metrics logging gemini-review
#3840 opened May 7, 2026 by aireenmei Collaborator Loading…
4 tasks done
[Qwen3.5] Onboard model to Checkpointing Util & Verify Correctness
#3839 opened May 7, 2026 by Rohan-Bierneni Collaborator Loading…
4 tasks done
ProTip! Exclude everything labeled bug with -label:bug.