-
Notifications
You must be signed in to change notification settings - Fork 507
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[New Model Bringup] Initial Commit to enable Text-only architecture for Qwen3.5
#3712
opened Apr 21, 2026 by
Rohan-Bierneni
Collaborator
Loading…
4 tasks done
Add unit tests for param_mapping.py
#3711
opened Apr 21, 2026 by
bvandermoon
Collaborator
Loading…
4 tasks done
Remove deprecated cross-program prefetch flags.
#3710
opened Apr 21, 2026 by
copybara-service
Bot
Loading…
Save run manifest for distillation reproducibility.
gemini-review
#3709
opened Apr 21, 2026 by
gagika
Collaborator
Loading…
4 tasks done
Add optional --skip-validation flag to benchmark recipes and XPK workload creation
#3708
opened Apr 21, 2026 by
RUEI4341
Contributor
Loading…
4 tasks done
distillation: resume + xpk launcher + metrics refactor
gemini-review
pull ready
#3701
opened Apr 20, 2026 by
gagika
Collaborator
Loading…
4 tasks done
fix masking error when using mlp_bias=True causing NaN during gradien…
#3699
opened Apr 19, 2026 by
snehalv2002
Collaborator
Loading…
4 tasks done
[Distillation] base learn-to-init llama attention for distillation
#3688
opened Apr 17, 2026 by
vlad-karp
Collaborator
Loading…
4 tasks done
[DO NOT REVIEW] Update generated requirements for post-training with JAX 0.9.2
#3684
opened Apr 16, 2026 by
SurbhiJainUSC
Collaborator
•
Draft
4 tasks done
docs: add LoRA tutorial and layer customization guide
#3682
opened Apr 16, 2026 by
chiajunglien
•
Draft
4 tasks
[Inference] Diverse Beam Search Integration
#3681
opened Apr 16, 2026 by
yipkingster
Loading…
5 tasks done
Add MoE load balancing loss to distillation
#3679
opened Apr 16, 2026 by
JamesDeng42
Collaborator
Loading…
4 tasks done
Adds ability to pass
chat_template_path argument into MaxText SFT, loading a separate chat_template from the tokenizer that's provided.
#3675
opened Apr 15, 2026 by
copybara-service
Bot
Loading…
4 tasks done
Make all links internal (where possible)
#3671
opened Apr 15, 2026 by
melissawm
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Follow long discussions with comments:>50.