Skip to content

Pull requests: THUDM/slime

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Preserve reloadable process group options
#2095 opened Jun 17, 2026 by EazyReal Contributor Draft
fix(scripts): correct model config source path in FP8 low_precision scripts
#2094 opened Jun 17, 2026 by aoshen02 Contributor Loading…
2 tasks done
Disk-level delta weight sync
#2089 opened Jun 16, 2026 by nanjiangwill Collaborator Loading…
fix(opd): score teacher logprobs at rollout temperature, not 0
#2085 opened Jun 15, 2026 by EazyReal Contributor Loading…
feat(rl): add REINFORCE advantage estimator
#2083 opened Jun 15, 2026 by EazyReal Contributor Loading…
feat(coding_agent_rl): add SWE-bench harness evaluation path
#2079 opened Jun 15, 2026 by aoshen02 Contributor Draft
3 tasks
fix(rollout): isolate per-trajectory exceptions in generate_and_rm_group
#2078 opened Jun 15, 2026 by aoshen02 Contributor Loading…
fix(script): correct GLM-4.7 expert_model_parallel_size for single-node 8 GPU
#2077 opened Jun 15, 2026 by aoshen02 Contributor Loading…
1 task
Support Qwen3.5-VL (dense + MoE) via Megatron-Bridge
#2075 opened Jun 14, 2026 by demouo Contributor Loading…
feat(rollouts) external rollouts endpoint with publish-only weight sync
#2071 opened Jun 12, 2026 by jvmncs Loading…
4 tasks done
fix(sglang): authenticate engine control-plane and router calls
#2068 opened Jun 12, 2026 by EazyReal Contributor Loading…
fix(metrics): make compute_pass_rate ragged-safe for over-sampled batches
#2064 opened Jun 12, 2026 by EazyReal Contributor Loading…
fix(rollout): apply rollout sample filter in the rollout manager
#2061 opened Jun 12, 2026 by EazyReal Contributor Loading…
[DON'T MERGE] run CI run-ci-megatron
#2053 opened Jun 11, 2026 by zhuzilin Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-06-14.