-
Notifications
You must be signed in to change notification settings - Fork 740
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
⚡ Bolt: Memoize module availability and device properties lookups
contributor
External developers
#7525
opened Apr 20, 2026 by
google-labs-jules
bot
Loading…
[Cherry-Pick][BugFix] Fix clear_parameters hang issue in MTP during weight cleanup in RL (#7522)
#7523
opened Apr 20, 2026 by
Deleter-D
Collaborator
Loading…
3 of 5 tasks
[BugFix] Fix clear_parameters hang issue in MTP during weight cleanup in RL
#7522
opened Apr 20, 2026 by
Deleter-D
Collaborator
Loading…
2 of 5 tasks
[Optimization] Support async D2H copy for MTP logprobs & Clean up overlap schedule condition checks
#7521
opened Apr 20, 2026 by
Sunny-bot1
Collaborator
Loading…
5 tasks
[Optimize] Import performance while enable cpu prefix caching
#7520
opened Apr 20, 2026 by
Jiang-Jia-Jun
Collaborator
•
Draft
5 tasks
[XPU] add support for rope3d
XPU
#7518
opened Apr 20, 2026 by
RuohengMa
Contributor
Loading…
5 tasks
[Cherry-Pick] [FDConfig] Unify num_experts_per_tok to moe_k in ModelConfig for MoE model compatibility(#7509)
#7517
opened Apr 20, 2026 by
xyxinyang
Collaborator
Loading…
5 tasks done
[Cherry-Pick][Optimization]Change default workers and max-concurrency when launch api-server(#7457)
#7516
opened Apr 20, 2026 by
EmmonsCurse
Collaborator
Loading…
5 tasks
[XPU] improce attn precision
XPU
#7515
opened Apr 20, 2026 by
lizan1999
Contributor
Loading…
5 tasks
[CI] Disable auto CUDA arch injection to avoid duplicate gencode flags
#7513
opened Apr 20, 2026 by
EmmonsCurse
Collaborator
Loading…
5 tasks done
[Feature]【Hackathon 10th Spring No.47】Add MiniMax-M1 integration tests and multi-GPU support
contributor
External developers
#7511
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
4 tasks done
[Feature]【Hackathon 10th Spring No.47】Add MiniMax-M1 model for FastDeploy
contributor
External developers
#7510
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
9 tasks done
[FDConfig] Unify num_experts_per_tok to moe_k in ModelConfig for MoE model compatibility
#7509
opened Apr 20, 2026 by
xyxinyang
Collaborator
Loading…
5 tasks done
【Hackathon 10th Spring No.50】Add MiniCPM4.1-8B model support with μP scaling
contributor
External developers
#7506
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
5 tasks done
[Feature]【Hackathon 10th Spring No.48】SD3 and Flux diffusion model implementation
contributor
External developers
#7505
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
11 of 13 tasks
[CI]【Hackathon 10th Spring No.45-part2】Add SM75/SM80 compile guards for cutlass and MoE tail ops
contributor
External developers
#7504
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
5 tasks done
【Hackathon 10th Spring No.46】Add Windows platform guards for Python runtime (Part 3/3)
contributor
External developers
#7503
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
4 tasks done
【Hackathon 10th Spring No.46】Add Windows build system support (Part 2/3)
contributor
External developers
#7502
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
4 tasks done
【Hackathon 10th Spring No.46】Add Windows C++ compile guards (Part 1/3)
contributor
External developers
#7501
opened Apr 20, 2026 by
bobby-cloudforge
Loading…
5 tasks done
[PD] Fix PD interaction and error response
#7500
opened Apr 20, 2026 by
juncaipeng
Collaborator
Loading…
[Scheduler][BugFix] Fix token_budget calculation to use actual decode request count
cherry-pick: release/2.5
cherry-pick: release/2.6
#7499
opened Apr 20, 2026 by
kevincheng2
Collaborator
Loading…
2 of 4 tasks
[Loader] Add values natural order check to layers grouped validation
#7498
opened Apr 20, 2026 by
bukejiyu
Collaborator
Loading…
1 of 5 tasks
[RL][Cherry-Pick] Fix the out-of-bounds issue caused by int32 in the R3 kernel
#7496
opened Apr 20, 2026 by
gongshaotian
Collaborator
Loading…
5 tasks done
[Iluvatar] Fix cannot import name mtp_save_first_token
contributor
External developers
#7495
opened Apr 20, 2026 by
wuyujiji
Contributor
Loading…
5 tasks done
Previous Next
ProTip!
Filter pull requests by the default branch with base:develop.