-
Notifications
You must be signed in to change notification settings - Fork 444
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ci: Bump Megatron-Bridge to 97e8dba
CI:L1
Run doctests, unit tests, and functional tests
#3025
opened Jul 1, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
cp: Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
feat: R3 gym notq router replay (2915) into r0.7.0
cherry-pick
CI:Lfast
#3023
opened Jul 1, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix: (qwen3-omni) disable vLLM sequence_parallel_chunk custom op
r0.7.0
#3021
opened Jul 1, 2026 by
yuekaizhang
Contributor
Loading…
1 task
cp: Run doctests, unit tests, and functional tests
Documentation
Improvements or additions to documentation
feat(xtoken): multi-teacher support for cross-tokenizer off-policy distillation (2797) into r0.7.0
cherry-pick
CI:L1
#3019
opened Jul 1, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
fix: Keep QARL nightlies schedulable on H100 nodes
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
r0.7.0
#3018
opened Jul 1, 2026 by
mxinO
Contributor
Loading…
4 tasks
ci: fix grpo-nanov3-30BA3B-2n8g-megatron_generation.yaml config
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
r0.7.0
#3017
opened Jul 1, 2026 by
ashors1
Contributor
Loading…
4 tasks
ci: Add nightly GB200 super test
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#3016
opened Jun 30, 2026 by
ashors1
Contributor
Loading…
4 tasks
fix: reduce NUM_MINUTES to 240 in dapo-deepseek-v3-64n8g.v2 test
#3014
opened Jun 30, 2026 by
kajalj22
Contributor
Loading…
1 task
fix: stabilize GB200 16n4g issue-2579 repro
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
#3013
opened Jun 30, 2026 by
jinglinglingling
Contributor
Loading…
fix(perf): fix CUDA OOM in grpo-qwen3-30ba3b-4n8g performance test
CI:L1
Run doctests, unit tests, and functional tests
r0.7.0
#3005
opened Jun 30, 2026 by
NolenLiang
Contributor
Loading…
feat(grpo): stream rollout batches by prompt group
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
Documentation
Improvements or additions to documentation
#3000
opened Jun 30, 2026 by
yfw
Contributor
Loading…
4 tasks
feat(vllm): MTP speculative-decoding inference with refit drafter
#2999
opened Jun 30, 2026 by
yfw
Contributor
Loading…
4 tasks
refactor(sglang): allocate ports from reserved ranges
community-request
waiting-on-customer
Waiting on the original author to respond
#2997
opened Jun 29, 2026 by
xiuhu17
Contributor
Loading…
feat: Dynamo K8s Integration
Documentation
Improvements or additions to documentation
#2990
opened Jun 29, 2026 by
jthomson04
Contributor
Loading…
4 tasks
fix(megatron): prefer nvrx strategy for HF import saves
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
r0.7.0
#2989
opened Jun 29, 2026 by
jinglinglingling
Contributor
Loading…
feat(modelopt): support real NVFP4 QAT rollout for MoE and Mamba
CI:L1
Run doctests, unit tests, and functional tests
Documentation
Improvements or additions to documentation
Feature
#2983
opened Jun 29, 2026 by
HollowMan6
Member
•
Draft
4 tasks done
refactor(ppo): reuse LossPostProcessor in value worker + add offload_to_cpu
CI:L1
Run doctests, unit tests, and functional tests
#2980
opened Jun 28, 2026 by
bg51717
Contributor
Loading…
3 of 4 tasks
fix: complete truncation in docs/conf.py html_theme_options (closes #2816)
community-request
Documentation
Improvements or additions to documentation
waiting-on-maintainers
Waiting on maintainers to respond
#2979
opened Jun 28, 2026 by
botbikamordehai2-sketch
Loading…
Use JSON for vLLM env list parsing
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2978
opened Jun 28, 2026 by
fallintoplace
Loading…
3 of 4 tasks
Avoid leaking env vars across Ray worker groups
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2977
opened Jun 28, 2026 by
fallintoplace
Loading…
3 of 4 tasks
Fix setuptools package discovery for nemo_rl subpackages
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2976
opened Jun 28, 2026 by
fallintoplace
Loading…
3 of 4 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.