Hi, I have a few questions regarding Nemotron Nano Omni and would like to raise them as an issue.
- Are there any plans to support async GRPO for Nano-Omni (including non-colocated setups and in-flight weight updates)?
- Related to (1): when I tried async GRPO with Nemo-RL on my own in a non-colocated setting, cross-node communication of pixel_values was extremely heavy. Since Ray does not yet seem to support IB RDMA, I ran into significant difficulties scaling to multi-node. Have you encountered the same issue, and are there any plans to address it?
- My understanding is that R3 (Routing Replay; https://arxiv.org/abs/2510.11370), which is supported in Verl and slime, has not yet been implemented in Nemo-RL. Are there plans to support it?
- Are there plans to support fully async training, including partial rollouts?
Thanks in advance!
Hi, I have a few questions regarding Nemotron Nano Omni and would like to raise them as an issue.
Thanks in advance!