by Gopeshh Raaj Subbaraj Papers
2 papers found
Conference
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matt Riemer, Gopeshh Raaj Subbaraj, Glen Berseth et al.
ICLR 2025arXiv:2412.14355
6
citations
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Md Rifat Arefin, Gopeshh Raaj Subbaraj, Nicolas Gontier et al.
ICLR 2025arXiv:2411.02344
5
citations