Poster "monte carlo estimation" Papers
5 papers found
Conference
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
Yiran Guo, Lijie Xu, Jie Liu et al.
NEURIPS 2025arXiv:2505.23564
18
citations
VinePPO: Refining Credit Assignment in RL Training of LLMs
Amirhossein Kazemnejad, Milad Aghajohari, Eva Portelance et al.
ICML 2025arXiv:2410.01679
56
citations
Model-Based Minimum Bayes Risk Decoding for Text Generation
Yuu Jinnai, Tetsuro Morimura, Ukyo Honda et al.
ICML 2024arXiv:2311.05263
9
citations
Sliced-Wasserstein Estimation with Spherical Harmonics as Control Variates
Rémi Leluc, Aymeric Dieuleveut, François Portier et al.
ICML 2024arXiv:2402.01493
9
citations
Sliced Wasserstein with Random-Path Projecting Directions
Khai Nguyen, Shujian Zhang, Tam Le et al.
ICML 2024arXiv:2401.15889
17
citations