Poster "reward evaluation" Papers
2 papers found
Conference
Policy Gradient with Kernel Quadrature
Tetsuro Morimura, Satoshi Hayakawa
ICLR 2025arXiv:2310.14768
1
citations
Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization
Hyeonah Kim, Minsu Kim, Sungsoo Ahn et al.
ICML 2024arXiv:2306.01276
9
citations