Poster "policy gradient optimization" Papers
3 papers found
Conference
Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models
Hualin Zhang, Haozhen Zhang, Zhekai Liu et al.
ICLR 2025
Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts
Onur Celik, Aleksandar Taranovic, Gerhard Neumann
ICML 2024arXiv:2403.06966
16
citations
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
Tianchen Zhou, Hairi, Haibo Yang et al.
ICML 2024arXiv:2405.03082
3
citations