"policy gradient optimization" Papers
4 papers found
Conference
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang, Zuxuan Wu, Zhen Xing et al.
AAAI 2025paperarXiv:2311.14768
20
citations
Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models
Hualin Zhang, Haozhen Zhang, Zhekai Liu et al.
ICLR 2025
Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts
Onur Celik, Aleksandar Taranovic, Gerhard Neumann
ICML 2024arXiv:2403.06966
16
citations
Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
Tianchen Zhou, Hairi, Haibo Yang et al.
ICML 2024arXiv:2405.03082
3
citations