α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Piotr Milos
Piotr Milos
3
papers
70
total citations
papers (3)
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
ICML 2024
arXiv
41
citations
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
ICML 2024
arXiv
26
citations
Contrastive Representations for Temporal Reasoning
NEURIPS 2025
arXiv
3
citations