by Harshit Sikchi Papers
5 papers found
Conference
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu, Shuozhe Li, Harshit Sikchi et al.
ICLR 2025arXiv:2504.13368
3
citations
Proto Successor Measure: Representing the Behavior Space of an RL Agent
Siddhant Agarwal, Harshit Sikchi, Peter Stone et al.
ICML 2025arXiv:2411.19418
8
citations
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning
Joey Hejna, Rafael Rafailov, Harshit Sikchi et al.
ICLR 2024
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit Sikchi, Qinqing Zheng, Amy Zhang et al.
ICLR 2024spotlightarXiv:2302.08560
41
citations
Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit Sikchi, Rohan Chitnis, Ahmed Touati et al.
ICLR 2024arXiv:2311.02013
14
citations