"downstream task optimization" Papers
2 papers found
Conference
RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
Jiaxing Wu, Lin Ning, Luyang Liu et al.
AAAI 2025paperarXiv:2409.04421
7
citations
RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression
Uri Gadot, Shie Mannor, Assaf Shocher et al.
CVPR 2025arXiv:2501.12216
3
citations