Poster "online learning" Papers

50 papers found

Agnostic Continuous-Time Online Learning

Pramith Devulapalli, Changlong Wu, Ananth Grama et al.

NEURIPS 2025

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Zhaowei Zhang, Fengshuo Bai, Qizhi Chen et al.

ICLR 2025arXiv:2502.19148
20
citations

Conformal Online Learning of Deep Koopman Linear Embeddings

Ben Gao, Jordan Patracone, Stephane Chretien et al.

NEURIPS 2025arXiv:2511.12760
1
citations

Event-Driven Online Vertical Federated Learning

Ganyu Wang, Boyu Wang, Bin Gu et al.

ICLR 2025arXiv:2506.14911
1
citations

Exploring the Noise Robustness of Online Conformal Prediction

HuaJun Xi, Kangdao Liu, Hao Zeng et al.

NEURIPS 2025arXiv:2501.18363
2
citations

Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation

Kim Yong Tan, YUEMING LYU, Ivor Tsang et al.

ICLR 2025arXiv:2502.01692
3
citations

Feature-Based Online Bilateral Trade

Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.

ICLR 2025arXiv:2405.18183
5
citations

Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality

Junyan Liu, Ziyun Chen, Kun Wang et al.

NEURIPS 2025arXiv:2505.18828
1
citations

Learning-Augmented Algorithms for $k$-median via Online Learning

Anish Hebbar, Rong Ge, Amit Kumar et al.

NEURIPS 2025

Lifelong Test-Time Adaptation via Online Learning in Tracked Low-Dimensional Subspace

Dexin Duan, Rui Xu, Peilin Liu et al.

NEURIPS 2025

Longhorn: State Space Models are Amortized Online Learners

Bo Liu, Rui Wang, Lemeng Wu et al.

ICLR 2025arXiv:2407.14207
31
citations

Markov Persuasion Processes: Learning to Persuade From Scratch

Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni et al.

NEURIPS 2025arXiv:2402.03077
9
citations

Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets

Zixian Yang, Sushil Varma, Lei Ying

NEURIPS 2025arXiv:2510.14097

Non-Stationary Dueling Bandits Under a Weighted Borda Criterion

Joe Suk, Arpit Agarwal

ICLR 2025arXiv:2403.12950
2
citations

Online Learning in the Repeated Mediated Newsvendor Problem

Nataša Bolić, Tom Cesari, Roberto Colomboni et al.

NEURIPS 2025

Online Reinforcement Learning in Non-Stationary Context-Driven Environments

Pouya Hamadanian, Arash Nasr-Esfahany, Malte Schwarzkopf et al.

ICLR 2025arXiv:2302.02182
3
citations

Online robust locally differentially private learning for nonparametric regression

Chenfei Gu, Qiangqiang Zhang, Ting Li et al.

NEURIPS 2025

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.

ICLR 2025arXiv:2410.02275
6
citations

Prediction with expert advice under additive noise

Alankrita Bhatt, Victoria Kostina

NEURIPS 2025

Replicable Online Learning

Saba Ahmadi, Siddharth Bhandari, Avrim Blum

NEURIPS 2025arXiv:2411.13730
3
citations

ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams

Chris Dongjoo Kim, Jihwan Moon, Sangwoo Moon et al.

CVPR 2025arXiv:2504.14875
1
citations

Robust Contextual Pricing

Anupam Gupta, Guru Guruganesh, Renato Leme et al.

NEURIPS 2025

Statistical Parity with Exponential Weights

Stephen Pasteris, Chris Hicks, Vasilios Mavroudis

NEURIPS 2025

Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts

Chiao-An Yang, Kuan-Chuan Peng, Raymond A. Yeh

ICCV 2025arXiv:2507.16946

Adaptive Robust Learning using Latent Bernoulli Variables

Aleksandr Karakulev, Dave Zachariah, Prashant Singh

ICML 2024arXiv:2312.00585
1
citations

A General Online Algorithm for Optimizing Complex Performance Metrics

Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.

ICML 2024

Conformalized Adaptive Forecasting of Heterogeneous Trajectories

Yanfei Zhou, Lars Lindemann, Matteo Sesia

ICML 2024arXiv:2402.09623
12
citations

Efficient Online Set-valued Classification with Bandit Feedback

Zhou Wang, Xingye Qiao

ICML 2024arXiv:2405.04393
1
citations

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing

Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, ioannis Patras

ECCV 2024arXiv:2407.11168
2
citations

Factored-Reward Bandits with Intermediate Observations

Marco Mussi, Simone Drago, Marcello Restelli et al.

ICML 2024

Federated Combinatorial Multi-Agent Multi-Armed Bandits

Fares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal

ICML 2024arXiv:2405.05950
8
citations

Graph2Tac: Online Representation Learning of Formal Math Concepts

Lasse Blaauwbroek, Mirek Olšák, Jason Rute et al.

ICML 2024arXiv:2401.02949
15
citations

High-dimensional Linear Bandits with Knapsacks

Wanteng Ma, Dong Xia, Jiashuo Jiang

ICML 2024arXiv:2311.01327

Imitation Learning in Discounted Linear MDPs without exploration assumptions

Luca Viano, EFSTRATIOS PANTELEIMON SKOULAKIS, Volkan Cevher

ICML 2024arXiv:2405.02181
8
citations

Learning from One Continuous Video Stream

Joao Carreira, Michael King, Viorica Patraucean et al.

CVPR 2024arXiv:2312.00598
10
citations

Locality Sensitive Sparse Encoding for Learning World Models Online

Zichen Liu, Chao Du, Wee Sun Lee et al.

ICLR 2024arXiv:2401.13034
18
citations

Monotone Individual Fairness

Yahav Bechavod

ICML 2024arXiv:2403.06812
3
citations

Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online Learning

Joon Suk Huh, Kirthevasan Kandasamy

ICML 2024arXiv:2407.04898
2
citations

Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization

Kwang-Sung Jun, Jungtaek Kim

ICML 2024arXiv:2402.07341
4
citations

Online Cascade Learning for Efficient Inference over Streams

Lunyiu Nie, Zhimin Ding, Erdong Hu et al.

ICML 2024arXiv:2402.04513
16
citations

Online Isolation Forest

Filippo Leveni, Guilherme Weigert Cassales, Bernhard Pfahringer et al.

ICML 2024arXiv:2505.09593
3
citations

Online Learning in Betting Markets: Profit versus Prediction

Haiqing Zhu, Alexander Soen, Yun Kuen Cheung et al.

ICML 2024arXiv:2406.04062
1
citations

Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints

Francesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti et al.

ICML 2024

Online Learning under Budget and ROI Constraints via Weak Adaptivity

Matteo Castiglioni, Andrea Celli, Christian Kroer

ICML 2024arXiv:2302.01203
9
citations

Online Learning with Bounded Recall

Jon Schneider, Kiran Vodrahalli

ICML 2024arXiv:2205.14519
1
citations

Online Matrix Completion: A Collaborative Approach with Hott Items

Dheeraj Baby, Soumyabrata Pal

ICML 2024arXiv:2408.05843

Online Variational Sequential Monte Carlo

Alessandro Mastrototaro, Jimmy Olsson

ICML 2024arXiv:2312.12616
4
citations

Performative Prediction with Bandit Feedback: Learning through Reparameterization

Yatong Chen, Wei Tang, Chien-Ju Ho et al.

ICML 2024arXiv:2305.01094
12
citations

T4P: Test-Time Training of Trajectory Prediction via Masked Autoencoder and Actor-specific Token Memory

Daehee Park, Jaeseok Jeong, Sung-Hoon Yoon et al.

CVPR 2024arXiv:2403.10052
18
citations

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook et al.

ICML 2024arXiv:2402.01567
22
citations