"online learning" Papers
69 papers found • Page 1 of 2
Conference
Agnostic Continuous-Time Online Learning
Pramith Devulapalli, Changlong Wu, Ananth Grama et al.
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Zhaowei Zhang, Fengshuo Bai, Qizhi Chen et al.
Asynchronous Distributed Gaussian Process Regression
Zewen Yang, Xiaobing Dai, Sandra Hirche
Conformal Online Learning of Deep Koopman Linear Embeddings
Ben Gao, Jordan Patracone, Stephane Chretien et al.
Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning
Dravyansh Sharma, Alec Sun
Event-Driven Online Vertical Federated Learning
Ganyu Wang, Boyu Wang, Bin Gu et al.
Exploring the Noise Robustness of Online Conformal Prediction
HuaJun Xi, Kangdao Liu, Hao Zeng et al.
Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation
Kim Yong Tan, YUEMING LYU, Ivor Tsang et al.
FCOM: A Federated Collaborative Online Monitoring Framework via Representation Learning
Tanapol Kosolwattana, Huazheng Wang, Raed Al Kontar et al.
Feature-Based Online Bilateral Trade
Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.
Improved Bounds for Swap Multicalibration and Swap Omniprediction
Haipeng Luo, Spandan Senapati, Vatsal Sharan
Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality
Junyan Liu, Ziyun Chen, Kun Wang et al.
Learning-Augmented Algorithms for $k$-median via Online Learning
Anish Hebbar, Rong Ge, Amit Kumar et al.
Lifelong Test-Time Adaptation via Online Learning in Tracked Low-Dimensional Subspace
Dexin Duan, Rui Xu, Peilin Liu et al.
LLMs Are In-Context Bandit Reinforcement Learners
Giovanni Monea, Antoine Bosselut, Kianté Brantley et al.
Longhorn: State Space Models are Amortized Online Learners
Bo Liu, Rui Wang, Lemeng Wu et al.
Markov Persuasion Processes: Learning to Persuade From Scratch
Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni et al.
Mixture of Online and Offline Experts for Non-Stationary Time Series
Zhilin Zhao, Longbing Cao, Yuanyu Wan
Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets
Zixian Yang, Sushil Varma, Lei Ying
Non-Stationary Dueling Bandits Under a Weighted Borda Criterion
Joe Suk, Arpit Agarwal
Offline-to-Online Hyperparameter Transfer for Stochastic Bandits
Dravyansh Sharma, Arun Suggala
Online Learning in the Repeated Mediated Newsvendor Problem
Nataša Bolić, Tom Cesari, Roberto Colomboni et al.
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian, Arash Nasr-Esfahany, Malte Schwarzkopf et al.
Online robust locally differentially private learning for nonparametric regression
Chenfei Gu, Qiangqiang Zhang, Ting Li et al.
On the Universal Near Optimality of Hedge in Combinatorial Settings
Zhiyuan Fan, Arnab Maiti, Lillian Ratliff et al.
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis
Satoki Ishikawa, Makoto Yamada, Han Bao et al.
Prediction with expert advice under additive noise
Alankrita Bhatt, Victoria Kostina
Replicable Online Learning
Saba Ahmadi, Siddharth Bhandari, Avrim Blum
ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
Chris Dongjoo Kim, Jihwan Moon, Sangwoo Moon et al.
Robust Contextual Pricing
Anupam Gupta, Guru Guruganesh, Renato Leme et al.
Statistical Parity with Exponential Weights
Stephen Pasteris, Chris Hicks, Vasilios Mavroudis
Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts
Chiao-An Yang, Kuan-Chuan Peng, Raymond A. Yeh
Tradeoffs between Mistakes and ERM Oracle Calls in Online and Transductive Online Learning
Idan Attias, Steve Hanneke, Arvind Ramaswami
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan, Taoan Huang, Bistra Dilkina et al.
Adaptive Online Experimental Design for Causal Discovery
Muhammad Qasim Elahi, Lai Wei, Murat Kocaoglu et al.
Adaptive Robust Learning using Latent Bernoulli Variables
Aleksandr Karakulev, Dave Zachariah, Prashant Singh
A General Online Algorithm for Optimizing Complex Performance Metrics
Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.
Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Nikhil Vyas, Depen Morwani, Rosie Zhao et al.
Conformalized Adaptive Forecasting of Heterogeneous Trajectories
Yanfei Zhou, Lars Lindemann, Matteo Sesia
Designing Decision Support Systems using Counterfactual Prediction Sets
Eleni Straitouri, Manuel Gomez-Rodriguez
Doubly Perturbed Task Free Continual Learning
Byung Hyun Lee, Min-hwan Oh, Se Young Chun
Efficient Learning in Polyhedral Games via Best-Response Oracles
Darshan Chakrabarti, Gabriele Farina, Christian Kroer
Efficient Online Set-valued Classification with Bandit Feedback
Zhou Wang, Xingye Qiao
Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, ioannis Patras
Factored-Reward Bandits with Intermediate Observations
Marco Mussi, Simone Drago, Marcello Restelli et al.
Federated Combinatorial Multi-Agent Multi-Armed Bandits
Fares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal
Graph2Tac: Online Representation Learning of Formal Math Concepts
Lasse Blaauwbroek, Mirek Olšák, Jason Rute et al.
High-dimensional Linear Bandits with Knapsacks
Wanteng Ma, Dong Xia, Jiashuo Jiang
Imitation Learning in Discounted Linear MDPs without exploration assumptions
Luca Viano, EFSTRATIOS PANTELEIMON SKOULAKIS, Volkan Cevher