"gradient estimation" Papers

16 papers found

Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training

Reza Shirkavand, Peiran Yu, Qi He et al.

NEURIPS 2025arXiv:2502.03604
1
citations

FLOPS: Forward Learning with OPtimal Sampling

Tao Ren, Zishi Zhang, Jinyang Jiang et al.

ICLR 2025arXiv:2410.05966
2
citations

Memory-Reduced Meta-Learning with Guaranteed Convergence

Honglin Yang, Ji Ma, Xiao Yu

AAAI 2025paperarXiv:2412.12030
1
citations

Neural Evolution Strategy for Black-box Pareto Set Learning

Chengyu Lu, Zhenhua Li, Xi Lin et al.

NEURIPS 2025

PseuZO: Pseudo-Zeroth-Order Algorithm for Training Deep Neural Networks

Pengyun Yue, Xuanlin Yang, Mingqing Xiao et al.

NEURIPS 2025

Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations

Shaocong Ma, Heng Huang

ICLR 2025arXiv:2510.19975
12
citations

Soft Merging of Experts with Adaptive Routing

Haokun Liu, Muqeeth Mohammed, Colin Raffel

ICLR 2025arXiv:2306.03745
82
citations

Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

Yong Liu, Zirui Zhu, Chaoyu Gong et al.

NEURIPS 2025arXiv:2402.15751
37
citations

Zeroth-Order Methods for Nonconvex Stochastic Problems with Decision-Dependent Distributions

Yuya Hikima, Akiko Takeda

AAAI 2025paperarXiv:2412.20330
3
citations

Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient

Hao Di, Haishan Ye, Yueling Zhang et al.

ICML 2024spotlightarXiv:2405.17761
2
citations

Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization

Sam Reifenstein, Timothee Leleu, Yoshihisa Yamamoto

ICML 2024arXiv:2405.01731
3
citations

Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine Workers

Ron Dorfman, Naseem Yehya, Kfir Levy

ICML 2024arXiv:2402.02951
5
citations

From Fourier to Neural ODEs: Flow Matching for Modeling Complex Systems

Xin Li, Jingdong Zhang, Qunxi Zhu et al.

ICML 2024oralarXiv:2405.11542
7
citations

GFlowNet Training by Policy Gradients

Puhua Niu, Shili Wu, Mingzhou Fan et al.

ICML 2024arXiv:2408.05885
3
citations

On Gradient-like Explanation under a Black-box Setting: When Black-box Explanations Become as Good as White-box

Yi Cai, Gerhard Wunder

ICML 2024arXiv:2308.09381
3
citations

Reparameterized Importance Sampling for Robust Variational Bayesian Neural Networks

Yunfei Long, Zilin Tian, Liguo Zhang et al.

ICML 2024