"gradient estimation" Papers
16 papers found
Conference
Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
Reza Shirkavand, Peiran Yu, Qi He et al.
NEURIPS 2025arXiv:2502.03604
1
citations
FLOPS: Forward Learning with OPtimal Sampling
Tao Ren, Zishi Zhang, Jinyang Jiang et al.
ICLR 2025arXiv:2410.05966
2
citations
Memory-Reduced Meta-Learning with Guaranteed Convergence
Honglin Yang, Ji Ma, Xiao Yu
AAAI 2025paperarXiv:2412.12030
1
citations
Neural Evolution Strategy for Black-box Pareto Set Learning
Chengyu Lu, Zhenhua Li, Xi Lin et al.
NEURIPS 2025
PseuZO: Pseudo-Zeroth-Order Algorithm for Training Deep Neural Networks
Pengyun Yue, Xuanlin Yang, Mingqing Xiao et al.
NEURIPS 2025
Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations
Shaocong Ma, Heng Huang
ICLR 2025arXiv:2510.19975
12
citations
Soft Merging of Experts with Adaptive Routing
Haokun Liu, Muqeeth Mohammed, Colin Raffel
ICLR 2025arXiv:2306.03745
82
citations
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
Yong Liu, Zirui Zhu, Chaoyu Gong et al.
NEURIPS 2025arXiv:2402.15751
37
citations
Zeroth-Order Methods for Nonconvex Stochastic Problems with Decision-Dependent Distributions
Yuya Hikima, Akiko Takeda
AAAI 2025paperarXiv:2412.20330
3
citations
Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient
Hao Di, Haishan Ye, Yueling Zhang et al.
ICML 2024spotlightarXiv:2405.17761
2
citations
Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization
Sam Reifenstein, Timothee Leleu, Yoshihisa Yamamoto
ICML 2024arXiv:2405.01731
3
citations
Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine Workers
Ron Dorfman, Naseem Yehya, Kfir Levy
ICML 2024arXiv:2402.02951
5
citations
From Fourier to Neural ODEs: Flow Matching for Modeling Complex Systems
Xin Li, Jingdong Zhang, Qunxi Zhu et al.
ICML 2024oralarXiv:2405.11542
7
citations
GFlowNet Training by Policy Gradients
Puhua Niu, Shili Wu, Mingzhou Fan et al.
ICML 2024arXiv:2408.05885
3
citations
On Gradient-like Explanation under a Black-box Setting: When Black-box Explanations Become as Good as White-box
Yi Cai, Gerhard Wunder
ICML 2024arXiv:2308.09381
3
citations
Reparameterized Importance Sampling for Robust Variational Bayesian Neural Networks
Yunfei Long, Zilin Tian, Liguo Zhang et al.
ICML 2024