Poster "gradient propagation" Papers
3 papers found
Conference
DS-VLM: Diffusion Supervision Vision Language Model
Zhen Sun, Yunhang Shen, Jie Li et al.
ICML 2025
1
citations
Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis
Hyunwoo Lee, Hayoung Choi, Hyunju Kim
ICLR 2025arXiv:2410.02242
6
citations
Do Transformer World Models Give Better Policy Gradients?
Michel Ma, Tianwei Ni, Clement Gehring et al.
ICML 2024arXiv:2402.05290
7
citations