"hardware-aware optimization" Papers
4 papers found
Conference
Adaptive Computation Pruning for the Forgetting Transformer
Zhixuan Lin, Johan Obando-Ceron, Xu Owen He et al.
COLM 2025paperarXiv:2504.06949
3
citations
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
Yuxian Gu, Qinghao Hu, Haocheng Xi et al.
NEURIPS 2025arXiv:2508.15884
16
citations
Kolmogorov-Arnold Transformer
Xingyi Yang, Xinchao Wang
ICLR 2025arXiv:2409.10594
92
citations
Neural Tangent Knowledge Distillation for Optical Convolutional Networks
Jinlin Xiang, Minho Choi, Yubo Zhang et al.
NEURIPS 2025arXiv:2508.08421
1
citations