Paper "inference efficiency" Papers
3 papers found
Conference
Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration
Zheqi Lv, Keming Ye, Zishu Wei et al.
AAAI 2025paperarXiv:2501.07596
1
citations
Overfill: Two-Stage Models for Efficient Language Model Decoding
Woojeong Kim, Junxiong Wang, Jing Nathan Yan et al.
COLM 2025paperarXiv:2508.08446
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation
Liyao Jiang, Negar Hassanpour, Mohammad Salameh et al.
AAAI 2025paperarXiv:2412.14283
5
citations