α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Baris Kasikci
Baris Kasikci
2
papers
296
total citations
papers (2)
QUEST: Query-Aware Sparsity for Efficient Long-Context LLM Inference
ICML 2024
arXiv
248
citations
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models
ICLR 2025
arXiv
48
citations