α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shang Yang
Shang Yang
6
papers
467
total citations
papers (6)
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
ICLR 2025
arXiv
179
citations
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
arXiv
157
citations
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
CVPR 2023
arXiv
111
citations
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
NEURIPS 2025
arXiv
16
citations
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
ECCV 2024
arXiv
3
citations
SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
ICCV 2025
arXiv
1
citations