α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Weizhu Chen
Weizhu Chen
12
papers
1,083
total citations
papers (12)
Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
NEURIPS 2021
arXiv
234
citations
LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models
ICLR 2024
arXiv
202
citations
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models
NEURIPS 2023
arXiv
164
citations
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
ICLR 2025
arXiv
122
citations
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
NEURIPS 2023
arXiv
120
citations
In-Context Learning Unlocked for Diffusion Models
NEURIPS 2023
arXiv
100
citations
Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning
AAAI 2025
arXiv
65
citations
Meet in the Middle: A New Pre-training Paradigm
NEURIPS 2023
arXiv
27
citations
MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
AAAI 2025
arXiv
17
citations
SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning
NEURIPS 2025
arXiv
16
citations
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
ICLR 2024
arXiv
16
citations
Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models
ECCV 2022
0
citations