α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Mengzhou Xia
Mengzhou Xia
7
papers
1,558
total citations
papers (7)
Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation
ICLR 2024
arXiv
430
citations
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
ICLR 2024
arXiv
426
citations
LESS: Selecting Influential Data for Targeted Instruction Tuning
ICML 2024
arXiv
400
citations
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
ICML 2024
arXiv
184
citations
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
NEURIPS 2025
arXiv
89
citations
Language Models as Science Tutors
ICML 2024
arXiv
15
citations
Trainable Transformer in Transformer
ICML 2024
arXiv
14
citations