α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Xuandong Zhao
Xuandong Zhao
1
Affiliations
Affiliations
UC Berkeley
10
papers
713
total citations
papers (10)
Provable Robust Watermarking for AI-Generated Text
ICLR 2024
arXiv
279
citations
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews
ICML 2024
arXiv
183
citations
Weak-to-Strong Jailbreaking on Large Language Models
ICML 2025
arXiv
95
citations
DE-COP: Detecting Copyrighted Content in Language Models Training Data
ICML 2024
arXiv
73
citations
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification
AAAI 2025
arXiv
31
citations
Improving LLM Safety Alignment with Dual-Objective Optimization
ICML 2025
arXiv
21
citations
Assessing Judging Bias in Large Reasoning Models: An Empirical Study
COLM 2025
arXiv
14
citations
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025
arXiv
11
citations
LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
COLM 2025
arXiv
6
citations
DIS-CO: Discovering Copyrighted Content in VLMs Training Data
ICML 2025
arXiv
0
citations