α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zidi Xiong
Zidi Xiong
6
papers
825
total citations
papers (6)
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
NEURIPS 2023
arXiv
571
citations
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
ICLR 2024
arXiv
84
citations
GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning
ICML 2025
69
citations
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
ICML 2024
arXiv
67
citations
CBD: A Certified Backdoor Detector Based on Local Dominant Probability
NEURIPS 2023
arXiv
23
citations
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025
arXiv
11
citations