α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Mintong Kang
Mintong Kang
13
papers
944
total citations
papers (13)
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
NEURIPS 2023
arXiv
571
citations
EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE
ICLR 2025
arXiv
111
citations
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
NEURIPS 2023
arXiv
50
citations
ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning
ICML 2025
arXiv
43
citations
Fairness in Federated Learning via Core-Stability
NEURIPS 2022
arXiv
41
citations
$R^2$-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
ICLR 2025
arXiv
34
citations
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models
ICML 2024
arXiv
31
citations
AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models
ICLR 2025
arXiv
24
citations
Certifying Some Distributional Fairness with Subpopulation Decomposition
NEURIPS 2022
arXiv
17
citations
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models
ICLR 2025
arXiv
11
citations
Certifiably Byzantine-Robust Federated Conformal Prediction
ICML 2024
arXiv
5
citations
PolyGuard: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset
NEURIPS 2025
3
citations
FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning
ICCV 2025
arXiv
3
citations