α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Huan Zhang
Huan Zhang
2
Affiliations
Affiliations
UC Davis
UCLA
21
papers
1,061
total citations
papers (21)
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
NEURIPS 2020
arXiv
358
citations
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
ICML 2024
arXiv
156
citations
General Cutting Planes for Bound-Propagation-Based Neural Network Verification
NEURIPS 2022
arXiv
127
citations
Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds
NEURIPS 2021
arXiv
96
citations
Fast Certified Robust Training with Short Warmup
NEURIPS 2021
arXiv
67
citations
Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation
NEURIPS 2022
arXiv
56
citations
Robust Mixture-of-Expert Training for Convolutional Neural Networks
ICCV 2023
arXiv
38
citations
Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation
ICML 2024
arXiv
34
citations
An Efficient Adversarial Attack for Tree Ensembles
NEURIPS 2020
arXiv
26
citations
Provably Bounding Neural Network Preimages
NEURIPS 2023
arXiv
24
citations
Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks
CVPR 2025
arXiv
23
citations
Are AlphaZero-like Agents Robust to Adversarial Perturbations?
NEURIPS 2022
arXiv
15
citations
Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond
NEURIPS 2020
arXiv
15
citations
Causal Composition Diffusion Model for Closed-loop Traffic Generation
CVPR 2025
arXiv
13
citations
Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models
CVPR 2025
arXiv
9
citations
SDP-CROWN: Efficient Bound Propagation for Neural Network Verification with Tightness of Semidefinite Programming
ICML 2025
arXiv
4
citations
VIP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers
ECCV 2022
0
citations
Robustness between the worst and average case
NEURIPS 2021
0
citations
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
0
citations
Beta-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Robustness Verification
NEURIPS 2021
0
citations
Fine-grained Local Sensitivity Analysis of Standard Dot-Product Self-Attention
ICML 2024
0
citations