"adversarial training" Papers
82 papers found • Page 2 of 2
Conference
Generalized Smooth Variational Inequalities: Methods with Adaptive Stepsizes
Daniil Vankov, Angelia Nedich, Lalitha Sankar
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika, Long Phan, Xuwang Yin et al.
Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training
Jiacheng Zhang, Feng Liu, Dawei Zhou et al.
Improving Adversarial Energy-Based Model via Diffusion Process
Cong Geng, Tian Han, Peng-Tao Jiang et al.
Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training
Yuanqi Yao, Gang Wu, Kui Jiang et al.
LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training
Khoi M. Le, Trinh Pham, Tho Quan et al.
Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency
Runqi Lin, Chaojian Yu, Bo Han et al.
Learning a Dynamic Privacy-preserving Camera Robust to Inversion Attacks
Jiacheng Cheng, Xiang Dai, Jia Wan et al.
Learning Decision Trees and Forests with Algorithmic Recourse
Kentaro Kanamori, Takuya Takagi, Ken Kobayashi et al.
Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation
Bochao Liu, Pengju Wang, Shiming Ge
Lyapunov-Stable Deep Equilibrium Models
Haoyu Chu, Shikui Wei, Ting Liu et al.
Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference
Md Musfiqur Rahman, Murat Kocaoglu
On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Yihao Zhang, Hangzhou He, Jingyu Zhu et al.
Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation
Yuzheng Wang, Zhaoyu Chen, Dingkang Yang et al.
Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off
Yuansan Liu, Ruqing Zhang, Mingkun Zhang et al.
Position: What makes an image realistic?
Lucas Theis
Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective
Zhaoxin Wang, Handing Wang, Cong Tian et al.
Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
Yi Yu, Yufei Wang, Song Xia et al.
Refining Minimax Regret for Unsupervised Environment Design
Michael Beukman, Samuel Coward, Michael Matthews et al.
Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration
Xiaole Tang, Hu Xin, Xiang Gu et al.
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.
Rethinking Robustness of Model Attributions
Sandesh Kamath, Sankalp Mittal, Amit Deshpande et al.
Robust Classification via a Single Diffusion Model
Huanran Chen, Yinpeng Dong, Zhengyi Wang et al.
RODEO: Robust Outlier Detection via Exposing Adaptive Out-of-Distribution Samples
Hossein Mirzaei, Mohammad Jafari Varnousfaderani, Hamid Reza Dehbashi et al.
Shedding More Light on Robust Classifiers under the lens of Energy-based Models
Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.
Stable Unlearnable Example: Enhancing the Robustness of Unlearnable Examples via Stable Error-Minimizing Noise
Yixin Liu, Kaidi Xu, Xun Chen et al.
The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks
Ziquan Liu, Yufei Cui, Yan Yan et al.
Towards Efficient Training and Evaluation of Robust Models against $l_0$ Bounded Adversarial Perturbations
Xuyang Zhong, Yixiao HUANG, Chen Liu
Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models
Francesco Croce, Naman D. Singh, Matthias Hein
Towards Robust Image Stitching: An Adaptive Resistance Learning against Compatible Attacks
Zhiying Jiang, Xingyuan Li, Jinyuan Liu et al.
Uniformly Stable Algorithms for Adversarial Training and Beyond
Jiancong Xiao, Jiawei Zhang, Zhi-Quan Luo et al.
Unleashing Network Potentials for Semantic Scene Completion
Fengyun Wang, Qianru Sun, Dong Zhang et al.