Instant adversarial purification with adversarial consistency distillation

Chun Tong Lei, Hon Ming Yam, Zhongliang Guo, Chun Pong Lau

Research output: Working paperPreprint

Abstract

Neural networks, despite their remarkable performance in widespread applications, including image classification, are also known to be vulnerable to subtle adversarial noise. Although some diffusion-based purification methods have been proposed, for example, DiffPure, those methods are time-consuming. In this paper, we propose One Step Control Purification (OSCP), a diffusion-based purification model that can purify the adversarial image in one Neural Function Evaluation (NFE) in diffusion models. We use Latent Consistency Model (LCM) and ControlNet for our one-step purification. OSCP is computationally friendly and time efficient compared to other diffusion-based purification methods; we achieve defense success rate of 74.19\% on ImageNet, only requiring 0.1s for each purification. Moreover, there is a fundamental incongruence between consistency distillation and adversarial perturbation. To address this ontological dissonance, we propose Gaussian Adversarial Noise Distillation (GAND), a novel consistency distillation framework that facilitates a more nuanced reconciliation of the latent space dynamics, effectively bridging the natural and adversarial manifolds. Our experiments show that the GAND does not need a Full Fine Tune (FFT); PEFT, e.g., LoRA is sufficient.
Original languageEnglish
PublisherarXiv
Publication statusPublished - 2 Sept 2024

Fingerprint

Dive into the research topics of 'Instant adversarial purification with adversarial consistency distillation'. Together they form a unique fingerprint.
  • Instant adversarial purification with adversarial consistency distillation

    Lei, C. T., Yam, H. M., Guo, Z., Qian, Y. & Lau, C. P., 13 Aug 2025, 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Los Alamitos: IEEE Computer Society, p. 24331-24340 10 p. (IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Open Access

Cite this