RobustFace: a novel image restoration technique for face adversarial robustness improvement

Publications

RobustFace: a novel image restoration technique for face adversarial robustness improvement

Year : 2025

Publisher : Springer

Source Title : Multimedia Tools and Applications

Document Type :

Abstract

Machine Learning (ML) models, particularly Deep Learning (DL), have made rapid progress and achieved significant milestones across various applications, including numerous safety-critical contexts. However, these models have recently been discovered to be susceptible to adversarial attacks, which are well-crafted input images. The primary objective of this paper is to propose a novel methodology titled “RobustFace“, which is based on deep image restoration networks, that significantly improves the facial adversarial robustness of various image-classification models. Adversarial images are created using the Private Fast Gradient Sign Method (P-FGSM), StyleGAN and Fast Landmark Manipulation (FLM) methods. The adversarial images are then enhanced using deep image restoration networks to bring back them into the original space. The encoded weighted local magnitude patterns (WLMP) are extracted and provided to different types of classifiers to detect facial adversarial images from the clean images. The effectiveness of RobustFace has been demonstrated on two real-world datasets and experimental outcomes show that it significantly improves facial adversarial robustness on all evaluating classifiers. It improves the highest classification accuracy from 98.75% to 99.00% on P-FGSM attacks, from 77.94% to 85.25% on adversarial attacks generated by StyleGAN and from 65.52% to 69.50% for FLM attacks.