A comprehensive evaluation framework for deep model robustness. (May 2023)
- Record Type:
- Journal Article
- Title:
- A comprehensive evaluation framework for deep model robustness. (May 2023)
- Main Title:
- A comprehensive evaluation framework for deep model robustness
- Authors:
- Guo, Jun
Bao, Wei
Wang, Jiakai
Ma, Yuqing
Gao, Xinghai
Xiao, Gang
Liu, Aishan
Dong, Jian
Liu, Xianglong
Wu, Wenjun - Abstract:
- Highlights: Establish a comprehensive framework for model robustness, containing 23 different metrics. Provide an open-sourced platform to support easy-to-use robustness evaluation and continuous integration. Conduct large-scale experiments and provide preliminary suggestions to the evaluation of model robustness. Abstract: Deep neural networks (DNNs) have achieved remarkable performance across a wide range of applications, while they are vulnerable to adversarial examples, which motivates the evaluation and benchmark of model robustness. However, current evaluations usually use simple metrics to study the performance of defenses, which are far from understanding the limitation and weaknesses of these defense methods. Thus, most proposed defenses are quickly shown to be attacked successfully, which results in the "arm race" phenomenon between attack and defense. To mitigate this problem, we establish a model robustness evaluation framework containing 23 comprehensive and rigorous metrics, which consider two key perspectives of adversarial learning (i.e., data and model). Through neuron coverage and data imperceptibility, we use data-oriented metrics to measure the integrity of test examples; by delving into model structure and behavior, we exploit model-oriented metrics to further evaluate robustness in the adversarial setting. To fully demonstrate the effectiveness of our framework, we conduct large-scale experiments on multiple datasets including CIFAR-10, SVHN, andHighlights: Establish a comprehensive framework for model robustness, containing 23 different metrics. Provide an open-sourced platform to support easy-to-use robustness evaluation and continuous integration. Conduct large-scale experiments and provide preliminary suggestions to the evaluation of model robustness. Abstract: Deep neural networks (DNNs) have achieved remarkable performance across a wide range of applications, while they are vulnerable to adversarial examples, which motivates the evaluation and benchmark of model robustness. However, current evaluations usually use simple metrics to study the performance of defenses, which are far from understanding the limitation and weaknesses of these defense methods. Thus, most proposed defenses are quickly shown to be attacked successfully, which results in the "arm race" phenomenon between attack and defense. To mitigate this problem, we establish a model robustness evaluation framework containing 23 comprehensive and rigorous metrics, which consider two key perspectives of adversarial learning (i.e., data and model). Through neuron coverage and data imperceptibility, we use data-oriented metrics to measure the integrity of test examples; by delving into model structure and behavior, we exploit model-oriented metrics to further evaluate robustness in the adversarial setting. To fully demonstrate the effectiveness of our framework, we conduct large-scale experiments on multiple datasets including CIFAR-10, SVHN, and ImageNet using different models and defenses with our open-source platform. Overall, our paper provides a comprehensive evaluation framework, where researchers could conduct comprehensive and fast evaluations using the open-source toolkit, and the analytical results could inspire deeper understanding and further improvement to the model robustness. … (more)
- Is Part Of:
- Pattern recognition. Volume 137(2023)
- Journal:
- Pattern recognition
- Issue:
- Volume 137(2023)
- Issue Display:
- Volume 137, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 137
- Issue:
- 2023
- Issue Sort Value:
- 2023-0137-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-05
- Subjects:
- Adversarial examples -- Evaluation metrics -- Model robustness
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2023.109308 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25712.xml