Towards Certifiable Adversarial Sample Detection

Shumailov, Ilia; Zhao, Yiren; Mullins, Robert; Anderson, Ross

Computer Science > Machine Learning

arXiv:2002.08740 (cs)

[Submitted on 20 Feb 2020]

Title:Towards Certifiable Adversarial Sample Detection

Authors:Ilia Shumailov, Yiren Zhao, Robert Mullins, Ross Anderson

View PDF

Abstract:Convolutional Neural Networks (CNNs) are deployed in more and more classification systems, but adversarial samples can be maliciously crafted to trick them, and are becoming a real threat. There have been various proposals to improve CNNs' adversarial robustness but these all suffer performance penalties or other limitations. In this paper, we provide a new approach in the form of a certifiable adversarial detection scheme, the Certifiable Taboo Trap (CTT). The system can provide certifiable guarantees of detection of adversarial inputs for certain $l_{\infty}$ sizes on a reasonable assumption, namely that the training data have the same distribution as the test data. We develop and evaluate several versions of CTT with a range of defense capabilities, training overheads and certifiability on adversarial samples. Against adversaries with various $l_p$ norms, CTT outperforms existing defense methods that focus purely on improving network robustness. We show that CTT has small false positive rates on clean test data, minimal compute overheads when deployed, and can support complex security policies.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2002.08740 [cs.LG]
	(or arXiv:2002.08740v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.08740

Submission history

From: Ilia Shumailov [view email]
[v1] Thu, 20 Feb 2020 14:10:00 UTC (1,427 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-02

Change to browse by:

cs
cs.CR
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ilia Shumailov
Yiren Zhao
Ross J. Anderson

export BibTeX citation

Computer Science > Machine Learning

Title:Towards Certifiable Adversarial Sample Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Certifiable Adversarial Sample Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators