On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models

Gowal, Sven; Dvijotham, Krishnamurthy; Stanforth, Robert; Bunel, Rudy; Qin, Chongli; Uesato, Jonathan; Arandjelovic, Relja; Mann, Timothy; Kohli, Pushmeet

Computer Science > Machine Learning

arXiv:1810.12715 (cs)

[Submitted on 30 Oct 2018 (v1), last revised 29 Aug 2019 (this version, v4)]

Title:On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models

Authors:Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Mann, Pushmeet Kohli

View PDF

Abstract:Recent work has shown that it is possible to train deep neural networks that are provably robust to norm-bounded adversarial perturbations. Most of these methods are based on minimizing an upper bound on the worst-case loss over all possible adversarial perturbations. While these techniques show promise, they often result in difficult optimization procedures that remain hard to scale to larger networks. Through a comprehensive analysis, we show how a simple bounding technique, interval bound propagation (IBP), can be exploited to train large provably robust neural networks that beat the state-of-the-art in verified accuracy. While the upper bound computed by IBP can be quite weak for general networks, we demonstrate that an appropriate loss and clever hyper-parameter schedule allow the network to adapt such that the IBP bound is tight. This results in a fast and stable learning algorithm that outperforms more sophisticated methods and achieves state-of-the-art results on MNIST, CIFAR-10 and SVHN. It also allows us to train the largest model to be verified beyond vacuous bounds on a downscaled version of ImageNet.

Comments:	[v2] Best paper at NeurIPS SECML 2018 Workshop [v4] Accepted at ICCV 2019 under the title "Scalable Verified Training for Provably Robust Image Classification"
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:1810.12715 [cs.LG]
	(or arXiv:1810.12715v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.12715

Submission history

From: Sven Gowal [view email]
[v1] Tue, 30 Oct 2018 13:12:47 UTC (143 KB)
[v2] Mon, 5 Nov 2018 11:48:21 UTC (148 KB)
[v3] Mon, 28 Jan 2019 16:53:04 UTC (370 KB)
[v4] Thu, 29 Aug 2019 12:23:52 UTC (379 KB)

Computer Science > Machine Learning

Title:On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Effectiveness of Interval Bound Propagation for Training Verifiably Robust Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators