Delve into the Performance Degradation of Differentiable Architecture Search

Zhang, Jiuling; Ding, Zhiming

doi:10.1145/3459637.3482248

Computer Science > Machine Learning

arXiv:2109.13466 (cs)

[Submitted on 28 Sep 2021]

Title:Delve into the Performance Degradation of Differentiable Architecture Search

Authors:Jiuling Zhang, Zhiming Ding

View PDF

Abstract:Differentiable architecture search (DARTS) is widely considered to be easy to overfit the validation set which leads to performance degradation. We first employ a series of exploratory experiments to verify that neither high-strength architecture parameters regularization nor warmup training scheme can effectively solve this problem. Based on the insights from the experiments, we conjecture that the performance of DARTS does not depend on the well-trained supernet weights and argue that the architecture parameters should be trained by the gradients which are obtained in the early stage rather than the final stage of training. This argument is then verified by exchanging the learning rate schemes of weights and parameters. Experimental results show that the simple swap of the learning rates can effectively solve the degradation and achieve competitive performance. Further empirical evidence suggests that the degradation is not a simple problem of the validation set overfitting but exhibit some links between the degradation and the operation selection bias within bilevel optimization dynamics. We demonstrate the generalization of this bias and propose to utilize this bias to achieve an operation-magnitude-based selective stop.

Comments:	Accepted as a full paper at the CIKM 2021 conference
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2109.13466 [cs.LG]
	(or arXiv:2109.13466v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.13466
Related DOI:	https://doi.org/10.1145/3459637.3482248

Submission history

From: Jiuling Zhang [view email]
[v1] Tue, 28 Sep 2021 03:37:56 UTC (2,242 KB)

Computer Science > Machine Learning

Title:Delve into the Performance Degradation of Differentiable Architecture Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Delve into the Performance Degradation of Differentiable Architecture Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators