TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

Cai, Lile; Barneche, Anne-Maelle; Herbout, Arthur; Foo, Chuan Sheng; Lin, Jie; Chandrasekhar, Vijay Ramaseshan; Sabry, Mohamed M.

Computer Science > Neural and Evolutionary Computing

arXiv:1811.12065 (cs)

[Submitted on 29 Nov 2018 (v1), last revised 21 Oct 2019 (this version, v2)]

Title:TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

Authors:Lile Cai, Anne-Maelle Barneche, Arthur Herbout, Chuan Sheng Foo, Jie Lin, Vijay Ramaseshan Chandrasekhar, Mohamed M. Sabry

View PDF

Abstract:Embedded deep learning platforms have witnessed two simultaneous improvements. First, the accuracy of convolutional neural networks (CNNs) has been significantly improved through the use of automated neural-architecture search (NAS) algorithms to determine CNN structure. Second, there has been increasing interest in developing hardware accelerators for CNNs that provide improved inference performance and energy consumption compared to GPUs. Such embedded deep learning platforms differ in the amount of compute resources and memory-access bandwidth, which would affect performance and energy consumption of CNNs. It is therefore critical to consider the available hardware resources in the network architecture search. To this end, we introduce TEA-DNN, a NAS algorithm targeting multi-objective optimization of execution time, energy consumption, and classification accuracy of CNN workloads on embedded architectures. TEA-DNN leverages energy and execution time measurements on embedded hardware when exploring the Pareto-optimal curves across accuracy, execution time, and energy consumption and does not require additional effort to model the underlying hardware. We apply TEA-DNN for image classification on actual embedded platforms (NVIDIA Jetson TX2 and Intel Movidius Neural Compute Stick). We highlight the Pareto-optimal operating points that emphasize the necessity to explicitly consider hardware characteristics in the search process. To the best of our knowledge, this is the most comprehensive study of Pareto-optimal models across a range of hardware platforms using actual measurements on hardware to obtain objective values.

Comments:	Accepted by ISLPED2019
Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
Cite as:	arXiv:1811.12065 [cs.NE]
	(or arXiv:1811.12065v2 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1811.12065

Submission history

From: Lile Cai [view email]
[v1] Thu, 29 Nov 2018 11:05:28 UTC (1,812 KB)
[v2] Mon, 21 Oct 2019 07:39:19 UTC (3,029 KB)

Computer Science > Neural and Evolutionary Computing

Title:TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators