Skip to main navigation Skip to search Skip to main content

A case for adaptive deep neural networks in edge computing

Francis McNamee, Schahram Dustdar, Peter Kilpatrick, Weisong Shi, Ivor Spence, Blesson Varghese*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Deep Neural Networks (DNNs) are an application class that benefit from being distributed across the edge and cloud. A DNN is partitioned such that specific layers of the DNN are deployed onto the edge and the cloud to meet performance and privacy objectives. However, there is limited understanding of: whether and how evolving operational conditions (increased CPU and memory utilization at the edge or reduced data transfer rates between the edge and cloud) affect the performance of already deployed DNNs, and whether a new partition configuration is required to maximize performance. A DNN that adapts to changing operational conditions is referred to as an ‘adaptive DNN’. This paper investigates whether there is a case for adaptive DNNs by considering four questions: (i) Are DNNs sensitive to operational conditions? (ii) How sensitive are DNNs to operational conditions? (iii) Do individual or a combination of operational conditions equally affect DNNs? (iv) Is DNN partitioning sensitive to hardware architectures? The exploration is carried out in the context of 8 pre-trained DNN models and the results presented are from analyzing nearly 8 million data points. The results highlight that network conditions affect DNN performance more than CPU or memory related operational conditions. Repartitioning is noted to provide a performance gain in a number of cases, but a specific trend is not noted in relation to the underlying hardware architecture. Nonetheless, the need for adaptive DNNs is confirmed.
Original languageEnglish
Title of host publication2021 IEEE 14th international conference on cloud computing (CLOUD)
EditorsClaudio Agostino Ardagna, Carl Chang, Ernesto Daminai, Rajiv Ranjan, Zhongjie Wang, Robert Wang, Jia Zhang, Wensheng Zhang
Place of PublicationPiscataway, NJ
PublisherIEEE Computer Society
Pages43-52
Number of pages10
ISBN (Electronic)9781665400602
ISBN (Print)9781665400619
DOIs
Publication statusPublished - 8 Nov 2021
Event2021 14th IEEE International Conference on Cloud Computing - Online virtual congress
Duration: 5 Sept 202211 Sept 2022
Conference number: 14
https://conferences.computer.org/cloud/2021/

Publication series

NameIEEE international conference on cloud computing (CLOUD)
PublisherIEEE
ISSN (Print)2159-6182
ISSN (Electronic)2159-6190

Conference

Conference2021 14th IEEE International Conference on Cloud Computing
Abbreviated titleCLOUD 2021
Period5/09/2211/09/22
Internet address

Fingerprint

Dive into the research topics of 'A case for adaptive deep neural networks in edge computing'. Together they form a unique fingerprint.

Cite this