Skip to main navigation Skip to search Skip to main content

NEUKONFIG: reducing edge service downtime when repartitioning DNNs

Ayesha Abdul Majeed, Peter Kilpatrick, Ivor Spence, Blesson Varghese

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Deep Neural Networks (DNNs) may be partitioned across the edge and the cloud to improve the performance efficiency of inference. DNN partitions are determined based on operational conditions such as network speed. When operational conditions change DNNs will need to be repartitioned to maintain the overall performance. However, repartitioning using existing approaches, such as Pause and Resume, will incur a service downtime on the edge. This paper presents the NEUKONFIG framework that identifies the service downtime incurred when repartitioning DNNs and proposes approaches for reducing edge service downtime. The proposed approaches are based on ‘Dynamic Switching’ in which, when the network speed changes and given an existing edge-cloud pipeline, a new edge-cloud pipeline is initialised with new DNN partitions. Incoming inference requests are switched to the new pipeline for processing data. Experimental studies are carried out on a lab-based testbed to demonstrate that Dynamic Switching reduces the downtime by at least an order of magnitude when compared to a baseline using Pause and Resume that has a downtime of 6 seconds. A trade-off in the edge service downtime and memory required is noted. The Dynamic Switching approach that requires the same amount of memory as the baseline reduces the edge service downtime to 0.6 seconds and to less than 1 millisecond in the best case when twice the amount of memory as the baseline is available.
Original languageEnglish
Title of host publication2021 IEEE international conference on cloud engineering (IC2E)
Place of PublicationPiscataway, NJ
PublisherIEEE Computer Society
Pages1-8
Number of pages8
ISBN (Electronic)9781665449700
ISBN (Print)9781665449717
DOIs
Publication statusPublished - 22 Nov 2021
Event9th IEEE International Conference on Cloud Engineering -
Duration: 4 Oct 20218 Oct 2021
Conference number: 9
https://conferences.computer.org/IC2E/2021/

Conference

Conference9th IEEE International Conference on Cloud Engineering
Abbreviated titleIC2E 2021
Period4/10/218/10/21
Internet address

Keywords

  • Edge computing
  • Distribution DNN
  • Service downtime

Fingerprint

Dive into the research topics of 'NEUKONFIG: reducing edge service downtime when repartitioning DNNs'. Together they form a unique fingerprint.

Cite this