Towards Virtualization-Agnostic Latency for Time-Sensitive Applications

  • Haoran Li
  • , Meng Xu
  • , Chong Li
  • , Chenyang Lu
  • , Christopher Gill
  • , Linh Phan
  • , Insup Lee
  • , Oleg Sokolsky

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

As time-sensitive applications are deployed spanning multiple edge clouds, delivering consistent and scalable latency performance across different virtualized hosts becomes increasingly challenging. In contrast to traditional real-time systems requiring deadline guarantees for all jobs, the latency service-level objectives of cloud applications are usually defined in terms of tail latency, i.e., the latency of a certain percentage of the jobs should be below a given threshold. This means that neither dedicating entire physical CPU cores, nor combining virtualization with deadline-based techniques such as compositional real-time scheduling, can meet the needs of these applications in a resource-efficient manner. To address this limitation, and to simplify the management of edge clouds for latency-sensitive applications, we introduce virtualization-agnostic latency (VAL) as an essential property to maintain consistent tail latency assurances across different virtualized hosts. VAL requires that an application experience similar latency distributions on a shared host as on a dedicated one. Towards achieving VAL in edge clouds, this paper presents a virtualization-agnostic scheduling (VAS) framework for time-sensitive applications sharing CPUs with other applications. We show both theoretically and experimentally that VAS can effectively deliver VAL on shared hosts. For periodic and sporadic tasks, we establish theoretical guarantees that VAS can achieve the same task schedule on a shared CPU as on a full CPU dedicated to time-sensitive services. Moreover, this can be achieved by allocating the minimal CPU bandwidth to time-sensitive services, thereby avoiding wasting CPU resources. VAS has been implemented on Xen 4.10.0. In case studies running time-sensitive workloads on Redis and Spark streaming services, we show that in practice the task schedule on a shared CPU can closely approximate the one on a full CPU.

Original languageEnglish
Title of host publicationRTNS 2021 - 29th International Conference on Real-Time Networks and Systems
PublisherAssociation for Computing Machinery
Pages35-45
Number of pages11
ISBN (Electronic)9781450390019
DOIs
StatePublished - Apr 7 2021
Event29th International Conference on Real-Time Networks and Systems, RTNS 2021 - Nantes, France
Duration: Apr 7 2021 → …

Publication series

NameACM International Conference Proceeding Series

Conference

Conference29th International Conference on Real-Time Networks and Systems, RTNS 2021
Country/TerritoryFrance
CityNantes
Period04/7/21 → …

Keywords

  • Deferrable Server
  • Real-Time Scheduling
  • Virtualization

Fingerprint

Dive into the research topics of 'Towards Virtualization-Agnostic Latency for Time-Sensitive Applications'. Together they form a unique fingerprint.

Cite this