Application Re-mapping for Fault-Tolerance in Ambient Intelligent Systems

Application Re-Mapping for Fault-Tolerance in Ambient Intelligent Systems
As technology advances, devices become smaller and cheaper, making it possible to build systems containing large numbers (possibly hundreds, or more) of miniature processing elements. Such platforms,...

Abstract

As technology advances, devices become smaller and cheaper, making it possible to build systems containing large numbers (possibly hundreds, or more) of miniature processing elements. Such platforms, although superficially similar to traditional distributed systems, pose additional unique challenges. Due to the desire to minimize costs, coupled with the sheer numbers of devices, it will be difficult to perform manufacture-time testing, and runtime-failures will likewise be common. One challenge is to efficiently harness the capabilities of large numbers of low power (and relatively low performance) processing elements, in the presence of failures such as depleted battery resources, as well as those due to unpredictable sources (e.g., electrical and mechanical failures). It will however be possible to employ a fraction of the multitude of resources as redundant or spare devices, re-mapping applications onto them, from failing ones.

This paper investigates the use of code migration as a general means of performing such application re-mapping, in the presence of intermittent communication and device failures, as well as limited battery resources. A new technique, Pre-Copying with Remote Execution (PCRE), an extension of code migration which enables more efficient application re-mapping in the presence of energy and communication constraints for symmetric applications, is presented.

It is shown that PCRE provides a 28.6% improvement in system lifetime and 9.8% improvement in energy efficiency for the applications investigated, over the baseline code migration strategy. Naturally, the re-mapping of applications involves overheads in computation and communication, and PCRE reduces these overheads to within 10% of the ideal case of doubled energy resources.

Cite as:

Stanley-Marbell P., Zamora N.H., Marculescu D., Marculescu R. (2003) Application Re-Mapping for Fault-Tolerance in Ambient Intelligent Systems. In: Basten T., Geilen M., de Groot H. (eds) Ambient Intelligence: Impact on Embedded Sytem Design. Springer, Boston, MA. https://doi.org/10.1007/0-306-48706-3_16

BibTeX:

@Inbook{Stanley-Marbell2003,
    author={Stanley-Marbell, Phillip and Zamora, Nicholas H. and Marculescu, Diana and Marculescu, Radu"},
    editor={Basten, Twan and Geilen, Marc and de Groot, Harmke},
    title={Application Re-Mapping for Fault-Tolerance in Ambient Intelligent Systems},
    bookTitle={Ambient Intelligence: Impact on Embedded Sytem Design},
    year={2003},
    publisher={Springer US},
    address={Boston, MA},
    pages={315--335},
    isbn={978-0-306-48706-4},
    doi={10.1007/0-306-48706-3_16},
    url={https://doi.org/10.1007/0-306-48706-3_16}
}