Reliable computation with unreliable computers. Issue 4 (1st July 2015)
- Record Type:
- Journal Article
- Title:
- Reliable computation with unreliable computers. Issue 4 (1st July 2015)
- Main Title:
- Reliable computation with unreliable computers
- Authors:
- Brown, Andrew D.
Mills, Rob
Dugan, Kier James
Reeve, Jeff S.
Furber, Steve B. - Abstract:
- Abstract : As computing systems continue their unquenchable rise towards and through million core architectures, two considerations that used to be unimportant become more and more dominant: power consumption (be it FLOPS/W or W/mm2) and reliability. This study is concerned with the latter: in a system of a million cores, it is unrealistic to expect 100% functionality on power‐up; equally, operational availability degrades with time. Monitoring and maintaining the health of such a system using traditional techniques is costly, and most rely on the concept of some sort of central overseer or monitor to make a final judgement about system availability, giving a single point of failure. Large systems of the future will consist of hardware and software that work synergistically to cope with isolated points of failure, allowing the gross behaviour of the system to degrade gracefully and in a meaningful way in the face of faults. This study describes one such system: spiking neural network architecture is a million‐core machine with layered fault‐tolerance built in at many levels. The authors show how the system may be used to solve the canonical distributed heat diffusion equation, and how the quality of solution is modulated by the effects of partial system failure.
- Is Part Of:
- IET computers & digital techniques. Volume 9:Issue 4(2015)
- Journal:
- IET computers & digital techniques
- Issue:
- Volume 9:Issue 4(2015)
- Issue Display:
- Volume 9, Issue 4 (2015)
- Year:
- 2015
- Volume:
- 9
- Issue:
- 4
- Issue Sort Value:
- 2015-0009-0004-0000
- Page Start:
- 230
- Page End:
- 237
- Publication Date:
- 2015-07-01
- Subjects:
- multiprocessing systems -- neural nets
reliable computation -- unreliable computers -- computing systems -- core architectures -- FLOPS/W -- reliability -- gross behaviour -- spiking neural network architecture -- million core machine -- distributed heat diffusion equation -- partial system failure
Computers -- Periodicals
Digital electronics -- Periodicals
Computer engineering -- Periodicals
Computer architecture -- Periodicals
Computer organization -- Periodicals
621.39 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-cdt ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4117424 ↗
http://www.ietdl.org/IET-CDT ↗
https://ietresearch.onlinelibrary.wiley.com/journal/1751861x ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/iet-cdt.2014.0110 ↗
- Languages:
- English
- ISSNs:
- 1751-8601
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17051.xml