Liu, Junxiu, Harkin, Jim, Li, Yuhua ORCID: https://orcid.org/0000-0003-2913-4478 and Maguire, Liam 2014. Online traffic-aware fault detection for networks-on-chip. Journal of Parallel and Distributed Computing 74 (1) , pp. 1984-1993. 10.1016/j.jpdc.2013.09.001 |
Abstract
A key requirement for modern Networks-on-Chip (NoC) is the ability to detect and diagnose faults and failures. This paper addresses the challenge of fault diagnosis using online testing where the interruption of the runtime operation (performance) under diagnosis is minimised. A novel Monitor Module (MM) is proposed to detect NoC interconnect faults which minimise the intrusion of the regular NoC traffic throughput by (1) using a channel tester which only examines NoC channels when they are idle; and (2) using a testing interval parameter based on the Binary Exponential Back off algorithm to dynamically balance the level of testing when recovering from temporary faults. The paper presents results on the minimal impact on NoC throughput for a range of testing conditions and also highlights the minimal area overhead of the MM (11.56%) compared with an adaptive NoC router implemented on FPGA hardware. Simulation results demonstrate non-intrusion of the NoC runtime traffic throughput when channel are fault free, and also how throughput loss is minimised when faults are identified.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
ISSN: | 0743-7315 |
Date of Acceptance: | 5 September 2013 |
Last Modified: | 07 Nov 2022 09:26 |
URI: | https://orca.cardiff.ac.uk/id/eprint/129144 |
Citation Data
Cited 48 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |