Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

A cascade framework for on-device uncertainty-aware event detection on microcontrollers

Jia, Hong, Kwon, Young D., Ma, Dong, Pham, N., Qendro, Lorena, Vu, Tam and Mascolo, Cecilia 2026. A cascade framework for on-device uncertainty-aware event detection on microcontrollers. Pervasive and Mobile Computing 119 , 102208. 10.1016/j.pmcj.2026.102208

[thumbnail of 1-s2.0-S1574119226000490-main.pdf] PDF - Published Version
Available under License Creative Commons Attribution.

Download (3MB)

Abstract

Pervasive sensing enables diverse wearable event detection (WED) applications, but deploying machine learning models on resource-constrained microcontrollers (MCUs) poses significant challenges, particularly in ensuring prediction reliability under data shifts or out-of-distribution (OOD) inputs. While Uncertainty quantification methods offer a way to assess this reliability, many are computationally prohibitive for MCUs, and detecting multiple events concurrently further exacerbates resource constraints. Addressing these combined challenges, this paper presents an uncertainty and resource-aware framework designed for reliable and efficient multi-event WED on MCUs, significantly extending our preliminary work. The proposed framework achieves this by integrating Evidential Deep Learning (EDL) for efficient, single-pass uncertainty estimation with a novel cascade learning architecture. This architecture promotes resource efficiency via: (i) intra-event sharing using uncertainty-aware early exits within a staged model (shallow, medium, deep), allowing simpler samples to terminate inference earlier; and (ii) inter-event sharing using a multi-head design where multiple event detectors share a common backbone, minimizing overhead. System efficiency is further enhanced through MCU-specific optimizations, including targeted architecture search, quantization, efficient uncertainty operator implementation using standard TensorFlow Lite Micro (TFLM) operations, and library footprint reduction. We conducted extensive experiments on four distinct wearable datasets (Oesense, KWS, ECG5000, and HHAR) and two MCU platforms (STM32F446ZE, STM32H747XI), comparing the proposed framework against strong baselines including Deep Ensembles and Vanilla EDL. Results demonstrate the proposed framework’s effectiveness, achieving competitive accuracy and uncertainty performance (e.g., up to 22% lower NLL than data augmentation) while drastically reducing resource consumption, offering up to 8.64 faster inference, up to 8.57 lower energy use, and 55% smaller memory footprint compared to ensemble methods. The proposed framework enables the deployment of reliable, uncertainty-aware multi-event detection on a wider range of low-power MCUs.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Schools > Computer Science & Informatics
Publisher: Elsevier
ISSN: 1574-1192
Date of First Compliant Deposit: 23 March 2026
Date of Acceptance: 10 March 2026
Last Modified: 23 Mar 2026 15:30
URI: https://orca.cardiff.ac.uk/id/eprint/185969

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics