Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Generating synthetic datasets with deep learning models for human physical fatigue analysis

Lambay, Arsalan Jaweed, Liu, Ying ORCID: https://orcid.org/0000-0001-9319-5940, Morgan, Phillip ORCID: https://orcid.org/0000-0002-5672-0758 and Ji, Ze ORCID: https://orcid.org/0000-0002-8968-9902 2025. Generating synthetic datasets with deep learning models for human physical fatigue analysis. Machines 13 (3) , 235. 10.3390/machines13030235

[thumbnail of machines-13-00235.pdf]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

Download (7MB) | Preview

Abstract

There has been a growth of collaborative robots in Industry 5.0 due to the research in automation involving human-centric workplace design. It has had a substantial impact on industrial processes; however, physical exertion in human workers is still an issue, requiring solutions that combine technological innovation with human-centric development. By analysing real-world data, machine learning (ML) models can detect physical fatigue. However, sensor-based data collection is frequently used, which is often expensive and constrained. To overcome this gap, synthetic data generation (SDG) uses methods such as tabular generative adversarial networks (GANs) to produce statistically realistic datasets that improve machine learning model training while providing scalability and cost-effectiveness. This study presents an innovative approach utilising conditional GAN with auxiliary conditioning to generate synthetic datasets with essential features for detecting human physical fatigue in industrial scenarios. This approach allows us to enhance the SDG process by effectively handling the heterogeneous and imbalanced nature of human fatigue data, which includes tabular, categorical, and time-series data points. These generated datasets will be used to train specialised ML models, such as ensemble models, to learn from the original dataset from the extracted feature and then identify signs of physical fatigue. The trained ML model will undergo rigorous testing using authentic, real-world data to evaluate its sensitivity and specificity in recognising how closely generated data match with actual human physical fatigue within industrial settings. This research aims to provide researchers with an innovative method to tackle data-driven ML challenges of data scarcity and further enhance ML technology’s efficiency through training on SD. This study not only provides an approach to create complex realistic datasets but also helps in bridging the gap of Industry 5.0 data challenges for the purpose of innovations and worker well-being by improving detection capabilities.

Item Type: Article
Date Type: Published Online
Status: Published
Schools: Schools > Engineering
Schools > Psychology
Publisher: MDPI
ISSN: 2075-1702
Date of First Compliant Deposit: 14 March 2025
Date of Acceptance: 11 March 2025
Last Modified: 14 Mar 2025 15:46
URI: https://orca.cardiff.ac.uk/id/eprint/176885

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics