Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

SAITI-DCGAN: Self-attention based deep convolutional generative adversarial networks for data augmentation of infrared thermal images

Wu, Zhichao, Wei, Changyun, Xia, Yu and Ji, Ze ORCID: https://orcid.org/0000-0002-8968-9902 2024. SAITI-DCGAN: Self-attention based deep convolutional generative adversarial networks for data augmentation of infrared thermal images. Applied Sciences 14 (23) , 11391. 10.3390/app142311391

[thumbnail of applsci-14-11391-v2.pdf] PDF - Published Version
Available under License Creative Commons Attribution.

Download (8MB)
License URL: https://creativecommons.org/licenses/by/4.0/
License Start date: 6 December 2024

Abstract

Defect detection plays a crucial role in industrial production, and the implementation of this technology has significant implications for improving both product quality and processing efficiency. However, the limited availability of defect samples for training deep-learning-based object detection models within industrial processes poses challenges for model training. In this paper, we propose a novel deep convolutional generative adversarial network with self-attention mechanism for the data augmentation of infrared thermal images for the application of aluminum foil sealing. To further expand its applicability, the proposed method is designed not only to address the specific needs of aluminum foil sealing but also to serve as a robust framework that can be adapted to a wide range of industrial defect detection tasks. To be specific, the proposed approach integrates a self-attention module into the generator, adopts spectral normalization in both the generator and discriminator, and introduces a two time-scale update rule to coordinate the training process of these components. The experimental results validated the superiority of the proposed approach in terms of the synthesized image quality and diversity. The results show that our approach can capture intricate details and distinctive features of defect images of aluminum foil sealing. Furthermore, ablation experiments demonstrated that the combination of self-attention, spectral normalization, and two time-scale update rules significantly enhanced the quality of image generation, while achieving a balance between stability and training efficiency. This innovative framework marks a notable technical breakthrough in the field of industrial defect detection and image synthesis, offering broad application prospects.

Item Type: Article
Date Type: Published Online
Status: Published
Schools: Engineering
Additional Information: License information from Publisher: LICENSE 1: URL: https://creativecommons.org/licenses/by/4.0/, Start Date: 2024-12-06
Publisher: MDPI
Date of First Compliant Deposit: 20 December 2024
Date of Acceptance: 5 December 2024
Last Modified: 20 Dec 2024 15:45
URI: https://orca.cardiff.ac.uk/id/eprint/174887

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics