Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

FruitQuery: A lightweight query-based instance segmentation model for in-field fruit ripeness determination

Zhao, Ziang ORCID: https://orcid.org/0009-0004-6600-5581, Hicks, Yulia ORCID: https://orcid.org/0000-0002-7179-4587, Sun, Xianfang ORCID: https://orcid.org/0000-0002-6114-0766 and Luo, Chaoxi 2025. FruitQuery: A lightweight query-based instance segmentation model for in-field fruit ripeness determination. Smart Agricultural Technology 12 , 101068. 10.1016/j.atech.2025.101068

[thumbnail of 2025FruitHicks.pdf] PDF - Published Version
Download (5MB)
License URL: http://creativecommons.org/licenses/by/4.0/
License Start date: 1 June 2025

Abstract

Accurate fruit instance segmentation at different ripeness stages is critical for developing autonomous harvesting robots, particularly given the unstructured in-field conditions. In this paper, we combine two in-field fruit datasets of peaches and strawberries for multiple ripeness stages determination, and propose a lightweight query-based instance segmentation model named FruitQuery. The combined dataset contains 3 peach ripeness stages and 4 strawberry ripeness stages, covering various unstructured conditions of two popular fruits. The model FruitQuery consists of three parts: a backbone, a pixel decoder and Transformer decoders. Efficient multi-head self-attention modules are introduced to the backbone to reduce computational overhead, and a pyramid pooling module is added to the pixel decoder to enhance multi-scale feature fusion. Transformer decoders are then applied to learn a fixed number of queries from features and generate instance masks, avoiding postprocessing like non-maximum suppression. FruitQuery runs in an end-to-end way and incorporates the convolution and Transformer to capture fine-grained features related to different fruits at different ripeness stages. Extensive experiments on the combined fruit dataset demonstrate that our FruitQuery achieves the highest average precision of 67.02 with only 14.08M parameters, outperforming 13 state-of-the-art models with 33 variants. It is noted that FruitQuery surpasses three series of YOLO (v8, v9 and v10) by a large margin. Ablation studies and visualizations also show its robust feature extraction with fewer parameter usage, indicating that the query-based design is effective in localizing fruit. These results highlight FruitQuery's compelling balance between segmentation performance and model size, offering the potential for in-field application.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Schools > Engineering
Schools > Computer Science & Informatics
Additional Information: License information from Publisher: LICENSE 1: URL: http://creativecommons.org/licenses/by/4.0/, Start Date: 2025-06-01
Publisher: Elsevier
ISSN: 2772-3755
Date of First Compliant Deposit: 10 June 2025
Date of Acceptance: 1 June 2025
Last Modified: 10 Jun 2025 09:00
URI: https://orca.cardiff.ac.uk/id/eprint/178949

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics