Pan, Xianwei, Wen, Lijie, Li, Yuhua ORCID: https://orcid.org/0000-0003-2913-4478, Zhang, Yijia and Lu, Mingyu
2025.
LLM-BCgrading: Large language model-based Chinese medical long text classification for bladder cancer grade prediction.
Digital Health
11
10.1177/20552076251393290
|
|
PDF
- Published Version
Available under License Creative Commons Attribution Non-commercial. Download (3MB) |
Abstract
Background Traditional cystoscopic biopsy-based methods for histological grading of bladder cancer (BC) are invasive, subject to sampling errors, and susceptible to interobserver variability among pathologists. To address these challenges, this study explores a large language model (LLM)-based noninvasive approach to BC grade prediction using long Chinese medical texts. Methods We retrospectively collected admission records and computed tomography urography (CTU) descriptions from 642 patients pathologically diagnosed with BC. Each paired text was annotated as low grade or high grade according to histopathological results. We developed LLM-BCgrading to leverage HuatuoGPT-7B for Chinese medical long-text representation and integrated a gated multiplicative attention mechanism (GMAM) to selectively emphasize discriminative features. To address class imbalance and clinical risk asymmetry, the model was optimized with a cost-sensitive loss function. Performance was evaluated on a fixed internal test set with additional evaluation on an independent external validation cohort to assess generalizability. Results The best-performing configuration combined both admission records and CTU descriptions via an attention-based fusion strategy and GMAM, achieving balanced accuracy of 0.757, macro F1 score of 0.749, and macro AUC of 0.740. The ablation results demonstrated that incorporating both texts significantly improved classification performance compared with single-text configurations, and the GMAM consistently outperformed conventional attention mechanisms. Dimensionality experiments identified 256 as the optimal embedding size, balancing computational efficiency and predictive performance. Conclusion Our findings demonstrate that LLMs can effectively process Chinese medical long-texts for accurate preoperative prediction of BC grade. Attention-based fusion, cost-sensitive optimization, and interpretability based on Shapley additive explanations further support the robustness and clinical relevance of this LLM-driven framework.
| Item Type: | Article |
|---|---|
| Date Type: | Published Online |
| Status: | In Press |
| Schools: | Schools > Computer Science & Informatics |
| Publisher: | SAGE Publications |
| ISSN: | 2055-2076 |
| Date of First Compliant Deposit: | 27 November 2025 |
| Date of Acceptance: | 16 October 2025 |
| Last Modified: | 27 Nov 2025 09:30 |
| URI: | https://orca.cardiff.ac.uk/id/eprint/182668 |
Actions (repository staff only)
![]() |
Edit Item |





Dimensions
Dimensions