aalto1 untyped-item.component.html
A clustering-based approach to address correlated features in predicting genitourinary toxicity from MRI-guided prostate SBRT
Loading...
Access rights
openAccess
acceptedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
11
Series
Medical Physics, Volume 52, issue 6, pp. 5104-5114
Abstract
Background: It is common in outcome analysis to work with a large set of candidate prognostic features. However, such high-dimensional input and relatively small sample size leads to risk of overfitting, low generalizability, and correlation bias. Purpose: This study addresses the issue of correlation bias mitigation in the context of predicting genitourinary (GU) toxicity in prostate cancer patients underwent MRI-guided stereotactic body radiation therapy (SBRT). Methods: Typical dimension reduction or feature selection methods include regularization for sparsity or information criterion. However, when heavy correlation occurs with (subsets of) input features, the assigned weights of correlated features can be diluted to an extent that the corresponding features are no more effective in the prediction, leading to suboptimal feature discovery and prediction. We propose to perform advanced hierarchical clustering and then apply regression modeling to cluster centroids. This approach addresses the challenges posed by high dimensionality and ill-conditioning, and improves accuracy and reliability of the resulting prediction models. Performance of the proposed method was evaluated on typical regression models with intrinsic feature reduction methods, namely Least Absolute Shrinkage and Selection Operator (LASSO) regularized logistic regression (LR), support vector machine (SVM), and decision trees (DT). Results: Extensive experiments show that introducing cluster-based feature compaction and representation improves all regression models under fair hyperparameter tuning conditions. Although LASSO and LR with clustered features had similar performance during training and validation, with LASSO-LR being slightly better, the cluster-based feature method achieved significantly better performance on the test set by achieving 0.91 AUC and 0.86 accuracy, demonstrating its advantage in stability and robustness. The overall best test performance is achieved by combining feature clustering to five representatives with SVM. Additional correlation study identified individual features closely representing the cluster centroids as exposure volume of rectum at 2 Gy rectum, trigone exposure at 2 Gy and 41 Gy, urethra at 42 Gy urethra, and rectal wall at 42 Gy rectal wall. This indicates the importance of hot spot control of urethra, trigone, and rectal wall for toxicity control. Conclusions: These findings underscore the superiority of the clustering method in mitigating correlation bias and enhancing predictive model accuracy. The current model also achieves state of the art (SOTA) performance in predicting GU toxicity in MRI-guided prostate SBRT. Correlating dose features to feature cluster centroids reveals the importance of hot spot control on urethra, trigone, and rectal wall to reduce toxicity risk.
Description
Publisher Copyright: © 2025 American Association of Physicists in Medicine.
Other note
Citation
Rezapoor, P, Pham, J, Neilsen, B, Liu, H, Cao, M, Yang, Y, Sheng, K, Ma, T M, Lamb, J, Steinberg, M, Kishan, A U, Taylor, Z & Ruan, D 2025, 'A clustering-based approach to address correlated features in predicting genitourinary toxicity from MRI-guided prostate SBRT', Medical Physics, vol. 52, no. 6, pp. 5104-5114. https://doi.org/10.1002/mp.17834