Exploring gene expression and clinical data for identifying prostate cancer severity levels using machine learning methods

Küçük Resim Yok

Tarih

2023

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Institute of Electrical and Electronics Engineers Inc.

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Prostate cancer (PCa) is the most common type of cancer in men worldwide. It is a cancer that starts in the small walnut-shaped male gland called the prostate. From the prostate, it can form a metastasis into other organs. If detected and diagnosed early the survival rate may increase to 95%. Therefore, early detection and diagnosis are important tasks performed by a pathologist. The pathologist identifies the severity levels using a scale called the Gleason grading group (GGG). The GGG is found by pathologists by looking at a biopsy sample and assigning a grade of low, intermediate, or high to the sample. The pathologist then assesses a second sample in the same manner. The GGG is found by adding these two scores provides the total Gleason score. In this paper, we have explored tissue microarray (TMA) and clinical data collected by pathologists of Alberta Precision Laboratory, for predicting the severity level of prostate cancer using various machine learning methods. Traditional classifiers, such as Naïve Bayes, Decision Tree, Support Vector Machine with Radial basis function (RBF), Logistic Regression, and ensemble classifiers, such as Random Forest, and Bagging with k-nearest neighbors have been applied through the machine learning pipeline containing imputation and sampling techniques. An integrated SMOTE-Tomek Links method is adopted for handling the class imbalance problem. The highest accuracy obtained is 99.64% from the Random Forest method.

Açıklama

Anahtar Kelimeler

Gleason Grading Group, Machine Learning, Prostate Cancer, Random Forest, Severity Levels

Kaynak

Annual IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

WoS Q Değeri

N/A

Scopus Q Değeri

N/A

Cilt

Sayı

2023-September

Künye

Marouf, A. A., Alhajj, R., Rokne, J. G., Ghose, S. ve Bismar, T. A. (2023). Exploring gene expression and clinical data for identifying prostate cancer severity levels using machine learning methods. Annual IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) içinde (186-191. ss.). Regina, 24-27 September 2023. https://doi.org/10.1109/CCECE58730.2023.10288946