Datasets
Standard Dataset
Dataset for Detection and Score Grading for Prostate Adenocarcinoma using Semantic Segmentation
- Citation Author(s):
- Submitted by:
- Kasikrit Damkliang
- Last updated:
- Mon, 08/05/2024 - 02:53
- DOI:
- 10.21227/jy12-2c41
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
Prostate cancer is a major global health challenge. In this study, we present an approach for the detection and grading of prostate cancer through the semantic segmentation of adenocarcinoma tissues, specifically focusing on distinguishing between Gleason patterns 3 and 4. Our method leverages deep learning techniques to improve diagnostic accuracy and enhance patient treatment strategies. We developed a new dataset comprising 100 digitized whole-slide images of prostate needle core biopsy specimens, which are publicly available for research purposes. Our proposed model integrates dilated attention mechanisms and a residual convolutional U-Net architecture to enhance the richness of feature representations. Class imbalance is addressed using pixel expansion and class weights, and a five-fold cross-validation method ensures robust training and validation. The model achieves an average accuracy of 0.82 on unseen test data. Segmentation and grading results were validated by a team of expert pathologists. Based on experimental results, this study demonstrates the potential of our proposed method and model as a promising tool for the detection and grading of prostate cancer in clinical settings.
Dataset contains 100 slides, each slide consists of image patches (image20x directory) with the size of 256 x 256 pixels in PNG RGB 8 bit depth, and it respective ground truth masks (mask20x directory). In addtion, plot20X contains sanity image and mask plots. This is the directory structure of each slide.
<slide_id>
image20X
mask20X
plot20X