Identification of intraductal carcinoma of the prostate on tissue specimens using Raman micro-spectroscopy: A diagnostic accuracy case–control study with multicohort validation
Authors:
Andrée-Anne Grosset aff001; Frédérick Dallaire aff001; Tien Nguyen aff001; Mirela Birlea aff001; Jahg Wong aff001; François Daoust aff001; Noémi Roy aff001; André Kougioumoutzakis aff001; Feryel Azzi aff001; Kelly Aubertin aff001; Samuel Kadoury aff001; Mathieu Latour aff003; Roula Albadine aff003; Susan Prendeville aff007; Paul Boutros aff008; Michael Fraser aff008; Rob G. Bristow aff013; Theodorus van der Kwast aff013; Michèle Orain aff014; Hervé Brisson aff014; Nazim Benzerdjeb aff001; Hélène Hovington aff014; Alain Bergeron aff014; Yves Fradet aff014; Bernard Têtu aff014; Fred Saad aff001; Frédéric Leblond aff001; Dominique Trudel aff001
Authors place of work:
Centre de recherche du Centre hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
aff001; Institut du cancer de Montréal, Montreal, Quebec, Canada
aff002; Department of Pathology and Cellular Biology, Université de Montréal, Montreal, Quebec, Canada
aff003; Department of Computer Engineering and Software Engineering, Polytechnique Montréal, Montreal, Quebec, Canada
aff004; Department of Engineering Physics, Polytechnique Montréal, Montreal, Quebec, Canada
aff005; Department of Pathology, Centre hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
aff006; Laboratory Medicine Program, University Health Network, Toronto, Ontario, Canada
aff007; Informatics & Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
aff008; Department of Human Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
aff009; Department of Urology, University of California, Los Angeles, Los Angeles, California, United States of America
aff010; Institute for Precision Health, University of California, Los Angeles, Los Angeles, California, United States of America
aff011; Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, California, United States of America
aff012; Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
aff013; Oncology Division, Centre de recherche du Centre hospitalier universitaire de Québec–Université Laval, Quebec City, Quebec, Canada
aff014; Centre de recherche sur le cancer, Université Laval, Quebec City, Quebec, Canada
aff015; Department of Surgery, Université Laval, Quebec City, Quebec, Canada
aff016
Published in the journal:
Identification of intraductal carcinoma of the prostate on tissue specimens using Raman micro-spectroscopy: A diagnostic accuracy case–control study with multicohort validation. PLoS Med 17(8): e32767. doi:10.1371/journal.pmed.1003281
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pmed.1003281
Summary
Background
Prostate cancer (PC) is the most frequently diagnosed cancer in North American men. Pathologists are in critical need of accurate biomarkers to characterize PC, particularly to confirm the presence of intraductal carcinoma of the prostate (IDC-P), an aggressive histopathological variant for which therapeutic options are now available. Our aim was to identify IDC-P with Raman micro-spectroscopy (RμS) and machine learning technology following a protocol suitable for routine clinical histopathology laboratories.
Methods and findings
We used RμS to differentiate IDC-P from PC, as well as PC and IDC-P from benign tissue on formalin-fixed paraffin-embedded first-line radical prostatectomy specimens (embedded in tissue microarrays [TMAs]) from 483 patients treated in 3 Canadian institutions between 1993 and 2013. The main measures were the presence or absence of IDC-P and of PC, regardless of the clinical outcomes. The median age at radical prostatectomy was 62 years. Most of the specimens from the first cohort (Centre hospitalier de l’Université de Montréal) were of Gleason score 3 + 3 = 6 (51%) while most of the specimens from the 2 other cohorts (University Health Network and Centre hospitalier universitaire de Québec–Université Laval) were of Gleason score 3 + 4 = 7 (51% and 52%, respectively). Most of the 483 patients were pT2 stage (44%–69%), and pT3a (22%–49%) was more frequent than pT3b (9%–12%). To investigate the prostate tissue of each patient, 2 consecutive sections of each TMA block were cut. The first section was transferred onto a glass slide to perform immunohistochemistry with H&E counterstaining for cell identification. The second section was placed on an aluminum slide, dewaxed, and then used to acquire an average of 7 Raman spectra per specimen (between 4 and 24 Raman spectra, 4 acquisitions/TMA core). Raman spectra of each cell type were then analyzed to retrieve tissue-specific molecular information and to generate classification models using machine learning technology. Models were trained and cross-validated using data from 1 institution. Accuracy, sensitivity, and specificity were 87% ± 5%, 86% ± 6%, and 89% ± 8%, respectively, to differentiate PC from benign tissue, and 95% ± 2%, 96% ± 4%, and 94% ± 2%, respectively, to differentiate IDC-P from PC. The trained models were then tested on Raman spectra from 2 independent institutions, reaching accuracies, sensitivities, and specificities of 84% and 86%, 84% and 87%, and 81% and 82%, respectively, to diagnose PC, and of 85% and 91%, 85% and 88%, and 86% and 93%, respectively, for the identification of IDC-P. IDC-P could further be differentiated from high-grade prostatic intraepithelial neoplasia (HGPIN), a pre-malignant intraductal proliferation that can be mistaken as IDC-P, with accuracies, sensitivities, and specificities > 95% in both training and testing cohorts. As we used stringent criteria to diagnose IDC-P, the main limitation of our study is the exclusion of borderline, difficult-to-classify lesions from our datasets.
Conclusions
In this study, we developed classification models for the analysis of RμS data to differentiate IDC-P, PC, and benign tissue, including HGPIN. RμS could be a next-generation histopathological technique used to reinforce the identification of high-risk PC patients and lead to more precise diagnosis of IDC-P.
Keywords:
Cancer detection and diagnosis – biomarkers – machine learning – Lesions – Lymphocytes – Pathologists – prostate cancer – Prostate gland
Introduction
Prostate cancer (PC) is the most common cancer in North American men, and the second leading cause of death by cancer in men in the United States [1]. Diagnostics by pathologists involve visualizing hematoxylin and eosin (H&E)–stained 4-μm-thick tissue sections under the microscope, but there is a lack of reliable biomarkers to accurately characterize PC to ensure that precision medicine can benefit affected men [2]. Importantly, there are no clinically implemented biomarkers for the identification of intraductal carcinoma of the prostate (IDC-P), an aggressive variant of PC. In the vast majority of IDC-P cases, IDC-P occurs in combination with usual, invasive PC, and it is identified in approximately 20% of PC cases [3]. Given its consistent association with PC recurrence, PC metastasis, and PC-specific death, the precise reporting of IDC-P is of the utmost importance [3,4]. Molecular investigation of tumors or tumor regions with and without IDC-P have identified probabilistic differences in the frequency of different driver genes [5], in transcriptional and epigenomic profiles [6], in their microenvironment [7], in tumor evolutionary features [8], and in their visibility to multi-parametric magnetic resonance imaging [9]. However, once the intraductal nature of a prostatic lesion has been established, only morphological criteria can currently be used to diagnose IDC-P, resulting in reports of low interobserver concordance [10,11]. Importantly, IDC-P can be mistaken for other intraductal proliferations such as high-grade prostatic intraepithelial neoplasia (HGPIN), and vice versa [10,11]. These misinterpretations crucially affect the care of men with PC, as HGPIN and IDC-P are associated with opposite clinical significance, HGPIN being presumed to be a precursor of PC [11]. Biomarkers are available to solve this diagnostic pitfall, including phosphatase and tensin homolog (PTEN) loss of expression and ETS transcription factor ERG overexpression (both detected by immunohistochemistry [IHC]). However, these biomarkers have low sensitivity (60%–75%) and are thus not used frequently by genitourinary pathologists [12]. Reliable biomarkers of IDC-P with high sensitivity and specificity (>85%) would thus help reinforce the identification of such high-risk patients and lead to more appropriate patient management, ensuring the therapies are in line with IDC-P status [13–16].
Apart from standard molecular pathology techniques, tissue characterization methods have evolved with the use of optical microscopy, lately with a steep increase in data acquisition and data analysis capacities [17–21]. Among optical microscopy techniques, confocal Raman micro-spectroscopy (RμS) measures light scattering resulting from interactions with specific molecular bonds (among others, in proteins, lipids, DNA, and RNA), allowing for the global molecular characterization of a specimen [22]. The first RμS spectrum of normal prostate tissue was reported by Stone et al. in 2002, describing spectra acquired from snap-frozen tissues [23]. Subsequently, other groups reported the capacity to distinguish the different zones of the prostate as well as different prostatic cell lines, whether benign or malignant, with high sensitivity and specificity [24–26]. RμS has also been used on human tissues to successfully predict the occurrence of end-stage PC (i.e., castration-resistant PC) as well as to determine PC grade [27–30]. Reported results demonstrated the potential of RμS to identify PC; however, to our knowledge the technique has not been used to characterize subtypes of PC such as IDC-P.
Previous RμS studies were performed using snap-frozen samples, formalin-fixed paraffin-embedded (FFPE) tissues, or cytospin preparations of cell lines, deposited on expensive substrates such as CaF2, quartz, and gold-coated glass. Importantly, when performing a clinical diagnosis of PC, no tissue is available for snap freezing outside research purposes [31]. The previously reported RμS protocols for FFPE samples also involved tedious sample preparation, such as long dewaxing procedures or thick tissue sections. Those are key issues limiting clinical implementation of RμS.
As the currently available biomarkers of IDC-P are not sufficiently robust to be of clinical relevance and as RμS has been previously used to identify PC, but not IDC-P, we hypothesized that RμS could be used as a diagnostic biomarker of IDC-P. To investigate the central issue of identifying biomarkers of IDC-P, we developed a FFPE tissue slide preparation protocol that mirrors standard hospital procedures to facilitate clinical implementation of RμS for the characterization of PC. We then conducted a RμS study aimed at differentiating PC from benign prostate epithelium, as well as differentiating IDC-P from PC and benign prostatic epithelium, including HGPIN. This was achieved in FFPE radical prostatectomy specimens from 483 patients from 3 Canadian institutions.
Methods
Study overview
We conducted this study to present RμS as a promising, ancillary technique that could be integrated into the pathological workflow (Figs 1, S1 and S2). Tissue samples from 483 PC patients from 3 different institutions were studied: Centre hospitalier de l’Université de Montréal (CHUM), University Health Network (UHN), and Centre hospitalier universitaire de Québec–Université Laval (CHUQc-UL). FFPE tissue microarrays (TMAs) were used to allow high-throughput RμS acquisitions (S3 Fig). Two adjacent sections of each TMA block were cut. The first section was transferred onto a glass slide to perform IHC to detect AMACR/p63/34BE12 with H&E counterstaining [32]. The second section was placed on an aluminum slide with low Raman activity (Miro5011, Anomet, Brampton, ON, Canada). TMA sections on aluminum slides were dewaxed for 8 minutes according to the CHUM standard clinical dewaxing protocol. Briefly, slides were agitated for 1 minute in each bath: 2 xylene substitute baths (VWR, Radnor, PA, US), 3 100% ethanol baths (Alcools de Commerce, Boucherville, QC, Canada), and 3 distilled water baths. A vacuum dryer was used for 20 minutes to avoid residual water on the slides prior to RμS measurements, which were performed without any additional tissue processing.
IHC H&E-stained slides were digitized using a Nanozoomer Digital Pathology slide scanner (Hamamatsu, Bridgewater, NJ, US) before identification of lymphocytes, benign prostatic tissue, PC tissue, IDC-P, and HGPIN on the IHC-stained slide by 5 observers (AAG, TN, MB, JW, and FA, under the supervision of DT). A second pathologist (RA) confirmed the presence of IDC-P on targeted cores, ensuring all cases with morphological characteristics that failed diagnostic criteria were interpreted as negative for IDC-P [33].
Each acquisition using a Raman micro-spectrometer lasted for 50 seconds. All acquisitions were supervised by a research assistant specialized in RμS (MB). After all acquisitions were completed, support vector machine (SVM) classification models were trained with data from the CHUM cohort. In an SVM model, spectra are represented as points in a high-dimensional space where each dimension corresponds to a feature, i.e., a spectral wavelength shift associated with a distribution of Raman signal intensities. Support vectors are the basic entities or parameters computed by the algorithm to classify between different tissue classes. As support vectors are actual data points in the N-dimensional feature space, they have a geometrical interpretation with respect to the decision boundary, i.e., a multidimensional plane (a hyperplane with N − 1 dimensions), that separates the 2 targeted tissue classes. The SVM algorithm finds the optimal decision boundary, corresponding to the hyperplane that best separates the data into 2 classes within the feature space, by maximizing a loss function that depends on the geometrical distances of all data points to the decision boundary (S2 Fig). The SVM algorithm takes as input the following parameters, or hyperparameters: (i) the regularization parameter C, (ii) the kernel function (linear or Gaussian), and (iii) the kernel coefficient γ (see “Statistical analysis and tissue classification”). The SVM statistical model was then applied to 2 independent testing cohorts (UHN and CHUQc-UL). Our results determined the accuracy, sensitivity, and specificity of our statistical models to identify each cell and tissue type in PC patient samples. This study is reported as per diagnostic studies guidelines (STARD checklist; S1 Table).
For this study, a prospective analysis plan was constructed in 2015 as part of an internal grant covering RμS analysis of PC as well as other optical analysis of PC. Based on team discussions and external reviews, the analysis plan was reviewed in early 2019 to include a comparison between HGPIN and IDC-P. In 2020, following peer review comments, the revision of IDC-P by a second pathologist (RA) as well as confusion matrices analysis were also added.
Human tissue samples
This multi-institutional retrospective study included a total of 483 PC patients and was approved by the CHUM ethics review board (15.107), after approval of the construction of the TMAs by local ethics review boards. All patients signed an informed consent allowing for the use of their prostate tissue samples in research. The TMAs from CHUM, UHN, and CHUQc-UL include patients treated by first-line radical prostatectomy and recruited from January 1, 1993, to December 31, 2013. FFPE PC tissues from surgery (radical prostatectomies) were used for the construction of TMAs, either by random selection of PC tissue (CHUM) or by targeted selection of representative grades (UHN and CHUQc-UL). All TMAs included benign tissues (tissue within normal, non-tumor range) from the radical prostatectomies performed to treat PC.
Raman micro-spectroscopy
All Raman spectra were acquired using a Renishaw inVia confocal Raman microscope (Renishaw, Gloucestershire, UK) equipped with a 785-nm line focus laser. Each acquisition (4 acquisitions/TMA core) lasted 50 seconds (i.e., 5 accumulations of 10 seconds) with 150-mW laser output power using the 50× short working distance objective of the microscope (numerical aperture = 0.75). A rectangular area of 24 μm2 (8 μm × 3 μm, approximately corresponding to single-cell analysis) within cell-rich tissue, whether PC cells, IDC-P cells, benign epithelial cells, or lymphocytes, was targeted at each acquisition. Recognizable structures such as glandular lumens or the cobbly surface of cancer cell sheets were used to aim the laser on cells rather than extracellular matrix or areas without tissue such as glandular lumens. A grating of 1,200 lines/mm allowed the visualization of Raman shifts between 602 and 1,726 cm−1. Spatial registration of Raman spectra with the IHC H&E-stained slide ensured acquisitions were performed at the exact location of the cells from the tissues of interest. As aluminum slides are inevitably streaked, the final white-light tissue images on aluminum slides were processed with a filter in the frequency domain to avoid streak visualization. All Raman spectra files are available from the Dryad Digital Repository database [34].
Statistical analysis and tissue classification
Aluminum background and intrinsic tissue fluorescence in the spectra were removed from the raw spectrum with the rolling ball algorithm [35]. As the resulting Raman spectra consisted of more than 1,000 spectral wavelengths shifts with a resolution of approximately 1.1 cm−1, a dimensional reduction procedure based on a linear SVM with L1 regularization was used prior to producing the classification models. This procedure allowed preselection of only those features (individual intensity values within a spectrum) that were most relevant in distinguishing tissue classes. The regularization method that was used, known as Lasso regression, assigns a weight to each feature within an optimization function (i.e., loss function) and gives a non-0 weight only to features that contribute significantly to establishing a decision boundary (S2 Fig).
Although SVM classifiers are linear classifiers, they can also be implemented using a nonlinear kernel function mapping the original data to another high-dimensional space, allowing improved classification performance by capturing more complex (i.e., nonlinear) attributes of the data. Here, the method used to produce the classification models from the preselected feature set was an SVM with a nonlinear Gaussian kernel. Prior to submitting the data to the classification algorithm [36], either for training or testing, each feature set underwent a standardization so that individual features had a mean of 0 and a unit variance.
Hyperparameters for the feature selection step and the subsequent classification model development were selected by performing a grid search. For the feature selection step, an SVM with a linear kernel was used with a regularization parameter C varying between 0.05 and 0.5, with larger values corresponding to more features being retained. For the development of the classification models (using only the preselected features), the regularization parameter C of the Gaussian SVM varied between 0.1 and 1,000, effectively acting as a penalty term for misclassified points. The kernel coefficient γ, which defines the variance of the Gaussian kernel, was varied between 10−4 and 10−1.
For each combination of hyperparameters (C and γ), the performance was assessed through 5-fold cross-validation. For this procedure, the training dataset was split into 5 nonoverlapping subsets. Each individual subset was used as a validation set while the other 4 were used to train a model, to assess the performance associated with a combination of hyperparameters. Cross-validation predictive performance was computed by averaging accuracy, sensitivity, and specificity across all folds, and the standard deviation was reported as modeling uncertainty. The model selected for testing was the one associated with hyperparameters yielding optimal classification. This final model was trained on the complete CHUM cohort and then tested on the UHN and CHUQc-UL cohorts. Training and testing performances were assessed through receiver operating characteristic (ROC) curves in which sensitivity and specificity were optimized by selecting the point with the minimal distance to the upper left corner (S5 Fig).
Results
RμS on TMAs
Three independent PC patient cohorts from Canadian hospitals were analyzed (Table 1). TMAs with core diameters ranging from 0.6 mm to 1.2 mm were assembled beforehand to ensure that all samples fit onto a small number of slides, to increase imaging throughput (S3 Fig) [37,38]. All Raman spectra were acquired using a confocal Raman microscope (inVia model, Renishaw, Gloucestershire, UK) equipped with a 785-nm line focus laser with an output power of 150 mW. Time for each acquisition was 50 seconds (5 accumulations of 10 seconds); the total time required for tissue processing and Raman spectra acquisition was <90 minutes for 1 patient. This was significantly shorter than other ancillary histopathology techniques, e.g., ~24 hours for IHC and several days for gene sequencing. Following a series of preprocessing steps including background removal, the Raman spectra were used to create classification models trained on the CHUM cohort and independently tested on the UHN and CHUQc-UL cohorts. Testing the model on data that were not used at any stage of the training phase ensures better clinical validity compared to internal validation strategies or the use of separate cohorts from a single institution for training and testing. Overall, 4 classification models were produced: The first identified lymphocyte clusters within prostate tissue, a classification that is reliably performed by pathologists; the second distinguished benign and malignant prostate epithelial cells to ensure recognition of cells from the same lineage; the third distinguished IDC-P from invasive carcinomas; and the fourth distinguished HGPIN from IDC-P for an accurate identification of the intraductal proliferation. Specifically, this protocol was developed to complement conventional pathology analyses in the identification of IDC-P (Figs 1, S1 and S2).
Identification of different cell types in PC tissue by RμS
Following the development of classification models for the detection of lymphocytes, 3 other classification models were developed to characterize PC and IDC-P: (i) benign versus cancer, (ii) IDC-P versus cancer, and (iii) HGPIN versus IDC-P. The potential of RμS to accurately differentiate cell types was quantified using a machine learning technique using as input (from the feature selection algorithm, see Methods) the Raman peaks contributing the most to the variability between different classes. As differentiation between lymphocytes and PC cells is straightforward in histopathology, we first tested the development of a RμS classification model to confirm the capacities of the system. This model could differentiate lymphocytes from PC cells with cross-validation accuracy, sensitivity, and specificity of 98%, 99%, and 98%, respectively (Methods; S2 Table). Classification performances were similar when testing the model on 2 independent cohorts. The peaks contributing the most to the classification models were assigned to vibrational modes and biochemical constituents (S4 Fig; S3 Table) [24–28,39,40].
Machine learning/feature selection was then applied to distinguish benign prostate epithelial cells from PC cells. The diagnosis was determined by 1 pathologist (DT) on slides stained with antibodies targeting AMACR/p63/34BE12 to distinguish PC from benign glands [41] prior to RμS acquisitions on adjacent tissue sections (Fig 2A). The CHUM cohort, comprising 99 patients with benign prostatic tissue (400 spectra) and 272 patients with PC tissue (1,088 spectra), was used as the training set for the classification model. The Raman spectra for all CHUM patients were classified with an accuracy, sensitivity, and specificity of 87% ± 5%, 86% ± 6%, and 89% ± 8%, respectively (Table 2; S7A Fig). The model was then applied to the UHN cohort including 49 patients with benign prostatic tissue (196 spectra) and 76 patients with PC tissue (818 spectra). Performance on this testing cohort was comparable to that of the training dataset (CHUM) with an accuracy, sensitivity, and specificity of 84%, 84%, and 82%, respectively (S7B Fig). In the other testing cohort (CHUQc-UL), with 68 patients with benign prostatic tissue (272 spectra) and 135 patients with PC tissue (1,450 spectra), accuracy, sensitivity, and specificity were 86%, 87%, and 81%, respectively (S7C Fig).
We identified 32 important Raman spectral differences between benign and malignant prostate tissue, and those features were used to produce the machine learning classification models. From these, the 10 Raman peaks contributing the most to the classification of benign and cancer tissues were identified (Fig 2B; Table 3) [24–28,39,40]. The peaks at 1,450 cm−1 and 1,484 cm−1 were significantly increased in the average Raman spectrum of PC tissue compared to benign prostate tissue. Biochemical constituents assigned to these peaks were mostly from DNA and RNA, as well as from the backbone of proteins and from lipids. All other biochemical components of PC tissue identified by RμS were decreased compared to benign prostate tissue. More specifically, the nucleobase adenine from DNA and RNA, and the amino acids proline, tyrosine, valine, phenylalanine, and tryptophan were reduced in the average Raman spectrum of PC tissue.
RμS as a biomarker of IDC-P
To train our statistical model, we used the cohort from CHUM. A total of 15 patients (17 cores) were identified with IDC-P (Fig 3A). When IDC-P was compared to adjacent invasive PC from the same core, RμS could not differentiate IDC-P from PC (in the UHN cohort, sensitivity was of 21%). Since several histopathological studies have reported that adjacent PC is similar to IDC-P [42–46], we combined the spectra of both regions of the prostate tissue, for a total of 112 spectra. We used the average Raman spectrum of each core with IDC-P and compared this to the average Raman spectrum of each PC core without IDC-P (272 patients, 1,088 spectra). The classification using machine learning (cross-validation) was performed, achieving an accuracy, sensitivity, and specificity of 95% ± 2%, 96% ± 4%, and 94% ± 2%, respectively (Table 2; S7D Fig). We then used our 2 other cohorts to test this classification model. From the UHN cohort, 14 patients were identified with IDC-P and adjacent PC on at least 1 TMA core. We acquired 139 Raman spectra from the 22 TMA cores from the 14 patients. For the classification, these spectra were compared to 767 PC spectra of TMA cores without IDC-P from 71 patients. Performances were comparable to cross-validation results from the training set, with an accuracy, sensitivity, and specificity of 91%, 88%, and 93%, respectively (S7E Fig). In the CHUQc-UL cohort, 9 patients (16 cores) with IDC-P were studied, and from these cores, we acquired 104 IDC-P spectra. PC from TMA cores without IDC-P from 93 patients was investigated, leading to 1,017 PC Raman spectra. The identification of IDC-P with the machine learning model using this cohort was performed with an accuracy, sensitivity, and specificity of 85%, 85%, and 86%, respectively (S7F Fig).
By comparing usual invasive PC to IDC-P, the feature selection algorithm retrieved 92 features from the training cohort. As when distinguishing benign versus cancer tissue, we identified the 10 most important Raman peaks that were used by machine learning (Fig 3B; Table 4) [24–28,39,40]. Specifically, the DNA and RNA backbone were increased in IDC-P compared to invasive PC. For proteins, α-helix and β-sheet secondary structures, specifically for the amide III peak, were more intense in the average Raman spectrum of IDC-P. Importantly, 3 features were associated with the amino acid tyrosine peak at 1,171 cm−1. Other amino acids (i.e., proline, valine, and phenylalanine) were identified mostly in IDC-P. We also observed a decrease in a few biochemical constituents in this aggressive variant of PC: the amino acid tryptophan (759 cm−1 peak), the nucleobase guanine, fatty acids, and the amide I peak from the protein α-helix (1,667 cm−1 peak).
We then tested the capacity of RμS to distinguish 2 intraductal proliferations, HGPIN and IDC-P (Fig 4A). The Raman spectra for all CHUM patients were classified (training, cross-validation) with an accuracy, sensitivity, and specificity of 97.5% ± 1.4%, 98.2% ± 1.5%, and 97.1% ± 1.4%, respectively (Table 2; S7G Fig). The model was then used on the UHN cohort, including 9 patients with HGPIN (23 spectra), and on the CHUQc-UL cohort, including 13 patients with HGPIN (30 spectra). Performance on testing cohorts was comparable to that of the training dataset (CHUM), with an accuracy of 97.8%–98.3%, a sensitivity of 95.5%–96.4%, and a specificity of 100% (S7H and S7I Fig).
For HGPIN and IDC-P, a total of 19 features were selected by the model. From these, we analyzed 5 Raman peaks that were the dominant contributors to the classification (Fig 4B; Table 5). The biochemical constituent predominantly found in HGPIN was adenine from DNA and RNA. Proteins were increased in IDC-P compared to HGPIN, more specifically the amino acid phenylalanine (1,003 cm−1 peak) and the β-sheet secondary structure (1,242 cm−1 peak).
Discussion
Beyond distinguishing between PC and benign prostatic tissue with accuracy ≥84%, RμS indicated the presence of IDC-P within the diagnostic FFPE prostatic tissue with an accuracy of at least 85% throughout the 3 studied independent cohorts. Importantly, IDC-P could also be distinguished from HGPIN with accuracy >97%.
A precise diagnosis of IDC-P is a challenge for genitourinary pathologists [10,11], especially since no specific biomarker is clinically available to reliably identify this aggressive histological variant of PC. Indeed, once a lesion has been confirmed to be intraductal on a prostate biopsy, the most common biomarkers of IDC-P, ERG overexpression and PTEN loss, are used only by 25% of all genitourinary pathologists [12]. Here we showed RμS combined with machine learning technology could be used as a specific molecular biomarker of IDC-P, results that are in line with the described capacity to identify PC in FFPE and snap-frozen samples [24–26]. Interestingly, from the Raman peaks that were associated with IDC-P (Table 4), 2 were also associated with end-stage, castration-resistant PC by Wang et al. (1,171 cm−1 and 1,247 cm−1) [27]. As the association between IDC-P and castration-resistant PC is well established [15,47,48], we believe these results support the value of our classification models.
To ensure maximal clinical validity, we studied 3 nonoverlapping cohorts, composed of 76 to 272 PC patients, from different institutions to independently train and test our machine learning classification models. This study on RμS includes large cohorts of patients from different institutions, with stringent morphological and immunohistochemical classification of the lesions. Other groups have conducted PC RμS studies on only 1 patient or on single-center cohorts composed of a maximum of 50 patients [23–30,49]. Studying a large group of men with PC from different institutions paves the way to the use of Raman spectra as a biomarker of IDC-P.
In addition to showing the capacity to detect a diagnostic signature for a histological variant of PC, our RμS protocol is fully compatible with the standard clinical histopathological workflow. First, in contrast to other published protocols using fresh or frozen tissues, our analyses were performed using FFPE tissues. Because PC is most often visually undetectable and impalpable, the entirety of prostate specimens examined in the specific context of PC diagnosis are FFPE in toto, i.e., no tissue is available for snap freezing outside research purposes [31]. Importantly, blindly harvesting tissue to eventually perform RμS on snap-frozen tissue—a method that is adopted for other organ systems instead of FFPE-based evaluations—could lead to analyzing a tissue devoid of PC or, even worse, underestimating disease severity if significant portions of the tumor are unavailable for routine H&E evaluation.
Second, we developed our protocol to enable smooth implementation into a clinical laboratory setting without disruption of routine service. Cut at the same thickness as standard tissue sections (4 μm) and dewaxed using our regular routine dewaxing protocol, RμS tissue sections can be treated using the same apparatus at the same settings as the vast majority of the tissue sections processed in a histopathology laboratory. The aluminum slides we used were of the same size as standard microscope glass slides and compatible with all chemicals used to dewax/prepare the sample for RμS and are in addition inexpensive. Tissue sections do not need further labeling before RμS acquisitions. Classification models can also be applied without technical engineering assistance with only basic training in the use of a RμS microscope. The cost of a Raman confocal microscope is in the same range as the cost of an automated immunostainer to perform IHC, and a Raman confocal microscope is also smaller than an automated autostainer. Therefore, a laboratory setup in which some slides are analyzed by standard molecular pathology techniques (e.g., IHC) while others are sent for RμS evaluation can be envisioned with minimal disruption of the standard clinicopathological workflow.
As our results involve large-scale validation and the use of a clinically implementable slide preparation protocol, the inclusion of lesions that fall short of the diagnostic criteria of IDC-P would have improved the clinical significance of our results. Indeed, these difficult-to-classify lesions are the lesions for which pathologists are expected to use RμS. However, as IDC-P was present in 6% to 18% of the investigated patients, as expected by the small diameter of the TMA cores and the limited size of IDC-P within PC [50], the number of borderline, difficult-to-classify lesions in our TMA set was insufficient for proper classification. Importantly, the RμS algorithms we designed provide a quantitative evaluation of the probability of a diagnosis (e.g., IDC-P, 75% probability), therefore decreasing the impact of this limitation.
Before clinical implementation, in which RμS could be used similarly to IHC (Figs 1, S1 and S2), next steps for research include, among others, validation in cohorts using material entirely processed in each center (from tissue fixation to slide preparation to RμS acquisitions) to ensure reproducibility of all the steps of the protocol regardless of the laboratory. Importantly, a thorough validation of the use of RμS in different disease classifications and organ types [51] will also facilitate the clinical implementation of RμS, by maximizing the use of the Raman microscopes. Moreover, other RμS modalities such as surface-enhanced Raman spectroscopy (SERS) or coherent anti-Stokes Raman spectroscopy (CARS) could be tested to improve the acquisition speed (allowing one to analyze larger portions of the specimens) and/or to modify the substrate on which RμS is performed to allow the use of glass slides. Altogether, RμS is a promising tool for histopathological ancillary studies, but further large-scale, multicenter studies are needed before actual clinical implementation.
The extent of similarity between IDC-P and immediately adjacent PC will also have to be investigated. Indeed, when IDC-P was present in a core, the Raman spectra from PC and IDC-P on the core were indistinguishable, despite their different localization with respect to prostatic ducts (inside for IDC-P, outside for PC). We thus combined the spectra from IDC-P and from the adjacent invasive PC tissue from the same core. As IDC-P and immediately adjacent PC have been shown to have similar expression of biomarkers such as ERG and PTEN [42–46], this similarity is not unexpected. However, beyond the scientific phylogeny questions raised by these similarities, from a technical standpoint, it will be of tremendous importance to characterize the “Raman-identical” zone around IDC-P. Among other factors, the size of this zone is likely to define the needed precision when evaluating an intraductal lesion.
Overall, we provided a large study of the use of RμS to detect PC and IDC-P in 3 independent cohorts of men with PC. Our results are not only in line with the current literature associating the important Raman features of IDC-P with the development of castration-resistant PC, but they also provide solid evidence to pursue the clinical implementation of RμS as an ancillary technique to refine the diagnosis of PC. In perspective, a prospective study on fully annotated specimens, including difficult-to-classify lesions, will ensure the transition from the testing of research TMAs to clinical workflow.
Supporting information
S1 Fig [s]
RμS workflow.
S2 Fig [tif]
Machine learning workflow.
S3 Fig [tif]
Prostate cancer tissue microarray.
S4 Fig [a]
Identification of lymphocyte clusters in PC tissue by RμS.
S5 Fig [roc]
Receiver operating characteristic curves.
S6 Fig [a]
Average spectra and respective variance.
S7 Fig [g]
Confusion matrices.
S1 Table [docx]
The STARD checklist.
S2 Table [docx]
Classification performance when distinguishing lymphocyte clusters and PC in training and testing cohorts.
S3 Table [docx]
Most important features used for the classification of lymphocytes and cancer within prostate tissue and their associated Raman peaks.
Zdroje
1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70:7–30. doi: 10.3322/caac.21590 31912902
2. Nuhn P, De Bono JS, Fizazi K, Freedland SJ, Grilli M, Kantoff PW, et al. Update on systemic prostate cancer therapies: management of metastatic castration-resistant prostate cancer in the era of precision oncology. Eur Urol. 2019;75:88–99. doi: 10.1016/j.eururo.2018.03.028 29673712
3. Chua MLK, van der Kwast TH, Bristow RG. Intraductal carcinoma of the prostate: anonymous to ominous. Eur Urol. 2017;72:496–8. doi: 10.1016/j.eururo.2017.04.004 28416351
4. Egevad L, Judge M, Delahunt B, Humphrey PA, Kristiansen G, Oxley J, et al. Dataset for the reporting of prostate carcinoma in core needle biopsy and transurethral resection and enucleation specimens: recommendations from the International Collaboration on Cancer Reporting (ICCR). Pathology. 2019;51:11–20. doi: 10.1016/j.pathol.2018.10.003 30477882
5. Bottcher R, Kweldam CF, Livingstone J, Lalonde E, Yamaguchi TN, Huang V, et al. Cribriform and intraductal prostate cancer are associated with increased genomic instability and distinct genomic alterations. BMC Cancer. 2018;18:8. doi: 10.1186/s12885-017-3976-z 29295717
6. Elfandy H, Armenia J, Pederzoli F, Pullman E, Pertega-Gomes N, Schultz N, et al. Genetic and epigenetic determinants of aggressiveness in cribriform carcinoma of the prostate. Mol Cancer Res. 2019;17:446–56. doi: 10.1158/1541-7786.MCR-18-0440 30333152
7. Bhandari V, Hoey C, Liu LY, Lalonde E, Ray J, Livingstone J, et al. Molecular landmarks of tumor hypoxia across cancer types. Nat Genet. 2019;51:308–18. doi: 10.1038/s41588-018-0318-2 30643250
8. Taylor RA, Fraser M, Livingstone J, Espiritu SM, Thorne H, Huang V, et al. Germline BRCA2 mutations drive prostate cancers with distinct evolutionary trajectories. Nat Commun. 2017;8:13671. doi: 10.1038/ncomms13671 28067867
9. Houlahan KE, Salmasi A, Sadun TY, Pooli A, Felker ER, Livingstone J, et al. Molecular hallmarks of multiparametric magnetic resonance imaging visibility in prostate cancer. Eur Urol. 2019;76:18–23. doi: 10.1016/j.eururo.2018.12.036 30685078
10. Iczkowski KA, Egevad L, Ma J, Harding-Jackson N, Algaba F, Billis A, et al. Intraductal carcinoma of the prostate: interobserver reproducibility survey of 39 urologic pathologists. Ann Diagn Pathol. 2014;18:333–42. doi: 10.1016/j.anndiagpath.2014.08.010 25263387
11. Humphrey PA, Moch H, Cubilla AL, Ulbright TM, Reuter VE. The 2016 WHO classification of tumours of the urinary system and male genital organs—part B: prostate and bladder tumours. Eur Urol. 2016;70:106–19. doi: 10.1016/j.eururo.2016.02.028 26996659
12. Gandhi JS, Smith SC, Paner GP, McKenney JK, Sekhri R, Osunkoya AO, et al. Reporting practices and resource utilization in the era of intraductal carcinoma of the prostate: a survey of genitourinary subspecialists. Am J Surg Pathol. 2020;44:673–80. doi: 10.1097/PAS.0000000000001417 31876580
13. Porter LH, Lawrence MG, Ilic D, Clouston D, Bolton DM, Frydenberg M, et al. Systematic review links the prevalence of intraductal carcinoma of the prostate to prostate cancer risk categories. Eur Urol. 2017;72:492–5. doi: 10.1016/j.eururo.2017.03.013 28342640
14. Yamamoto A, Kato M, Tsuzuki T, Gotoh M, Matsukawa Y, Sassa N, et al. Effect of chemotherapy on survival for intraductal carcinoma of the prostate (IDC-P) in patients with castration-resistant prostate cancer. J Clin Oncol. 2017;35:e601.
15. Porter LH, Hashimoto K, Lawrence MG, Pezaro C, Clouston D, Wang H, et al. Intraductal carcinoma of the prostate can evade androgen deprivation, with emergence of castrate-tolerant cells. BJU Int. 2018;121:971–8. doi: 10.1111/bju.14043 28977728
16. Trinh VQ, Benzerdjeb N, Chagnon-Monarque S, Dionne N, Delouya G, Kougioumoutzakis A, et al. Retrospective study on the benefit of adjuvant radiotherapy in men with intraductal carcinoma of prostate. Radiat Oncol. 2019;14:60. doi: 10.1186/s13014-019-1267-3 31018850
17. Cheng JX, Xie XS. Vibrational spectroscopic imaging of living systems: an emerging platform for biology and medicine. Science. 2015;350:aaa8870. doi: 10.1126/science.aaa8870 26612955
18. Gardecki JA, Singh K, Wu CL, Tearney GJ. Imaging the human prostate gland using 1–mum-resolution optical coherence tomography. Arch Pathol Lab Med. 2019;143:314–8. doi: 10.5858/arpa.2018-0135-OA 30550349
19. Kallepitis C, Bergholt MS, Mazo MM, Leonardo V, Skaalure SC, Maynard SA, et al. Quantitative volumetric Raman imaging of three dimensional cell cultures. Nat Commun. 2017;8:14843. doi: 10.1038/ncomms14843 28327660
20. Lu FK, Basu S, Igras V, Hoang MP, Ji M, Fu D, et al. Label-free DNA imaging in vivo with stimulated Raman scattering microscopy. Proc Natl Acad Sci U S A. 2015;112:11624–9. doi: 10.1073/pnas.1515121112 26324899
21. Wei M, Shi L, Shen Y, Zhao Z, Guzman A, Kaufman LJ, et al. Volumetric chemical imaging by clearing-enhanced stimulated Raman scattering microscopy. Proc Natl Acad Sci U S A. 2019;116:6608–17. doi: 10.1073/pnas.1813044116 30872474
22. Butler HJ, Ashton L, Bird B, Cinque G, Curtis K, Dorney J, et al. Using Raman spectroscopy to characterize biological materials. Nat Protoc. 2016;11:664–87. doi: 10.1038/nprot.2016.036 26963630
23. Stone N, Kendall C, Shepherd N, Crow P, Barr H. Near-infrared Raman spectroscopy for the classification of epithelial pre-cancers and cancers. J Raman Spectrosc. 2002;33:564–73.
24. Patel II, Martin FL. Discrimination of zone-specific spectral signatures in normal human prostate using Raman spectroscopy. Analyst. 2010;135:3060–9. doi: 10.1039/c0an00518e 20949203
25. Taleb A, Diamond J, McGarvey JJ, Beattie JR, Toland C, Hamilton PW. Raman microscopy for the chemometric analysis of tumor cells. J Phys Chem B. 2006;110:19625–31. doi: 10.1021/jp061981q 17004830
26. Crow P, Barrass B, Kendall C, Hart-Prieto M, Wright M, Persad R, et al. The use of Raman spectroscopy to differentiate between different prostatic adenocarcinoma cell lines. Br J Cancer. 2005;92:2166–70. doi: 10.1038/sj.bjc.6602638 15928665
27. Wang L, He D, Zeng J, Guan Z, Dang Q, Wang X, et al. Raman spectroscopy, a potential tool in diagnosis and prognosis of castration-resistant prostate cancer. J Biomed Opt. 2013;18:87001. doi: 10.1117/1.JBO.18.8.087001 23907278
28. Devpura S, Thakur JS, Sarkar FH, Sakr WA, Naik VM, Naik R. Detection of benign epithelia, prostatic intraepithelial neoplasia, and cancer regions in radical prostatectomy tissues using Raman spectroscopy. Vib Spectrosc. 2010;53:227–32.
29. Tollefson M, Magera J, Sebo T, Cohen J, Drauch A, Maier J, et al. Raman spectral imaging of prostate cancer: can Raman molecular imaging be used to augment standard histopathology? BJU Int. 2010;106:484–8. doi: 10.1111/j.1464-410X.2010.09185.x 20201840
30. Crow P, Stone N, Kendall CA, Uff JS, Farmer JA, Barr H, et al. The use of Raman spectroscopy to identify and grade prostatic adenocarcinoma in vitro. Br J Cancer. 2003;89:106–8. doi: 10.1038/sj.bjc.6601059 12838309
31. Samaratunga H, Montironi R, True L, Epstein JI, Griffiths DF, Humphrey PA, et al. International Society of Urological Pathology (ISUP) consensus conference on handling and staging of radical prostatectomy specimens. Working group 1: specimen handling. Mod Pathol. 2011;24:6–15. doi: 10.1038/modpathol.2010.178 20834234
32. Grosset AA, Loayza-Vega K, Adam-Granger E, Birlea M, Gilks B, Nguyen B, et al. Hematoxylin and eosin counterstaining protocol for immunohistochemistry interpretation and diagnosis. Appl Immunohistochem Mol Morphol. 2019;27:558–63. doi: 10.1097/PAI.0000000000000626 29271792
33. Hickman RA, Yu H, Li J, Kong M, Shah RB, Zhou M, et al. Atypical intraductal cribriform proliferations of the prostate exhibit similar molecular and clinicopathologic characteristics as intraductal carcinoma of the prostate. Am J Surg Pathol. 2017;41:550–6. doi: 10.1097/PAS.0000000000000794 28009609
34. Grosset AA, Dallaire F, Nguyen T, Birlea M, Wong J, Daoust F, et al. Data from: Identification of intraductal carcinoma of the prostate on tissue specimens using Raman micro-spectroscopy: a diagnostic accuracy case-control study with multicohort validation. Dryad. 2020 Jul 16. doi: 10.5061/dryad.cjsxksn3p
35. Perez-Pueyo R, Soneira MJ, Ruiz-Moreno S. Morphology-based automated baseline removal for Raman spectra of artistic pigments. Appl Spectrosc. 2010;64:595–600. doi: 10.1366/000370210791414281 20537226
36. Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20:273–97.
37. Kononen J, Bubendorf L, Kallioniemi A, Barlund M, Schraml P, Leighton S, et al. Tissue microarrays for high-throughput molecular profiling of tumor specimens. Nat Med. 1998;4:844–7. doi: 10.1038/nm0798-844 9662379
38. Datta MW, True LD, Nelson PS, Amin MB. The role of tissue microarrays in prostate cancer biomarker discovery. Adv Anat Pathol. 2007;14:408–18. doi: 10.1097/PAP.0b013e318155709a 18049130
39. Kast RE, Tucker SC, Killian K, Trexler M, Honn KV, Auner GW. Emerging technology: applications of Raman spectroscopy for prostate cancer. Cancer Metastasis Rev. 2014;33:673–93. doi: 10.1007/s10555-013-9489-6 24510129
40. Talari ACS, Movasaghi Z, Rehman S, Rehman IU. Raman spectroscopy of biological tissues. Appl Spectrosc Rev. 2015;50:46–111.
41. Epstein JI, Egevad L, Humphrey PA, Montironi R, Members of the ISUP Immunohistochemistry in Diagnostic Urologic Pathology Group. Best practices recommendations in the application of immunohistochemistry in the prostate: report from the International Society of Urologic Pathology consensus conference. Am J Surg Pathol. 2014;38:e6–19. doi: 10.1097/PAS.0000000000000238 25029122
42. Haffner MC, Weier C, Xu MM, Vaghasia A, Gurel B, Gumuskaya B, et al. Molecular evidence that invasive adenocarcinoma can mimic prostatic intraepithelial neoplasia (PIN) and intraductal carcinoma through retrograde glandular colonization. J Pathol. 2016;238:31–41. doi: 10.1002/path.4628 26331372
43. Lotan TL, Gumuskaya B, Rahimi H, Hicks JL, Iwata T, Robinson BD, et al. Cytoplasmic PTEN protein loss distinguishes intraductal carcinoma of the prostate from high-grade prostatic intraepithelial neoplasia. Mod Pathol. 2013;26:587–603. doi: 10.1038/modpathol.2012.201 23222491
44. Schneider TM, Osunkoya AO. ERG expression in intraductal carcinoma of the prostate: comparison with adjacent invasive prostatic adenocarcinoma. Mod Pathol. 2014;27:1174–8. doi: 10.1038/modpathol.2013.248 24406865
45. Morais CL, Han JS, Gordetsky J, Nagar MS, Anderson AE, Lee S, et al. Utility of PTEN and ERG immunostaining for distinguishing high-grade PIN from intraductal carcinoma of the prostate on needle biopsy. Am J Surg Pathol. 2015;39:169–78. doi: 10.1097/PAS.0000000000000348 25517949
46. Downes MR, Satturwar S, Trudel D, van der Kwast TH. Evaluation of ERG and PTEN protein expression in cribriform architecture prostate carcinomas. Pathol Res Pract. 2017;213:34–8. doi: 10.1016/j.prp.2016.10.007 27913052
47. Zhao T, Liao B, Yao J, Liu J, Huang R, Shen P, et al. Is there any prognostic impact of intraductal carcinoma of prostate in initial diagnosed aggressively metastatic prostate cancer? Prostate. 2015;75:225–32. doi: 10.1002/pros.22906 25307858
48. Chen Z, Chen N, Shen P, Gong J, Li X, Zhao T, et al. The presence and clinical implication of intraductal carcinoma of prostate in metastatic castration resistant prostate cancer. Prostate. 2015;75:1247–54. doi: 10.1002/pros.23005 25917338
49. Stone N, Hart Prieto MC, Crow P, Uff J, Ritchie AW. The use of Raman spectroscopy to provide an estimation of the gross biochemistry associated with urological pathologies. Anal Bioanal Chem. 2007;387:1657–68. doi: 10.1007/s00216-006-0937-9 17123068
50. Trudel D, Downes MR, Sykes J, Kron KJ, Trachtenberg J, van der Kwast TH. Prognostic impact of intraductal carcinoma and large cribriform carcinoma architecture after prostatectomy in a contemporary cohort. Eur J Cancer. 2014;50:1610–6. doi: 10.1016/j.ejca.2014.03.009 24703897
51. Jermyn M, Desroches J, Aubertin K, St-Arnaud K, Madore WJ, De Montigny E, et al. A review of Raman spectroscopy advances with an emphasis on clinical translation challenges in oncology. Phys Med Biol. 2016;61:R370–400. doi: 10.1088/0031-9155/61/23/R370 27804917
Článek vyšel v časopise
PLOS Medicine
2020 Číslo 8
- Čokoláda podávaná v malých dávkách neškodí. Vědecky prokázáno!
- MUDr. Václav Šmíd, Ph.D.: Jaterní fibróza a iniciální stadia cirhózy jsou potenciálně vratné stavy
- Esenciální fosfolipidy v podpůrné léčbě jaterní steatózy asociované s metabolickou dysfunkcí
- Prof. Jan Škrha: Metformin je bezpečný, ale je třeba jej bezpečně užívat a léčbu kontrolovat
- FDA varuje před selfmonitoringem cukru pomocí chytrých hodinek. Jak je to v Česku?
Nejčtenější v tomto čísle
- Social distancing to slow the US COVID-19 epidemic: Longitudinal pretest–posttest comparison group study
- Coming together to improve access to medicines: The genesis of the East African Community’s Medicines Regulatory Harmonization initiative
- Age and the association between apolipoprotein E genotype and Alzheimer disease: A cerebrospinal fluid biomarker–based case–control study
- A clinical algorithm for same-day HIV treatment initiation in settings with high TB symptom prevalence in South Africa: The SLATE II individually randomized clinical trial