1. Summary
With approximately 13,300 new cases every year, kidney cancer is the seventh-most-common type of cancer in the U.K. [1]. Clear cell renal cell carcinoma (ccRCC), a subtype of kidney cancer, whose name is derived from the appearance of its tumour cells under the microscope, is by far the most prevalent [2,3]. Studying its highly heterogeneous and vascularised tumour microenvironment (TME) is important for improving our understanding of the disease and its progression [4].
An important technique in clinical oncology and cancer research is the process of immunostaining, which facilitates the visualisation of various proteins in the cells of cancer tissue using artificial colouration [5] to distinguish between different cell types. Immunostaining assists pathologists in diagnosing cancer and deciding on treatment options [6,7,8]. Multiple immunofluorescence (mIF) allows different proteins to be visualised simultaneously by the enzymatic reaction between fluorescent-coated tyramide and horseradish peroxidase (HRP) [6,9,10]. In this work, we employed mIF with three different fluorophores to decorate ccRCC tissue sections for Hoechst 33342, cluster of differentiation 3 (CD3), and 29 cluster of differentiation 8 (CD8). The first is a widely used counterstaining fluorescent dye used to highlight cell nuclei [11], while the other two highlight specific cell subtypes: CD3 identifies T lymphocytes, and CD8 marks cytotoxic T lymphocytes.
Digitising whole slide images (WSIs) of tumour tissue as gigapixel images (typically around 100,000 × 100,000 pixels in size) has become an increasingly common practice in the last decade, not only in research, but also clinical settings [12]. The contemporaneous advent of deep learning, which flourishes with the availability of large amounts of data, has sparked leaps in the computer vision community. These advancements, combined with the availability of digital pathology images, pave the way towards developing automated methods for WSI analysis. Potential applications vary from slide-level tasks such as patient risk stratification [13,14], to specific image tasks such as detecting cellular subtypes and their spatial distribution [15,16,17]. In this setting, deep learning not only has the potential to help reduce the workload of pathologists, but also to alleviate inter-observer bias, which is a common problem in pathology [18,19].
In an effort to facilitate deep learning research in digital pathology, we present a dataset of ten WSIs of ccRCC tissue, alongside the corresponding clinical data. The fact that our images contain three channels of information (Hoechst 33342, CD3, and CD8) makes our dataset particularly well-suited to the task of virtual staining [20], where a deep learning model is tasked with translating from one type of stain to another. In other words, given an image of stain A, the model should produce an image that appears as if the tissue section had instead been stained with another stain B. Our dataset, which is available in the BioImage Archive (
2. Data Description
Our dataset consists of WSIs digitised from the tumour tissue of ten patients with ccRCC. The slides were sourced from the Pathology Archive in Lothian NHS (Ethics Reference 10/S1402/33). Using mIF, the slides were stained with Hoechst, CD3, and CD8 before being scanned at an objective of x40 on an Axioscan Zeiss scanner, resulting in a dataset of ten WSIs, each with three channels (Hoechst, CD3, and CD8).
We present the slides in two different formats: as raw WSIs and as preprocessed non-overlapping image patches of size 256 × 256 pixels covering the entire tissue region of the WSI. Furthermore, we provide the associated patients’ clinical data in CSV format.
2.1. Raw Whole-Slide Images
We supply all ten WSIs in CZI format named according to the following convention:
2.2. Preprocessed Image Patches
The
Listing 1. Structure of the JSON file accompanying each patch. |
In addition to the self-explanatory metadata fields referencing the original WSI file and patch coordinates, there is a field named
In total, the dataset consists of 627,519 non-overlapping patches. The 256 × 256 pixel patches under magnification correspond to a physical size of about 58 × 58 m. Statistics on the representation of each cell type in the dataset are provided in Table 2.
2.3. Clinical Data
We provide a CSV file containing clinical data for the ten patients (
3. Methods
3.1. Multiplex Immunofluorescence Protocol
The method of staining the slides and obtaining the WSIs was described in the work of Wölflein et al. [17], but we include it here for completeness. The Leica BOND RX automated immunostainer (Leica Microsystems, Milton Keynes, U.K.) was utilised to perform mIF. The sections were dewaxed at 72 °C using BOND dewax solution (Leica, AR9222) and rehydrated in absolute alcohol and deionised water, respectively. The sections were treated with BOND epitope retrieval 1 (ER1) buffer (Leica, AR9961) for 20 min at 100 °C to unmask the epitopes. The endogenous peroxidase was blocked with peroxide block (Leica, DS9800), followed by serum-free protein block (Agilent, x090930-2). The sections were incubated with the first primary antibody (CD8, Agilent, M710301-2, 1:400 dilution) for 40 min at room temperature, followed by anti-mouse HRP conjugated secondary antibody (Agilent, K400111-2) for 40 min. Then, the CD8 antigen was visualised by Cy5-conjugated tyramide signal amplification (TSA) (Akoya Bioscience, NEL745001KT). Redundant antibodies, which were not covalently bound, were stripped off by ER1 buffer at 95 °C for 20 min. Then, the second primary antibody (CD3, Agilent, A045229-2, 1:400 dilution) was visualised by TSA Cy3, taking the same steps of the peroxide block to the ER1 buffer stripping of the first antibody visualisation. Cell nuclei were counterstained by Hoechst 33342 (Thermo Fisher, H3570, 1:100), and the sections were mounted with prolong gold antifade mountant (Thermo Fisher, P36930).
3.2. Whole-Slide Image Acquisition
The fluorescence images were captured using a Zeiss Axio Scan Z1 at an objective of x40 magnification. We used three different fluorescent channels (Hoechst 33342, Cy3, and Cy5) simultaneously to capture individual channel images under object magnification with the respective exposure times of 10 ms, 20 ms, and 30 ms. Figure 2 shows the density curves of the three different channel intensities across the entire dataset.
3.3. Patch Processing
3.3.1. Intensity Normalisation
PNG files store pixels as 8-bit integers, which limits the dynamic range of the images. However, when examining the intensity histograms in Figure 3, we observed that most pixel luminance was concentrated at the lower end of the range. A naïve quantisation of the image to the range would lose most of the important information, specifically the variation at the lower end. To address this, we applied a form of thresholding.
Each histogram in Figure 3 exhibits one main peak (disregarding the leftmost maximum at an intensity close to zero, corresponding to background pixels). Therefore, we found it sufficient to assume that the histogram follows a normal distribution , the parameters of which we obtained using maximum likelihood estimation. In practice, most of the important information is contained between the peak and three standard deviations to the right, i.e., in the range , indicated by the red lines in Figure 3. Eliminating intensities to the left of that peak () reduces the background noise. Moreover, pixels with high intensities () are rare and can thus be discarded as well because they do not add much information. As a result, we transformed the intensities x to the range by the function:
Note that we estimated the parameters and derived from the histograms of the entire WSIs and not on a per-patch basis, due to the height variance between the patches. Furthermore, the described intensity normalisation procedure was applied to each stain separately, as illustrated by the sample patch in Figure 4.
3.3.2. Nucleus Segmentation
As indicated in Section 2.2, we supply the normalised image patches of each of the three channels (Hoechst, CD3, and CD8). However, we also include masks for each of the three channels (see Table 1), which are generated by a nucleus segmentation algorithm. These masks can be used to evaluate the quality of virtual staining algorithms [17,20] or even directly train segmentation models.
Our approach to nucleus segmentation uses the Hoechst channel as the starting point, instead of directly segmenting cells on the CD3/CD8 channels because those are less reliable. First, we segmented all nuclei in this channel using the StarDist algorithm [27], a popular deep-learning-based nucleus segmentation method. We employed StarDist because it is able to produce plausible non-overlapping masks even in crowded areas where instance segmentation models such as Mask-RCNN [28] tend to generate blobs of multiple cells [27]. This is because StarDist represents cells as star-convex polygons, whereas instance segmentation models simply operate on a pixel level. Figure 4g depicts the result of StarDist with a probability threshold of 0.6 and no cell expansion, as we employed it in our pipeline. Following Hoechst cell segmentation, we applied a threshold on the CD3 channel to identify which nuclei in the Hoechst mask were CD3+ (Figure 4h). We repeated this process for the CD8 channel as well (Figure 4i). The entire nucleus segmentation pipeline (i.e., the aforementioned steps) was implemented as scripts using the QuPath software [29].
There were two factors that impacted the quality of the masks. First, Hoechst and CD3 stains may sometimes not align perfectly, which is evident in Figure 4e, where some of the high-intensity blobs do not match exactly with Figure 4d. This is because, while Hoechst stains the cell nuclei, CD3 is expressed only in a tiny part of a T cell’s cytoplasm. Analogous reasoning applies to CD8. The second factor is the thickness of the slides (4 m), which causes some cells to be out of focus, which becomes evident by the varying intensity levels in Figure 4a–c. As a result of both of these factors, there may be some cases where CD3+ or CD8+ cells may, by mistake, not be classified as such.
Conceptualisation, G.W., I.H.U., D.J.H. and O.A.; methodology, G.W. and I.H.U.; software, G.W.; validation, G.W.; formal analysis, G.W.; investigation, I.H.U. and G.W.; resources, I.H.U. and D.J.H.; data curation, I.H.U. and G.W.; writing—original draft preparation, G.W.; writing—review and editing, I.H.U., D.J.H. and O.A.; visualisation, G.W.; supervision, O.A. and D.J.H.; project administration, O.A. and D.J.H.; funding acquisition, D.J.H. All authors have read and agreed to the published version of the manuscript.
The work was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of NHS Lothian NRS BioResource, REC-approved Research Tissue Bank (REC Approval Ref. 13/ES/0126, 3 February 2015).
Informed consent was obtained from all subjects involved in the study.
We would like to thank Craig Marshall, Lothian Biorepository, who granted access to the samples.
The funders had no role in the design of the study; in the collection, analyses, or interpretation of the data; in the writing of the manuscript; nor in the decision to publish the results.
The following abbreviations are used in this manuscript:
ccRCC | clear cell renal cell carcinoma |
TME | tumour microenvironment |
mIF | multiplex immunofluorescence |
IHC | immunohistochemistry |
WSI | whole-slide image |
GAN | generative adversarial network |
CD3 | cluster of differentiation 3 |
CD8 | cluster of differentiation 8 |
TSA | tyramide signal amplification |
HRP | horseradish peroxidase |
JSON | JavaScript object notation |
PNG | portable network graphics |
CSV | comma-separated values |
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. Thumbnail image of one of the WSIs in the dataset, displaying the Hoechst channel in blue, CD3 in yellow, and CD8 in red. Note that the individual cells are too small to be identified at the low resolution of this image.
Figure 2. Intensity histograms of all 10 WSIs in the dataset (each WSI corresponds to a differently coloured line).
Figure 3. Intensity histograms (left axes) and fit normal distributions (right axes) of a sample WSI’s Hoechst and CD3 channels. The CD8 histograms behave similarly.
Figure 4. A [Forumla omitted. See PDF.] pixel patch extracted from the WSI in Figure 1, showing raw and normalised intensities for Hoechst, CD3, and CD8, as well as masks for different cell types. CD8+ cells are a subset of CD3+ cells because CD3 highlights all T cells, whereas CD8 binds only to cytotoxic T cells. (a) Hoechst. (b) CD3. (c) CD8. (d) normalised Hoechst. (e) normalised CD3. (f) normalised CD8. (g) StarDist [27] cell mask. (h) CD3+ cells. (i) CD8+ cells.
Types of image files associated with each patch, alongside their respective
Mode | Channel | Description |
---|---|---|
|
|
normalised Hoechst patch |
|
|
normalised CD3 patch |
|
|
normalised CD8 patch |
|
|
segmentation mask of all detected cells |
|
|
segmentation mask of CD3+ cells (subset of Hoechst cells) |
|
|
segmentation mask of CD8+ cells (subset of CD3+ cells) |
|
|
segmentation mask of CD3- cells (subset of Hoechst cells) |
Representation of cell subtypes across the dataset. Presence refers to the percentage of patches that contain at least one cell of the respective subtype. Area coverage means the percentage of pixels that are occupied by each cell subtype.
Hoechst | CD3 | CD8 | |
---|---|---|---|
Total cells | 15,956,049 | 3,390,533 | 1,894,016 |
Cells per patch | 25.42 | 5.40 | 3.02 |
Presence | 99.95% | 93.08% | 71.61% |
Area coverage | 26.48% | 05.01% | 03.02% |
Columns in the clinical data table. Note that the “Disease-free months” column indicates a lower bound, as some patients may have experienced recurrence after the period of data collection.
Column Name | Format | Description |
---|---|---|
ICAIRD number |
|
patient ID |
Gender | M or |
gender |
Response | recurrence within 5 years after surgery | |
Age at surgery | whole number | age at surgery in years |
Disease-free months | float | number of months with no recurrence |
Fuhrman nuclear grade | Fuhrman grade [ |
|
ISUP nuclear grade | ISUP grade [ |
|
Tumour stage | tumour size according to TNM system [ |
|
Tumour size | float | tumour size in cm |
Node status | lymph node status according to TNM system [ |
|
Necrosis | whether necrosis is detected | |
Leibovich score (Fuhrman) | Leibovich score [ |
|
Leibovich score (ISUP) | Leibovich score [ |
References
1. Cancer Research UK. Kidney Cancer Statistics. Available online: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/kidney-cancer (accessed on 30 September 2022).
2. Siegel, R.L.; Miller, K.D.; Fuchs, H.E.; Jemal, A. Cancer Statistics, 2021. CA Cancer J. Clin.; 2021; 71, pp. 7-33. [DOI: https://dx.doi.org/10.3322/caac.21654]
3. Moch, H.; Cubilla, A.L.; Humphrey, P.A.; Reuter, V.E.; Ulbright, T.M. The 2016 WHO Classification of Tumours of the Urinary System and Male Genital Organs—Part A: Renal, Penile, and Testicular Tumours. Eur. Urol.; 2016; 70, pp. 93-105. [DOI: https://dx.doi.org/10.1016/j.eururo.2016.02.029]
4. De Filippis, R.; Wölflein, G.; Um, I.H.; Caie, P.D.; Warren, S.; White, A.; Suen, E.; To, E.; Arandjelović, O.; Harrison, D.J. Use of high-plex data reveals novel insights into the tumour microenvironment of clear cell renal cell carcinoma. Cancers; 2022; 14, 5387. [DOI: https://dx.doi.org/10.3390/cancers14215387] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36358805]
5. Coons, A.H.; Creech, H.J.; Jones, R.N.; Berliner, E. The Demonstration of Pneumococcal Antigen in Tissues by the Use of Fluorescent Antibody. J. Immunol.; 1942; 45, pp. 159-170. [DOI: https://dx.doi.org/10.4049/jimmunol.45.3.159]
6. Kalyuzhny, A.E. Immunohistochemistry—Essential Elements and Beyond; Springer: Berlin/Heidelberg, Germany, 2016.
7. Goldstein, N.S.; Hewitt, S.M.; Taylor, C.R.; Yaziji, H.; Hicks, D.G. Members of Ad-Hoc Committee on Immunohistochemistry Standardization. Recommendations for Improved Standardization of Immunohistochemistry. Appl. Immunohistochem. Mol. Morphol.; 2007; 15, pp. 124-133. [DOI: https://dx.doi.org/10.1097/PAI.0b013e31804c7283] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17525622]
8. Donaldson, J.G. Immunofluorescence Staining. Curr. Protoc. Cell Biol.; 2015; 69, pp. 3-4. [DOI: https://dx.doi.org/10.1002/0471143030.cb0403s69] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26621373]
9. Zaidi, A.U.; Enomoto, H.; Milbrandt, J.; Roth, K.A. Dual Fluorescent in Situ Hybridization and Immunohistochemical Detection with Tyramide Signal Amplification. J. Histochem. Cytochem.; 2000; 48, pp. 1369-1375. [DOI: https://dx.doi.org/10.1177/002215540004801007]
10. Buchwalow, I.B.; Böcker, W. Immunohistochemistry: Basics and Methods; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2010.
11. Chazotte, B. Labeling Nuclear DNA with Hoechst 33342. Cold Spring Harb. Protoc.; 2011; 2011, pdb-prot5557. [DOI: https://dx.doi.org/10.1101/pdb.prot5557] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/21205857]
12. Caie, P.D.; Dimitriou, N.; Arandjelović, O. Precision Medicine in Digital Pathology via Image Analysis and Machine Learning. Artificial Intelligence and Deep Learning in Pathology; Elsevier: Amsterdam, The Netherlands, 2021; pp. 149-173.
13. Kather, J.N.; Krisam, J.; Charoentong, P.; Luedde, T.; Herpel, E.; Weis, C.A.; Gaiser, T.; Marx, A.; Valous, N.A.; Ferber, D. et al. Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study. PLoS Med.; 2019; 16, e1002730. [DOI: https://dx.doi.org/10.1371/journal.pmed.1002730] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30677016]
14. Yao, J.; Zhu, X.; Huang, J. Deep multi-instance learning for survival prediction from whole-slide images. Proceedings of the Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference; Shenzhen, China, 13–17 October 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 496-504.
15. Abousamra, S.; Gupta, R.; Hou, L.; Batiste, R.; Zhao, T.; Shankar, A.; Rao, A.; Chen, C.; Samaras, D.; Kurc, T. et al. Deep learning-based mapping of tumor infiltrating lymphocytes in whole-slide images of 23 types of cancer. Front. Oncol.; 2022; 11, 5971. [DOI: https://dx.doi.org/10.3389/fonc.2021.806603] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35251953]
16. Cooper, J.; Um, I.H.; Arandjelović, O.; Harrison, D.J. Lymphocyte Classification from Hoechst Stained Slides with Deep Learning. Cancers; 2022; 14, 5957. [DOI: https://dx.doi.org/10.3390/cancers14235957] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36497439]
17. Wölflein, G.; Um, I.H.; Harrison, D.J.; Arandjelović, O. HoechstGAN: Virtual Lymphocyte Staining Using Generative Adversarial Networks. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV); Waikoloa, HI, USA, 2–8 January 2023; pp. 4997-5007.
18. Warren, A.Y.; Harrison, D. WHO/ISUP classification, grading and pathological staging of renal cell carcinoma: Standards and controversies. World J. Urol.; 2018; 36, pp. 1913-1926. [DOI: https://dx.doi.org/10.1007/s00345-018-2447-8] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30123932]
19. Bektas, S.; Bahadir, B.; Kandemir, N.O.; Barut, F.; Gul, A.E.; Ozdamar, S.O. Intraobserver and interobserver variability of Fuhrman and modified Fuhrman grading systems for conventional renal cell carcinoma. Kaohsiung J. Med Sci.; 2009; 25, pp. 596-600. [DOI: https://dx.doi.org/10.1016/S1607-551X(09)70562-5] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19858038]
20. Mulrane, L.; Rexhepaj, E.; Penney, S.; Callanan, J.J.; Gallagher, W.M. Automated image analysis in histopathology: A valuable tool in medical diagnostics. Expert Rev. Mol. Diagn.; 2008; 8, pp. 707-725. [DOI: https://dx.doi.org/10.1586/14737159.8.6.707] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18999923]
21. Wölflein, G.; Um, I.H.; Harrison, D.J.; Arandjelović, O. Whole Slide Images and Patches of Clear Cell Renal Cell Carcinoma Counterstained with Multiple Immunofluorescence for Hoechst, CD3, and CD8. 2022; Available online: https://www.ebi.ac.uk/biostudies/bioimages/studies/S-BIAD605 (accessed on 6 February 2023).
22. Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. Proceedings of the Advances in Neural Information Processing Systems; Ghahramani, Z.; Welling, M.; Cortes, C.; Lawrence, N.; Weinberger, K. Curran Associates, Inc.: New York, NY, USA, 2014; Volume 27.
23. Isola, P.; Zhu, J.Y.; Zhou, T.; Efros, A.A. Image-to-image translation with conditional adversarial networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; Honolulu, HI, USA, 21–26 July 2017; pp. 1125-1134.
24. Fuhrman, S.A.; Lasky, L.C.; Limas, C. Prognostic significance of morphologic parameters in renal cell carcinoma. Am. J. Surg. Pathol.; 1982; 6, pp. 655-663. [DOI: https://dx.doi.org/10.1097/00000478-198210000-00007] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/7180965]
25. Amin, M.B.; Greene, F.L.; Edge, S.B.; Compton, C.C.; Gershenwald, J.E.; Brookland, R.K.; Meyer, L.; Gress, D.M.; Byrd, D.R.; Winchester, D.P. The eighth edition AJCC cancer staging manual: Continuing to build a bridge from a population-based to a more “personalized” approach to cancer staging. CA Cancer J. Clin.; 2017; 67, pp. 93-99. [DOI: https://dx.doi.org/10.3322/caac.21388] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28094848]
26. Leibovich, B.C.; Blute, M.L.; Cheville, J.C.; Lohse, C.M.; Frank, I.; Kwon, E.D.; Weaver, A.L.; Parker, A.S.; Zincke, H. Prediction of progression after radical nephrectomy for patients with clear cell renal cell carcinoma: A stratification tool for prospective clinical trials. Cancer Interdiscip. Int. J. Am. Cancer Soc.; 2003; 97, pp. 1663-1671. [DOI: https://dx.doi.org/10.1002/cncr.11234] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/12655523]
27. Schmidt, U.; Weigert, M.; Broaddus, C.; Myers, G. Cell Detection with Star-Convex Polygons. Proceedings of the Medical Image Computing and Computer Assisted Intervention—MICCAI 2018—21st International Conference; Granada, Spain, 6–20 September 2018; pp. 265-273.
28. He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision; Venice, Italy, 22–29 October 2017; pp. 2961-2969.
29. Bankhead, P.; Loughrey, M.B.; Fernández, J.A.; Dombrowski, Y.; McArt, D.G.; Dunne, P.D.; McQuaid, S.; Gray, R.T.; Murray, L.J.; Coleman, H.G. et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep.; 2017; 7, 16878. [DOI: https://dx.doi.org/10.1038/s41598-017-17204-5] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29203879]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
In recent years, there has been an increased effort to digitise whole-slide images of cancer tissue. This effort has opened up a range of new avenues for the application of deep learning in oncology. One such avenue is virtual staining, where a deep learning model is tasked with reproducing the appearance of stained tissue sections, conditioned on a different, often times less expensive, input stain. However, data to train such models in a supervised manner where the input and output stains are aligned on the same tissue sections are scarce. In this work, we introduce a dataset of ten whole-slide images of clear cell renal cell carcinoma tissue sections counterstained with Hoechst 33342, CD3, and CD8 using multiple immunofluorescence. We also provide a set of over 600,000 patches of size 256 × 256 pixels extracted from these images together with cell segmentation masks in a format amenable to training deep learning models. It is our hope that this dataset will be used to further the development of deep learning methods for digital pathology by serving as a dataset for comparing and benchmarking virtual staining models.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details




1 School of Computer Science, University of St Andrews, North Haugh, St Andrews KY16 9SX, Scotland, UK
2 School of Medicine, University of St Andrews, North Haugh, St Andrews KY16 9TF, Scotland, UK
3 School of Medicine, University of St Andrews, North Haugh, St Andrews KY16 9TF, Scotland, UK; Division of Laboratory Medicine, Lothian NHS University Hospitals, Edinburgh EH16 6SA, Scotland, UK