Rheumatoid arthritis (RA), the most prevalent autoimmune arthritis, is characterized by inflammation in synovial tissue. There is great interest in using synovial tissue samples to optimize treatment decisions for patients with RA in the long term (1–3) and in the short term; rigorous histologic assessments of synovial tissue are needed as benchmarks for molecular approaches such as single cell RNA sequencing (RNA-seq) (4). Many groups have compared and validated various methods of quantifying antibody-stained cells such as T cells, B cells, plasma cells, and macrophages in RA synovium (5). Three general methods of measurement include semiquantitative scoring (whereby a pathologist generates an ordinal summary score ranging from 0 to 4 to describe the level of infiltrate), manual counting of individual cells (which is laborious and not well suited for large-scale studies), and digital image assessment (whereby computer software is used to quantify cell types). Although ordinal semiquantitative scores are considered valid, the more granular continuous data generated from manual counts and digital image analysis are more sensitive for discriminating clinically involved and uninvolved joints (6). However, these approaches are currently limited by the need for special antibody stains and field selection bias. Careful studies comparing digital image analysis of either 60, 20, 6, or 1 high-power fields (representing 7.5 mm2, 2.5 mm2, 0.75 mm2, and 0.125 mm2 of tissue, respectively) found that the variance increases when the number of high-power fields drops from 20 to 6 (7).
Hematoxylin and eosin (H&E) stain is the most widely used method for assessing any type of tissue. Some groups, including our group, have also validated semiquantitative assessments of H&E-stained synovium in RA (8,9). A key limitation of these semiquantitative methods is that they rely on the subjective selection of a subset of high-power fields from the whole slide image. Given that synovial inflammation tends to be heterogeneous, variability in the selected high-power fields for assessment can lead to different scores. In a recent study from our group, the weighted kappa for interrater reliability between two highly experienced pathologists assessing synovial lymphocyte infiltration on the same slides was only 67% (9).
Advances in computational pathology have been applied with great success in oncology to predict cancer outcomes and treatment response (10–13). In general, computational pathology seeks to extract meaningful information from histology samples such as detecting, segmenting, and characterizing key features. By automating these fundamental tasks, the entire sample can be efficiently analyzed, and the burden on the pathologist and other human biases in selecting a subset of high-power fields from the whole slide image can be reduced (14). However, these computer vision approaches have not yet been widely adopted for H&E-stained synovial tissue analysis. Considering that hospitals are beginning to incorporate digital pathology images of whole slides into medical records (15), and our site alone performs over 5,000 arthroplasties per year, there is an opportunity to apply computer vision at an enterprise scale to enable precise, rapid, and cost-effective quantification of inflammation in synovial tissue samples.
In recent years, deep learning methods, such as convolutional neural networks, have been successfully applied for quantifying histological features and phenotypes in H&E-stained tissue samples (16,17). Deep learning is a powerful tool that often outperforms traditional computer vision methods. However, the main drawback of data-driven methods, like deep learning, is the need for large datasets with labeled annotations. Acquiring a sufficiently large training dataset is often prohibitively time consuming and expensive. Therefore, labeled data are often unavailable, and conventional computer vision approaches, such as our proposed pipeline, are needed.
Synovial samples with dense inflammatory infiltrates typically have more nuclei (18). We hypothesized that the density of nuclei in H&E-stained synovial tissue samples can be used as a simple, fast, quantitative measurement of inflammation. Here, we share an algorithm (Figure 1) using computer vision techniques including color deconvolution, local adaptive thresholding, watershed segmentation, and shape filtering. We compare the output of computer vision assessments of RA synovial nuclei density, which yield hundreds of thousands of nuclei per whole slide image, with semiquantitative histology scores, bulk and single cell RNA-seq, and clinical measures of disease activity. These comparisons provide proof of concept that nuclei density is a robust measurement of synovial inflammation, thereby laying the groundwork for advancing computer vision in rheumatic disease.
METHODS Patient dataThis study includes data from 170 patients with RA undergoing arthroplasty at the Hospital for Special Surgery (HSS) in New York. All patients met either the American College of Rheumatology (ACR)/European League Against Rheumatism 2010 classification criteria and/or the ACR 1987 criteria for RA (19,20). Patient data, including age, duration of disease, body mass index, tender and swollen joint counts, Disease Activity Score in 28 joints (DAS28), erythrocyte sedimentation rate (ESR), C-reactive protein (CRP), rheumatoid factor (RF), and anti-cyclic citrullinated peptide (CCP), were collected (9). This study was approved by the HSS Institutional Review Board (approval number 2014-233), the Rockefeller University Institutional Review Board (approval number DOR0822), and the Biomedical Research Alliance of New York (approval number 15-08-114-385). All participating patients provided their signed informed consent. The patients and the public were not involved in the design of this study.
This study was approved by the HSS Institutional Review Board (approval no. 2014-233), the Rockefeller University Institutional Review Board (approval no. DOR0822), and the Biomedical Research Alliance of New York (approval no. 15-08-114-385). All participating patients provided their signed informed consent.
Tissue quantificationWhole slide images were split into a set of smaller 1,024 × 1,024 pixel-sized images, termed “tiles,” representing an area of 0.25 mm2 (Figure 1A). To equitably compare nuclei counts between tiles, it is necessary to quantify and normalize for tissue. The RBG (Red Blue Green) image was transformed into a grayscale image such that each pixel contains an intensity value between 0 and 255. Next, we employed an intensity value threshold of 30 to annotate pixels above this value as tissue. Finally, a tissue metric was calculated based on the proportion of the tile determined to contain tissue.
Hematoxylin (blue) preferentially stains cell nuclei, whereas eosin (magenta-red) stains cytoplasm (21). To separate the nuclei from the cytoplasm, color deconvolution was applied to transform the color image into grayscale images representing the concentration of each dye (22). Color deconvolution was performed using RBG channel weighting vectors of (0.65, 0.70, 0.29) for hematoxylin and (0.07, 0.99, 0.11) for eosin. The resulting grayscale images have normalized pixel intensities ranging from 0 to 255 (Figure 1B).
Using the hematoxylin image, it is easier to separate cell nuclei from the background. Pixels in the image were classified as background (eg, empty space, cytoplasm, and stroma) or foreground (ie, nuclei) based on a threshold. Otsu's method was used to automate selecting a threshold to separate the nuclei from the background (23). Due to the heterogeneous staining intensity, local adaptive thresholding was used to select threshold values for different regions of an image. First, a global threshold was applied resulting in a binary image with a rough estimate for the locations of nuclei (Figure 1B). Local regions were then defined based on an enlarged bounding box for each connected component in the global binary image. Local threshold values were then computed and applied to each local region to provide a refined estimate for the locations of nuclei.
Nuclei segmentation and shape filteringIn regions with densely clustered nuclei, an object in the binary image can contain multiple adjacent or overlapping nuclei. Nuclei segmentation was thus performed using the watershed method to find boundary lines between individual nuclei (24). We applied the distance transform to the binary image resulting in a map that represents the shortest Euclidean distance between the foreground and background (25). The distance map and the hematoxylin grayscale image were normalized and summed together (18). The watershed method was then applied to the combined distance and image intensity map to identify individual nuclei.
Staining artifacts incorrectly identified as nuclei were removed by filtering objects based on their shape with metrics like circularity and Euler number (the total number of objects minus the total number of holes in an object). Objects with a circularity less than 0.4 and a Euler number less than 0 were removed (Figure 1C). Nuclei density was calculated by normalizing the total count of individual nuclei by the tissue metric. Whole slide image-level mean nuclei densities were also calculated for each patient by averaging all tile-level nuclei densities.
Pathologist validation of nuclei detectedNuclei identified by the algorithm were validated by a pathologist on a sample of 10 tiles (two tiles from each lymphocyte grade). We used the open-source software QuPath to annotate nuclei that were either mislabeled or missing (26). Algorithm performance was evaluated by calculating sensitivity, specificity, and the F1 score.
Histologic scoringGiven that the whole synovium is available for direct visual inspection after arthroplasty, the pathologist selected the area that appeared most grossly inflamed. If there were no obvious niduses of inflammation, synovial samples were preferentially obtained from areas of inflamed synovium or standard locations as previously described (9). Histology features were scored by a pathologist using a systematic approach described in (9) and is also available at
RNA was extracted from paired bulk synovial tissue samples and previously sequenced as described in (Immport Accession #SDY1299) (9). In brief, these libraries were prepared using TruSeq messenger RNA (mRNA) Stranded Library kits; 50-bp paired-end reads were sequenced on a HiSeq2500 platform, and reads were aligned to hg19 using STAR (27). Samples with more than 0.1% globin mRNA were excluded from further analysis. Consensus clustering identified three gene expression clusters (low, mixed, and high inflammatory). A principal component analysis was performed on the top 5,000 most variably expressed genes. The limma R package was used to identify differentially expressed genes using mean nuclei count as a continuous variable (28). Differentially expressed genes were defined as those with a false discovery rate (FDR) less than 0.05.
An enrichment analysis of 18 synovial cell type subsets (200 marker genes per cell) was performed (Immport Accession #SDY998) (4). Genes were ranked according to the moderated t-statistic, and the fgsea R package was used for enrichment analysis using the marker genes as gene sets (28,29).
RESULTS Clinical characteristicsAmong the 170 patients with RA, 81% were female, 41% were seropositive for RF, and 64% were seropositive for anti-CCP. Disease activity was moderate to high in 70% of patients, with 50% DASs of 3.2 or more and 17% DASs of 5.1 or more. Patient characteristics are presented in Supplemental Table 1.
Automated quantification of synovial nucleiOur algorithm was used to identify synovial nuclei in 170 whole slide images of patients with RA and detected a median of 112,657 (range 8,160 to 821,717) nuclei per whole slide image. This large range of nuclei detected was partially due to the varying synovial tissue sample sizes. The whole slide images were sectioned into a median of 579 (range 108 to 1,850) tiles. After tissue quantification and removal of tiles containing mostly empty space, the whole slide images had a remaining median of 249 (range 29 to 1,103) tiles, corresponding to a median tissue area of 1.25 cm2. A median of 430 (range 1 to 3,437) individual nuclei was detected in each tile. To account for varying synovial tissue sample sizes, we normalized the whole slide image nuclei count by tissue area per tile and refer to this value herein as the nuclei density.
Algorithm validation and robustnessThrough a series of experiments, algorithm performance was evaluated under varying conditions to determine its accuracy and robustness. The spatial distribution of nuclei (ie, sparse, dense, and clustered) did not have a large impact on the algorithm performance provided because there was some separation between the nuclei (Figure 2A). There were instances of multiple overlapping nuclei being identified as one, but in most cases, even a trained expert would have difficulty in separating the nuclei. The algorithm's performance under varying H&E staining concentrations was evaluated by initially selecting multiple tiles and then digitally modifying them to generate a broad range of realistic H&E stain variations (30). The algorithm appears to be robust to a range of staining variations (Figure 2B). An expert pathologist reviewed 10 sample tiles ranging from low to high inflammatory grades. Each tile was annotated for mislabeled or missing nuclei. Using that assessment as the gold standard, the algorithm detected nuclei with a sensitivity of 97% and a specificity of 100%, as shown in Fig. 2C.
Synovial tissue samples with a high histological score for lymphocytic inflammation typically contained more nuclei (Figure 3A and 3B). Nuclei density varies between tiles in a whole slide image, and the distribution of nuclei density can provide useful insights into inflammatory burden. For example, the nuclei density histograms demonstrated that increased lymphocyte infiltration is associated with increased nuclei density (Figure 3C and 3D). This pattern is consistent when considering all patients with RA across each lymphocyte grade (Figure 3E). With increasing lymphocyte grade, the peak of the histogram shifted to the right, and the histogram had a longer tail. Interestingly, lymphocyte grade 4 had a wide distribution with two peaks indicating that the most inflamed samples were the most heterogenous. This heterogeneity can be seen in the representative example shown for grade 4 inflammation, in which a large range of nuclei densities were observed over adjacent tiles.
Nuclei density was found to have the strongest correlation with lymphocytic inflammation and was also correlated with many other histology features (Figure 4A). Sublining giant cells and stromal features such as detritus, fibrosis, and mucin were not significantly related to nuclei density. The one-way Kruskal-Wallis test demonstrated that there was a statistically significant difference in the mean nuclei density (P < 0.05) between grades of lymphocytes and other histology features correlated with lymphocytes (Figure 4B).
RNA-seq gene expression analysis was previously performed on a subset (n = 36) of synovial samples and characterized as having either low, mixed, or high inflammatory gene expression (9). Mean nuclei density was statistically different among RNA subtypes (P < 0.05) and was increased in high inflammatory RNA subtype samples (Figure 5A). From the principal components analysis, the first principal component (PC1) explained 43% of the variance and was lowest in low inflammatory subtypes and highest in high inflammatory subtypes (Figure 5B). The PC1 score can therefore be interpreted as a summary bulk gene expression score for general inflammatory burden. Nuclei density was significantly correlated with the PC1 score (r = 0.45, P < 0.05) (Figure 5C). We next used linear modeling to identify 915 genes that were significantly associated with nuclei density (FDR less than 0.05) (Figure 5D). Genes that were significantly increased in association with nuclei density included chemokines, chemokine receptors (such as CXCL9, CXCL10, and CCR5), and genes important for B cell activation and ectopic lymphoid structure formations (such as IL21R, TNFSF13B, and CD38), as well as JAK2, EOMES, and CTLA4. Genes that were significantly decreased in association with nuclei density included genes in the TGFB pathway such as TGFBR3, SMAD3, BMP4, and TIMP2, as well as cartilage intermediate layer protein 1 (CILP) and nerve growth factor receptors and neural guidance molecules such as SEMA4C, NTRK1, and NTRK2. A comparison of gene expression changes supervised according to nuclei density (log Fold Change vs. significance in the volcano plot) highly overlap with previously identified high inflammatory RNA subset genes identified using unsupervised clustering (presented as orange vs. green in the volcano plot in Figure 5D). Of the 2,460 genes that were increased in high inflammatory samples, 486 were also increased in association with nuclei density, and none were decreased in association with nuclei density. Of the 3,106 genes that were decreased in high inflammatory samples, 360 were also decreased in association with nuclei density, and none were increased in association with nuclei density.
Eighteen synovial cell subsets were previously defined in the RA synovium using single cell RNA-seq (4). We used gene-set enrichment analysis on 200 marker genes for each of the synovial cell type subsets in our paired bulk tissue RNA-seq data to determine whether nuclei density was associated with gene expression changes in any of these subsets. Consistent with the prior report, cells associated with leukocyte-rich RA synovium, such as age-related memory B cells, interferon (IFN) activated and IL1B+ monocytes, and HLA-DR+ (Human Leukocyte Antigen - DR Isotype) sublining fibroblasts were enriched in samples with increased nuclei density, whereas cells that were previously associated with leukocyte poor synovium were negatively associated with nuclei density, such as lining fibroblasts and M2-like NUPR1+ monocytes (Figure 5E). Our analysis additionally detected significant enrichment in T-cell subsets such as GZMK+/GZMB+ T cells, GZMK+ T cells, C1QA+ monocytes, and cytotoxic T cells in accordance with increased nuclei density.
Comparison of nuclei density with clinical measuresNuclei density was compared with clinical measures of disease activity. Mean nuclei density was significantly increased in samples labeled as RF+ or having high titer CCP (Figure 6A and 6B). Similarly, nuclei density was significantly correlated with elevated acute phase reactants, ESR and CRP, and global physician assessment of disease (Figure 6C-6E). There was no significant correlation between nuclei density and age, duration of disease, body mass index, tender joint counts, swollen joint counts, patient global assessment of disease, or pain (data not shown).
Here, we describe the development and application of a computer vision algorithm to automatically count hundreds of thousands of cell nuclei within H&E-stained RA synovium whole slide images. We assessed the validity of the algorithm by comparing nuclei density with histology features scored by a pathologist, gene expression analysis of inflammation, and clinical features of RA. We demonstrated that nuclei density was associated with histology features, such as lymphocytes, that are features of inflammation in RA. Nuclei density was also significantly increased in high inflammatory RNA gene expression RA subsets and significantly correlated with the PC1 inflammation score. We also found that marker genes from single cell RNA-seq subsets, such as age-related memory B cells, IFN activated and IL1B+ monocytes, and HLA-DR+ sublining fibroblasts, that were previously found to be increased in samples with increased leukocyte infiltration as assessed by flow cytometry were enriched in samples with increasing nuclei density. Our analysis further identified additional significant associations with other T and monocyte cell types. This discrepancy was likely due to nuclei density potentially being a more accurate assessment of leukocyte infiltration by not requiring tissue freezing or dissociation, which can lead to cell death. Nuclei density was associated with RA clinical features (ie, RF, CCP, ESR, and CRP) related to synovial inflammation. Results demonstrate that whole slide nuclei density is a robust measurement of inflammatory burden and correlates with multiple orthogonal measurements of inflammation.
Computer vision nuclei counting provides the potential to transform assessment of inflammation in synovial and other tissue biopsies by providing a precise, automatic, cost-efficient, and quantitative approach amenable to universal distribution and use. Automated nuclei detection is an important step in developing standardized and reproducible assessments of synovial inflammation, which are essential to build guidelines for incorporating histological findings in diagnosing RA and optimizing treatment decisions. Highly inflamed synovium tended to be heterogeneously distributed with some areas having approximately 200 nuclei per tile and others with upward of approximately 1,000 nuclei per tile (Figure 3C). This finding illustrates a key challenge in interrater reliability when pathologists subjectively choose a subset of high-power fields for assessment. Moreover, nuclei density is a continuous measurement and likely can better characterize and discriminate among inflammatory burden types.
The synovial samples assessed in this study were acquired at arthroplasty, and some suggest that these are less informative because the patients at arthroplasty have “end-stage RA.” However, we have demonstrated that patients with RA have active disease at the time of arthroplasty, with moderate to high disease activity present in 70% of patients. Moreover, we propose that studies of inflammation in long-standing RA deserve consideration, because this population represents the vast majority of patients with RA and their ongoing management is an enormous financial burden and warrants optimization.
Another limitation of this study was the assessment of only one synovial biopsy per joint. Consistent with the patchiness of inflammation in a single whole slide image reported here, others have shown that there is a high degree of patchiness across the whole joint, and the accepted standard in the field is assessment of a minimum of six synovial biopsies for any one joint in order to accurately represent the inflammation in that joint. As opposed to ultrasound guided synovial biopsies, arthroplasty yields the entire joint explant, which was inspected by our pathologist, and the site of the most obvious inflammation (opaque and dull) was selected for biopsy. This selection bias may enrich for detection of inflammation. It is also important to note that the purpose of this algorithm is to address interrater variability between two pathologists rating the same image not to address intra-joint variability. Studies comparing the variance within individual inflamed joints indicate that six biopsy samples across a joint are needed to decrease the variance to less than 10% (31). This need for multiple samples across a joint compounds the time needed for a pathologist assessment as well as interrater variability, underscoring the need to develop automated computer vision to standardize assessments of synovial inflammation.
The algorithm uses classical computer vision techniques to identify synovial nuclei based on color, shape, and size. Other histology features and artifacts, such as excess staining with a similar appearance, can be incorrectly identified as nuclei. Dense clusters of nuclei, especially those with adjacent or overlapping nuclei, can be incorrectly labeled as a single nucleus. Typically, the nuclei count far exceeds the number of falsely identified objects as demonstrated in the validation study. Thus, whole slide image statistics are robust to this type of error, but local statistics on smaller areas might be affected.
This work is an initial step toward the application of artificial intelligence and computer vision for understanding RA and developing automated tools for analyzing H&E-stained RA synovial samples. The proposed algorithm was developed for detecting individual nuclei but can be refined to identify patterns or groups of nuclei such as follicular structures, and it differentiate between cell types such as lymphocytes and other stromal cells. Inflammation and inflammatory infiltrates are common to many pathologies, and the proposed methodologies can be adapted for analyzing whole slide images of other tissue types and diseases.
ACKNOWLEDGMENTSAccelerating Medicines Partnership (AMP) is a public–private partnership (AbbVie Inc., Arthritis Foundation, Bristol-Myers Squibb Company, GlaxoSmithKline, Janssen Research and Development, Lupus Foundation of America, Lupus Research Alliance, Merck Sharp & Dohme Corp., National Institute of Allergy and Infectious Diseases, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Pfizer Inc., Rheumatology Research Foundation, Sanofi, and Takeda Pharmaceuticals International, Inc.) created to develop new ways of identifying and validating promising biological targets for diagnostics and drug development.
AUTHOR CONTRIBUTIONS Study conception and designSusan Goodman, Dana E. Orange.
Acquisition and analysis of dataBella Mehta, Diyu Pearce-Fisher.
Algorithm design and analysisSteven Guan, David Slater, James R. Thompson.
Analysis of dataEdward DiCarlo, Tania Pannellini, Fan Zhang, Soumya Raychaudhuri, Caryn Hale, Caroline S. Jiang.
DATA SHARINGThe methods for scoring synovial histology features are available at
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022. This work is published under http://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Objective
We quantified inflammatory burden in rheumatoid arthritis (RA) synovial tissue by using computer vision to automate the process of counting individual nuclei in hematoxylin and eosin images.
Methods
We adapted and applied computer vision algorithms to quantify nuclei density (count of nuclei per unit area of tissue) on synovial tissue from arthroplasty samples. A pathologist validated algorithm results by labeling nuclei in synovial images that were mislabeled or missed by the algorithm. Nuclei density was compared with other measures of RA inflammation such as semiquantitative histology scores, gene-expression data, and clinical measures of disease activity.
Results
The algorithm detected a median of 112,657 (range 8,160-821,717) nuclei per synovial sample. Based on pathologist-validated results, the sensitivity and specificity of the algorithm was 97% and 100%, respectively. The mean nuclei density calculated by the algorithm was significantly higher (P < 0.05) in synovium with increased histology scores for lymphocytic inflammation, plasma cells, and lining hyperplasia. Analysis of RNA sequencing identified 915 significantly differentially expressed genes in correlation with nuclei density (false discovery rate is less than 0.05). Mean nuclei density was significantly higher (P < 0.05) in patients with elevated levels of C-reactive protein, erythrocyte sedimentation rate, rheumatoid factor, and cyclized citrullinated protein antibody.
Conclusion
Nuclei density is a robust measurement of inflammatory burden in RA and correlates with multiple orthogonal measurements of inflammation.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



1 The MITRE Corporation, McLean, Virginia
2 Hospital for Special Surgery, New York, New York; Weill Cornell Medicine, New York, New York
3 Hospital for Special Surgery, New York, New York
4 Center for Data Sciences, Brigham and Women's Hospital, Boston, Massachusetts; Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts; Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts; Division of Rheumatology, Inflammation and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts
5 Center for Data Sciences, Brigham and Women's Hospital, Boston, Massachusetts; Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts; Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts; Division of Rheumatology, Inflammation and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts; Centre for Genetics and Genomics Versus Arthritis, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
6 Rockefeller University, New York, New York
7 Hospital for Special Surgery, New York, New York; Rockefeller University, New York, New York