Introduction
Biological frailty and analytic morphomics (i.e. body composition) have increasingly gained interest in recent years in relation to treatment outcomes, such as complications and (disease‐free) survival. Frailty, a state of increased vulnerability towards stressors, leads to an increased risk of developing adverse health outcomes and is an important predictor of complications after interventional procedures, such as surgery and chemotherapy. For example, frail patients undergoing colorectal surgery have a fourfold increased risk to develop major post operative complications. One of the hallmark signs of frailty is sarcopenia, the involuntary depletion of skeletal muscle mass. It is estimated that up to 25% of persons under 70 years of age and over 50% of persons of 80 years and older experience sarcopenia. In addition, up to 80% of patients with advanced cancer are affected by cancer‐induced cachexia, a clinical condition that also results in skeletal muscle wasting with or without the loss of body fat. Patients with cachexia are more prone to a reduced therapy effect, and patients with low skeletal muscle mass experience increased chemotherapy toxicity. This ultimately results in death in nearly one‐third of all cancer patients.
Over the last years, numerous studies have used abdominal computed tomography (CT) scans to quantify skeletal muscle mass, for example, in clinical and surgical oncology, vascular surgery, and transplantation surgery patients. Furthermore, multiple studies measured visceral and/or subcutaneous adipose tissue on CT scans. However, different software programmes have been used to perform these body composition analyses, such as FatSeg, OsiriX, ImageJ, and sliceOmatic. To be able to adequately compare study results, the comparability of these various software programmes should be known. Therefore, the aim of this study was to investigate the agreement of these four different software packages for the assessment of cross‐sectional skeletal muscle and subcutaneous and visceral adipose tissue measurements on abdominal CT scans.
Materials and Methods
Patients
Fifty abdominal CT scans of patients who were scheduled for rectal cancer resection at Erasmus MC University Medical Centre (Rotterdam, the Netherlands) between 2005 and 2012 were randomly selected. All CT scans were routinely performed as part of the pre‐operative diagnostic work up or assessment of down staging after neo‐adjuvant therapy. Only one CT scan was used per patient. None of the patients had an ostomy, abdominal wall deformity, abdominal wall tumour, or a CT scan with artefacts at the level of L3 that could potentially influence measurements. Self‐reported weight and height in the pre‐operative workup were retrospectively collected from electronic patient files.
Skeletal muscle and adipose tissue area measurements
The cross‐sectional skeletal muscle area (CSMA), subcutaneous adipose tissue area (SAT), and visceral adipose tissue area (VAT) (cm2), including renal adipose tissue, were measured at the mid third lumbar vertebra (L3) level on a slice showing both transversal processes. The CSMA measurements included the following muscles: psoas, paraspinal, transverse abdominal, external oblique, internal oblique, and rectus abdominis. All abdominal CT scans were assessed on identical slices in a random order by two medically trained observers [AG (observer A) and JLAvV (observer B)], with great knowledge about radiological anatomy and extensive experience in skeletal muscle and adipose tissue area measurements using various software programmes. Observer A performed measurements twice on identical a priori selected slices, whereas observer B performed a second reading without a priori selected slice numbers. The observers were blinded for each other's measurements and for patient details. For each observer, the time interval between two readings in the same patient with different software programmes was at least 1 week. This resulted in an interval of at least 4 weeks between two readings within one patient with the same software programme. Only the first reading of observer B was used for the inter‐software and inter‐observer comparisons.
The CSMA was corrected for height squared (m2), resulting in the L3 muscle index (SMI, cm2/m2). Patients were classified as having sarcopenia or not having sarcopenia according to previously described cut‐off values (52.4 cm2/m2 for men and 38.5 cm2/m2 for women). Predefined cut‐off values for VAT to define visceral obesity of 163.8 cm2 for men and 80.1 cm2 for women were used. For subcutaneous adipose tissue, no cut‐off values have been reported in the literature.
Four software programmes were compared: FatSeg [developed by the Biomedical Imaging Group Rotterdam of Erasmus MC, Rotterdam, the Netherlands, using MeVisLab (Mevis Medical Solutions, Bremen, Germany)], OsiriX (Pixmeo SARL, Geneva, Switzerland), ImageJ (National Institutes of Health, Bethesda, MD, USA), and sliceOmatic (TomoVision, Magog, Canada). The CSMA, VAT, and SAT were segmented using standard Hounsfield unit (HU) thresholds in all four software programmes. An intensity window between −30 and +150 HU was used for skeletal muscle tissue. For adipose tissue, an intensity window between −190 and −30 HU was used. Because the tissue of interest is manually selected, competency in anatomic radiology is a prerequisite for these measurements.
FatSeg
FatSeg is an in‐house developed software programme to perform soft tissue measurements on CT scans and was developed using the MeVisLab development environment for medical image processing and visualization version 2.4 (available from
OsiriX
The open‐source 32‐bit edition of OsiriX version 5.8.5 (available from
ImageJ
ImageJ version 1.48 is a freely downloadable public domain software programme developed by the National Institutes of Health for image processing and analyzing (available from
SliceOmatic
SliceOmatic (TomoVIsion, Magog, Canada) version 5.0 (64 bit; available from
Cutaneous tissue disclosure
In OsiriX, cutaneous tissue is not included in the SAT measurement, because this is not automatically selected using the ‘Grow Region (2D/3D Segmentation)’ tool. SliceOmatic also allows to exclude encompassed skin. However, not all software programmes allow to reliably exclude cutaneous tissue from SAT as a consequence of their measurement method: the delineation of tissue of interest using inner and outer contours. Consequently, to ensure highly comparable measurements in three rather than two software programmes, cutaneous tissue was included in the SAT measurements with sliceOmatic. A comparison of SAT measurements using sliceOmatic with and without the inclusion of cutaneous tissue resulted in a median difference of 2.3% (interquartile range 0.8–3.8) and was considered acceptable.
Statistical analysis
Continuous data are presented as mean with the standard error of the mean. Normality was tested using the Shapiro–Wilk test. Differences between the different software packages and within and between observers were compared using the paired samples t‐test for normally distributed data and the Wilcoxon signed rank test for data that were not normally distributed. The inter‐software and inter‐observer and intra‐observer agreement for the cross‐sectional skeletal muscle, visceral adipose tissue, and subcutaneous adipose tissue measurements were calculated using intra‐class correlation coefficients (ICCs) with 95% confidence interval using a two‐way mixed single measures model with absolute agreement. For the inter‐observer correlation, reading 1 of observer B was compared with reading 1 of observer A. Ninety‐five per cent limits of agreement were determined to investigate the agreement between the various software programmes, according to the method described by Bland and Altman. The presence of proportional systematic bias was determined by linear regression analysis of the difference and mean of two measurements. The inter‐software, inter‐observer and intra‐observer agreement of the assessment of sarcopenia and visceral obesity were analyzed using Cohen's κ coefficients. The ICCs and Cohen's κ coefficients were interpreted as poor (0.00–0.49), fair to good (0.50–0.74), and excellent (0.75–1.00), as proposed by Shrout and Fleiss. The Jaccard similarity coefficient, ranging from 0 to 1, was used to compare the similarity and diversity of measurements by dividing the area of the intersection by the size of the union of two measurements. An overlay of two measurements was created, and the Jaccard similarity coefficient was calculated using MeVisLab version 2.7.1 (MeVis Medical Solutions AG, Bremen, Germany). A Jaccard similarity coefficient of 1 represents perfect overlap of two samples, whereas 0 represents no overlap. Two‐tailed P‐values <0.05 were considered statistically significant. All statistical analyses were performed using IBM SPSS Statistics for Windows version 21.0 (IBM Corp. Armonk, NY, USA).
Results
Patients
The study population consisted of 29 men (58%) and 21 women (42%) with a median age of 62 years (range 33–81) and a median body mass index of 24.6 kg/m2 (range 16.5–38.8). Ten patients had stage II (20.0%), 24 stage III (48.0%), and 15 stage IV (30.0%) rectal cancer. Tumour stage was unknown for one patient. The mean CSMA, VAT, and SAT for all measurements are provided in Tables and .
Mean cross‐sectional skeletal muscle and visceral and subcutaneous adipose tissue area (cm2) measurements and intra‐observer agreement indices (i.e. ICC) using FatSeg, OsiriX, ImageJ, and sliceOmatic of observer ASoftware | Observer A | ||||||
Reading 1 (cm2) | SEM | Reading 2 (cm2) | SEM | Mean difference (95% CI) | P‐value | ICC (95% CI) | |
Skeletal muscle area | |||||||
FatSeg | 139.0 | 5.2 | 139.3 | 5.2 | −0.3 (−0.6; 0.0) | 0.072 | 0.999 (0.999–1.000) |
OsiriX | 139.4 | 5.2 | 138.7 | 5.1 | 0.7 (0.4; 1.0) | <0.001 | 0.999 (0.999–1.000) |
ImageJ | 139.0 | 5.2 | 139.3 | 5.1 | −0.3 (−0.6; −0.1) | 0.013 | 1.000 (0.999–1.000) |
sliceOmatic | 138.7 | 5.2 | 138.6 | 5.2 | 0.1 (−0.2; 0.4) | 0.441 | 1.000 (0.999–1.000) |
Visceral adipose tissue area | |||||||
FatSeg | 149.9 | 13.1 | 149.2 | 13.1 | 0.7 (0.3; 1.0) | <0.001 | 1.000 (1.000–1.000) |
OsiriX | 147.6 | 13.0 | 147.3 | 13.0 | 0.3 (−0.3; 0.8) | 0.220 | 1.000 (1.000–1.000) |
ImageJ | 148.6 | 13.0 | 150.8 | 12.8 | −2.2 (−7.5; 3.1) | 0.003 | 0.979 (0.964–0.988) |
sliceOmatic | 147.1 | 13.0 | 146.6 | 13.0 | 0.5 (0.2; 0.9) | 0.004 | 1.000 (1.000–1.000) |
Subcutaneous adipose tissue area | |||||||
FatSeg | 158.9 | 11.2 | 158.9 | 11.2 | 0.1 (−0.2; 0.3) | 0.359 | 1.000 (1.000–1.000) |
OsiriX | 155.9 | 11.2 | 155.7 | 11.3 | 0.2 (−0.1; 0.4) | 0.137 | 1.000 (1.000–1.000) |
ImageJ | 158.9 | 11.2 | 159.1 | 11.3 | −0.2 (−0.5; 0.0) | 0.201 | 1.000 (1.000–1.000) |
sliceOmatic | 158.8 | 11.3 | 158.8 | 11.3 | 0.0 (−0.3; 0.2) | 0.448 | 1.000 (1.000–1.000) |
Software | Observer A | Observer B | Mean difference (95% CI) | P‐value | ICC (95% CI) | ||
Reading 1 (cm2) | SEM | Reading 1 (cm2) | SEM | ||||
Skeletal muscle area (CSMA) | |||||||
FatSeg | 139.0 | 5.2 | 140.1 | 5.2 | −1.1 (−1.4; −0.8) | <0.001a | 0.999 (0.989–1.000) |
OsiriX | 139.4 | 5.2 | 139.7 | 5.1 | −0.3 (−0.5; 0.0) | 0.047a | 1.000 (0.999–1.000) |
ImageJ | 139.0 | 5.2 | 139.8 | 5.2 | −0.8 (−1.0; −0.5) | <0.001a | 0.999 (0.997–1.000) |
sliceOmatic | 138.7 | 5.2 | 139.3 | 5.2 | −0.6 (−0.9; −0.2) | 0.006a | 0.999 (0.998–1.000) |
Visceral adipose tissue area (VAT) | |||||||
FatSeg | 149.9 | 13.1 | 148.7 | 13.1 | 1.2 (0.8; 1.5) | <0.001b | 1.000 (0.999–1.000) |
OsiriX | 147.6 | 13.0 | 147.3 | 13.0 | 0.3 (−0.3; 0.8) | 0.133b | 1.000 (1.000–1.000) |
ImageJ | 148.6 | 13.0 | 148.4 | 13.1 | 0.3 (−0.1; 0.6) | 0.015b | 1.000 (1.000–1.000) |
sliceOmatic | 147.1 | 13.0 | 146.9 | 13.0 | 0.2 (−0.1; 0.5) | 0.412b | 1.000 (1.000–1.000) |
Subcutaneous adipose tissue area (SAT) | |||||||
FatSeg | 158.9 | 11.2 | 159.2 | 11.3 | −0.3 (−0.5; −0.1) | 0.005b | 1.000 (1.000–1.000) |
OsiriX | 155.9 | 11.2 | 155.8 | 11.3 | 0.1 (−0.3; 0.5) | 0.918b | 1.000 (1.000–1.000) |
ImageJ | 158.9 | 11.2 | 158.7 | 11.2 | 0.2 (−0.2; 0.5) | 0.306b | 1.000 (1.000–1.000) |
sliceOmatic | 158.8 | 11.3 | 158.5 | 11.2 | 0.2 (0.0; 0.5) | 0.183b | 1.000 (1.000–1.000) |
Inter‐software agreement
The inter‐software ICCs were excellent (≥0.999) for the CSMA, VAT, and SAT for all software programmes with P‐values <0.001 (Table ). Figure and Supporting Information, Figures S1 and S2 show the Bland–Altman 95% limits of agreement plots, with the mean difference and 95% limits of agreement for the CSMA, VAT, and SAT for both observers. All plots show a good agreement between the various software programmes. Small limits of agreement are observed in the CSMA measurements, whereas these limits of agreement are greater for the VAT and SAT measurements. Proportional systematic bias was observed between FatSeg and OsiriX for CSMA (P = 0.049) for observer B (Figure A) and between FatSeg and sliceOmatic for SAT (P = 0.031) for observer A (Supporting Information, Figure S2C). Furthermore, proportional systematic bias was frequently observed between programmes for VAT measurements (Supporting Information, Figure S1). Comparable results were achieved when non a priori selected slices of observer B (reading 2) were analyzed (data not shown). The mean Jaccard similarity coefficients for the inter‐software comparisons are summarized in Table and depicted in Figure .
Mean cross‐sectional skeletal muscle and visceral and subcutaneous adipose tissue area (cm2) measurements and inter‐software agreement indices (i.e. ICC) using FatSeg, OsiriX, ImageJ, and sliceOmatic of reading 1 of observer BSoftware programme | Mean difference (cm2) (95% CI) | SEM | P‐value | ICC (95% CI) |
Skeletal muscle area (CSMA) | ||||
FatSeg–OsiriX | −0.4 (−0.8; 0.0) | 0.184 | 0.047 | 0.999 (0.999–1.000) |
FatSeg–ImageJ | 0.0 (−0.3; 0.3) | 0.151 | 0.992 | 1.000 (0.999–1.000) |
FatSeg–sliceOmatic | 0.3 (−0.2; 0.8) | 0.230 | 0.207 | 0.999 (0.998–0.999) |
OsiriX–ImageJ | 0.4 (0.1; 0.7) | 0.161 | 0.023 | 0.999 (0.999–1.000) |
OsiriX–sliceOmatic | 0.7 (0.3; 1.1) | 0.189 | 0.001 | 0.999 (0.998–1.000) |
ImageJ–sliceOmatic | 0.3 (−0.1; 0.7) | 0.208 | 0.165 | 0.999 (0.999–1.000) |
Visceral adipose tissue area (VAT) | ||||
FatSeg–OsiriX | 2.3 (1.6; 2.9) | 0.326 | <0.001 | 0.999 (0.995–1.000) |
FatSeg–ImageJ | 1.2 (0.8; 1.7) | 0.203 | <0.001 | 1.000 (0.999–1.000) |
FatSeg–sliceOmatic | 2.8 (2.3; 3.2) | 0.238 | <0.001 | 0.999 (0.971–1.000) |
OsiriX–ImageJ | −1.0 (−1.5; −0.6) | 0.237 | <0.001 | 1.000 (0.999–1.000) |
OsiriX–sliceOmatic | 0.5 (0.0; 0.9) | 0.229 | 0.044 | 1.000 (1.000–1.000) |
ImageJ–sliceOmatic | 1.5 (1.2; 1.8) | 0.158 | <0.001 | 1.000 (0.995–1.000) |
Subcutaneous adipose tissue area (SAT) | ||||
FatSeg–OsiriX | 3.0 (2.5; 3.6) | 0.256 | <0.001 | 0.999 (0.948–1.000) |
FatSeg–ImageJ | 0.1 (–0.3; 0.4) | 0.180 | 0.698 | 1.000 (1.000–1.000) |
FatSeg–sliceOmatic | 0.2 (−0.1; 0.5) | 0.141 | 0.240 | 1.000 (1.000–1.000) |
OsiriX–ImageJ | −3.0 (−3.5; −2.5) | 0.260 | <0.001 | 0.999 (0.956–1.000) |
OsiriX–sliceOmatic | −2.9 (−3.3; −2.5) | 0.211 | <0.001 | 0.999 (0.932–1.000) |
ImageJ–sliceOmatic | 0.1 (−0.2; 0.4) | 0.139 | 0.485 | 1.000 (1.000–1.000) |
Mean Jaccard similarity coefficients | ||||||
Observer A (reading 1) (range) | Observer B (reading 1) (range) | |||||
CSMA | VAT | SAT | CSMA | VAT | SAT | |
FatSeg–OsiriX | 0.978 (0.940–0.997) | 0.964 (0.825–0.996) | 0.965 (0.928–0.976) | 0.983 (0.948–0.997) | 0.973 (0.886–0.997) | 0.965 (0.926–0.998) |
FatSeg–ImageJ | 0.982 (0.935–0.996) | 0.981 (0.912–0.999) | 0.988 (0.900–0.999) | 0.987 (0.959–0.998) | 0.981 (0.903–0.998) | 0.990 (0.968–0.998) |
FatSeg–sliceOmatic | 0.978 (0.937–0.996) | 0.970 (0.908–0.997) | 0.989 (0.964–0.998) | 0.981 (0.927–0.996) | 0.972 (0.860–0.997) | 0.987 (0.960–0.998) |
OsiriX–ImageJ | 0.982 (0.935–0.996) | 0.968 (0.856–0.995) | 0.964 (0.900–0.998) | 0.983 (0.948–0.997) | 0.976 (0.891–0.998) | 0.966 (0.927–0.997) |
OsiriX–sliceOmatic | 0.979 (0.941–0.997) | 0.974 (0.876–0.997) | 0.988 (0.900–0.998) | 0.985 (0.944–0.997) | 0.973 (0.884–0.998) | 0.967 (0.923–0.998) |
ImageJ–sliceOmatic | 0.979 (0.950–0.994) | 0.967 (0.809–0.995) | 0.966 (0.928–0.997) | 0.983 (0.932–0.997) | 0.975 (0.855–0.998) | 0.988 (0.965–0.999) |
Intra‐observer and inter‐observer agreement
The ICCs for the intra‐observer agreement of observer A were all 0.979 or higher for the different software programmes, approaching perfect correlation (Table ). The ICCs for the inter‐observer agreement also approached perfect agreement (all ≥0.999, Table ). The mean CSMA was significantly lower for observer A compared with observer B for all software programmes. A significantly higher mean VAT of observer A was found using FatSeg (149.9 cm2 vs. 148.7 cm2, P < 0.001) and ImageJ (148.6 cm2 vs. 148.4 cm2, P = 0.015) compared with observer B, whereas the mean VAT of both observers did not significantly differ for OsiriX (P = 0.133) and sliceOmatic (P = 0.412). The mean SAT did significantly differ for FatSeg (158.9 cm2 vs. 159.2 cm2, P = 0.005) between the observers. Comparable results were observed when non a priori selected slices of reading 2 of observer B were used for analyses (data not shown). The mean Jaccard similarity coefficients for the inter‐observer and intra‐observer comparisons are summarized in Table and depicted in Supporting Information, Figures S3 and S4. All remaining worst Jaccard similarity coefficients are provided in Supporting Information, Figure S5.
Mean Jaccard indices for inter‐observer (reading 1 of observer A vs. reading 1 of observer B) and intra‐observer comparisons (reading 1 vs. reading 2 of observer A)Mean Jaccard similarity coefficients | ||||||
Inter‐observer (range) | Intra‐observer (range) | |||||
CSMA | VAT | SAT | CSMA | VAT | SAT | |
FatSeg | 0.981 (0.949–0.997) | 0.976 (0.908–0.998) | 0.991 (0.969–0.999) | 0.982 (0.961–1.000) | 0.984 (0.916–0.999) | 0.991 (0.956–1.000) |
OsiriX | 0.985 (0.960–0.997) | 0.973 (0.835–0.997) | 0.989 (0.960–1.000) | 0.984 (0.953–0.997) | 0.975 (0.838–0.998) | 0.990 (0.967–1.000) |
ImageJ | 0.982 (0.931–0.993) | 0.980 (0.905–0.997) | 0.988 (0.899–0.999) | 0.985 (0.948–1.000) | 0.982 (0.891–0.998) | 0.990 (0.900–1.000) |
sliceOmatic | 0.981 (0.939–0.997) | 0.976 (0.876–0.996) | 0.989 (0.959–0.999) | 0.986 (0.961–0.997) | 0.980 (0.901–0.998) | 0.993 (0.967–1.000) |
The classification of sarcopenia and visceral obesity
The inter‐software Cohen's κ's of the first reading of observer A for the classification of sarcopenia were 0.96 (between FatSeg and Osirix, OsiriX and ImageJ, and ImageJ and sliceOmatic), 0.92 (between FatSeg and ImageJ, and Osirix and sliceOmatic), and 0.88 (between FatSeg and sliceOmatic). No inter‐software differences were found in the classification of patients with and without sarcopenia for observer B. According to the cut‐off values used, all software programmes diagnosed sarcopenia in 16 men (55.2%) and 8 women (38.1%). This resulted in a Cohen's κ of 1.00 for all comparisons between software programmes (P < 0.001).
The Cohen's κ for the intra‐observer agreement of sarcopenia assessment of observer A was 0.96 using FatSeg and ImageJ and 1.00 for OsiriX and sliceOmatic (all P < 0.001).
The Cohen's κ for the inter‐observer agreement (reading 1 of observer A vs. reading 1 of observer B) of sarcopenia assessment was 0.92 for sliceOmatic, 0.96 for FatSeg and ImageJ, and 1.00 for Osirix (all P < 0.001).
The classification of visceral obesity
In total, 17 men (58.6%) and 9 women (42.9%) were classified as visceral obese using FatSeg, OsiriX, ImageJ, and sliceOmatic in all readings. This resulted in a Cohen's κ of 1.00 for all comparisons (all P < 0.001).
Discussion
This study shows that the inter‐software agreement was excellent for all software programmes. Furthermore, the inter‐observer and intra‐observer agreements were excellent for four distinct software programmes to assess CSMA, VAT, and SAT on abdominal CT scans with high Jaccard similarity coefficients.
Body composition analyses using abdominal CT scans are increasingly being performed. In multiple surgical populations, such as vascular, gastrointestinal, urological, gynaecological, and transplantation surgery, the association between low skeletal muscle mass and an increased risk of post operative complications, recurrent disease, or impaired survival has been shown. Low skeletal muscle mass is also related to discharge destination in elderly trauma patients, associated with an increased risk of dose‐limiting chemotherapy toxicity and with morbidity and mortality in various oncologic populations, such as lung cancer and melanoma patients. Furthermore, CT‐assessed visceral obesity is associated with worse short‐term and long‐term outcome in distinct patient populations undergoing surgery. Various software programmes have been used to measure body composition in these studies. The current study shows that the results of these studies can reliably be compared. Based on our findings, it is likely that this is also true for other software programmes, which similarly compute skeletal muscle area by quantifying selected voxels within preset HU intensity thresholds [e.g. studies that used software programmes designed in MATLAB (MathWorks, Natick, MA, USA)].
Software programmes for various body composition measurements on CT images, such as adipose tissue surface area, skeletal muscle tissue surface area, and liver volumetric measurements, have been compared in multiple previous studies, demonstrating high levels of agreement. Excellent agreement levels between sliceOmatic and ImageJ, as well as between observers using sliceOmatic for CSMA measurements have previously been reported. Furthermore, excellent agreement levels between OsiriX and ImageJ have been observed for paraspinal muscle measurements on magnetic resonance images. Nevertheless, this is the first study to compare multiple software programmes for the measurement of CSMA, VAT, and SAT, showing that previous studies investigating the association between skeletal muscle mass on the one side and visceral or subcutaneous adipose tissue on the other side, and patient outcomes can reliably be compared.
The skeletal muscle area (cm2) measured at a single cross‐sectional CT image at the level of the third lumbar vertebra (L3) is linearly related to total body skeletal muscle mass and is therefore corrected for height squared (m2), as is conventional for body composition measures. This results in the L3 muscle index (cm2/m2). Another frequently used method is measuring the total psoas area (TPA). The principle of TPA measurements is identical to L3 muscle area measurements, using single cross‐sectional CT images. Therefore, the findings of this study may be extrapolated to TPA measurements as well. Nevertheless, this should be confirmed in a future study.
Significant differences were observed between the mean skeletal muscle areas within and between observers. However, these mean differences are small and consequently could be considered as not clinically relevant. Differences in individual measurements resulted, for instance, from the incorrect annotation of skeletal muscle tissue (see Figure 2A2 for an example of an intra‐observer difference). However, we decided not to correct measurements in retrospect to show inter‐observer and intra‐observer agreements. In our opinion, this study reflects daily practice, with observers who have excellent (radiological) anatomical knowledge performing body composition measurements. Regardless of these human errors and some inter‐observer and intra‐observer differences, high comparability between software programmes was observed.
Significant differences between VAT measurements were also observed with greater mean differences between software. This could be due to the greater complexity of the measurement technique, as intraluminal content (i.e. fat in stool) needs to be manually erased. The greatest significant mean differences in SAT could partly been explained by the fact that in OsiriX the cutaneous adipose tissue in not included in the SAT, in contrast to the other software programmes. Furthermore, every tissue of interest needs to be manually selected in OsiriX, in contrast to the other programmes in which methods of delineating or a painting brush can be used to select regions of interest.
Significant differences in the mean VAT (FatSeg and ImageJ) within observer A and in the mean VAT (FatSeg, OsiriX, and sliceOmatic) and SAT (FatSeg, OsiriX, and ImageJ) between observers (reading 2 of observer A with non a priori selected slices; data not shown) were found, whereas the CSMA did not significantly differ. One explanation for the differences in VAT and SAT could be the random slice selection. After all, the distribution of the intra‐abdominal content (e.g. bowel) can greatly differ between slices. Consequently, single slice measurement of visceral adipose tissue would not be clinical applicable and should be reserved for clinical research of patient cohorts rather than individual patients. For SAT, the variance of subcutaneous adipose tissue distribution could have led to the observed differences. Nevertheless, all differences are relatively small and could therefore be considered as not clinically relevant. The inter‐observer agreement levels for OsiriX and sliceOmatic are in line with previous studies that showed a strong and significant correlation between CSMA measurements of two observers. The inter‐observer agreements for FatSeg and ImageJ have never been reported before, whereas a high agreement for the classification of patients with sarcopenia, as expressed in Cohen's κ, has previously been reported.
Several limitations apply to the current study and the used software programmes. First, both observers in the current study were experienced in quantifying skeletal muscle mass using these software programmes prior to conducting this study. Therefore, the agreement rates that were obtained may not apply to less experienced users. Second, OsiriX is only compatible with Macintosh, which is less commonly used in clinical practice. Furthermore, FatSeg is not freely downloadable as it is an in‐house developed software programme that has not been made publically available, in contrast to OsiriX and ImageJ. A license is required for the use of sliceOmatic. Third, this study could only assess the agreement of the measurement with different software programmes on the same data. Intra‐scanner and inter‐scanner reproducibility of the measurement could not be assessed with the current study design. Last, previous studies reported an approximate time of 8 min to quantify skeletal muscle, visceral, and subcutaneous adipose tissue in liver transplant patients using sliceOmatic. Although some differences in user‐friendliness were observed while performing the measurements, these were not objectively observed and scored in the current study. Consequently, these are not described.
Conclusions
In conclusion, this study showed that four different software programmes have an excellent agreement to measure VAT and SAT, and CSMA in particular on abdominal CT scans, which enables reliable comparison of results of studies that use these different software programmes. Multiple slice analysis is preferred for VAT and SAT measurements.
Acknowledgements
The authors would like to thank Laurens Groenendijk and Elsaline Rijkse of the Imaging Trial Office, department of Radiology and Nuclear Medicine, Erasmus MC University Medical Centre, Rotterdam, the Netherlands, for anonymizing and providing the CT scans, and Yves Martel of sliceOmatic (TomoVision, Magog, Canada) for providing a temporary free license to use the software package. The authors certify that they comply with the ethical guidelines for publishing in the Journal of Cachexia, Sarcopenia and Muscle: update 2015.
JLAvV designed the study, performed the measurements (observer B), analyzed and interpreted the data, and wrote the manuscript. SL designed the study, interpreted the data, and wrote the manuscript. AG performed the measurements (observer A) and analyzed the data. MK and WJN developed and provided the FatSeg software programme and provided technical advice and support for the measurements and the calculation of the Jaccard similarity coefficients. SPW provided statistical and methodological advice and interpreted the data. JWA Burger provided the clinical data. RWFdB and JNMIJ interpreted the data and supervised the study. All authors critically revized the manuscript and approved the manuscript for publication.
Conflict of interest
J.L.A. van Vugt, S. Levolger, A. Gharbharan, M. Koek, J.W.A. Burger, S.P. Willemsen, R.W.F. de Bruin and J.N.M IJzermans have no conflicts of interest. W.J. Niessen is co‐founder and shareholder of Quantib BV.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2017. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background
The association between body composition (e.g. sarcopenia or visceral obesity) and treatment outcomes, such as survival, using single‐slice computed tomography (CT)‐based measurements has recently been studied in various patient groups. These studies have been conducted with different software programmes, each with their specific characteristics, of which the inter‐observer, intra‐observer, and inter‐software correlation are unknown. Therefore, a comparative study was performed.
Methods
Fifty abdominal CT scans were randomly selected from 50 different patients and independently assessed by two observers. Cross‐sectional muscle area (CSMA, i.e. rectus abdominis, oblique and transverse abdominal muscles, paraspinal muscles, and the psoas muscle), visceral adipose tissue area (VAT), and subcutaneous adipose tissue area (SAT) were segmented by using standard Hounsfield unit ranges and computed for regions of interest. The inter‐software, intra‐observer, and inter‐observer agreement for CSMA, VAT, and SAT measurements using FatSeg, OsiriX, ImageJ, and sliceOmatic were calculated using intra‐class correlation coefficients (ICCs) and Bland–Altman analyses. Cohen's κ was calculated for the agreement of sarcopenia and visceral obesity assessment. The Jaccard similarity coefficient was used to compare the similarity and diversity of measurements.
Results
Bland–Altman analyses and ICC indicated that the CSMA, VAT, and SAT measurements between the different software programmes were highly comparable (ICC 0.979–1.000, P < 0.001). All programmes adequately distinguished between the presence or absence of sarcopenia (κ = 0.88–0.96 for one observer and all κ = 1.00 for all comparisons of the other observer) and visceral obesity (all κ = 1.00). Furthermore, excellent intra‐observer (ICC 0.999–1.000, P < 0.001) and inter‐observer (ICC 0.998–0.999, P < 0.001) agreement for all software programmes were found. Accordingly, excellent Jaccard similarity coefficients were found for all comparisons (mean ≥ 0.964).
Conclusions
FatSeg, OsiriX, ImageJ, and sliceOmatic showed an excellent agreement for CSMA, VAT, and SAT measurements on abdominal CT scans. Furthermore, excellent inter‐observer and intra‐observer agreement were achieved. Therefore, results of studies using these different software programmes can reliably be compared.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
1 Department of Surgery, Erasmus MC University Medical Centre, Rotterdam, the Netherlands
2 Department of Radiology and Medical Informatics, Erasmus MC University Medical Centre, Rotterdam, the Netherlands
3 Department of Radiology and Medical Informatics, Erasmus MC University Medical Centre, Rotterdam, the Netherlands; Faculty of Applied Sciences, Delft University of Technology, Delft, the Netherlands
4 Department of Surgical Oncology, Erasmus MC – Daniel den Hoed Cancer Institute, Rotterdam, the Netherlands
5 Department of Biostatistics, Erasmus MC University Medical Centre, Rotterdam, the Netherlands