About the Authors:
Haotian Lin
Contributed equally to this work with: Haotian Lin, Erping Long
Roles Conceptualization, Funding acquisition, Investigation, Project administration, Writing – original draft, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
ORCID logo http://orcid.org/0000-0003-4672-9721
Erping Long
Contributed equally to this work with: Haotian Lin, Erping Long
Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Xiaohu Ding
Roles Methodology, Validation, Visualization, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Hongxing Diao
Roles Project administration, Resources, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Zicong Chen
Roles Formal analysis, Methodology, Software, Validation, Writing – review & editing
Affiliation: School of Public Health, Sun Yat-sen University, Guangzhou, China
Runzhong Liu
Roles Data curation, Investigation, Methodology, Software, Writing – review & editing
Affiliation: School of Public Health, Sun Yat-sen University, Guangzhou, China
Jialing Huang
Roles Formal analysis, Methodology, Validation, Writing – review & editing
Affiliation: School of Public Health, Sun Yat-sen University, Guangzhou, China
ORCID logo http://orcid.org/0000-0001-7158-3782
Jingheng Cai
Roles Methodology, Writing – review & editing
Affiliation: School of Mathematics, Sun Yat-sen University, Guangzhou, China
Shuangjuan Xu
Roles Data curation, Methodology, Writing – review & editing
Affiliation: School of Mathematics, Sun Yat-sen University, Guangzhou, China
Xiayin Zhang
Roles Formal analysis, Investigation, Supervision, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Dongni Wang
Roles Project administration, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Kexin Chen
Roles Data curation, Resources, Writing – review & editing
Affiliation: Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
Tongyong Yu
Roles Methodology, Writing – review & editing
Affiliation: Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
Dongxuan Wu
Roles Resources, Writing – review & editing
Affiliation: Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
Xutu Zhao
Roles Methodology, Writing – review & editing
Affiliation: Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
Zhenzhen Liu
Roles Supervision, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Xiaohang Wu
Roles Supervision, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
ORCID logo http://orcid.org/0000-0002-9398-4330
Yuzhen Jiang
Roles Supervision, Writing – review & editing
Affiliations State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China, UCL Institute of Ophthalmology, University College London and Moorfields Eye Hospital, London, United Kingdom
Xiao Yang
Roles Resources, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Dongmei Cui
Roles Supervision, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Wenyan Liu
Roles Resources, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Yingfeng Zheng
Roles Supervision, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Lixia Luo
Roles Supervision, Writing – review & editing
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
Haibo Wang
Roles Supervision, Writing – review & editing
Affiliation: First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
Chi-Chao Chan
Roles Supervision, Writing – review & editing
Affiliations State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China, Laboratory of Immunology, National Eye Institute, National Institutes of Health, Bethesda, Maryland, United States of America
Ian G. Morgan
Roles Conceptualization, Supervision, Writing – review & editing
Affiliations State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China, ARC Centre of Excellence in Vision Science, Research School of Biology, College of Medicine, Biology and Environment, Australian National University, Canberra, Australian Capital Territory, Australia
Mingguang He
Roles Conceptualization, Data curation, Investigation, Project administration, Resources, Supervision, Writing – original draft, Writing – review & editing
Affiliations State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China, Centre for Eye Research Australia, University of Melbourne, Royal Victorian Eye and Ear Hospital, East Melbourne, Victoria, Australia
ORCID logo http://orcid.org/0000-0002-6664-3442
Yizhi Liu
Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Project administration, Supervision, Validation, Writing – review & editing
* E-mail: [email protected] (YL)
Affiliation: State Key Laboratory of Ophthalmology, Clinical Research Center for Ocular Disease, Zhongshan Ophthalmic Centre, Sun Yat-sen University, Guangzhou, China
ORCID logo http://orcid.org/0000-0003-4108-9593
Introduction
Myopia, the most common visual impairment in children, has increased markedly in Chinese school-aged children in recent years. This “myopia boom” is a significant international public concern, impacting study performance and daily life [1]. The risk of children developing high myopia has become a great concern among parents [2], with thousands of students seeking care at optometric and ophthalmic clinics annually in China. This creates an enormous burden for the healthcare system but provides an unprecedented opportunity to collect large-scale real-world clinical data that are unified and reliable.
Big data available from service providers contain valuable “signals” for authentic disease progression and prognosis; however, the analysis of these data is challenging because such data are often contaminated by various types of “noise”, given that the data are not collected in a controlled research setting [3]. Machine learning offers a ubiquitous and indispensable method to solve these complexities of data noise and heterogeneity, having the capacity to combine enormous numbers of predictors in a non-linear and highly interactive way [4].
This study is a data-and-algorithm-driven analysis of more than half a million optometry records and data derived from long-term population-based cohort studies in China. The goal was to build a prediction algorithm based on machine learning techniques to uncover the key determinants of high myopia and to predict, as early and as accurately as possible, the development of high myopia in adulthood. The performance of the algorithm was validated using multi-source datasets from independent ophthalmic centres and population-based research cohorts. The results provide evidence for health policy-making regarding the practical control of school-age myopia and precise individual interventions.
Methods
Data source
A summary of study procedures is presented in Fig 1. Eight ophthalmic centres were included in the study, including Zhongshan Ophthalmic Centre (ZOC), the Haizhu Optometry Department (HZD), the Huangpu Optometry Department (HPD), the Panyu Optometry Department (PYD), the Dongguan Guangming Ophthalmic Hospital (DGC), the Optometry Centre in Huizhou City (HZC), the Haikou Longhua Optometry Department (LHD), and the Xiuying Optometry Department in Haikou City (HKC). This study also included 2 datasets collected from population-based cohort studies: the Guangzhou Outdoor Activity Longitudinal Trial (GOAL) [5] and the Refractive Error Longitudinal Study (RELS). These 8 ophthalmic centres and 2 cohorts from South China collectively composed a representative medical big data sample for children of Chinese ethnicity. This sample could be generalisable to Chinese children living in Hong Kong, Taiwan, and Singapore, where myopia is similarly a common public health problem in children. The geographical locations and a detailed description of the study population are presented in Fig 2, S1 Table, and S1 Text.
[Figure omitted. See PDF.]
Fig 1. Overall study pipeline.
(A) Eight ophthalmic centres and 2 non-medical (population-based) cohorts were included. To focus on the school-aged population, only individuals aged from 6 to 20 years at the initial examination, with ≥3 visits at ≥1-year intervals were included in the analysis. (B) Candidate predictors included age at examination, SE, and annual progression rate. Using these predictors, the algorithm was used to predict SE and whether patients will progress to high myopia in the subsequent 10 years (with each year as a predictive time point). (C) We used the random forest method of machine learning to establish a prediction algorithm. All records from ZOC were used as the training set. Ten-fold cross-validation and OOB methods were applied for internal validation. The remaining records from the other 7 centres and 2 independent population-based datasets were used for external validation and the multi-resource test, respectively. PR, annual progression rate; OOB, out-of-bag; SE, spherical equivalent; ZOC, Zhongshan Ophthalmic Centre.
https://doi.org/10.1371/journal.pmed.1002674.g001
[Figure omitted. See PDF.]
Fig 2. Overall distribution of the study population.
A total of 132,457 participants were recruited from 8 ophthalmic centres—4 centres in Guangzhou (Zhongshan Ophthalmic Centre [ZOC, 92,484, 69.82%], the Haizhu Optometry Department [HZD, 9,524, 7.19%], the Huangpu Optometry Department [HPD, 2,000,1.52%], and the Panyu Optometry Department [PYD, 3,815, 2.88%]) and 4 centres outside of Guangzhou (the Dongguan Guangming Ophthalmic Hospital [DGC, 12,200, 9.21%], the Optometry Centre in Huizhou City [HZC, 1,854, 1.40%], the Haikou Longhua Optometry Department [LHD, 1,775, 1.35%], and the Xiuying Optometry Department in Haikou City [HKC, 5,590, 4.22%])—and 2 non-medical (population-based) cohorts (the Guangzhou Outdoor Activity Longitudinal Trial [GOAL, 1,525, 1.15%] and the Refractive Error Longitudinal Study [RELS, 1,690, 1.28%]). The 8 ophthalmic centres and 2 cohorts are in South China and Southeast Asia and provided a representative big data sample.
https://doi.org/10.1371/journal.pmed.1002674.g002
Ethics statement
The study adhered to the tenets of the Declaration of Helsinki, and approval for the study protocol was obtained from the Institutional Review Board/Ethics Committee of Sun Yat-sen University (Guangzhou, China). All of the datasets used throughout the study were deidentified prior to transfer to the study investigators.
Dataset preparation
We extracted data from electronic medical record systems collected between January 1, 2005, and December 30, 2015, at the optometry service of 8 participating ophthalmic centres. To focus on the school-aged population, only individuals aged from 6 to 20 years at the initial examination and with ≥3 visits at ≥1-year intervals were included in the current analysis.
Predictors and outcomes
Predictors included age at examination, spherical equivalent (SE), and annual progression rate. Cycloplegic refraction was performed according to a standard protocol in each centre. The right eye was arbitrarily chosen to represent a specific individual.
Using these predictors, we aimed to develop an algorithm to predict SE and presence of high myopia in the subsequent 10 years (with each year as a predictive time point). The presence of high myopia was defined as a SE ≤ −6.0 dioptres.
Algorithm development and validation
Electronic medical records from ZOC were used as the training dataset, and 10-fold cross-validation and out-of-bag (OOB) validation methods [6] were applied for internal validation (details are provided in S2 Text). Meanwhile, a methodological comparison of random forest and other conventional algorithms (generalised estimating equation [7] and mixed-effects model [8]; details are provided in S3 Text) was performed using the average performance of the cross-validation. A complete algorithm was trained on the entire ZOC dataset prior to external validation (variable contributions in S1 Fig).
The refraction data records from the other 7 centres were used for external validation. All individuals from the 7 centres with available refraction records at 18 years of age and with at least 2 visits (≥1-year interval) were included. These records were used to explore the accuracy of prediction at a given time before 18 years of age. Two population-based longitudinal cohorts were used for the multi-resource test.
Random forest algorithm
Random forest is an ensemble learning method that operates by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes (classification) or mean prediction (regression) of the individual trees. Here, we employed the random forest algorithm for the development of the prediction algorithm, which was established in the BrainWave machine learning module [9]. The R randomForest package, which implements Breiman’s classic algorithm, was used to fit the random forest model [10].
Each decision tree in the random forest was built using a bootstrap sample with replacement from the original data. This bootstrap aggregation and random feature selection helped reduce the variance of the algorithm and avoided over-fitting. Consequently, in the random forest algorithm, cross-validation is performed internally, which can be just as effective as using a separate test set to estimate the generalisation error of the training set. Moreover, the random forest algorithm can be used to evaluate the variables in a dataset and to provide a graphical display to assess the importance of each variable.
The 2 random forest parameters, mTry (i.e., the number of input variables randomly chosen at each split) and nTree (i.e., the number of trees to grow for each forest), were set to 2 (square root of 5 features) and 500, respectively. In each tree, each feature received a variable importance (VIMP) score, which can be used to rank and select relatively important features.
Regarding the regression analysis, the most widely used VIMP score of a feature is the average percent increase in the OOB mean square error (MSE) as a result of randomly permuting the OOB feature values [11]. The MSE is the mean of the squared regression residuals, and the VIMP score of a feature indicates its overall predictive ability for the regression. Regarding the classification analysis, the error rate is the proportion of misclassified samples of the total number of samples, and the VIMP score of a feature indicates its overall predictive ability for the classification.
Evaluation metrics
Three evaluation metrics—the coefficient of determination (R2), the root mean square error (RMSE), and the mean absolute error (MAE)—were used to assess the performance of the regression algorithm in predicting a targeted SE [12]. R2 can be expressed aswhere MSE is the same as noted above and Var(y) is the variance of the actual value. The RMSE is the square root of the MSE, which penalises large errors but has the same units as the original response variable being predicted; thus, its magnitude is more easily interpreted. The MAE measures the forecast accuracy by averaging the absolute values of the residuals. The MAE is expressed in the same units as the original response variable and provides an average size of the “miss”, regardless of the direction. This variable can be expressed aswhere yi is the actual value, and is the predicted value. These 3 evaluation metrics were calculated for the different predicted target times of each algorithm.
For classification performance, the receiver operating characteristic (ROC) curves and area under the curve (AUC) values were calculated as a comprehensive evaluation. All analyses were performed using R statistical software version 3.2.4 [13].
Results
Overall study population
A description of the study population is displayed in Table 1. In total, 687,063 longitudinal electronic medical records of 129,242 individuals from 8 ophthalmic centres and 17,113 follow-up records for 3,215 participants in population-based cohorts were included in the analysis. A total of 517,949 records from ZOC were used as the training set (the follow-up duration ranged from 2 to 11 years, mean ± SD 4.6 ± 1.9 years). The datasets of the remaining 7 centres (169,114 records; the follow-up duration ranged from 2 to 11 years, mean ± SD 5.2 ± 2.1 years) were used for external validation, and the records from the 2 population-based cohorts (17,113 records; the follow-up duration ranged from 3 to 5 years, mean ± SD 4.1 ± 1.2 years) were used for multi-resource validation testing.
[Figure omitted. See PDF.]
Table 1. Overall characteristics.
https://doi.org/10.1371/journal.pmed.1002674.t001
Internal validation
For the comparative analysis, the random forest algorithm outperformed the generalised estimating equation and the mixed-effects model in the detection of high myopia (S2 Fig). Therefore, all subsequent analyses were conducted based solely on the random forest algorithm. For classification, AUC values more than 0.9 indicated excellent performance, and values from 0.8 to 0.9 indicated good performance; MAE within ±0.75 dioptres was considered clinically acceptable accuracy (i.e., clinically acceptable prediction) based on the measurement variations of refraction [14].
As presented in Fig 3, our algorithm provided high-precision predictions of high myopia in the cross-validation (the AUC ranged from 0.903 to 0.958 for 3 years, 0.886 to 0.889 for 5 years, and 0.862 to 0.888 for 8 years) and OOB tests (the AUC ranged from 0.934 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.874 for 8 years). Meanwhile, our algorithm achieved clinically acceptable prediction of the refraction value at each time point (year) after baseline assessment (the MAE ranged from 0.253 to 0.395 for 3 years, 0.394 to 0.496 for 5 years, and 0.503 to 0.799 for 8 years). The regressive performance and calibration curves of the algorithm are presented in Table 2 and S3 Fig, respectively. These calibration results also supported that our algorithm can predict the actual refraction values at time points over 10 years.
[Figure omitted. See PDF.]
Fig 3. Algorithm performance in internal validation.
(A and B) Our algorithm provided high-precision prediction for detecting high myopia in the cross-validation (the AUC ranged from 0.903 to 0.958 for 3 years, 0.886 to 0.889 for 5 years, and 0.862 to 0.888 for 8 years) and OOB tests (the AUC ranged from 0.934 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.874 for 8 years). (C) AUC values more than 0.9 indicated excellent performance, and AUC values from 0.8 to 0.9 indicated good performance. Our algorithm provided excellent performance in the first 3 years and provided good performance in all 10 years. (D) MAE within 0.75 dioptres was considered a clinically acceptable prediction. Meanwhile, our algorithm achieved clinically acceptable prediction of the refraction value at time points after the initial baseline visit (the MAE ranged from 0.253 to 0.395 for 3 years, 0.394 to 0.496 for 5 years, and 0.503 to 0.799 for 8 years). AUC, area under the curve; D, dioptres; MAE, mean absolute error; OOB, out-of-bag.
https://doi.org/10.1371/journal.pmed.1002674.g003
[Figure omitted. See PDF.]
Table 2. Regressive performance of the algorithm using multi-resource datasets.
https://doi.org/10.1371/journal.pmed.1002674.t002
External validation
The performance of the algorithm in the external validation is presented in Fig 4. Our algorithm achieved stable performance for high myopia detection in the DGC (the AUC ranged from 0.768 to 0.969 for 10 years), the HZD (the AUC ranged from 0.773 to 0.968 for 10 years), the PYD (the AUC ranged from 0.854 to 0.951 for 5 years), the HZC (the AUC ranged from 0.822 to 0.941 for 6 years), the HPD (the AUC ranged from 0.802 to 0.976 for 8 years), the HKC (the AUC ranged from 0.897 to 0.929 for 3 years), and the LHD (the AUC ranged from 0.888 to 0.916 for 2 years). Clinically acceptable prediction of the refraction value was achieved at the majority of the time points examined (the MAE ranged from 0.201 to 0.494 for 3 years, 0.354 to 0.731 for 5 years, and 0.508 to 0.879 for 8 years).
[Figure omitted. See PDF.]
Fig 4. Algorithm performance in external validation.
(A–F) Our algorithm achieved stable performance for high myopia detection in DGC (the AUC ranged from 0.768 to 0.969 for 10 years), the HZD (the AUC ranged from 0.773 to 0.968 for 10 years), the PYD (the AUC ranged from 0.854 to 0.951 for 5 years), the HZC (the AUC ranged from 0.822 to 0.941 for 6 years), the HPD (the AUC ranged from 0.802 to 0.976 for 8 years), the HKC (the AUC ranged from 0.897 to 0.929 for 3 years), and the LHD (the AUC ranged from 0.888 to 0.916 for 2 years). (G) Our algorithm provided excellent performance at 19 time points (43.18%, 19/44) and good performance at 22 time points (50.0%, 22/44). (H) A clinically acceptable prediction of the refraction value was provided at the majority of the time points examined (the MAE ranged from 0.201 to 0.494 for 3 years, 0.354 to 0.731 for 5 years, and 0.508 to 0.879 for 8 years). AUC, area under the curve; D, dioptres; DGC, Dongguan Guangming Ophthalmic Hospital; HKC, Xiuying Optometry Department in Haikou City; HPD, Huangpu Optometry Department; HZC, Optometry Centre in Huizhou City; HZD, Haizhu Optometry Department; LHD, Haikou Longhua Optometry Department; MAE, mean absolute error; PYD, Panyu Optometry Department.
https://doi.org/10.1371/journal.pmed.1002674.g004
With respect to predicting the presence of high myopia (Table 3), our algorithm provided clinically acceptable prediction over 3 years (the AUC ranged from 0.940 to 0.985), 5 years (the AUC ranged from 0.856 to 0.901), and even 8 years (the AUC ranged from 0.801 to 0.837).
[Figure omitted. See PDF.]
Table 3. Algorithm performance in predicting the presence of high myopia at 18 years of age.
https://doi.org/10.1371/journal.pmed.1002674.t003
Multi-resource test
In the multi-resource test (Fig 5), our algorithm presented stable high myopia detection in GOAL (the AUC ranged from 0.784 to 0.869 for 3 years) and RELS (the AUC ranged from 0.752 to 0.845 for 4 years). A clinically acceptable prediction of refraction value was achieved at all time points examined (the MAE ranged from 0.314 to 0.562 for 4 years).
[Figure omitted. See PDF.]
Fig 5. Algorithm performance in the multi-resource test.
(A and B) Our algorithm presented stable high myopia detection in GOAL (the AUC ranged from 0.784 to 0.869 for 3 years) and RELS (the AUC ranged from 0.752 to 0.845 for 4 years). (C) Our algorithm provided excellent performance at 4 time points (57.1%, 4/7) and good performance at 3 time points (42.9%, 3/7). (D) A clinically acceptable prediction of refraction value was achieved at all time points examined (the MAE ranged from 0.314 to 0.562 for 4 years). AUC, area under the curve; D, dioptres; GOAL, Guangzhou Outdoor Activity Longitudinal Trial; MAE, mean absolute error; RELS, Refractive Error Longitudinal Study.
https://doi.org/10.1371/journal.pmed.1002674.g005
Discussion
This study, to our knowledge for the first time, demonstrates the utilisation of large-scale electronic medical record data to generate a random forest algorithm for predicting disease prognosis, which, in our analysis, was the risk of developing high myopia in adulthood. Furthermore, this algorithm exhibited high accuracy in a predicting future trait, i.e., the dioptre value at 18 years of age. Our data suggest that this prediction can be performed as early as 8 years prior to an individual turning 18 years old.
Identifying “severe myopia” in younger children is of major clinical importance but poses a significant challenge. The severity of myopia is often estimated as the degree of SE, with an SE of −6.00 dioptres chosen as the cutoff to define high myopia. High myopia carries a much greater risk of developing other ocular complications, including retinal detachment, glaucoma, and pathological myopia [15,16]. Given that myopia is in the development phase during childhood, it is difficult to choose a specific SE cutoff to define “severe myopia” among children.
A few studies have identified children at a greater risk of progressive myopia [17–19]; however, none to our knowledge has attempted to predict actual SE or risk of high myopia in adulthood. The “risk classification” in previous studies was often inferred from the analysis of short-term longitudinal data, or a control group in the instance of intervention randomised trials. As such, the available data are only generalisable among children who meet the inclusion and exclusion criteria of the specific studies [20]. Due to pragmatic feasibility, most myopia control trials can only run for up to 3 years, and, similarly, longitudinal studies on myopia are often shorter than 5 years. Real-world electronic medical record data from established optometry services in tertiary ophthalmic centres are of considerable advantage in terms of the size of the dataset and the length of follow-up.
A prediction is meaningful only when it is accurate and early enough to provide an added clinical benefit. As demonstrated by our results, the accuracy of prediction is reduced when the targeted prediction time increases. However, interestingly, in our analysis, the accuracy, indicated by the AUC, remained high (0.80–0.90) for up to 8 years in both the internal and external validation. Furthermore, the 95% predicted dioptre of refraction was within 0.5 to 0.8 dioptres of the true value at 8 years. Such accurate “long-term” predictions are critically important given that current treatments for myopia control, including low-dose atropine [21] and orthokeratology lenses [22], are effective but often have potential side effects and therefore must be utilised effectively. In addition, accurate early prediction and timely treatment of myopia in its mild stages are important to maximise the treatment benefits.
Methodologically, the random forest algorithm, which is based on random selection and a combination of predictors [9], achieved superior performance in the current analysis compared to conventional methodologies (i.e., a generalised estimating equation and a mixed-effects model). The added value of the random forest algorithm presented a gradual enlargement after 3 years, indicating that myopia development became increasingly non-linear in long-term range. This advantage can be further appreciated in analyses that require the inclusion of more complex potential predictors in the model.
There are some limitations that must be addressed with the development of a prediction model using large-scale real-world clinical data. First, clinical data collected in real-world settings are often subject to bias, with compromised quality. For instance, although a standard clinical protocol was followed, refraction measurement was performed by a number of different optometrists in the present analysis. Despite this, one may argue that this “noise” can be de-emphasised by a stronger “signal” when the sample size is large enough. This effect has been highlighted in previous myopia genetic studies; for example, the genes identified by the Consortium for Refractive Error and Myopia (CREAM) study [23] (in which the refractive error was measured in every participant) were similar to those reported in the 23andMe study [24] (self-reported refractive error). Second, an algorithm developed from a training set may be subject to over-fitting, whereby the correlation or prediction is spurious [25]. This appears not to be the case in the present study, with our algorithm showing balanced contributions of all involved predictors and robust performance when evaluated in independent, external datasets.
In summary, to our knowledge, this study, for the first time, used large-scale data collected from the electronic health records from the largest ophthalmic centres in China to demonstrate the contribution of big data to the better prediction of disease prognosis. In the context of school-age myopia, the most prevalent eye disease in the Chinese population, our study demonstrates that machine learning prediction algorithms further translate the benefit of big data research into clinical practice. The performance of our predictive algorithm is promising, with large sample sizes and diversified data resources. This work proposes a novel direction for the use of medical big data mining to transform clinical practice and guide health policy-making and precise individualised interventions.
Supporting information
S1 Code. The source code of the algorithm training and validation.
https://doi.org/10.1371/journal.pmed.1002674.s001
(PDF)
S1 Data. The datasets of the Guangzhou Outdoor Activity Longitudinal Trial (GOAL) and the Refractive Error Longitudinal Study (RELS).
https://doi.org/10.1371/journal.pmed.1002674.s002
(XLSX)
S1 Fig. The variable contributions of the trained model on the entire Zhongshan Ophthalmic Centre (ZOC) dataset.
The feature importance of spherical equivalent (SE), age at examination (AAE), and annual progression rate (APR) is presented. IncMSE, increase in mean square error.
https://doi.org/10.1371/journal.pmed.1002674.s003
(TIF)
S2 Fig. Comparative performance of the random forest and other conventional algorithms.
Based on the comparative analysis, the random forest (RF) algorithm outperforms the generalised estimating equation (GEE) and the mixed-effects model (ME) regarding the detection of high myopia.
https://doi.org/10.1371/journal.pmed.1002674.s004
(TIF)
S3 Fig. Calibration curves in cross-validation.
The mean differences of predicted and actual values at each time point are presented, with the interval as 0.25 dioptres (D) (the minimum interval of actual spherical equivalent [SE]). Our algorithm achieved stable prediction of refraction values over 10 years after baseline assessment in cross-validation.
https://doi.org/10.1371/journal.pmed.1002674.s005
(TIF)
S1 Table. Demographic characteristics of the 2 population-based datasets.
https://doi.org/10.1371/journal.pmed.1002674.s006
(DOCX)
S1 Text. Detailed information on cohorts.
https://doi.org/10.1371/journal.pmed.1002674.s007
(DOCX)
S2 Text. Detailed information on out-of-bag validation.
https://doi.org/10.1371/journal.pmed.1002674.s008
(DOCX)
S3 Text. Detailed information on conventional algorithms.
https://doi.org/10.1371/journal.pmed.1002674.s009
(DOCX)
Citation: Lin H, Long E, Ding X, Diao H, Chen Z, Liu R, et al. (2018) Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study. PLoS Med 15(11): e1002674. https://doi.org/10.1371/journal.pmed.1002674
1. Dolgin E. The myopia boom. Nature. 2015;519(7543):276–8. pmid:25788077
2. Morgan IG, Ohno-Matsui K, Saw SM. Myopia. Lancet. 2012;379(9827):1739–48. pmid:22559900
3. Khera R, Krumholz HM. With great power comes great responsibility: big data research from the national inpatient sample. Circ Cardiovasc Qual Outcomes. 2017;10(7):e003846. pmid:28705865
4. Obermeyer Z, Emanuel EJ. Predicting the future—big data, machine learning, and clinical medicine. N Engl J Med. 2016;375(13):1216–9. pmid:27682033
5. He M, Xiang F, Zeng Y, Mai J, Chen Q, Zhang J, et al. Effect of time spent outdoors at school on the development of myopia among children in China: a randomized clinical trial. JAMA 2015;314(11):1142–8. pmid:26372583
6. Willmott CJ. Some comments on the evaluation of model performance. Bull Am Meteorol Soc. 1982;63(11):1309–13.
7. Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika 1986;73(1):13–22.
8. Stiratelli R, Laird N, Ware JH. Random-effects models for serial observations with binary response. Biometrika 1984;40(4):961–71.
9. Breiman L. Random forests. Mach Learn. 2001;45:5–32.
10. Liaw A, Wiener M. Classification and regression by random forest. R News. 2002;2:18–22.
11. Grömping U. Variable importance assessment in regression: linear regression versus random forest. Am Stat. 2009;63(4):308–19.
12. Chai T, Draxler RR. Root mean square error (RMSE) or mean absolute error (MAE)?—arguments against avoiding RMSE in the literature. Geosci Algorithm Dev. 2014;7(3):1525–34.
13. R Core Team. R: a language and environment for statistical computing. Version 3.2.4. Vienna: R Foundation for Statistical Computing; 2015 [cited 2017 Jan 12]. Available from: https://www.R-project.org.
14. Smith G. Refraction and visual acuity measurements: what are their measurement uncertainties? Clin Exp Optom. 2006;89(2):66–72. pmid:16494608
15. Saw SM, Gazzard G, Shih-Yen EC, Chua WH. Myopia and associated pathological complications. Ophthalmic Physiol Opt. 2005;25(5):381–91. pmid:16101943
16. Neelam K, Cheung CM, Ohno-Matsui K, Lai TY, Wong TY. Choroidal neovascularization in pathological myopia. Prog Retin Eye Res. 2012;31:495–525. pmid:22569156
17. Zadnik K, Sinnott LT, Cotter SA. Prediction of juvenile-onset myopia. JAMA Ophthalmol. 2015;133(5):683–9.
18. Chua SY, Sabanayagam C, Cheung YB, Chia A, Valenzuela RK, Tan D, et al. Age of onset of myopia predicts risk of high myopia in later childhood in myopic Singapore children. Ophthalmic Physiol Opt. 2016;36(4):388–94. pmid:27350183
19. Chen Y, Zhang J, Morgan IG, He M. Identifying children at risk of high myopia using population centile curves of refraction. PLoS ONE 2016;11(12):e0167642. pmid:28030593
20. Sherman R, Anderson SA, Dal Pan GJ, Gray GW, Gross T, Hunter NL, et al. Real-world evidence—what is it and what can it tell us? N Engl J Med. 2016;375(23):2293–7. pmid:27959688
21. Chia A, Lu QS, Tan D. Five-year clinical trial on atropine for the treatment of myopia 2: Myopia control with atropine 0.01% eyedrops. Ophthalmology. 2016;123(2):391–9. pmid:26271839
22. Hiraoka T, Kakita T, Okamoto F, Takahashi H, Oshika T. Long-term effect of overnight orthokeratology on axial length elongation in childhood myopia: a 5-year follow-up study. Invest Ophthalmol Vis Sci. 2012;53(7):3913–9. pmid:22577080
23. Fan Q, Verhoeven VJ, Wojciechowski R, Barathi VA, Hysi PG, Guggenheim JA, et al. Meta-analysis of gene-environment-wide association scans accounting for education level identifies additional loci for refractive error. Nat Commun. 2016;7:11008. pmid:27020472
24. Kiefer AK, Tung JY, Do CB, Hinds DA, Mountain JL, Francke U, et al. Genome-wide analysis points to roles for extracellular matrix remodeling, the visual cycle, and neuronal development in myopia. PLoS Genet. 2013;9(2):e1003299. pmid:23468642
25. Hawkins DM. The problem of overfitting. J Chem Info Computer Sci. 2004;44(1):1–12.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2018 Lin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background
Electronic medical records provide large-scale real-world clinical data for use in developing clinical decision systems. However, sophisticated methodology and analytical skills are required to handle the large-scale datasets necessary for the optimisation of prediction accuracy. Myopia is a common cause of vision loss. Current approaches to control myopia progression are effective but have significant side effects. Therefore, identifying those at greatest risk who should undergo targeted therapy is of great clinical importance. The objective of this study was to apply big data and machine learning technology to develop an algorithm that can predict the onset of high myopia, at specific future time points, among Chinese school-aged children.
Methods and findings
Real-world clinical refraction data were derived from electronic medical record systems in 8 ophthalmic centres from January 1, 2005, to December 30, 2015. The variables of age, spherical equivalent (SE), and annual progression rate were used to develop an algorithm to predict SE and onset of high myopia (SE ≤ −6.0 dioptres) up to 10 years in the future. Random forest machine learning was used for algorithm training and validation. Electronic medical records from the Zhongshan Ophthalmic Centre (a major tertiary ophthalmic centre in China) were used as the training set. Ten-fold cross-validation and out-of-bag (OOB) methods were applied for internal validation. The remaining 7 independent datasets were used for external validation. Two population-based datasets, which had no participant overlap with the ophthalmic-centre-based datasets, were used for multi-resource validation testing. The main outcomes and measures were the area under the curve (AUC) values for predicting the onset of high myopia over 10 years and the presence of high myopia at 18 years of age. In total, 687,063 multiple visit records (≥3 records) of 129,242 individuals in the ophthalmic-centre-based electronic medical record databases and 17,113 follow-up records of 3,215 participants in population-based cohorts were included in the analysis. Our algorithm accurately predicted the presence of high myopia in internal validation (the AUC ranged from 0.903 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.888 for 8 years), external validation (the AUC ranged from 0.874 to 0.976 for 3 years, 0.847 to 0.921 for 5 years, and 0.802 to 0.886 for 8 years), and multi-resource testing (the AUC ranged from 0.752 to 0.869 for 4 years). With respect to the prediction of high myopia development by 18 years of age, as a surrogate of high myopia in adulthood, the algorithm provided clinically acceptable accuracy over 3 years (the AUC ranged from 0.940 to 0.985), 5 years (the AUC ranged from 0.856 to 0.901), and even 8 years (the AUC ranged from 0.801 to 0.837). Meanwhile, our algorithm achieved clinically acceptable prediction of the actual refraction values at future time points, which is supported by the regressive performance and calibration curves. Although the algorithm achieved balanced and robust performance, concerns about the compromised quality of real-world clinical data and over-fitting issues should be cautiously considered.
Conclusions
To our knowledge, this study, for the first time, used large-scale data collected from electronic health records to demonstrate the contribution of big data and machine learning approaches to improved prediction of myopia prognosis in Chinese school-aged children. This work provides evidence for transforming clinical practice, health policy-making, and precise individualised interventions regarding the practical control of school-aged myopia.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer