It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
Benchmark datasets have a powerful normative influence: by determining how the real world is represented in data, they define which problems will first be solved by algorithms built using the datasets and, by extension, who these algorithms will work for. It is desirable for these datasets to serve four functions: (1) enabling the creation of clinically relevant algorithms; (2) facilitating like-for-like comparison of algorithmic performance; (3) ensuring reproducibility of algorithms; (4) asserting a normative influence on the clinical domains and diversity of patients that will potentially benefit from technological advances. Without benchmark datasets that satisfy these functions, it is impossible to address two perennial concerns of clinicians experienced in computational research: “the data scientists just go where the data is rather than where the needs are,” and, “yes, but will this work for my patients?” If algorithms are to be developed and applied for the care of patients, then it is prudent for the research community to create benchmark datasets proactively, across specialties. As yet, best practice in this area has not been defined. Broadly speaking, efforts will include design of the dataset; compliance and contracting issues relating to the sharing of sensitive data; enabling access and reuse; and planning for translation of algorithms to the clinical environment. If a deliberate and systematic approach is not followed, not only will the considerable benefits of clinical algorithms fail to be realized, but the potential harms may be regressively incurred across existing gradients of social inequity.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Harvard T.H. Chan School of Public Health, Division of Health Policy and Management, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Wellframe Inc., Boston, USA (GRID:grid.38142.3c)
2 Massachusetts Institute of Technology, Laboratory for Computational Physiology, Cambridge, USA (GRID:grid.116068.8) (ISNI:0000 0001 2341 2786)
3 Wellframe Inc., Boston, USA (GRID:grid.116068.8); Harvard T.H. Chan School of Public Health, Department of Biostatistics, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X)
4 Wellframe Inc., Boston, USA (GRID:grid.38142.3c)
5 NIHR Biomedical Research Centre at Moorfields Eye Hospital and UCL Institute of Ophthalmology, London, UK (GRID:grid.451056.3) (ISNI:0000 0001 2116 3923)
6 Massachusetts Institute of Technology, Laboratory for Computational Physiology, Cambridge, USA (GRID:grid.116068.8) (ISNI:0000 0001 2341 2786); Harvard T.H. Chan School of Public Health, Department of Biostatistics, Boston, USA (GRID:grid.38142.3c) (ISNI:000000041936754X); Beth Israel Deaconess Medical Center, Division of Pulmonary, Critical Care and Sleep Medicine, Boston, USA (GRID:grid.239395.7) (ISNI:0000 0000 9011 8547)