Full text

Turn on search term navigation

© 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Classifying images using supervised machine learning (ML) relies on labeled training data—classes or text descriptions, for example, associated with each image. Data‐driven models are only as good as the data used for training, and this points to the importance of high‐quality labeled data for developing a ML model that has predictive skill. Labeling data is typically a time‐consuming, manual process. Here, we investigate the process of labeling data, with a specific focus on coastal aerial imagery captured in the wake of hurricanes that affected the Atlantic and Gulf Coasts of the United States. The imagery data set is a rich observational record of storm impacts and coastal change, but the imagery requires labeling to render that information accessible. We created an online interface that served labelers a stream of images and a fixed set of questions. A total of 1,600 images were labeled by at least two or as many as seven coastal scientists. We used the resulting data set to investigate interrater agreement: the extent to which labelers labeled each image similarly. Interrater agreement scores, assessed with percent agreement and Krippendorff's alpha, are higher when the questions posed to labelers are relatively simple, when the labelers are provided with a user manual, and when images are smaller. Experiments in interrater agreement point toward the benefit of multiple labelers for understanding the uncertainty in labeling data for machine learning research.

Details

Title
Labeling Poststorm Coastal Imagery for Machine Learning: Measurement of Interrater Agreement
Author
Goldstein, Evan B 1   VIAFID ORCID Logo  ; Buscombe, Daniel 2   VIAFID ORCID Logo  ; Lazarus, Eli D 3   VIAFID ORCID Logo  ; Mohanty, Somya D 4   VIAFID ORCID Logo  ; Shah, Nafis Rafique 4   VIAFID ORCID Logo  ; Anarde, Katherine A 5   VIAFID ORCID Logo  ; Ashton, Andrew D 6   VIAFID ORCID Logo  ; Beuzen, Tomas 7   VIAFID ORCID Logo  ; Castagno, Katherine A 8   VIAFID ORCID Logo  ; Cohn, Nicholas 9   VIAFID ORCID Logo  ; Conlin, Matthew P 10   VIAFID ORCID Logo  ; Ellenson, Ashley 11 ; Gillen, Megan 12   VIAFID ORCID Logo  ; Hovenga, Paige A 11   VIAFID ORCID Logo  ; Jin‐Si R. Over 13   VIAFID ORCID Logo  ; Palermo, Rose V 12   VIAFID ORCID Logo  ; Ratliff, Katherine M 14   VIAFID ORCID Logo  ; Reeves, Ian R B 15   VIAFID ORCID Logo  ; Sanborn, Lily H 12   VIAFID ORCID Logo  ; Straub, Jessamin A 9 ; Taylor, Luke A 3   VIAFID ORCID Logo  ; Wallace, Elizabeth J 16   VIAFID ORCID Logo  ; Warrick, Jonathan 17   VIAFID ORCID Logo  ; Wernette, Phillipe 17   VIAFID ORCID Logo  ; Williams, Hannah E 18   VIAFID ORCID Logo 

 Department of Geography, Environment, and Sustainability, University of North Carolina at Greensboro, Greensboro, NC, USA 
 Marda Science LLC, Contracted to USGS Pacific Coastal and Marine Science Center, Santa Cruz, CA, USA 
 Environmental Dynamics Lab, School of Geography and Environmental Science, University of Southampton, Southampton, UK 
 Department of Computer Science, University of North Carolina at Greensboro, Greensboro, NC, USA 
 Department of Civil, Construction, and Environmental Engineering, North Carolina State University, Raleigh, NC, USA 
 Department of Geology and Geophysics, Woods Hole Oceanographic Institution, Woods Hole, MA, USA 
 Department of Statistics, University of British Columbia, Vancouver, BC, Canada 
 Center for Coastal Studies, Provincetown, MA, USA 
 U.S. Army Engineer Research and Development Center, Field Research Facility, Duck, NC, USA 
10  Department of Geological Sciences, University of Florida, Gainesville, FL, USA 
11  College of Engineering, Oregon State University, Corvallis, OR, USA 
12  MIT‐WHOI Joint Program in Oceanography/Applied Ocean Science & Engineering, Cambridge and Woods Hole, MA, USA 
13  U.S. Geological Survey, Coastal and Marine Science Center, Woods Hole, MA, USA 
14  Earth and Ocean Sciences, Duke University, Durham, NC, USA 
15  Department of Geological Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA 
16  Department Earth, Environmental, and Planetary Sciences, Rice University, Houston, TX, USA 
17  U.S. Geological Survey, Pacific Coastal and Marine Science Center, Santa Cruz, CA, USA 
18  Water Engineering, Faculty of Engineering and Physical Sciences, University of Southampton, Southampton, UK 
Section
Technical Reports: Methods
Publication year
2021
Publication date
Sep 2021
Publisher
John Wiley & Sons, Inc.
e-ISSN
2333-5084
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2576704753
Copyright
© 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.