1 Introduction
Flood forecasting has a critical importance as it results in decisions of high societal value. In order to produce the most accurate flood predictions, it is essential to provide public authorities with the best combination of data and models, and with a robust knowledge of the model behaviour in terms of reliability and uncertainty. Modellers thus have a responsibility to deeply assess the strengths and limitations of model forcing data.
Within this general picture, the topic of community-based monitoring aimed at providing crowdsourced data (CSD) is gaining increasing attention among hydrologists . For example, the availability of hydrometric data, collected by active citizens in the course of severe flood events, offers a new, exciting chance to improve real-time flood forecasts. However, the use of CSD poses challenges to modellers since their information content, reliability, arrival frequency, and location are a priori unknown . In addition, long time series of CSD are unavailable, thus complicating efforts to assess their effectiveness in improving flood prediction.
In pioneering applications , CSD collected in the upper part of a basin were assimilated into adaptive hydrological models to reduce uncertainty in forecasting flood hydrographs at downstream sections. In this recent work, paid particular attention to the issues of uncertainty and irregular arrival frequency of CSD. Their results showed that assimilation of CSD improves the overall model performance. They also showed that the accuracy of CSD is, in general, more important than their arrival frequency.
In their work, the authors used synthetic (i.e. not actually measured) CSD, because real streamflow CSD were not available at the time of the study. Commenting on this aspect, the authors wrote that “the developed methodology is not tested with data coming from actual social sensors. Therefore, the conclusions need to be confirmed using real crowdsourced observations of water level”. A practical verification of the results by is indeed necessary; furthermore, particular attention has to be paid to possible drawbacks inherent in the use of CSD for operational flood forecasting and related to model structural uncertainty, which are not discussed in their proof-of-concept study.
The comment is outlined as follows. Section presents an
in-depth assessment of the Bacchiglione River case study
2 Specific comments
2.1 The Bacchiglione catchment closed at Ponte degli Angeli (Vicenza)
Figure 1
The catchment of the Bacchiglione River at Ponte degli Angeli, Vicenza (Italy).
[Figure omitted. See PDF]
The catchment of the upper Bacchiglione River, closed at Ponte degli Angeli in the historical centre of Vicenza (Fig. ), is located in the north of the Veneto Region, a plain that is fringed by the Alpine barrier at a distance of less than 100 km north of the Adriatic Sea .
With regard to the precipitation climatology, the southern part of this plain
is the drier one, with approximately 700–1000 mm of mean annual rainfall,
whereas more than 2000 mm are measured close to the pre-Alpine chain due to
the interaction of the southerly warm and humid currents coming from the
Mediterranean Sea with the mountain barrier . A significant
portion of the annual rainfall often concentrates into very short periods of
time in the form of what often turns out to be an extreme event with deep
convection playing a central role .
As a consequence, severe flooding events have threatened agricultural and
urban areas in recent years
Due to the spatial and temporal variability of the rainfall fields, meteorological models are often unable to provide accurate and reliable quantitative precipitation estimates for the upper Bacchiglione catchment. An example of this inadequacy is given, for instance, by Fig. 13 in .
The upper Veneto plain is a highly populated and urbanized area, with extremely complex drainage and irrigation networks that significantly affect both runoff production and propagation . Within this plain, the Bacchiglione River and its tributaries are provided with relatively high levees , which prevent the exchange of water from inside to outside the riverbed (and vice versa) when the inner water levels are relatively high. As a consequence, the minor channel networks are not always allowed to deliver their drainage water towards the nearest tributary, i.e. the inflow points along the main river reaches change during a flood event depending on the instantaneous water level within the river. This occurrence modifies the network connectedness which, in turn, leads to different mechanisms of hydrologic response in the overall catchment.
Just upstream of the city of Vicenza, an area of up to 1 km (the “Viale Diaz” floodplain, Fig. ) is flooded when the Bacchiglione flow rate exceeds m s. Since about m of water can be temporarily stored in this area, a significant flood attenuation can be produced, particularly in the case of hydrographs with a steep rising limb (which is often the case due to the climatic regime and the catchment characteristics).
Moreover, the lower part of the Bacchiglione basin, north of Vicenza, includes a vast groundwater resurgence zone, in which it is difficult to assess both the actual contribution of resurgence to the Bacchiglione streamflow (up to m s) and the time-variable behaviour of soil moisture.
Clearly, such a system is highly non-linear. Nonetheless, significant parts of the Bacchiglione catchment are poorly monitored, and the remaining parts are completely unmonitored. The Leogra subcatchment (blue shaded area in Fig. ) is provided with a pressure transducer for the measurement of water level at Torrebelvicino (Fig. ). A rating curve derived from theoretical considerations is available for this cross section. However, the absence of instrumental measurements of flow discharge limits its reliability. The Leogra–Timonchio subcatchment (orange shaded area in Fig. ) is monitored by an ultrasonic stage sensor located at Ponte Marchese, just upstream of the confluence with the Orolo River. Flow rate measurements at Ponte Marchese refer only to low hydraulic regimes, and show great variability due to the operation of a hydroelectric power plant located just downstream of Ponte Marchese. The Orolo River (green shaded area in Fig. ), with a discharge capacity of more than one-third of the Bacchiglione at Ponte degli Angeli, is one of its major tributaries. Unfortunately, not only is the Orolo subcatchment completely uncovered by meteorological gauging stations, but no hydrometric gauging stations are present along its reach either. Similarly to the Orolo, the Astichello catchment (red shaded area in Fig. ) is unmonitored and, due to backwater effects, significant areas adjacent to the main channel of the Astichello are flooded when water levels in the Bacchiglione are relatively high. Hence, the discharge that effectively flows from the Astichello into the Bacchiglione River may significantly decrease depending on the water stage within the main course of the Bacchiglione River.
Attention must be paid to the fact that the three major tributaries (Orolo, Timonchio, and Astichello) meet just upstream of the gauging station of Ponte degli Angeli (Fig. ), making it difficult to correctly estimate the actual contribution of each single tributary to the total streamflow. By looking at the tree-like structure of the drainage network in an electrical analogy , the major tributaries of the Bacchiglione are in fact “conductors in parallel”.
Certainly, given the irregular topography of the catchments, the heterogeneity of the landscape, and the complexity of the hydraulic network, it can be stated that the Bacchiglione catchment is poorly monitored.
2.2 The semi-distributed model of the Bacchiglione catchmentIn catchments like that of the Bacchiglione River, for all the reasons reported in the previous section, the accurate prediction of flood hydrographs with continuous time simulation is unquestionably a difficult task .
used an available semi-distributed hydrological model coupled with a Muskingum–Cunge scheme for flood propagation within the main river network, which was originally set up to forecast flood hydrographs of the Bacchiglione River at Ponte degli Angeli (Vicenza). Sensibly, the model was calibrated by minimizing the root mean square error between observed and simulated values of water discharge only at Ponte degli Angeli, which is the only hydrometric station provided with a reliable rating curve. The semi-distributed model, although explicitly representing the hydrological processes within the main subcatchments, has to be intended as a lumped model from a practical standpoint, since the discharge in Ponte degli Angeli is its only control point.
Therefore, regardless of the accuracy of streamflow prediction in Ponte degli Angeli, little can be said about the accuracy of the model in describing the internal states of the system, such as the streamflow along upstream tributaries. This limitation has to be ascribed to uncertainty in precipitation fields, to the paucity of (reliable) flow rate data upstream of Vicenza, and to inherent limitations of the model itself.
Indeed, it has to be remarked that the Muskingum–Cunge model for flood propagation used in considers rectangular river cross sections for the estimation of hydraulic radius, wave celerity, and other hydraulic variables . Accordingly, the effects exerted by the Viale Diaz floodplain, which acts as a sort of in-line natural flood control reservoir on flood propagation, can not be properly accounted for. This means that, if the flood hydrograph is correctly modelled at Ponte degli Angeli, it can not be correctly modelled upstream of the Viale Diaz floodplain (and vice versa).
2.3 The use of synthetic CSD in the Bacchiglione case study
In the Bacchiglione case study, calibrated the model using measured rainfall data to reproduce the streamflow hydrograph at the basin outlet well (call this post-event simulation “scenario 1”). Then they forced the model with predicted rainfall fields that were completely different from the actual storm event (“scenario 2”); in this case, the discharge simulated using forecasted input was very different from that obtained using recorded rainfall, with a significant time shift and errors in predicted discharge ranging between 25 and 50 % at the flood peak (and up to 90 % when considering synchronous data). In their “scenario 3”, similarly to the “observing system simulation experiment” (OSSE) approach, synthetic streamflow CSD extracted from “scenario 1” were assimilated into a new run using the same forcing as in “scenario 2”. Not surprisingly, the model performance in “scenario 3” was significantly better than in “scenario 2”, as the synthetic CSD they assimilated were representative of the model internal states in the best-fit scenario.
The authors argued that the synthetic CSD they used are realistic. For this condition to be met, given that these CSD are the results of the model itself, the model must well represent the physics of the real system (i.e. it must be calibrated or, at least, verified) at locations where CSD are first generated and then assimilated; this is a fundamental hypothesis behind the OSSE approach. The synthetic CSD used in for the Bacchiglione case study are drawn from the model internal states under best-fit conditions. Thus, when the model is forced with different (wrong) input data, their assimilation is expected to be as successful as possible in updating the model states toward the best-fit scenario. However, the accuracy of such synthetic CSD is questionable, since they do not refer to model control points (i.e. they are drawn from the semi-distributed model at locations where the model is neither calibrated nor verified), so nothing can actually be said about the model performance at these locations. In a sense, synthetic CSD used by are optimal (in view of assimilation performance) rather than realistic. Since real CSD are likely biased with respect to the synthetic CSD actually used, assimilation of real CSD can not be as effective as that performed in .
From one point of view, it is possible that such an inconsistency could have led to overrate the importance of CSD, as they considered issues related to CSD precision, but not accuracy . Therefore, additional care must be taken in operational flood forecasting when assimilating CSD into (semi-)distributed hydrological models at locations other than model control points.
3 The use of real CSD in operational flood forecasting
As remarked by , the success of assimilating real CSD in hydrological modelling strictly depends on their accuracy, quantity, and spatial–temporal distribution. However, this comment points out that attention must be paid not only to CSD, but also to the model.
In general, historical data recorded by traditional sensors are first used to
calibrate a model; then, in real-time mode, the same sensors provide data
both to force the model and to update the model states
Given the spatially distributed nature of CSD, spatially explicit hydrological models can take major advantage of CSD. On the other hand, particular care has to be taken when dealing with physically based, (semi-)distributed models, which are known to suffer from equifinality and poor identifiability of model parameters .
After the critical work by , detailed investigations were carried out about the model complexity needed to simulate rainfall–runoff processes. Several studies indicated that the information content in a rainfall–runoff record is sufficient to support models of only very limited complexity . This implies that distributed, or semi-distributed, hydrological models are seldom calibrated; rather, they are commonly over-parametrized, since calibration rarely involves their internal states .
In addition, flood routing processes are typically oversimplified in operational models meant for real-time flood forecasting . For instance, significant effects related to either compound sections, large floodplains connected to the main channel, or confluences causing backwater effects are seldom accounted for.
As a consequence, (semi-)distributed rainfall–runoff models may provide accurate predictions of outflow discharge at the basin outlet and, at the same time, poor predictions of internal states of the system (e.g. the soil moisture content, or the relative contribution of upstream tributaries); in other words, one can likely get the correct answer for the wrong reason . Therefore, (semi-)distributed models can be said to be calibrated only at calibration (or control) points, and verified only at locations in which model results are shown to compare favourably with enough (and accurate enough) measured data.
This caveat particularly applies to assimilation of CSD in hydrological modelling for operational, real-time flood forecasting. Indeed, while CSD typically refer to model internal states, they are assimilated in order to improve the accuracy of the main outputs of the model, such as streamflow hydrographs at basin outlets (model internal states are relatively less important in this context).
Recalling that model input, states, parameters, and outputs (or a subset of
them) can be updated using different data assimilation techniques
, assimilation of CSD in operational flood forecasting
can be helpful provided that the model is able to represent the physics of
the system well at locations where CSD are collected. Of course, data
assimilation can contribute, in many cases, to improving such a
representation. However, when only internal states are updated
As an alternative for operational forecasting, ensemble-based data assimilation methods (e.g. the ensemble Kalman filter or the particle filter) can be used to update jointly model states and parameters and to provide a direct measure of uncertainty . In this way, models cope directly with equifinality and problems of over-parametrization, since parameter posterior distributions are represented by ensembles. Note that typical data assimilation algorithms are in principle able to screen out noisy data automatically, but need to be modified to tackle possible data bias, which otherwise leads to poorly calibrated models. Thus, it is important, regardless of the nature of the data, to verify whether such bias exists before any data assimilation is applied.
Nonetheless, such sophisticated tools may also fail if the model has structural deficiencies that make it unable to represent true system states at given locations. As a representative example, consider the Bacchiglione River (Fig. ) and, specifically, the Viale Diaz floodplain described in Sect. . The role played by such an in-line flood control reservoir in flood routing can not be accounted for using a basic Muskingum–Cunge model that considers rectangular cross sections. It follows that the assimilation of accurate streamflow data referring to a section located just upstream of the Viale Diaz floodplain (e.g. Ponte Marchese; see Fig. ) can likely deteriorate the model predictions in Ponte degli Angeli, downstream of the floodplain.
Shortcomings similar to the one described above, which can be found in many different case studies, can be a priori conjectured through a close inspection of both the physical system and the model characteristic. Their quantitative assessment needs an extensive comparison with measured data; of course, a “blind” use of CSD (i.e. their assimilation at locations where the model is neither calibrated nor verified) is at least questionable.
4 Summary
The approach proposed and investigated by , based on the assimilation of crowdsourced data (CSD), can be generally valuable to improve real-time flood forecasts using non-traditional information now available thanks to active citizens and new technologies.
However, it has to be remarked that physically based modelling of rainfall–runoff and flow routing processes faces limitations ascribed to the paucity of measured data, to the complexity of real environments, and to deficiencies in model structure and parametrization. As a consequence, (semi-)distributed rainfall–runoff models used for operational flood forecasting can provide reliable predictions at locations where calibration is performed (i.e. control points) and, at the same time, incorrectly represent system states elsewhere (e.g. discharges in upstream, ungauged tributaries).
In a context of equifinality and simplified representation of real physical
processes, the accurate prediction of outflow hydrographs can be achieved
even though model internal states do not match the true system states. In such
cases, the assimilation of real CSD can lead to a substantially lower
performance than the use of synthetic CSD would suggest, as it corresponds,
in fact, to updating a model using biased data
A possible solution is the use of ensemble-based data assimilation methods to update jointly model states and parameters. An additional pragmatic recommendation is the collection of accurate measured data for a suitable period, for at least two reasons: (i) to develop reliable rating curves at locations where water level CSD are planned to be collected, and (ii) to calibrate and verify the model ability in describing the system states correctly at the locations in which CSD are collected.
It must be observed that, while scarce control on the collection of CSD can be exerted during significant flood events, the locations at which citizens can collect CSD of water levels are always determined a priori, since the availability of rating curves is a necessary condition in order to convert water levels into discharges. The amount of measured data needed to develop reliable rating curves can also be profitably used to calibrate the model at those sections as well.
As a final remark, both modellers and environmental agencies should comprehensively account for the characteristics of the physical system, for model structure and parametrization, for the design of the sensor network, and for data to be used in both calibration and operational mode.
Competing interests
The author declares that he has no conflict of interest.
Acknowledgements
The editor, Maurizio Mazzoleni, and two anonymous reviewers are gratefully acknowledged for providing valuable comments and suggestions that allowed me to significantly improve the manuscript. Edited by: Stacey Archfield Reviewed by: Maurizio Mazzoleni and two anonymous referees
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2018. This work is published under https://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Citizen science and crowdsourcing are gaining increasing attention among hydrologists. In a recent contribution,
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer






