This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
1. Introduction
Crash injury severity analysis plays a crucial role in traffic crash analysis, which can assist traffic management [1–4]. Crash injury severity is defined as the degree of injury and property damage caused by a crash event. The crash injury severity analysis aims to explore the correlation between crash injury severity and various contributing factors, such as road-users-related factors, temporal-related factors, environmental conditions, and crash types. The universal rules support traffic managers to better understand the contributions of factors on crash injury severity and further reduce the crash severity and improve traffic safety by developing countermeasures [5–7].
Currently, the research approaches on crash injury severity can be divided into two categories, which are statistical models and machine learning-based models. The statistical models assume that the contributing factors affecting crash injury severity follow a particular distribution, which needs to be defined carefully for better capturing the relationship between crash injury severity and explanatory variables. The commonly used models contain multivariate Poisson regression model [8, 9], ordered probit model [10, 11], bivariate binary/ordered probit model [12, 13], random parameter probit model [14], etc. Wang et al. focused on mountainous expressways and proposed a partial proportional odds model to determine the determinants of truck-involved crash injury severity [15]. Xu et al. attempted to investigate pedestrian-involved crash injury severity by using geographically and temporally weighted regression model taking into account spatial-temporal correlation [16]. The statistical models could demonstrate and explain clearly the correlation between crash severity and related variables with the help of explainable and logical theoretical deductions. However, due to the nonlinear relationship between crash injury severity and contributing factors, these statistical models difficultly capture the inner and intrinsic correlations [17–19].
Machine-learning-based models have a powerful internal inferential capability, which makes them more flexible by learning without or little prior assumptions of related factors to describe the complex characteristics of crash events. Previous researches employed logistic models (e.g., random parameter logit model, and mixed/ordered logit model) [20, 21], support vector machine (SVM) [22, 23], random forest (RF) [24, 25], Bayesian-related models [26, 27], etc., to explore the complex relationship between crash injury severity and contributing factors and further identify the risk factors on crash injury severity. For comprehensive accounting of the observed heterogeneity, Behnood et al. introduced a random parameter multinomial logit model for comparing the contribution of risk factors to crash injury severity under bicycle-vehicle crashes [28]. Liu et al. introduced an ordinal logistic regression model to examine the risk factors on pedestrian-motor vehicle collisions, taking into account the spatial-temporal correlation [29]. Li et al. introduced SVM model to investigate the potential correlation between external factors and crash injury severity, but the performance was suppressed due to multiclass classification problems [30]. Li et al. analyzed the key factors affecting electric bicycle-related crash injury severity with the help of random forest model [31].
Beyond that, Bayesian approaches, as a classical machine learning model, have been widely used in crash injury severity modelling, which were regarded as Bayesian-related models. For instance, Bayesian binomial logistic model [32, 33], Bayesian multivariate regression model [34, 35], Bayesian spatial model [36, 37], and Bayesian mixed logit model [38] have successfully demonstrated their applicability in crash injury severity-involved correlation issues. Yuan et al. divided crash severity into two categories (property damage only and injury/fatality) and integrated bivariate probit model and Bayesian approach to identify the contributing factors associated with crash injury severity [39]. Haq et al. developed binary logistic model with Bayesian inference approach to investigate the effects on truck-involved crashes, especially on occupant injury severity considering comprehensive factors [40]. Guo et al. proposed a novel random parameter, that is, multivariate Tobit model, to identify risk factors on crash severity under different crash types [41]. Zhang et al. utilized a Bayesian multinomial logit model with conditional autoregression prior to examining the hazardous factors that contributed to freeway crash injury severity [42].
In sum, previous researches focused on identifying risk factors towards traffic injury severity by various statistical models and machine learning-based models, and satisfying results were obtained. However, decision tree ensemble-based models, also a type of machine leaning model, including adaptive boosting (AdaBoost), gradient boost decision tree (GBDT), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost), are less utilized for crash injury severity analysis. Additionally, vulnerable road users (VRUs), as the vulnerable groups of the traffic participants, are prone to fatal injuries in crashes [43]. As the most common VRUs, the pedestrians and bicyclists have been paid much attention to, but decision tree ensembles-based models are rarely utilized to investigate the potential universal rules behind the VRUs-involved crashes. Hence, this paper attempts to describe the characteristics of VRUs-involved crashes and identify the contributing factors associated with crash injury severity. Based on this purpose, 554 VRU-vehicle crashes (contains 232 bicycle-vehicle crashes and 322 pedestrian-vehicle crashes) were collected. Moreover, XGBoost is introduced for crash injury severity modelling and ranking the importance of risk factors on crash injury severity. The contribution of this paper is twofold:
(1) Conduct a descriptive statistical analysis to investigate the characteristics of VRUs-involved crashes from the perspective of six risk factors (i.e., time of day, day of week, rush hour, crash position, weather, and crash involvements), and further transform into universal rules to support traffic management
(2) XGBoost is adopted to identify the risk factors contributing to VRUs-involved crash injury severity with the help of VRUs-involved crashes dataset from policy records, which further determine the real causes to enhance traffic safety
The rest of this paper is organized as follows. Section 2 introduces the data details and candidate variables analyzed in this paper. Section 3 describes the details of XGBoost model. Section 4 provides the experimental results, which consist of crash severity characteristics and identified risk factors. Section 5 briefly concludes the study.
2. Data Description
2.1. Data Source
For exploring the characteristic of VRUs-involved crashes, 554 crash samples were collected from police records on crashes, which have occurred in a city in northern China within about four years. The dataset contains various information, such as crash time, position, involvements, and injury severity, and six factors are extracted to explain the characteristics of VRUs-involved crashes. Vehicles and bicycles or pedestrians were involved in one crash, and bicyclists and pedestrians were defined as VRUs. The crashes dataset contains 323 vehicle-bicycle crashes and 322 vehicle-pedestrian crashes. The property-damage-only crashes are excluded because the vehicle-bicycle or vehicle-pedestrian crashes are prone to injury or death, which belong to injury or fatal accidents. Additionally, the crashes dataset consists of 385 injury crashes and 169 fatal crashes, which caused 517 injuries and 173 deaths.
2.2. Candidate Variables
Generally, if fatal or injured occupants are involved in a crash, it can be regarded as a severe accident. Considering that the dataset only contains fatal accidents and injury accidents, but without property-damage-only accidents, the crash injury severity is divided into two categories: injury accident (only injured occupant involved in the crash), which is coded as 0, and fatal accident (at least one fatality occupant involved in the crash), which is coded as 1. Figure 1 describes the extracted factors related to crashes from the dataset, which are time of day, day of week, rush hour, weather, crash position, and crash involvements. These six factors are extracted to investigate the characteristics of VRUs-involved crashes, divided into two typical injury categories (see Table 1).
[figure omitted; refer to PDF]
Table 1
Data list of main variables related to crashes.
| Variable | Categories | Value | Fatal (n = 169) | Injury (n = 385) |
| Time of day | Day (7:00–19:00) | 1 | 92 (54.4%) | 263 (68.3%) |
| Night (19:00–7:00) | 0 | 77 (45.6%) | 122 (31.7%) | |
| Day of week | Weekday | 1 | 118 (69.8%) | 294 (76.4%) |
| Weekend/holiday | 0 | 51 (30.2%) | 91 (23.6%) | |
| Rush hour | Rush hour (7:00–9:00, 17:00–20:00) | 1 | 49 (29.0%) | 126 (32.7%) |
| Off-peak hour | 0 | 120 (71.0%) | 259 (67.3%) | |
| Weather | Good (sunny, cloudy) | 1 | 138 (81.7%) | 320 (83.1%) |
| Adverse (rainy, snowy, etc.) | 0 | 31 (18.3%) | 65 (16.9%) | |
| Crash position | Road section | 1 | 110 (65.1%) | 266 (69.1%) |
| Intersection | 0 | 59 (34.9%) | 119 (30.9%) | |
| Crash involvements | Vehicle-bicycle | 1 | 53 (31.4%) | 179 (46.5%) |
| Vehicle-pedestrian | 0 | 116 (68.6%) | 206 (53.5%) |
To some extent, time of day reflects the lighting conditions laterally, which is a crucial factor for traveling. Considering that, the crash position is complex, which mainly contains road section and intersection, but less sidewalk, roundabout. For better modelling, the crashes happened on sidewalk were regarded as road section. The weather information is collected from the related website (see http://www.tianqihoubao.com/lishi) based on the date and time of crashes [26]. Noting that this website provides the weather information only in two periods, that is daytime and night, it is not detailed enough to the specific hours. Additionally, due to the various types of weather, some of them have similar impact on traveling environment, for instance, sunny and cloudy, rainy and snowy. Therefore, the weather was divided into two categories: good and adverse.
3. Methodology
Extreme gradient boosting (XGBoost), as a typical decision tree ensemble-based model, was proposed by Chen in 2016 [44]. XGBoost is optimized from GBDT, which introduced second-order derivatives into optimization process. It outperforms with advantages of parallel learning, high flexibility, built-in cross-validation, etc. Previous studies have proved the successful use in traffic crash severity analysis and risk prediction [45, 46].
3.1. Objective Function
Given training data
3.2. Additive Training
In the training process, it is intractable to learn all trees simultaneously. Instead, XGBoost introduces an additive strategy, which corrects what we have learned and adds one new tree at a time. The details of additive strategy are provided as
Generally, the constant terms in the objective function are ignored, and the simplified objective function can be obtained as
3.3. Model Complexity
For defining the complexity of tree, we refine the tree
We define
To measure how good a tree is and find the best tree structure, a greedy algorithm is introduced. It starts from a single leaf and attempts to split each leaf into two leaves and then calculates the information gain in this split process (see equation (14)).
4. Results
Based on the 544 crashes data, the time-related information, crash position, weather, and crash involvements are investigated. In the section, six risk factors are extracted to explore the characteristics of VRUs-involved crashes and further determine the risk factors contributing to crash injury severity.
4.1. Descriptive Statistical Analysis
4.1.1. Temporal Characteristics
Figure 2 illustrates the proportion of different accident types under three time-related factors. From the perspective of the time of day, the VRUs-involved crashes are probable to occur in the daytime, while the proportion of fatal crashes at night is relatively higher than those in the daytime, with values of 38.7% and 25.9% (see Table 2). Maybe most people intend to travel during the daytime, which is prone to cause crashes. But at night, due to the terrible travel environment (i.e., poor light visible condition), the crashes are easy to cause deaths. Additionally, the proportion of crashes on weekdays is larger than that on weekends/holidays, but the fatality rate is the opposite and the values of weekday and weekend/holiday are 28.6% and 35.9%, respectively. The reason may be that people keep a relatively low safety alert when traveling on weekends/holidays than on weekdays. Moreover, the VRUs-involved crashes are prone to happen in off-peak hours than in rush hours due to the longer period of off-peak hours. Similarly, the fatality rate of off-peak hours is higher than those of rush hours (the values are 31.7% and 28.0%, respectively).
[figure omitted; refer to PDF]
Moreover, rush hour, day of week, and crash involvements show relatively similar importance, with the information gain scores as 1.42, 1.32, and 1.11, respectively. The reason may be that the categories of rush hour (i.e., rush hour and off-peak hour) show a minor difference of influence on VRUs-involved crashes, and day of week (i.e., weekday and weekend/holiday) and crash involvements (i.e., vehicle-bicycle and vehicle-pedestrian) are similar. The weather and crash position represent the least importance to VRUs-involved crash injury severity, and the information gain values are 0.43 and 0.20. Therefore, we infer that the people who travel in good or adverse weather show a similar impact on crash injury severity, which is consistent with the result of Section 4.1.3 (the mortalities are close in Table 6). This may be because people do not like to travel in adverse weather and they keep a relatively high safety awareness when traveling. Additionally, the VRUs-involved crashes that happened in different position (i.e., road section and intersection) show semblable result.
5. Conclusions
VRUs-involved crash injury severity analysis transforms the relationship behind the crashes into universal rules and further supports traffic management. This paper demonstrates a descriptive statistical analysis of the characteristics of VRUs-involved crashes based on 554 crashes data collected in a city of northern China and further utilizes XGBoost to identify the risk factors affecting crash injury severity. The important conclusions are summarized as follows. (1) The risk factors (i.e., time of day, day of week, rush hour, crash position, weather, and crash involvements) are closely related to VRUs-involved crash injury severity. More specifically, vehicle-bicycle and vehicle-pedestrian crashes are prone to involve fatalities at intersections on the weekend night in adverse weather. (2) The time of day plays a more important role in VRUs-involved crash injury severity compared with other factors, which reveals that VRUs-involved crashes that happened at night are prone to cause deaths. Additionally, the weather has little effect on VRUs-involved crash injury severity. (3) Compared to vehicle-bicycle crashes, vehicle-pedestrian crashes are prone to happen at intersections (especially at the crosswalk near the intersection), and these crashes readily cause deaths.
Although few factors were analyzed, the AUC and accuracy of XGBoost are 0.675 and 0.706, respectively, and the results still can be accepted and meet the current study. To obtain more accurate and detailed characteristics of VRU-vehicle crash injury severity, several research directions are proposed. (1) More risk factors (e.g., lighting condition, drivers’ age, gender, crash pattern, and crash location related factors) can be considered to better explain the characteristics of VRU-vehicle crash injury severity and further identify the crucial risk factors. The characteristics of VRU-vehicle crash injury severity are not perfectly and accurately exploited due to the limitation of the risk factors. However, abundant risk factors may cause unfaithful characteristics to be described. To this topic, how to extract an appropriate number and precise risk factors is a crucial challenge. (2) Risk factors identified mechanism can be developed with high accuracy and robustness on crash injury severity analysis, such as random forest (RF) and nonparametric Bayesian approach, to better explain the characteristics and determine the real causes of crashes. The XGBoost model facilitates the investigation of crash injury severity issues, but the accuracy is limited due to the small sample size. Therefore, how to develop risk factors identified approach with a small sample size is a hot point. In addition, how to consider the spatial-temporal correlations in modelling process is a crucial challenge.
Acknowledgments
This paper was supported by the National Natural Science Foundation of China (nos. 52102397 and 52072214) and the National Key R & D Program of China (no. 2019YFB1600605).
[1] Y. Yuan, M. Yang, Y. Guo, S. Rasouli, Z. Gan, Y. Ren, "Risk factors associated with truck-involved fatal crash severity: analyzing their impact for different groups of truck drivers," Journal of Safety Research, vol. 76, pp. 154-165, DOI: 10.1016/j.jsr.2020.12.012, 2021.
[2] P. Liu, W. Fan, "Analysis of head-on crash injury severity using a partial proportional odds model," Journal of Transportation Safety & Security, vol. 13 no. 7, pp. 714-734, DOI: 10.1080/19439962.2019.1667933, 2019.
[3] D. Li, P. Ranjitkar, Y. Zhao, H. Yi, S. Rashidi, "Analyzing pedestrian crash injury severity under different weather conditions," Traffic Injury Prevention, vol. 18 no. 4, pp. 427-430, DOI: 10.1080/15389588.2016.1207762, 2017.
[4] X. Chen, Z. Li, Y. Yang, L. Qi, R. Ke, "High-resolution vehicle trajectory extraction and denoising from aerial videos," IEEE Transactions on Intelligent Transportation Systems, vol. 22 no. 5, pp. 3190-3202, DOI: 10.1109/tits.2020.3003782, 2020.
[5] C. Lee, M. Abdel-Aty, "Comprehensive analysis of vehicle-pedestrian crashes at intersections in Florida," Accident Analysis & Prevention, vol. 37 no. 4, pp. 775-786, DOI: 10.1016/j.aap.2005.03.019, 2005.
[6] M. G. Mohamed, N. Saunier, L. F. Miranda-Moreno, S. V. Ukkusuri, "A clustering regression approach: a comprehensive injury severity analysis of pedestrian-vehicle crashes in New York, US and Montreal, Canada," Safety Science, vol. 54, pp. 27-37, DOI: 10.1016/j.ssci.2012.11.001, 2013.
[7] X. Chen, S. Wang, C. Shi, H. Wu, J. Zhao, J. Fu, "Robust ship tracking via multi-view learning and sparse representation," Journal of Navigation, vol. 72 no. 1, pp. 176-192, DOI: 10.1017/s0373463318000504, 2019.
[8] J. Ma, K. M. Kockelman, P. Damien, "A multivariate Poisson-lognormal regression model for prediction of crash counts by severity, using Bayesian methods," Accident Analysis & Prevention, vol. 40 no. 3, pp. 964-975, DOI: 10.1016/j.aap.2007.11.002, 2008.
[9] K. El-Basyouny, S. Barua, M. T. Islam, "Investigation of time and weather effects on crash types using full Bayesian multivariate Poisson lognormal models," Accident Analysis & Prevention, vol. 73, pp. 91-99, DOI: 10.1016/j.aap.2014.08.014, 2014.
[10] A. J. Anarkooli, M. Hosseinpour, A. Kardar, "Investigation of factors affecting the injury severity of single-vehicle rollover crashes: a random-effects generalized ordered probit model," Accident Analysis & Prevention, vol. 106, pp. 399-410, DOI: 10.1016/j.aap.2017.07.008, 2017.
[11] G. Fountas, P. C. Anastasopoulos, "Analysis of accident injury-severity outcomes: the zero-inflated hierarchical ordered probit model with correlated disturbances," Analytic Methods in Accident Research, vol. 20, pp. 30-45, DOI: 10.1016/j.amar.2018.09.002, 2018.
[12] Y. Guo, J. Zhou, Y. Wu, J. Chen, "Evaluation of factors affecting e-bike involved crash and e-bike license plate use in China using a bivariate probit model," Journal of Advanced Transportation, vol. 2017,DOI: 10.1155/2017/2142659, 2017.
[13] F. Chen, M. Song, X. Ma, "Investigation on the injury severity of drivers in rear-end collisions between cars using a random parameters bivariate ordered probit model," International Journal of Environmental Research and Public Health, vol. 16 no. 14,DOI: 10.3390/ijerph16142632, 2019.
[14] S. Darban Khales, M. M. Kunt, B. Dimitrijevic, "Analysis of the impacts of risk factors on teenage and older driver injury severity using random-parameter ordered probit," Canadian Journal of Civil Engineering, vol. 47 no. 11, pp. 1249-1257, DOI: 10.1139/cjce-2019-0394, 2020.
[15] Y. Wang, C. G. Prato, "Determinants of injury severity for truck crashes on mountain expressways in China: a case-study with a partial proportional odds model," Safety Science, vol. 117 no. April, pp. 100-107, DOI: 10.1016/j.ssci.2019.04.011, 2019.
[16] X. Xu, X. Luo, C. Ma, D. Xiao, "Spatial-temporal analysis of pedestrian injury severity with geographically and temporally weighted regression model in Hong Kong," Transportation Research Part F: Traffic Psychology and Behaviour, vol. 69, pp. 286-300, DOI: 10.1016/j.trf.2020.02.003, 2020.
[17] F. L. Mannering, C. R. Bhat, "Analytic methods in accident research: methodological frontier and future directions," Analytic Methods in Accident Research, vol. 1,DOI: 10.1016/j.amar.2013.09.001, 2014.
[18] X. Chen, J. Lu, J. Zhao, Z. Qu, Y. Yang, J. Xian, "Traffic flow prediction at varied time scales via ensemble empirical mode decomposition and artificial neural network," Sustainability, vol. 12 no. 9,DOI: 10.3390/su12093678, 2020.
[19] X. Chen, L. Qi, Y. Yang, Q. Luo, O. Postolache, J. Tang, H. Wu, "Video-based detection infrastructure enhancement for automated ship recognition and behavior analysis," Journal of Advanced Transportation, vol. 2020,DOI: 10.1155/2020/7194342, 2020.
[20] Q. Yuan, H. Yang, J. Huang, S. Kou, Y. Li, A. Theofilatos, "What factors impact injury severity of vehicle to electric bike crashes in China?," Advances in Mechanical Engineering, vol. 9 no. 8,DOI: 10.1177/1687814017700546, 2017.
[21] K. Haleem, A. Gan, "Effect of driver’s age and side of impact on crash severity along urban freeways: a mixed logit approach," Journal of Safety Research, vol. 46, pp. 67-76, DOI: 10.1016/j.jsr.2013.04.002, 2013.
[22] A. Iranitalab, A. Khattak, "Comparison of four statistical and machine learning methods for crash severity prediction," Accident Analysis & Prevention, vol. 108 no. August, pp. 27-36, DOI: 10.1016/j.aap.2017.08.008, 2017.
[23] J. Tang, J. Liang, C. Han, Z. Li, H. Huang, "Crash injury severity analysis using a two-layer Stacking framework," Accident Analysis & Prevention, vol. 122 no. May, pp. 226-238, DOI: 10.1016/j.aap.2018.10.016, 2019.
[24] X. Zhou, P. Lu, Z. Zheng, D. Tolliver, A. Keramati, "Accident prediction accuracy assessment for highway-rail grade crossings using random forest algorithm compared with decision tree," Reliability Engineering & System Safety, vol. 200 no. July,DOI: 10.1016/j.ress.2020.106931, 2020.
[25] L. Li, C. G. Prato, Y. Wang, "Ranking contributors to traffic crashes on mountainous freeways from an incomplete dataset: a sequential approach of multivariate imputation by chained equations and random forest classifier," Accident Analysis & Prevention, vol. 146 no. January,DOI: 10.1016/j.aap.2020.105744, 2020.
[26] Q. Yuan, Z. Liang, W. Hao, Y. Li, "Investigating severity and main weather factors in tricycle crashes in Beijing," Proceedings of the 18th COTA International Conference of Transportation, pp. 1710-1719, .
[27] Y. Guo, Z. Li, Y. Wu, C. Xu, "Exploring unobserved heterogeneity in bicyclists’ red-light running behaviors at different crossing facilities," Accident Analysis & Prevention, vol. 115 no. November, pp. 118-127, DOI: 10.1016/j.aap.2018.03.006, 2018.
[28] A. Behnood, F. Mannering, "Determinants of bicyclist injury severities in bicycle-vehicle crashes: a random parameters approach with heterogeneity in means and variances," Analytic Methods in Accident Research, vol. 16, pp. 35-47, DOI: 10.1016/j.amar.2017.08.001, 2017.
[29] J. Liu, A. Hainen, X. Li, Q. Nie, S. Nambisan, "Pedestrian injury severity in motor vehicle crashes: an integrated spatio-temporal modeling approach," Accident Analysis & Prevention, vol. 132 no. August,DOI: 10.1016/j.aap.2019.105272, 2019.
[30] Z. Li, P. Liu, W. Wang, C. Xu, "Using support vector machine models for crash injury severity analysis," Accident Analysis & Prevention, vol. 45, pp. 478-486, DOI: 10.1016/j.aap.2011.08.016, 2012.
[31] Y. S. Li, X. Zhang, W. J. Wang, X. F. Ju, "Factors affecting electric bicycle rider injury in accident based on random forest model," Journal of Transportation Systems Engineering and Information Technology, vol. 21 no. 1, pp. 196-200, DOI: 10.16097/j.cnki.1009-6744.2021.01.030, 2021.
[32] Y. Guo, Z. Li, Y. Wu, C. Xu, "Evaluating factors affecting electric bike users’ registration of license plate in China using Bayesian approach," Transportation Research Part F: Traffic Psychology and Behaviour, vol. 59, pp. 212-221, DOI: 10.1016/j.trf.2018.09.008, 2018.
[33] R. Yu, X. Wang, K. Yang, M. Abdel-Aty, "Crash risk analysis for Shanghai urban expressways: a Bayesian semi-parametric modeling approach," Accident Analysis & Prevention, vol. 95, pp. 495-502, DOI: 10.1016/j.aap.2015.11.029, 2016.
[34] S. Heydari, L. Fu, L. Joseph, L. F. Miranda-Moreno, "Bayesian nonparametric modeling in transportation safety studies: applications in univariate and multivariate settings," Analytic Methods in Accident Research, vol. 12 no. September, pp. 18-34, DOI: 10.1016/j.amar.2016.09.001, 2016.
[35] Q. Zeng, W. Gu, X. Zhang, H. Wen, J. Lee, W. Hao, "Analyzing freeway crash severity using a Bayesian spatial generalized ordered logit model with conditional autoregressive priors," Accident Analysis & Prevention, vol. 127 no. February, pp. 87-95, DOI: 10.1016/j.aap.2019.02.029, 2019.
[36] X. Xu, S. Xie, S. C. Wong, P. Xu, H. Huang, X. Pei, "Severity of pedestrian injuries due to traffic crashes at signalized intersections in Hong Kong: a Bayesian spatial logit model," Journal of Advanced Transportation, vol. 50 no. 8, pp. 2015-2028, DOI: 10.1002/atr.1442, 2016.
[37] Q. Zeng, H. Wen, H. Huang, M. Abdel-Aty, "A Bayesian spatial random parameters Tobit model for analyzing crash rates on roadway segments," Accident Analysis & Prevention, vol. 100, pp. 37-43, DOI: 10.1016/j.aap.2016.12.023, 2017.
[38] Z. H. Khattak, M. D. Fontaine, "A Bayesian modeling framework for crash severity effects of active traffic management systems," Accident Analysis & Prevention, vol. 145,DOI: 10.1016/j.aap.2020.105544, 2020.
[39] Q. Yuan, X. Xu, M. Xu, J. Zhao, Y. Li, "The role of striking and struck vehicles in side crashes between vehicles: bayesian bivariate probit analysis in China," Accident Analysis & Prevention, vol. 134 no. October,DOI: 10.1016/j.aap.2019.105324, 2020.
[40] M. T. Haq, M. Zlatkovic, K. Ksaibati, "Investigating occupant injury severity of truck-involved crashes based on vehicle types on a mountainous freeway: a hierarchical Bayesian random intercept approach," Accident Analysis & Prevention, vol. 144,DOI: 10.1016/j.aap.2020.105654, 2020.
[41] Y. Guo, Z. Li, P. Liu, Y. Wu, "Modeling correlation and heterogeneity in crash rates by collision types using full bayesian random parameters multivariate Tobit model," Accident Analysis & Prevention, vol. 128, pp. 164-174, DOI: 10.1016/j.aap.2019.04.013, 2019.
[42] X. Zhang, H. Wen, T. Yamamoto, Q. Zeng, "Investigating hazardous factors affecting freeway crash injury severity incorporating real-time weather data: using a Bayesian multinomial logit model with conditional autoregressive priors," Journal of Safety Research, vol. 76, pp. 248-255, DOI: 10.1016/j.jsr.2020.12.014, 2021.
[43] Q. Yuan, H. Chen, "Factor comparison of passenger-vehicle to vulnerable road user crashes in Beijing, China," International Journal of Crashworthiness, vol. 22 no. 3, pp. 260-270, DOI: 10.1080/13588265.2016.1248226, 2017.
[44] T. Chen, C. Guestrin, "XGBoost," Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, vol. 13-17, pp. 785-794, DOI: 10.1145/2939672.2939785, 2016.
[45] X. Shi, Y. D. Wong, M. Z.-F. Li, C. Palanisamy, C. Chai, "A feature learning approach based on XGBoost for driving assessment and risk prediction," Accident Analysis & Prevention, vol. 129 no. March, pp. 170-179, DOI: 10.1016/j.aap.2019.05.005, 2019.
[46] M. Guo, Z. Yuan, B. Janson, Y. Peng, Y. Yang, W. Wang, "Older pedestrian traffic crashes severity analysis based on an emerging machine learning xgboost," Sustainability, vol. 13 no. 2, pp. 926-26, DOI: 10.3390/su13020926, 2021.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Shubo Wu et al. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Vehicle to vulnerable road user (VRU) crashes occupy a large proportion of traffic crashes in China, and crash injury severity analysis can support traffic managers to understand the implicit rules behind the crashes. Therefore, 554 VRUs-involved crashes are collected from January, 2017, to February, 2021, in a city in northern China, including 322 vehicle-pedestrian crashes and 232 vehicle-bicycle crashes. First, a descriptive statistical analysis is conducted to investigate the characteristics of VRUs-involved crashes. Second, the extreme gradient boosting (XGBoost) model is introduced to identify the importance of risk factors (i.e., time of day, day of week, rushing hour, crash position, weather, and crash involvements) of VRUs-involved crashes. The statistical analysis demonstrates that the risk factors are closely related to VRUs-involved crash injury severity. Moreover, the results of XGBoost reveal that time of day has the greatest impact on VRUs-involved crashes, and crash position shows the minimum importance among these risk factors.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer






