This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
In recent years, academia and industry have invested enormous human and material resources in the research and development of AVs. One of their original intentions is to reduce the rate of traffic accidents. Every year, the economic loss caused by traffic accidents is 277 billion dollars, which is twice as much as traffic congestion [1]. Drivers’ errors cause 90% of traffic accidents. More than 40% of fatal crashes are related to alcohol, distraction, drug addiction, and fatigued driving [2]. Even when nonhuman factors primarily cause crashes, they usually include some human factors such as distractions or unfamiliar driving skills. With improved AV technology, fatal accident rates are likely to fall by at least 40%, and human factors may disappear. Many studies have been carried out to explore the features that influence the severity of conventional vehicle crashes [3, 4]. However, the literature review shows that, relatively, few studies have focused on factors influencing the severity of AV-involved crashes. The driving system of AVs is different from conventional vehicles. Therefore, clarifying the mechanism of AV-related crashes is of great importance to improve the safety of AVs.
In the previous literature on the contributing factors of AV-related crashes [5–8], less attention was paid to the correlation between different factors. These research studies have proposed various methods (such as classification and regression trees and neural networks) to explore the features and mechanism of the AV-involved crash. However, these methods are based on the hypotheses that the factors are independent of each other, and it is easy to ignore the causal relationship between factors. Association rule mining is a crucial technology among numerous data mining technologies, especially in analyzing the cause of traffic crashes, because it does not rely on any assumptions and can discover meaningful relationships hidden in large datasets [9]. Therefore, to interpret the interrelationships between factors and further explore the mechanism of AV-involved crashes, the association rule mining algorithms need to be adopted.
The interpretability of model results is another valuable issue. There are many machine learning methods (such as random forest [10], classification and regression tree [11], and gradient boosting model [12]) that have been utilized to study the severity of traffic crashes. These models are more complex and data-driven and have higher accuracy than traditional calculation models [13]. However, these models are usually regarded as a “black box” because the complicated and nonlinear effects of features on the prediction results cannot be explained [14]. The current study uses SHAP (Shapley Additive Explanations) to interpret how a variable affects model prediction results. SHAP, proposed by Lundberg and Lee [15], originated from cooperative game theory. The prediction results are explained by calculating the contribution of individual variables to the results.
The objective of this study is to explore the mechanism of AV-involved crashes and investigate the impact of each feature on accident severity. We adopted 131 AV-involved crash reports received in California from 2019 to October 2020. We used synthetic minority oversampling technique (SMOTE) to balance the dataset. The Apriori algorithm and classification models are used for accident mechanism analysis. We apply the SHAP to interpret how a variable affects model classification results.
2. Literature Review
The method of improving traffic safety through advanced driving assistance system (such as antilock brake system (ABS), electronic stability program (ESP), autonomous emergency braking (AEB), and lane-keeping assist system (LKA)) has been used in the automotive industry for many years. Statistics show that these systems have effectively improved vehicle safety and reduced the rate of traffic accidents [2]. As the level of AV technology increases, the driving tasks gradually transferred from the driver to the autonomous driving system. Since AVs have a strong environmental perception (such as the vehicle to everything (V2X)), data processing, and rapid response capabilities, they can make up for the driver’s inherent shortcomings to a certain extent [13]. It is foreseeable that, with the development of AV technology, traffic safety problems will be further alleviated. In the 1970s, Haddon proposed a theory from the perspective of human-vehicle-environment, which divided the crash into three stages: precrash, crash, and postcrash [16]. The three factors at each stage of the crash process are arranged and combined to form the famous Haddon matrix, shown in Table 1.
Table 1
Haddon matrix.
Phase | Human factors | Vehicle factors | Environment factors |
Precrash | Information | Road worthiness | Road design and road layout |
Attitudes | Lighting | Speed limits | |
Impairment | Braking | Pedestrian facilities | |
Police enforcement | Speed management | Other safety devices | |
Crash | Use of restraints | Occupant restraints | Crash-protective roadside objects |
Impairments | Other safety devices’ crash-protective design | ||
Postcrash | First-aid skills | Ease of access | Rescue facilities |
Access to medics | Fire risk | Congestion |
With the development of AV technology, the causes of traffic accidents are significantly changed. As shown in Figure 1, drivers make driving decisions based on complex environmental information and vehicle status for conventional vehicles. Therefore, if the vehicle is in danger of collision, the driver needs to quickly make a decision based on the current driving scene and combined with his own experience. However, the situation for AVs is different. As shown in Figure 1, the AV technology eliminates the driver’s unstable factor. In the preset operation design domain (ODD), AVs rely on their sensors to perceive all environmental information (including traffic information, environmental conditions, and road conditions) and make driving decisions [16]. Simultaneously, through human-computer interaction techniques, V2X, and other technologies, AVs share some of their driving status with other traffic participants. If an accident occurs in the ODD, the primary responsibility is the AV. Therefore, to protect passengers’ safety and avoid accident liability, the AV manufacturer will carry out comprehensive testing and verification before the AV launch. Thus, based on the perception of the environment and the infrastructure’s support, the traffic accidents caused by human drivers may disappear under ideal circumstances. The Haddon matrix will change accordingly. However, limited by current AV technology and transportation facilities, this goal cannot be achieved in the short term.
[figures omitted; refer to PDF]
The previous studies on the safety of AV technology were carried out in analyzing driver behavior in the driving simulator and testing the autonomous driving system’s stability in closed environments. To avoid potential collisions and improve traffic efficiency, some research studies concentrate on trajectory optimization of AVs. Omidvar et al. [17] designed an AV trajectory optimization algorithm for closed road signalized intersections. The algorithm can optimize the signal control scheme before the AVs arrive at the intersection, control the vehicle’s speed, and ensure that the vehicles can quickly pass the intersection. Li et al. [18] developed an integrated local trajectory optimization scheme and tracking control framework to avoid obstacles in time. With safety and comfort as the objective function, the best trajectory plan is selected. Zhu et al. [19] proposed a speed control method for AVs based on vehicle speed prediction, which improves vehicle operating efficiency and comfort. Many studies use driving simulators to analyze the characteristics of drivers. In the environment of automatic driving, attention should be paid to drivers’ physiological and psychological reactions. Winter et al. [20] found that drivers do not need to monitor the vehicle’s automation process for a long time when driving a highly automated vehicle. They can shift their attention to nondriving-related tasks without affecting the safety of the vehicle. However, the insufficient sample size is one of the limitations of AV safety testing. Both field and driving simulator research attempt to analyze the safety problems of AVs from the perspective of vehicle control and human factors. However, we also need to fully explore the mechanism of AV-involved crashes and analyze the impact of each feature on crash severity. Table 2 lists the studies conducted on AV-involved crashes. This table presents a list of variables used in the studies, analysis methods, and the significant factors obtained from the study.
Table 2
Methods and significant factors in previous studies.
Study | Variables used in the studies | Analysis methods | Significant factors |
Wang and Li [6] | Crash severity, collision type, roadway characteristics | CART models | Location (highway), accident liability |
Xu et al. [5] | Type of intersection, type of collision, roadway characteristics, weather | Binary logistic regression model | Driving mode, collision location, roadside parking |
Boggs et al. [7] | County, test year, time of the day, vehicle state, type of crash, crash severity | Hierarchical Bayesian heterogeneity-based model | Weather condition |
Chen et al. [8] | Crash severity, collision type, weather, vehicle damage, accident location, weather, vehicle state, land-used data | XGBoost model | Weather condition, location (intersection) |
Determining the cause of a traffic crash is the most critical process for taking preventive measures to reduce the severity and traffic crashes. The Apriori algorithm proposed by Agrawal et al. [21] is the most commonly used association rule mining algorithm. It has been widely used in traffic safety analysis [22–24]. Xu et al. [9] used the Apriori algorithm to explore the causes of traffic crashes with heavy casualties and their interdependent relationship in China. The results indicated that serious casualty crashes resulted from complex interactions between traffic participants, vehicle, road, and environmental conditions. Montella et al. [25] applied the Apriori algorithm to analyze the Italian PTW collision to find the interdependencies and differences between the collision features. Yu et al. [23] adopted the Apriori algorithm to recognize risk factors that are prominently linked to the severity of crash accidents. Many studies have begun to use SHAP for model interpretation. In terms of traffic safety, Mihaita et al. [26] applied the SHAP to study the influence of various characteristics on crash duration. Parsa et al. [1] adopted the SHAP to explain the individual features’ importance on accident detection. Zhou et al. [14] applied the SHAP to interpret the influence factors of the severity of car and truck driver injury in the car-truck collision.
By combining the Apriori algorithm and classification models to explore the mechanism of AV-involved crashes, this study provides some useful information for taking preventive measures to promote the safety of AVs.
3. Data Analysis and Feature Extraction
3.1. Data Sources
The Department of Motor Vehicles (DMV) was required to provide AV-related crash reports within ten business days of an accident. This study adopted 131 crash reports received in California from 2019 to October 2020. We extract AV-related information from crash reports such as type of collision, crash severity, vehicle information, and weather. According to the heat map (shown in Figure 2) of the accident location, we find that the crash mainly occurred in Northeast San Francisco because it is the primary test site for AVs.
[figure omitted; refer to PDF]
Obviously, the greater the degree of vehicle damage is, the more likely it is to cause a severe accident. The interpretation of the model results suggests that weather is another key feature. Particularly, in low-visibility conditions, such as fog and snow, injury accidents are more likely to occur [36]. This is probably due to the sensors’ worse perception performance in extreme weather [37]. According to Hasirlioglu et al. [38], the reflector can only detect a short distance in foggy weather, and crashes are more likely to occur in this situation.
The next most important features are accident location and driving mode. The possibility of a crash is higher at intersections [39]. This is probably due to the complex and changeable traffic environment at intersections because vehicles, nonmotor vehicles, and pedestrians are highly mixed [40, 41]. According to crash reports, AVs usually switch to the conventional driving mode when they arrive at intersections because intelligent transportation facilities are not perfect enough now. These infrastructures can increase vehicle stability during driving and improve the safety of all traffic participants [42]. In terms of driving mode, Figure 4 illustrates that the automatic driving mode will increase the risk of injury, surprising. This is probably because the driver diverts their attention to secondary tasks (such as playing mobile phones) in the automatic driving mode, so it is more likely to be injured in an accident.
6. Conclusion
The objective of this study is to explore the mechanism of AV-involved crashes and analyze the impact of each feature on crash severity. We employ 131 accident reports involving AVs received in California from 2019 to October 2020. We use the Apriori algorithm to explore the causal relationship between multiple factors to explore the mechanism of crashes. Given the imbalanced crash severity distribution, we apply the SMOTE to balance the dataset. Three different classification models are used to compare the classification performance: XGBoost, CART, and SVM. The result shows that the XGBoost model can better recognize the injured crashes involving AVs. We apply the SHAP (Shapley Additive Explanations) to interpret how a specific variable influences model classification results. Both the XGBoost and the Apriori algorithm effectively provided meaningful insights about AV-involved crash characteristics and their relationship.
For the analysis of crash mechanisms involving AVs, we use the Apriori algorithm to mine association rules for uninjured and injured crashes, respectively. Among the top ten strong association rules for uninjured crashes, we can find that most rear-end collisions are conventional vehicles bumping into the rear of AVs. It is probably because the AVs have stopped before the collision, while the conventional vehicles are still moving forward (“PMAV = 0 + TOC = 2”⟶ “PMCV = 1”). Among the top ten strong association rules for injured crashes, we can find that the AVs were still moving before the collision on a cloudy day. It could be caused by the detector failing to find anomalies in time when there is insufficient light (“L = 2 + PMAV = 1 + W = 2”⟶ “CS = 1”). Besides, the driver may distract their attention to secondary tasks in the automatic driving mode, so it is more likely to be injured in an accident (“DM = 1 + TOC = 2”⟶ “CS = 1”). For the crash severity analysis, XGBoost generates the best result (overall accuracy = 64.10%, G-mean = 67.82%, and recall = 75%). To make the results of the XGBoost model more informative, we apply the SHAP to analyze the impact of each feature on crash severity. Among all these features, vehicle damage, weather conditions, accident location, and driving mode are the most critical features. The greater the degree of vehicle damage is, the more likely it is to cause a severe accident. Injured accidents are more possible to occur in low-visibility conditions (such as fog and snow). Intersections are more prone to injury accidents, especially in the automatic driving mode. This study may provide some help in reducing the severity of AV-involved crashes. For example, autonomous vehicle drivers should be extremely cautious when driving in low-visibility conditions (e.g., fog and snow). They should be more careful when driving near intersections, especially in the autonomous driving mode. It is recommended to use vehicle sensors with strong stability and high sensitivity.
However, the current study has certain limitations. Firstly, this study’s sample size and variables could be extended to increase the model result’s reliability. In the future, we will collect driver characteristics, traffic flow information before the crash, and vehicle speed to understand the mechanism of AV-involved crashes. Secondly, this study uses only crash reports received in California for modeling. Future research should continue to collect accident data from other countries and regions because driving habits and traffic laws in different countries and regions may be completely different.
Acknowledgments
This work was supported by the 111 Project of Sustainable Transportation for Urban Agglomeration in Western China (no. B20035).
[1] A. B. Parsa, A. Movahedi, H. Taghipour, S. Derrible, A. Mohammadian, "Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis," Accident Analysis & Prevention, vol. 136,DOI: 10.1016/j.aap.2019.105405, 2020.
[2] J. Stilgoe, "Machine learning, social learning and the governance of self-driving cars," Social Studies of Science, vol. 48 no. 1, pp. 25-56, DOI: 10.1177/0306312717741687, 2018.
[3] L.-Y. Chang, J.-T. Chien, "Analysis of driver injury severity in truck-involved accidents using a non-parametric classification tree model," Safety Science, vol. 51 no. 1, pp. 17-22, DOI: 10.1016/j.ssci.2012.06.017, 2013.
[4] H. Jeong, Y. Jang, P. J. Bowman, N. Masoud, "Classification of motor vehicle crash injury severity: a hybrid approach for imbalanced data," Accident Analysis & Prevention, vol. 120, pp. 250-261, DOI: 10.1016/j.aap.2018.08.025, 2018.
[5] C. Xu, Z. Ding, C. Wang, Z. Li, "Statistical analysis of the patterns and characteristics of connected and autonomous vehicle involved crashes," Journal of Safety Research, vol. 71, pp. 41-47, DOI: 10.1016/j.jsr.2019.09.001, 2019.
[6] S. Wang, Z. Li, "Exploring the mechanism of crashes with automated vehicles using statistical modeling approaches," PLoS One, vol. 14,DOI: 10.1371/journal.pone.0214550, 2019.
[7] A. M. Boggs, B. Wali, A. J. Khattak, "Exploratory analysis of automated vehicle crashes in California: a text analytics & hierarchical Bayesian heterogeneity-based approach," Accident Analysis & Prevention, vol. 135,DOI: 10.1016/j.aap.2019.105354, 2020.
[8] H. Chen, H. Chen, Z. Liu, X. Sun, R. Zhou, "Analysis of factors affecting the severity of automated vehicle crashes using XGBoost model combining POI data," Journal of Advanced Transportation, vol. 2020,DOI: 10.1155/2020/8881545, 2020.
[9] C. Xu, J. Bao, C. Wang, P. Liu, "Association rule analysis of factors contributing to extraordinarily severe traffic crashes in China," Journal of Safety Research, vol. 67, pp. 65-75, DOI: 10.1016/j.jsr.2018.09.013, 2018.
[10] Z. Liu, H. Chen, Y. Li, Q. Zhang, "Taxi demand prediction based on a combination forecasting model in hotspots," Journal of Advanced Transportation, vol. 2020,DOI: 10.1155/2020/1302586, 2020.
[11] B. Zhou, Z. Li, S. Zhang, X. Zhang, X. Liu, Q. Ma, "Analysis of factors affecting hit-and-run and non-hit-and-run in vehicle-bicycle crashes: a non-parametric approach incorporating data imbalance treatment," Sustain, vol. 11,DOI: 10.3390/su11051327, 2019.
[12] S. R. Mousa, P. R. Bakhit, S. Ishak, "An extreme gradient boosting method for identifying the factors contributing to crash/near-crash events: a naturalistic driving study," Canadian Journal of Civil Engineering, vol. 46 no. 8, pp. 712-721, DOI: 10.1139/cjce-2018-0117, 2019.
[13] F. Mannering, "Temporal instability and the analysis of highway accident data," Analytic Methods in Accident Research, vol. 17,DOI: 10.1016/j.amar.2017.10.002, 2018.
[14] B. Zhou, X. Wang, S. Zhang, "Comparing factors affecting injury severity of passenger car and truck drivers," IEEE Access, vol. 8, pp. 153849-153861, DOI: 10.1109/access.2020.3018183, 2020.
[15] K. Zhang, Y. Zhang, M. Wang, "A unified approach to interpreting model predictions scott," Nips, vol. 16, pp. 426-430, 2012.
[16] W. Haddon, "The changing approach to the epidemiology, prevention, and amelioration of trauma: the transition to approaches etiologically rather than descriptively based," Injury Prevention, vol. 5 no. 3, pp. 231-235, DOI: 10.1136/ip.5.3.231, 1999.
[17] A. Omidvar, M. Pourmehrab, P. Emami, "Deployment and testing of optimized autonomous and connected vehicle trajectories at a closed-course signalized intersection," Transportation Research Record: Journal of the Transportation Research Board, vol. 2672 no. 19, pp. 45-54, DOI: 10.1177/0361198118782798, 2018.
[18] X. Li, Z. Sun, D. Cao, D. Liu, H. He, "Development of a new integrated local trajectory planning and tracking control framework for autonomous ground vehicles," Mechanical Systems and Signal Processing, vol. 87, pp. 118-137, DOI: 10.1016/j.ymssp.2015.10.021, 2017.
[19] M. Zhu, H. Chen, G. Xiong, "A model predictive speed tracking control approach for autonomous ground vehicles," Mechanical Systems and Signal Processing, vol. 87, pp. 138-152, DOI: 10.1016/j.ymssp.2016.03.003, 2017.
[20] J. C. F. Winter, R. Happee, M. H. Martens, N. A. Stanton, "Effects of adaptive cruise control and highly automated driving on workload and situation awareness : a review of the empirical evidence," Transportation Research Part F: Traffic Psychology and Behaviour, vol. 27, pp. 196-217, DOI: 10.1016/j.trf.2014.06.016, 2014.
[21] A. K. Agarwal, S. Wadhwa, S. Chandra, "Diagnosis of tuberculosis--newer tests," Journal of the Association of Physicians of India, vol. 42 no. 8, 1994.
[22] P. Wu, X. Meng, L. Song, W. Zuo, "Crash risk evaluation and crash severity pattern analysis for different types of urban junctions: fault tree analysis and association rules approaches," Transportation Research Record: Journal of the Transportation Research Board, vol. 2673 no. 1, pp. 403-416, DOI: 10.1177/0361198118822817, 2019.
[23] S. Yu, Y. Jia, D. Sun, "Identifying factors that influence the patterns of road crashes using association rules: a case study from Wisconsin, United States," Sustain, vol. 11,DOI: 10.3390/su11071925, 2019.
[24] J. Weng, G. Li, "Exploring shipping accident contributory factors using association rules," Journal of Transportation Safety & Security, vol. 11 no. 1, pp. 36-57, DOI: 10.1080/19439962.2017.1341440, 2019.
[25] A. Montella, M. Aria, A. D’Ambrosio, F. Mauriello, "Analysis of powered two-wheeler crashes in Italy by classification trees and rules discovery," Accident Analysis & Prevention, vol. 49, pp. 58-72, DOI: 10.1016/j.aap.2011.04.025, 2012.
[26] A. S. Mihǎiţǎ, Z. Liu, C. Cai, M. A. Rizoiu, "Arterial incident duration prediction using a bi-level framework of extreme gradient-tree boosting," 2019. http://arxiv.org/abs/1905.12254
[27] D. A. Belsley, "A Guide to using the collinearity diagnostics," Computer Science in Economics and Management, vol. 4, pp. 33-50, DOI: 10.1007/BF00426854, 1991.
[28] C. H. Mason, W. D. Perreault, "Collinearity, power, and interpretation of multiple regression analysis," Journal of Marketing Research, vol. 28 no. 3,DOI: 10.2307/3172863, 1991.
[29] T. Næs, B. H. Mevik, "Understanding the collinearity problem in regression and discriminant analysis," Journal of Chemometrics, vol. 15, pp. 413-426, DOI: 10.1002/cem.676, 2001.
[30] Y. Pan, S. Chen, T. Li, S. Niu, K. Tang, "Exploring spatial variation of the bus stop influence zone with multi-source data: a case study in Zhenjiang, China," Journal of Transport Geography, vol. 76, pp. 166-177, DOI: 10.1016/j.jtrangeo.2019.03.012, 2019.
[31] M. Petrere, "Pesque-solte," Ciência Hoje, vol. 53, pp. 1189-1232, DOI: 10.1214/aos/1013203451, 2014.
[32] J. H. Friedman, "Stochastic gradient boosting," Computational Statistics & Data Analysis, vol. 38 no. 4, pp. 367-378, DOI: 10.1016/s0167-9473(01)00065-2, 2002.
[33] X. Li, J. Tang, X. Hu, W. Wang, "Assessing intercity multimodal choice behavior in a Touristy City: a factor analysis," Journal of Transport Geography, vol. 86,DOI: 10.1016/j.jtrangeo.2020.102776, 2020.
[34] A. Bradley, "The use OF the area under the roc curve IN the evaluation OF machine learning algorithms," Transportation Research Part A: Policy and Practice, vol. 44, pp. 291-305, DOI: 10.1016/j.tra.2010.02.001, 1997.
[35] S. Wang, Z. Li, "Exploring causes and effects of automated vehicle disengagement using statistical modeling and classification tree based on field test data," Accident Analysis & Prevention, vol. 129, pp. 44-54, DOI: 10.1016/j.aap.2019.04.015, 2019b.
[36] X. Yu, M. Marinov, "A study on recent developments and issues with obstacle detection systems for automated vehicles," Sustain, vol. 12,DOI: 10.3390/su12083281, 2020.
[37] R. H. Rasshofer, M. Spies, H. Spies, "Influences of weather phenomena on automotive laser radar systems," Advances in Radio Science, vol. 9, pp. 49-60, DOI: 10.5194/ars-9-49-2011, 2011.
[38] S. Hasirlioglu, A. Riener, W. Ruber, P. Wintersberger, "Effects of exhaust gases on laser scanner data quality at low ambient temperatures," Proceedings of the IEEE Transactions on Intelligent Transportation Systems, pp. 1708-1713, .
[39] V. Gadepally, A. Krishnamurthy, Ü. Özgüner, "A framework for estimating long term driver behavior," Journal of Advanced Transportation, vol. 2017,DOI: 10.1155/2017/3080859, 2017.
[40] L. Hu, J. Ou, J. Huang, Y. Chen, D. Cao, "A review of research on traffic conflicts based on intelligent vehicles," IEEE Access, vol. 8, pp. 24471-24483, 2020.
[41] M. M. Morando, Q. Tian, L. T. Truong, H. L. Vu, "Studying the safety impact of autonomous vehicles using simulation-based surrogate safety measures," Journal of Advanced Transportation, vol. 2018,DOI: 10.1155/2018/6135183, 2018.
[42] O. Grembek, A. Kurzhanskiy, A. Medury, P. Varaiya, M. Yu, "Making intersections safer with I2V communication," Transportation Research Part C: Emerging Technologies, vol. 102, pp. 396-410, DOI: 10.1016/j.trc.2019.02.017, 2019.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Hengrui Chen et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
The safety issue has become a critical obstacle that cannot be ignored in the marketization of autonomous vehicles (AVs). The objective of this study is to explore the mechanism of AV-involved crashes and analyze the impact of each feature on crash severity. We use the Apriori algorithm to explore the causal relationship between multiple factors to explore the mechanism of crashes. We use various machine learning models, including support vector machine (SVM), classification and regression tree (CART), and eXtreme Gradient Boosting (XGBoost), to analyze the crash severity. Besides, we apply the Shapley Additive Explanations (SHAP) to interpret the importance of each factor. The results indicate that XGBoost obtains the best result (recall = 75%; G-mean = 67.82%). Both XGBoost and Apriori algorithm effectively provided meaningful insights about AV-involved crash characteristics and their relationship. Among all these features, vehicle damage, weather conditions, accident location, and driving mode are the most critical features. We found that most rear-end crashes are conventional vehicles bumping into the rear of AVs. Drivers should be extremely cautious when driving in fog, snow, and insufficient light. Besides, drivers should be careful when driving near intersections, especially in the autonomous driving mode.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer