Deriving Driver Behavioral Pattern Analysis and Performance Using Neural Network Approaches

It has been observed that driver behavior has a direct and considerable impact upon factors like fuel consumption, environmentally harmful emissions, and public safety, making it a key consideration of further research in order to monitor and control such related hazards. This has fueled our decision to conduct a study in order to arrive at an efficient way of analyzing the various parameters of driver behavior and find ways and means of positively impacting such behavior. It has been ascertained that such behavioral patterns can significantly impact the analysis of traffic-related conditions and outcomes. In such cases, the specific vehicular behavior can be detected and related data mined in order to analyze the spatial or temporal patterns of movement patterns as well as to position/track the prominent trends. This analysis seeks to determine the efficacy of such an exercise and whether the various parameters employed can help efficiently determine the various criteria for defining the driver’s style. To that end, the analysis of a driver’s behavioral pattern and performance utilizes a computer modeled application for generating a set of classifications based on the autonomous driving data as well as indicators that are characteristic of driver aggression. In order to draw such insights from the driver’s behavior, the application is modeled using various categories of data, for instance, the steering wheel’s angle, braking conditions, acceleration conditions, the vehicle speed, etc. Unlike the previously developed mechanisms for analyzing the system-based driver behavioral patterns, which were not very efficacious, this endeavor assimilates the contemporary breakthroughs in real-world scenario analysis approaches and driver behavior classification methods. Based on the system capabilities and desired outcomes, distinct strategies can be employed in order to detect the target driver’s behavior. In this specific case, neural network algorithms were utilized in order to conduct an intensive study to determine and analyze the prevailing driver behavior and driving styles. This proposed approach evaluated multiple factors that were determinants in identifying specific driver behavior and driving styles. The results of this experiment that utilized Python, indicated that the driver model in question was successful in achieving a 90% accuracy in terms of logistic regression. This work is licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Intelligent Automation & Soft Computing DOI:10.32604/iasc.2022.020249 Article ech T Press Science


Introduction
Deep learning has pioneered a new era of data and has been heralded as one of the forerunners of new and innovative statistical and computational models. Applied in the context of analyzing driver behavior, such a model can revolutionize the way driver behavior can be understood and various insights generated that can lead to real-life improvements within a short span of time. The very first step towards conducting such an analysis requires a thorough understanding and categorizations of a driver's individual skill levels and habitual patterns of driving. The various parameters that can be used for such a classification and analysis need to be defined first before the actual analysis can be carried out. For this, the availability and quality of the driving data is of utmost significance. To that end, on-board sensors or sensor-based smartphones have sometimes been used. A vehicle controller can be usually used to glean compliance related data and information including the settings of the steering wheel, vehicle speed, position of the brake pedal as well as sensor data related to the speed, direction and velocity of the vehicle.
Contemporary developments have led to such onboard data to be considered as accurate and reliable sources that can be effectively used in drive analysis and also for monitoring the behavior and style of the identified driver in question [1].
Every driver has a unique combination of speed, acceleration and braking habits that can be regarded as a fingerprint and can help identify the unique driving habit of an individual. Such a fingerprint or signature style can be extracted and identified via the analysis of an individual driver's characteristic behavior under driving conditions, such as the speed, the aggression and the focus or the lack of it. Such categorization of driving behaviors can help identify, monitor and reduce the possibility of rash or dangerous driving that can help improve long-term road safety protocols. The correlation between dangerous driving and road accidents has been long observed and is well-studied [2]. Several studies have also emphasized upon the connection between the acceleration mode and the consumption of fuel as well as hazardous fuel emissions. While these association are well accepted, a thorough understanding of driving patterns and styles with respect to various behaviors like the application of brakes under different traffic situations, for instance, have not been well understood due to the paucity of data [3].

Driver Behavior Analysis
The automotive and transportation industries, car insurance agencies, as well as government-controlled public transport infrastructure can all benefit from behavioral modeling that can promote positive driving behaviors, making such analysis an important research topic with far reaching implications. All these stakeholders have an interest in understanding such behaviors in order to be able to create innovative solution in order to improve overall performance. To that end, remedial technologies need to enable effective evaluation of parameters including driving styles as well as the driver's physical characteristics (such as face recognition, monitoring of physical features, etc.), and the use of in-vehicle telematics to monitor the collected navigation data [4].
The classification of driver behaviors is considered a complex undertaking because it is a multifaceted problem that juxtaposes these behaviors with non-uniform traffic conditions. Therefore, such an endeavor requires that variable traffic conditions, actual road conditions, vehicle-specific travel data as well as the driving behavior are effectively grouped. This has in turn necessitated the development of fuzzy sets that can enable recommendations and rules for promoting positive driving behavior specific to the driver and the drive conditions, and therefore enabling remedial operations and tactics [5]. Fig. 1 represents the first module of the proposed design which encapsulates the background data of the driver. Such data includes the driver's gender, age, experience, education level, work status, income level, average annual mileage, the number of drivers and violations of traffic regulations, the previous year's accident rate, etc. Once collected, a statistical learning model is trained with this data to establish a connection between the background data and accident probabilities. The model is then be used to predict the probability of an individual driver causing an accident. Grading rules are applied to background data to provide a specific driver score [6].
In order to determine the driver's effectiveness and driving style, several factors must be evaluated and taken into account, like ecological features, road and vehicle conditions, event classification and identification, as well as natural and functional conditions. Building an accurate and thorough model that can effectively incorporate all the above mentioned aspects is a fairly complex, even unmanageable undertaking. In the past decade, a variety of commercial and research systems have been proposed with respect to driving behavior analysis in order to evaluate driving performance and to assist the driver as well as to monitor various drive systems [7].

Related Work
Taxi services propose numerous benefits and utilities to our daily lives. However, in numerous towns, the supply of taxi services fail to meet the driving demand, especially during peak hours. In this scenario, the dominant player's ambition is to expand their territory and increase mobile payment to protect its dominance. For this, one of the most important factors is to improve the company's mobile Internet for those who wish to book services online. A related example is the competition between Tencent and Alibaba becoming more intense, since their aim was to win over as many people as possible, who would pay money to facilitate taxi bookings using mobile payments [8].
Another example is the design of a dynamic surface adaptive fuzzy controller and a three-phase APF (Active Power Filter). Adaptation support methods combined with fuzzy strategies have been proposed to address the non-linearity of APF, and fuzzy controllers are the main cause for eliminating non-linearity. It is noteworthy that by using the dynamic surface method, the system may decrease the number of fuzzy input variables in modeling, design parameters and cause the controller to be made without complicated calculations [9].
Traditional biometric solutions are either few, inaccurate, slow-moving, or too expensive to put into practical use; these solutions come at the expense of precision, privacy and protection. Similarly, scalability and cost-effectiveness are inversely related. In such a system, a driver identification method may be used to solve these problems. It is usually deployed in car navigation systems to understand people via a global positioning system (GPS) that uses data driving mode. The current market can also use a general-purpose handheld device [10]. Halfway during the trial in each scene, the contributors were asked to accomplish the task or failed to overcome secondary distractions. In some cases, cluster analysis is used to determine a response mode drive. For driving on the highway, they increase (1) fixed variables and the steady-state performance of the eye, and (2) is fixed and constant changes, the average speed increase. Cross drives, which increases the overall speed reduction (mean and variance), then decreased at a constant speed (3) and increasing the speed, adding a fixed change [11].
To drive, a simulator is used for research, an effective tool for developing a car or driver training, which is why it is necessary to cause a similar driving behavior to suit the real car. For example, as a specific task, two minimum steering driver models of this type have been developed and tested in real data sets, a model close to the limit, low friction, and showing a clearer preference than the other. When driven by a closed-loop simulation apparatus, till the vehicle positioning data is reliable, the model parameters' current implementation is commonly used to prove a simple method of adjusting the open-loop undisturbed driver's steering data to estimate [12] accurately. Integrated signal T-intersection clustering and classification system predicts the driver's personalized, seamless control. PRM (Polynomial regression mixture) clusters and Akaike information criterion are applied to the track's driving behavior to drive indepth study. Then various categories are evaluated with respect to low vehicle conditions linked to highlevel driving behavior. CART classification Bayesian offered optimization and with excellent accuracy [13] (see Tab. 1).
First, the design and establishment of a driver behavior data acquisition system is undertaken. A different type of vehicle with a driver input data is collected under a typical operating condition signal. Drivers using PSO (Particle Swarm Optimization) algorithm is optimized to drive and use neural network behavior perception factor analysis and FCM "Fuzzy C-Means" clustering algorithm and backpropagation "BP" classification. Post that, primary caution, preparation and control system of personalized lane changes [14]. Advantages Disadvantages Expectation maximum algorithm [15] It proposes an expressively smaller time outline The gaze behavior is influenced by age variances, gender and driving experimentations. Optimization for Ranking Task [16] Taxi driving behavior to improve the predictability Isn't an explanation of taxi driving pattern evolution K-means cluster analysis [17] Speed measurement is available It does not dispense consistently in all speed assortments Hierarchical Dirichlet process -hidden semi-Markov model [18] It can capture static driving characteristics The test data set is difficult to obtain unknown numbers Principal component analysis [19] Classification of the driving behavior and the warning to the driver The classification evaluates only based on speed Hybrid Convention Neural Network framework (HCF-CNN) [20] The extracted features to obtain comprehensive information This HCF-CNN not trained sufficiently Automatic blink detection algorithm [21] To reduce the dimensionality of the dataset The warnings are often inaccurate and unreliable ANN (Artificial Neural Network) was used to search the composite connections between the generated road situation and the overall graphical representation's of the driving behavior, improving the deep learning model. The model Denoising Stacked Autoencoder (DSAE) laminated unsupervised can provide RGB colors for the output layer. The dataset followed for kinematics measures and is an onboard GPS device utilized for investigational operations come from Graphical results display the capability of methodologies to detect real, simple driving behavior patterns and the complexity of the road situation, environment and few actions faced on the itinerary [22].
Due to the increased number of cars and roads, there is continuing increase in the number of traffic accidents and additionally causes dangerous driver behavior. Few prevailing methodologies trust the driving behavior recognition method for joint point estimates. Still, the key point estimate cannot fully explain the human body. On the other hand, the proposed risk behavior people-oriented analysis of driving behavior recognition method can quickly and accurately identify the driver [23].
This problem targets driving behavior assessment based on extensive data network work vehicle control system to select quantitative analysis. After that, the three main indicators for a detailed description: acceleration and deceleration behavior index, an indicator of stability indicators speeding behavior and actions. Using method clustering GA-FCM "Genetic Algorithm, Fuzzy C-Means clustering", 40-level drivers of dangerous goods transport are classified according to their operating parameters. The outcomes demonstrate that most drivers at high risk of poor driving behavior and driving stability have differential levels [24].
Assessing driving behavior similarity plays an important role in driving style classification and analyzes the important role, which benefits from the design support system driven people-centered. Hidden Markov Model (HMM) is a primitive drive for automatically extracted from the observed sequence, without using the existing knowledge integrated hierarchical Dirichlet process (HDP) Bayesian non-parametric method. A relative entropy grid-based allows us to quantify the probability level of similarity between these primitives [25].
Human Posture restoration and Behavioral Analytics (HuPBA) is a computer vision, pattern analysis and challenging machine learning themes. The application includes public safety, sports, commercial, games, human-computer interaction, assistive technology, rehabilitation, recognition of handwriting and driving assistance technology. In essence, HuPBA properties need to deal with a change in appearance due to the joint body garments, such as artifacts, obstacles, and lighting background noise changes specific to the scene [26].
Machine learning algorithms, most important in revealing latest driver monitoring technology, demonstrate a close examination of basic strengths and weaknesses. Driver drowsiness and distractions: The study also falls into two categories: Driver drowsiness studies were then developed based on either visual features or invisible functions subdivided into two major subgroups [27].
For the turning lane to keep or develop advanced driver assistance systems (ADAS), an important design goal is to share driver's steering control properly. Therefore, the driver's steering behavior should be properly characterized by the designing high-performance ADAS controller. To characterize behavior of the driver's steering that system, two well-known visual-driven model, and conducted a sequence of ground trials/tests to determine model parameters and verify in real-world situations, this model [28].
For lane-keeping and growth of advanced driver assistance systems, a substantial designed goal of steering control with accurate drive share is must. Therefore, the driver's steering behavior should be high enough to characterize the performance of the controller design. In this system, the use of two wellknown video driver models to determine the model parameters and verify in the real-world field this model to characterize the driver's steering behavior [28] did a series of tests (see Tab. 2).
Detection method recommended for the driver and mobile applications are based on behavior analysis, monitoring, analysis, and personal smartphone accident prevention dangerous driving behavior. Built-in sensors (gyroscope, accelerometer, GPS, and microphone) and smartphone cameras are utilized to monitor drive activity. The method comprises the development of hazardous conditions, hazardous conditions detected, and the reference model [34]. Driver attention monitor system status and driver behavior analysis is undertaken to determine whether the driver is sympathetic. Use a front camera mounted on the windshield systems; smartphones can be used in several functions by monitoring the driver's attention. It comes from the state and is based on driver behavior, such as head posture and eye contact, eyes closed, yawning, mobile phones and other fusion using spatial and temporal characteristics of the driver's attention. Data from a metropolitan city with 30 drivers, evaluating driving data from world control, static vehicle settings [35].
First categorized, human driver's negligent driving behavior (HIDB) discussion is divided into two groupings: DD "driver distraction", DF "driven fatigue" or DF "drowsiness". After that, we deliberate the above reasons and the impact of so-called ADB "aggressive driving behavior" on other people's dangerous driving behavior. An aggressive driving style is dangerous when it leads to a large category of serious accidents. Abnormal driving behavior in individuals viz "DD", "DFD", and "ADB" impacted by numerous features which include age of driver, experience of driver, gender or insufficient ailment experience [36].
A variety of factors, fatigue, distraction, drunk, and reckless, is the most common type, resulting in unexpected driving behavior anomalies. Due to advances in technology, modern smartphones the ability to analyze driving behavior. It can only detect abnormal sleepiness and driving behavior, and the use of smartphones is not a Advantages Disadvantages machine-learning algorithm [29] Provides automatic verification of drivers' behavior Maximum times, drivers do not reduce the speed FCW (forwardcollision warning) algorithm [30] It may regulate its cautioning threshold according to the driver's behavior changes.
It does not evaluate the risk perception k-NN classifier [31] Analyze the movement of neighboring vehicles while shifting the driver's sight Use a longer interval without analyzing gaze Advanced driver assistance systems (ADASs) [32] It defines the "braking" behavior as the driver's body movement It is difficult to identify specific drivers and predict their behavior braking Convolutional neural network (CNN) [33] Founded on the analysis of time and space characteristics of the -driving state Adjacent levels of attention rating are not clear Neural network algorithm [5] To be more observant and concurrently conscious of numerous vehicles positions in order to react swiftly The problem of sensing abnormal driver behavior with the support of face netting and analyzation is tough Feedforward neural network (FFNN) [6] Identification task to identify evaluations with different characteristics It is difficult to collect more real-world data sets and identify deeper driver behavior.
comprehensive review. In this article, we discuss different approaches proposed by different authors. This method includes sensing, detection algorithms, and their respective accuracy limitations and [37].
The distracted [38] driving behavior of high frequency is considered to be a high potential risk. Turn right driver distracted driving behavior may greatly upsurge the danger of conflict. Considering complexity of the program, need is there to turn right at the intersection by the movement. Numerous aspects have been examined comprising impact of geometry of road, ecological features and driver to turn right on the traffic conditions of distracted driving activity occurring. Logistic model and RF "Random Forest" implemented for regression analysis. It was noted that lane occupancy and traffic control, two models were ominously associated to distracted driving behavior. Probability odds ratio shows an analysis of the right-turn dedicated lane, and increase production in the logo design can reduce the intersection distracted driving behavior [39].
To collected 14 data on subjective risk assessments by 14 young drivers and risk events and experienced drivers during the driving behavior of older drivers. Expert drivers classify the degree of conflict and subjective emotions in combination with the threat level of the driving sight and use fuzzy signal detection to estimate the driver's risk sensitivity "d" and judgment threshold "β". Judgement threshold, which has largest variance between cluster analysis and driver, is classified into four forms in accordance to the threshold value. In conclusion, the driver classification identification model is founded on Fisher identification analysis [40].
In contrast, popular asymmetric characteristic provides vibrational characteristics for the general class of accurate legally unstable next-linear vehicles that are being considered to explain the recent mathematical framework of micro traffic. It is flowing with increasing interest in quantification. This method provides an angle to understand flow vibrations based on a mathematically asymmetric method, but still, it is unclear that in what way asymmetry happens [41].

Materials and Methodology
Analysis based on variable accuracy periods is based on a pre-training model that can recognize driving behavior. Though pre-trained models too have significant losses, the same cannot recognize distracted-led images without this particular dataset. This research is focused on driver behavior analysis. Initially, the collective driver dataset is pre-processed with several factors that are considered to have attributes. These prompting influences consider non-redundant data with uncovered data-filled or non-filled in a raw dataset and to verify the presence and originality of the values.
The ML algorithms were implemented in python language. The dataset was distributed into 80:20 ratios, demonstrating the training and test sets, correspondingly. Each ML algorithm's working principles are defined in the subsequent sections.

Random Forest
The Random forest classifier generates numerous decision trees from randomly selection subsets of the training dataset as presented in Fig. 2. Afterwards, it goes for the collection of the votes from various decision trees to elect test object's final class [42,43]. A random forest classification was projected in [44] with reduced number of trees. Fig. 2.

Support Vector Machine (SVM)
A SVM [45] is ML algorithm which is performed for both regression and classification tasks but is majorly utilized in classification. This classifier [46] has been used of late in numerous applications due to its excellent classifying capability and presentation excellence, distributing the data linearly into two separate classes (i.e., hyperplanes), with maximum distance between the two classes as Fig. 3 demonstrated below.

Logistic Regression
Logistic regression analysis is utilized to observe the connotation of (categorical or continuous) independent variable(s) with one dichotomous dependent variable. It is in divergence to linear regression analysis where the dependent variable is a continuous variable. Logistic regression [47] is a statistical classification model which counts the association between a categorical-dependent variable (containing two categories only) and one or more independent variables, that are generally (not certainly) continuous, by utilizing probability scores as the predicted values of dependent variable. Logistic regression does not take up a linear relationship between dependent and independent variables. The independent variables require neither to be generally distributed nor linearly related or nor of equal variance within each group.

Gaussian Naive Bayes
Naïve Bayes This classifier superintends ML algorithms utilizing Bayes theorem and acts on foundation whose features are analytically independent. This theorem relies on naïve assumption, where input factors are independent of each other [48][49][50][51]. Naive bayes formula is given below: Here, p H ð Þ = It is the prior means what is believed before seeing the evidence.
p D H j ð Þ = It is the likelihood of seeing that evidence if your hypothesis is correct p D ð Þ = It is the normalizing of that evidence under any circumstances

Results and Discussions
The application of all four approachesi.e., RFT, Gaussian Naïve Bayes, SVM and Logistic Regression -depicted in Tab. 3. Eqs. (2)-(6) below have been used to compute accuracy and precision, recall, error rates and specificity in every confusion matrix.

Accuracy ¼
Sum of diagnols ðTPÞ Total number of instances In which, TP "True positive" stands for Diagonals of matrix FN "False negative" stands for Sum of the consistent row for class Tab. 3 demonstrates the accuracy, precision, recall, error rate, specificity, sensitivity and f1 score attained by the various algorithms. As conclusion of Tab. 3, the uppermost accuracy for all was achieved by logistic regression. In this type of cases, accuracy only is not an adequate degree, and the f1 score counts as a significant measure for decisive selection of the best model. The f1 score is a harmonic mean of precision and recall, having upper value, when both precision and recall are higher. Therefore, the best model in cases of classification imbalance is the one which has higher f1 score even if with lower accuracy. The f1 score of Random Forest was the highest for Gaussian Naïve Bayes whereas it is low for SVM. The specificity of all the algorithms was found to be around 82%-90% or above. Moreover, this is an important parameter. All of the algorithms applied in this study formed highly accurate outcomes. The Tab. 3 above provides the comparison of precision, sensitivity, specificity, accuracy and F1-score. The proposed approach and logistic regression has produced a higher accuracy than any other methods. In the proposed prediction of the driver's behavioral pattern, the best pattern prediction from the driver's activities is made in order to provide the best recommendation for improving driving behaviors.

Conclusion
Driving assistance applications are increasing and driver behavior has currently become the focus of extensive research. As one of the main reasons for pushing accelerators leading to road safety issues, as research is being conducted to solve human behavior quirks and resulting safety issues. Proposing solutions through driving behavior analysis for solving road safety issues and other business problems, driver behavior represents different driving behavior modes of the same drive system, each with a different driving style. An important factor in driving monitoring systems is that, in order to detect driver behavior, we need to find an automatic collection of data and algorithms and model applications that generate classification methods that describe driving performance configuration.
The proposed technology includes the most current trends for analyzing real-world scenarios as well as classifications of driver behavior. Based on the system, various strategies can be employed to determine the target driver's behavior. The present driving behavior and driving style analysis system has been based on extensive research, and by utilizing neural network algorithms. The driver model was observed to have attained a 90% accuracy for logistic regression, according to the test findings using Python. Collecting massive volumes of driving data from various cars and drivers will be of tremendous interest in the future.
Funding Statement: The authors received no specific funding for this study.

Conflicts of Interest:
The authors declare that they have no conflicts of interest to report regarding the present study.