TY - EJOU AU - Dong, Jianhua AU - Wu, Lifeng AU - Liu, Xiaogang AU - Fan, Cheng AU - Leng, Menghui AU - Yang, Qiliang TI - Simulation of Daily Diffuse Solar Radiation Based on Three Machine Learning Models T2 - Computer Modeling in Engineering \& Sciences PY - 2020 VL - 123 IS - 1 SN - 1526-1506 AB - Solar radiation is an important parameter in the fields of computer modeling, engineering technology and energy development. This paper evaluated the ability of three machine learning models, i.e., Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM) and Multivariate Adaptive Regression Splines (MARS), to estimate the daily diffuse solar radiation (Rd). The regular meteorological data of 1966-2015 at five stations in China were taken as the input parameters (including mean average temperature (Ta), theoretical sunshine duration (N), actual sunshine duration (n), daily average air relative humidity (RH), and extra-terrestrial solar radiation (Ra)). And their estimation accuracies were subjected to comparative analysis. The three models were first trained using meteorological data from 1966 to 2000. Then, the 2001-2015 data was used to test the trained machine learning model. The results show that the XGBoost had better accuracy than the other two models in coefficient of determination (R2 ), root mean square error (RMSE), mean bias error (MBE) and normalized root mean square error (NRMSE). The MARS performed better in the training phase than the testing phase, but became less accurate in the testing phase, with the R2 value falling by 2.7-16.9% on average. By contrast, the R2 values of SVM and XGBoost increased by 2.9-12.2% and 1.9-14.3%, respectively. Despite trailing slightly behind the SVM at the Beijing station, the XGBoost showed good performance at the rest of the stations in the two phases. In the training phase, the accuracy growth is small but observable. In addition, the XGBoost had a slightly lower RMSE than the SVM, a signal of its edge in stability. Therefore, the three machine learning models can estimate the daily Rd based on local inputs and the XGBoost stands out for its excellent performance and stability. KW - Diffuse solar radiation KW - extreme gradient boosting KW - multivariate adaptive regression splines KW - statistical indices KW - support vector machine DO - 10.32604/cmes.2020.09014