Open Access iconOpen Access

ARTICLE

crossmark

Research on Interpolation Method for Missing Electricity Consumption Data

Junde Chen1, Jiajia Yuan2, Weirong Chen3, Adnan Zeb4, Md Suzauddola5, Yaser A. Nanehkaran2,*

1 Department of Electronic Commerce, Xiangtan University, Xiangtan, 411105, China
2 School of Information Engineering, Yancheng Teachers University, Yancheng, 224000, China
3 Department of Information and Electrical Engineering, Ningde Normal University, Ningde, 352100, China
4 College of Engineering, Southern University of Science and Technology, Shenzhen, 518005, China
5 School of Informatics, Xiamen University, Xiamen, 361005, China

* Corresponding Author: Yaser A. Nanehkaran. Email: email

(This article belongs to the Special Issue: Industrial Big Data and Artificial Intelligence-Driven Intelligent Perception, Maintenance, and Decision Optimization in Industrial Systems)

Computers, Materials & Continua 2024, 78(2), 2575-2591. https://doi.org/10.32604/cmc.2024.048522

Abstract

Missing value is one of the main factors that cause dirty data. Without high-quality data, there will be no reliable analysis results and precise decision-making. Therefore, the data warehouse needs to integrate high-quality data consistently. In the power system, the electricity consumption data of some large users cannot be normally collected resulting in missing data, which affects the calculation of power supply and eventually leads to a large error in the daily power line loss rate. For the problem of missing electricity consumption data, this study proposes a group method of data handling (GMDH) based data interpolation method in distribution power networks and applies it in the analysis of actually collected electricity data. First, the dependent and independent variables are defined from the original data, and the upper and lower limits of missing values are determined according to prior knowledge or existing data information. All missing data are randomly interpolated within the upper and lower limits. Then, the GMDH network is established to obtain the optimal complexity model, which is used to predict the missing data to replace the last imputed electricity consumption data. At last, this process is implemented iteratively until the missing values do not change. Under a relatively small noise level (α = 0.25), the proposed approach achieves a maximum error of no more than 0.605%. Experimental findings demonstrate the efficacy and feasibility of the proposed approach, which realizes the transformation from incomplete data to complete data. Also, this proposed data interpolation approach provides a strong basis for the electricity theft diagnosis and metering fault analysis of electricity enterprises.

Keywords


Cite This Article

J. Chen, J. Yuan, W. Chen, A. Zeb, M. Suzauddola et al., "Research on interpolation method for missing electricity consumption data," Computers, Materials & Continua, vol. 78, no.2, pp. 2575–2591, 2024. https://doi.org/10.32604/cmc.2024.048522



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 266

    View

  • 115

    Download

  • 1

    Like

Share Link