Open Access iconOpen Access

ARTICLE

crossmark

A Two-Stage Feature Extraction Approach for Green Energy Consumers in Retail Electricity Markets Using Clustering and TF–IDF Algorithms

Wei Yang1, Weicong Tan1, Zhijian Zeng1, Ren Li1, Jie Qin1, Yuting Xie1, Yongjun Zhang2, Runting Cheng2, Dongliang Xiao2,*

1 Guangdong Power Exchange Center Co., Ltd., Guangzhou, 510180, China
2 School of Electric Power, South China University of Technology, Guangzhou, 510641, China

* Corresponding Author: Dongliang Xiao. Email: email

Energy Engineering 2025, 122(5), 1697-1713. https://doi.org/10.32604/ee.2025.060571

Abstract

The rapid development of electricity retail market has prompted an increasing number of electricity consumers to sign green electricity contracts with retail electricity companies, which poses greater challenges for the market service for green energy consumers. This study proposed a two-stage feature extraction approach for green energy consumers leveraging clustering and term frequency-inverse document frequency (TF–IDF) algorithms within a knowledge graph framework to provide an information basis that supports the green development of the retail electricity market. First, the multi-source heterogeneous data of green energy consumers under an actual market environment is systematically introduced and the information is categorized into discrete, interval, and relational features. A clustering algorithm was employed to extract features of the trading behavior of green energy consumers in the first stage using the parameter data of green retail electricity contracts. Then, TF–IDF algorithm was applied in the second stage to extract features for green energy consumers in different clusters. Finally, the effectiveness of the proposed approach was validated based on the actual operational data in a southern province of China. It is shown that the most significant discrepancy between the retail trading behaviors of green energy consumers is the power share of green retail packages, whose averaged values are 25.64%, 50%, 39.66%, and 24.89% in four different clusters, respectively. Additionally, power supply bureaus and electricity retail companies affects the behavior of the green energy consumers most significantly.

Keywords

Green energy consumer; feature extraction; knowledge graph; retail electricity market

Cite This Article

APA Style
Yang, W., Tan, W., Zeng, Z., Li, R., Qin, J. et al. (2025). A Two-Stage Feature Extraction Approach for Green Energy Consumers in Retail Electricity Markets Using Clustering and TF–IDF Algorithms. Energy Engineering, 122(5), 1697–1713. https://doi.org/10.32604/ee.2025.060571
Vancouver Style
Yang W, Tan W, Zeng Z, Li R, Qin J, Xie Y, et al. A Two-Stage Feature Extraction Approach for Green Energy Consumers in Retail Electricity Markets Using Clustering and TF–IDF Algorithms. Energ Eng. 2025;122(5):1697–1713. https://doi.org/10.32604/ee.2025.060571
IEEE Style
W. Yang et al., “A Two-Stage Feature Extraction Approach for Green Energy Consumers in Retail Electricity Markets Using Clustering and TF–IDF Algorithms,” Energ. Eng., vol. 122, no. 5, pp. 1697–1713, 2025. https://doi.org/10.32604/ee.2025.060571



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 292

    View

  • 166

    Download

  • 0

    Like

Share Link