Research and Practice of Telecommunication User Rating Method Based on Machine Learning

Qian Tang; Hao Chen; Yifei Wei

doi:10.32604/jbd.2022.026850

Open Access icon Open Access

ARTICLE

Research and Practice of Telecommunication User Rating Method Based on Machine Learning

Qian Tang, Hao Chen, Yifei Wei^*

Beijing University of Posts and Telecommunications, Beijing, 100876, China

* Corresponding Author: Yifei Wei. Email: email

Journal on Big Data 2022, 4(1), 27-39. https://doi.org/10.32604/jbd.2022.026850

Received 12 January 2022; Accepted 22 February 2022; Issue published 04 May 2022

Abstract

The machine learning model has advantages in multi-category credit rating classification. It can replace discriminant analysis based on statistical methods, greatly helping credit rating reduce human interference and improve rating efficiency. Therefore, we use a variety of machine learning algorithms to study the credit rating of telecom users. This paper conducts data understanding and preprocessing on Operator Telecom user data, and matches the user’s characteristics and tags based on the time sliding window method. In order to deal with the deviation caused by the imbalance of multi-category data, the SMOTE oversampling method is used to balance the data. Using the Removing features with low variance method and packaging method for feature selection, then the basic models are established. The empirical results of the model show that the Random Forest and XGBOOST ensemble models are better than the single models such as Bayes, SVM, KNN, and Decision Tree. The performance of Decision Tree in single models is better. Therefore, Random Forest, XGBOOST and Decision Tree models were selected to debug the hyper parameters to achieve model optimization. Based on the optimized model, the accuracy, recall, precision, confusion matrix and other indicators are evaluated, and it is concluded that low-level recognition is more accurate than high-level recognition and fewer misjudgments. Comparing the evaluation indicators of each level of different models, it is found that the integrated model performs better, indicating that Random Forest and XGBOOST are more suitable for solving the problem of telecommunications user rating. For this reason, this article proposes an implementation plan based on Random Forest and XGBOOST algorithm and model for the problem of telecommunications user rating.

Keywords

Credit rating; model evaluation; random forest; XGBOOST

Cite This Article

APA Style

Tang, Q., Chen, H., Wei, Y. (2022). Research and Practice of Telecommunication User Rating Method Based on Machine Learning. Journal on Big Data, 4(1), 27–39. https://doi.org/10.32604/jbd.2022.026850

Vancouver Style

Tang Q, Chen H, Wei Y. Research and Practice of Telecommunication User Rating Method Based on Machine Learning. J Big Data. 2022;4(1):27–39. https://doi.org/10.32604/jbd.2022.026850

IEEE Style

Q. Tang, H. Chen, and Y. Wei, “Research and Practice of Telecommunication User Rating Method Based on Machine Learning,” J. Big Data, vol. 4, no. 1, pp. 27–39, 2022. https://doi.org/10.32604/jbd.2022.026850

BibTex EndNote RIS

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Research and Practice of Telecommunication User Rating Method Based on Machine Learning

Abstract

Keywords

Cite This Article

2389

1698

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link