Table of Content

Open Access iconOpen Access



A Review of Data Cleaning Methods for Web Information System

Jinlin Wang1, Xing Wang1, *, Yuchen Yang1, Hongli Zhang1, Binxing Fang1

1 School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150006, China.

* Corresponding Author: Xing Wang. Email: email.

Computers, Materials & Continua 2020, 62(3), 1053-1075.


Web information system (WIS) is frequently-used and indispensable in daily social life. WIS provides information services in many scenarios, such as electronic commerce, communities, and edutainment. Data cleaning plays an essential role in various WIS scenarios to improve the quality of data service. In this paper, we present a review of the state-of-the-art methods for data cleaning in WIS. According to the characteristics of data cleaning, we extract the critical elements of WIS, such as interactive objects, application scenarios, and core technology, to classify the existing works. Then, after elaborating and analyzing each category, we summarize the descriptions and challenges of data cleaning methods with sub-elements such as data & user interaction, data quality rule, model, crowdsourcing, and privacy preservation. Finally, we analyze various types of problems and provide suggestions for future research on data cleaning in WIS from the technology and interactive perspective.


Cite This Article

APA Style
Wang, J., Wang, X., Yang, Y., Zhang, H., Fang, B. (2020). A review of data cleaning methods for web information system. Computers, Materials & Continua, 62(3), 1053-1075.
Vancouver Style
Wang J, Wang X, Yang Y, Zhang H, Fang B. A review of data cleaning methods for web information system. Comput Mater Contin. 2020;62(3):1053-1075
IEEE Style
J. Wang, X. Wang, Y. Yang, H. Zhang, and B. Fang "A Review of Data Cleaning Methods for Web Information System," Comput. Mater. Contin., vol. 62, no. 3, pp. 1053-1075. 2020.


cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 13182


  • 12810


  • 1


Related articles

Share Link