TY - EJOU AU - Jiang, Wangdong AU - Yang, Taian AU - Sun, Guang AU - Li, Yucai AU - Tang, Yixuan AU - Lv, Hongzhang AU - Xiang, Wenqian TI - The Analysis of China’s Integrity Situation Based on Big Data T2 - Journal on Big Data PY - 2019 VL - 1 IS - 3 SN - 2579-0056 AB - In order to study deeply the prominent problems faced by China’s clean government work, and put forward effective coping strategies, this article analyzes the network information of anti-corruption related news events, which is based on big data technology. In this study, we take the news report from the website of the Communist Party of China (CPC) Central Commission for Discipline Inspection (CCDI) as the source of data. Firstly, the obtained text data is converted to word segmentation and stop words under preprocessing, and then the pre-processed data is improved by vectorization and text clustering, finally, after text clustering, the key words of clean government work is derived from visualization analysis. According to the results of this study, it shows that China’s clean government work should focus on ‘the four forms of decadence’ issue, and related departments must strictly crack down five categories of phenomena, such as “illegal payment of subsidies or benefits, illegal delivery of gifts and cash gift, illegal use of official vehicles, banquets using public funds, extravagant wedding ceremonies and funeral”. The results of this study are consistent with the official data released by the CCDI’s website, which also suggests that the method is feasible and effective. KW - Big data KW - anti-corruption KW - text clustering KW - visualization DO - 10.32604/jbd.2019.08454