Lubna Alhenak1, Manar Hosny1,*
CMC-Computers, Materials & Continua, Vol.61, No.3, pp. 1045-1074, 2019, DOI:10.32604/cmc.2019.08355
Abstract In recent years, the volume of information in digital form has increased tremendously owing to the increased popularity of the World Wide Web. As a result, the use of techniques for extracting useful information from large collections of data, and particularly documents, has become more necessary and challenging. Text clustering is such a technique; it consists in dividing a set of text documents into clusters (groups), so that documents within the same cluster are closely related, whereas documents in different clusters are as different as possible. Clustering depends on measuring the content (i.e., words) of a document in terms of… More >