Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (36)
  • Open Access

    ARTICLE

    Improving Chinese Word Representation with Conceptual Semantics

    Tingxin Wei1, 2, Weiguang Qu2, 3, *, Junsheng Zhou3, Yunfei Long4, Yanhui Gu3, Zhentao Xia3

    CMC-Computers, Materials & Continua, Vol.64, No.3, pp. 1897-1913, 2020, DOI:10.32604/cmc.2020.010813

    Abstract The meaning of a word includes a conceptual meaning and a distributive meaning. Word embedding based on distribution suffers from insufficient conceptual semantic representation caused by data sparsity, especially for low-frequency words. In knowledge bases, manually annotated semantic knowledge is stable and the essential attributes of words are accurately denoted. In this paper, we propose a Conceptual Semantics Enhanced Word Representation (CEWR) model, computing the synset embedding and hypernym embedding of Chinese words based on the Tongyici Cilin thesaurus, and aggregating it with distributed word representation to have both distributed information and the conceptual meaning encoded in the representation of… More >

  • Open Access

    ARTICLE

    MII: A Novel Text Classification Model Combining Deep Active Learning with BERT

    Anman Zhang1, Bohan Li1, 2, 3, *, Wenhuan Wang1, Shuo Wan1, Weitong Chen4

    CMC-Computers, Materials & Continua, Vol.63, No.3, pp. 1499-1514, 2020, DOI:10.32604/cmc.2020.09962

    Abstract Active learning has been widely utilized to reduce the labeling cost of supervised learning. By selecting specific instances to train the model, the performance of the model was improved within limited steps. However, rare work paid attention to the effectiveness of active learning on it. In this paper, we proposed a deep active learning model with bidirectional encoder representations from transformers (BERT) for text classification. BERT takes advantage of the self-attention mechanism to integrate contextual information, which is beneficial to accelerate the convergence of training. As for the process of active learning, we design an instance selection strategy based on… More >

  • Open Access

    ARTICLE

    Review of Text Classification Methods on Deep Learning

    Hongping Wu1, Yuling Liu1, *, Jingwen Wang2

    CMC-Computers, Materials & Continua, Vol.63, No.3, pp. 1309-1321, 2020, DOI:10.32604/cmc.2020.010172

    Abstract Text classification has always been an increasingly crucial topic in natural language processing. Traditional text classification methods based on machine learning have many disadvantages such as dimension explosion, data sparsity, limited generalization ability and so on. Based on deep learning text classification, this paper presents an extensive study on the text classification models including Convolutional Neural Network-Based (CNN-Based), Recurrent Neural Network-Based (RNN-based), Attention Mechanisms-Based and so on. Many studies have proved that text classification methods based on deep learning outperform the traditional methods when processing large-scale and complex datasets. The main reasons are text classification methods based on deep learning… More >

  • Open Access

    ARTICLE

    Research on Privacy Disclosure Detection Method in Social Networks Based on Multi-Dimensional Deep Learning

    Yabin Xu1, 2, *, Xuyang Meng1, Yangyang Li3, Xiaowei Xu4, *

    CMC-Computers, Materials & Continua, Vol.62, No.1, pp. 137-155, 2020, DOI:10.32604/cmc.2020.05825

    Abstract In order to effectively detect the privacy that may be leaked through social networks and avoid unnecessary harm to users, this paper takes microblog as the research object to study the detection of privacy disclosure in social networks. First, we perform fast privacy leak detection on the currently published text based on the fastText model. In the case that the text to be published contains certain private information, we fully consider the aggregation effect of the private information leaked by different channels, and establish a convolution neural network model based on multi-dimensional features (MF-CNN) to detect privacy disclosure comprehensively and… More >

  • Open Access

    ARTICLE

    Multi-Label Chinese Comments Categorization: Comparison of Multi-Label Learning Algorithms

    Jiahui He1, Chaozhi Wang1, Hongyu Wu1, Leiming Yan1,*, Christian Lu2

    Journal of New Media, Vol.1, No.2, pp. 51-61, 2019, DOI:10.32604/jnm.2019.06238

    Abstract Multi-label text categorization refers to the problem of categorizing text through a multi-label learning algorithm. Text classification for Asian languages such as Chinese is different from work for other languages such as English which use spaces to separate words. Before classifying text, it is necessary to perform a word segmentation operation to convert a continuous language into a list of separate words and then convert it into a vector of a certain dimension. Generally, multi-label learning algorithms can be divided into two categories, problem transformation methods and adapted algorithms. This work will use customer's comments about some hotels as a… More >

  • Open Access

    ARTICLE

    Cross-Lingual Non-Ferrous Metals Related News Recognition Method Based on CNN with A Limited Bi-Lingual Dictionary

    Xudong Hong1, Xiao Zheng1,*, Jinyuan Xia1, Linna Wei1, Wei Xue1

    CMC-Computers, Materials & Continua, Vol.58, No.2, pp. 379-389, 2019, DOI:10.32604/cmc.2019.04059

    Abstract To acquire non-ferrous metals related news from different countries’ internet, we proposed a cross-lingual non-ferrous metals related news recognition method based on CNN with a limited bilingual dictionary. Firstly, considering the lack of related language resources of non-ferrous metals, we use a limited bilingual dictionary and CCA to learn cross-lingual word vector and to represent news in different languages uniformly. Then, to improve the effect of recognition, we use a variant of the CNN to learn recognition features and construct the recognition model. The experimental results show that our proposed method acquires better results. More >

Displaying 31-40 on page 4 of 36. Per Page