Table of Content

Open Access iconOpen Access

ARTICLE

crossmark

Enhancing Embedding-Based Chinese Word Similarity Evaluation with Concepts and Synonyms Knowledge

Fulian Yin, Yanyan Wang, Jianbo Liu*, Meiqi Ji

Communication University of China, Beijing, 100024, China

* Corresponding Author: Jianbo Liu. Email: email

(This article belongs to this Special Issue: Information Hiding and Multimedia Security)

Computer Modeling in Engineering & Sciences 2020, 124(2), 747-764. https://doi.org/10.32604/cmes.2020.010579

Abstract

Word similarity (WS) is a fundamental and critical task in natural language processing. Existing approaches to WS are mainly to calculate the similarity or relatedness of word pairs based on word embedding obtained by massive and high-quality corpus. However, it may suffer from poor performance for insuf- ficient corpus in some specific fields, and cannot capture rich semantic and sentimental information. To address these above problems, we propose an enhancing embedding-based word similarity evaluation with character-word concepts and synonyms knowledge, namely EWS-CS model, which can provide extra semantic information to enhance word similarity evaluation. The core of our approach contains knowledge encoder and word encoder. In knowledge encoder, we incorporate the semantic knowledge extracted from knowledge resources, including character-word concepts, synonyms and sentiment lexicons, to obtain knowledge representation. Word encoder is to learn enhancing embedding-based word representation from pre-trained model and knowledge representation based on similarity task. Finally, compared with baseline models, the experiments on four similarity evaluation datasets validate the effectiveness of our EWS-CS model in WS task.

Keywords


Cite This Article

Yin, F., Wang, Y., Liu, J., Ji, M. (2020). Enhancing Embedding-Based Chinese Word Similarity Evaluation with Concepts and Synonyms Knowledge. CMES-Computer Modeling in Engineering & Sciences, 124(2), 747–764.

Citations




cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 2716

    View

  • 2353

    Download

  • 0

    Like

Share Link