Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement

Sarah Alrumiah; Amal Al-Shargabi

doi:10.32604/cmc.2022.021780

Open Access icon Open Access

ARTICLE

Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement

Sarah S. Alrumiah^*, Amal A. Al-Shargabi

Department of Information Technology, College of Computer, Qassim University, Buraydah, 51452, Saudi Arabia

* Corresponding Author: Sarah S. Alrumiah. Email: email

(This article belongs to the Special Issue: Machine Learning Applications in Medical, Finance, Education and Cyber Security)

Computers, Materials & Continua 2022, 70(3), 6205-6221. https://doi.org/10.32604/cmc.2022.021780

Received 14 July 2021; Accepted 19 August 2021; Issue published 11 October 2021

Abstract

Nowadays, people use online resources such as educational videos and courses. However, such videos and courses are mostly long and thus, summarizing them will be valuable. The video contents (visual, audio, and subtitles) could be analyzed to generate textual summaries, i.e., notes. Videos’ subtitles contain significant information. Therefore, summarizing subtitles is effective to concentrate on the necessary details. Most of the existing studies used Term Frequency–Inverse Document Frequency (TF-IDF) and Latent Semantic Analysis (LSA) models to create lectures’ summaries. This study takes another approach and applies Latent Dirichlet Allocation (LDA), which proved its effectiveness in document summarization. Specifically, the proposed LDA summarization model follows three phases. The first phase aims to prepare the subtitle file for modelling by performing some preprocessing steps, such as removing stop words. In the second phase, the LDA model is trained on subtitles to generate the keywords list used to extract important sentences. Whereas in the third phase, a summary is generated based on the keywords list. The generated summaries by LDA were lengthy; thus, a length enhancement method has been proposed. For the evaluation, the authors developed manual summaries of the existing “EDUVSUM” educational videos dataset. The authors compared the generated summaries with the manual-generated outlines using two methods, (i) Recall-Oriented Understudy for Gisting Evaluation (ROUGE) and (ii) human evaluation. The performance of LDA-based generated summaries outperforms the summaries generated by TF-IDF and LSA. Besides reducing the summaries’ length, the proposed length enhancement method did improve the summaries’ precision rates. Other domains, such as news videos, can apply the proposed method for video summarization.

Keywords

Subtitle summarization; educational videos; topic modelling; LDA; extractive summarization

Cite This Article

APA Style

Alrumiah, S.S., Al-Shargabi, A.A. (2022). Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement. Computers, Materials & Continua, 70(3), 6205–6221. https://doi.org/10.32604/cmc.2022.021780

Vancouver Style

Alrumiah SS, Al-Shargabi AA. Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement. Comput Mater Contin. 2022;70(3):6205–6221. https://doi.org/10.32604/cmc.2022.021780

IEEE Style

S. S. Alrumiah and A. A. Al-Shargabi, “Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement,” Comput. Mater. Contin., vol. 70, no. 3, pp. 6205–6221, 2022. https://doi.org/10.32604/cmc.2022.021780

BibTex EndNote RIS

Citations

1

[click to view]

Copyright © 2022 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Educational Videos Subtitles’ Summarization Using Latent Dirichlet Allocation and Length Enhancement

Abstract

Keywords

Cite This Article

Citations

4279

2508

1

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link