Open Access iconOpen Access

ARTICLE

Syntactic and Socially Responsible Machine Translation: A POS and DEP Integrated Framework for English–Tamil

Rama Sugavanam*, Mythili Ramu

Department of Information Technology, School of Computer Science and Engineering, SRM Institute of Science and Technology, Ramapuram Campus, Chennai, India

* Corresponding Author: Rama Sugavanam. Email: email

Computers, Materials & Continua 2026, 87(1), 97 https://doi.org/10.32604/cmc.2026.071469

Abstract

When performing English-to-Tamil Neural Machine Translation (NMT), end users face several challenges due to Tamil’s rich morphology, free word order, and limited annotated corpora. Although available transformer-based models offer strong baselines, they compromise syntactic awareness and the detection and management of offensive content in cluttered, noisy, and informal text. In this paper, we present POSDEP-Offense-Trans, a multi-task NMT framework that combines Part-of-Speech (POS) and Dependency Parsing (DEP) methods with a robust offensive language classification module. Our architecture enriches the Transformer encoder with syntax-aware embeddings and provides syntax-guided attention mechanisms. The architecture incorporates a structure-aware contrastive loss that reinforces syntactic consistency and deploys auxiliary classification heads for POS tagging, dependency parsing, and multi-class offensive detection. The classifier for offensive words operates at both sentence and token levels and obtains guidance from syntactic features and formal finite automata rules that model offensive language structures-hate speech, profanity, sarcasm, and threats. Using this architecture, we construct a syntactically enriched, socially annotated corpus. Experimental results show improvements in translation quality, with a BLEU score of 33.5, UAS/LAS parsing accuracies of 92.4% and 90%, and a 4.5% F1-score gain in offensive content detection compared with baseline POS + DEP + Offense models. Also, the proposed model achieved 92.3% in offensive content neutralization, as confirmed by ablation studies. This comprehensive English–Tamil NMT model that unifies syntactic modelling and ethical filtering—laying the groundwork for applications in social media moderation, hate speech mitigation, and policy-compliant multilingual content generation.

Keywords

POS-aware NMT; dependency parsing; syntax-guided attention; multi-task learning; offensive language detection; offensive language neutralization; English–Tamil neural machine translation

Cite This Article

APA Style
Sugavanam, R., Ramu, M. (2026). Syntactic and Socially Responsible Machine Translation: A POS and DEP Integrated Framework for English–Tamil. Computers, Materials & Continua, 87(1), 97. https://doi.org/10.32604/cmc.2026.071469
Vancouver Style
Sugavanam R, Ramu M. Syntactic and Socially Responsible Machine Translation: A POS and DEP Integrated Framework for English–Tamil. Comput Mater Contin. 2026;87(1):97. https://doi.org/10.32604/cmc.2026.071469
IEEE Style
R. Sugavanam and M. Ramu, “Syntactic and Socially Responsible Machine Translation: A POS and DEP Integrated Framework for English–Tamil,” Comput. Mater. Contin., vol. 87, no. 1, pp. 97, 2026. https://doi.org/10.32604/cmc.2026.071469



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 122

    View

  • 20

    Download

  • 0

    Like

Share Link