Open Access iconOpen Access

ARTICLE

Interpretable Deep Representation Learning for Pan-Cancer Diagnosis via Pathway-Constrained Transcriptomics

Maram Fahaad Almufareh1,*, Samabia Tehsin2,*

1 Department of Information Systems, College of Computer and Information Sciences, Jouf University, Al-Jawf, Saudi Arabia
2 Center of Excellence–AI, Bahria University, Islamabad, Pakistan

* Corresponding Authors: Maram Fahaad Almufareh. Email: email; Samabia Tehsin. Email: email

(This article belongs to the Special Issue: Mathematical Aspects of Computational Biology and Bioinformatics-III)

Computer Modeling in Engineering & Sciences 2026, 147(3), 29 https://doi.org/10.32604/cmes.2026.081129

Abstract

This article presents a Hierarchical Pathway-Masked Attention Autoencoder (H-PAAE), a biologically inspired representation-learning framework that enables explainable AI-guided cancer diagnosis. The model directly integrates the curated MSigDB Hallmark pathways, introducing pathway-constrained information flow and mechanistic interpretability through multi-level attention mechanisms. Based on TCGA RNA-seq data from 33 tumor types, H-PAAE compresses approximately 20,000 genes into a 128-dimensional latent space while preserving biologically meaningful structure. When used with XGBoost classification, H-PAAE delivers 92.37% test accuracy and 99.38% macro-AUROC with robust cross-validation results (92.5 ± 0.6%). SHAP analysis identifies a small number of key latent features, corresponding to conserved oncogenic processes, and pathway enrichment analysis shows strong overlap with cancer hallmarks. H-PAAE provides a clear and interpretable biological foundation for pan-cancer classification, with well-calibrated posterior probabilities that can be used for clinical decision-making, and can be easily integrated into multimodal diagnostic workflows.

Keywords

Computational biology; bioinformatics; cancer computational biology; transcriptomics; interpretable deep learning; pan-cancer analysis; gene expression analysis; machine learning in genomics

Cite This Article

APA Style
Almufareh, M.F., Tehsin, S. (2026). Interpretable Deep Representation Learning for Pan-Cancer Diagnosis via Pathway-Constrained Transcriptomics. Computer Modeling in Engineering & Sciences, 147(3), 29. https://doi.org/10.32604/cmes.2026.081129
Vancouver Style
Almufareh MF, Tehsin S. Interpretable Deep Representation Learning for Pan-Cancer Diagnosis via Pathway-Constrained Transcriptomics. Comput Model Eng Sci. 2026;147(3):29. https://doi.org/10.32604/cmes.2026.081129
IEEE Style
M. F. Almufareh and S. Tehsin, “Interpretable Deep Representation Learning for Pan-Cancer Diagnosis via Pathway-Constrained Transcriptomics,” Comput. Model. Eng. Sci., vol. 147, no. 3, pp. 29, 2026. https://doi.org/10.32604/cmes.2026.081129



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 303

    View

  • 81

    Download

  • 0

    Like

Share Link