Computational Frameworks for Pathway Analysis: Evaluating Over-representation, Topology-aware Scoring, and Crosstalk Modeling

Document Type : Research Article

Authors

Department of Genetics, Faculty of Basic Sciences, Shahrekord University, Shahrekord, Iran

10.22080/jgr.2026.31513.1460

Abstract

Pathway analysis translates high-throughput molecular measurements into interpretable hypotheses about coordinated biological processes, but its conclusions depend strongly on the statistical and biological assumptions embedded in each method. This narrative review provides a practical, tool-focused update rather than a proposal of a new formal taxonomy. It revisits threshold-dependent over-representation analysis, functional class scoring, topology-aware scoring, and emerging crosstalk-aware approaches, with emphasis on when each is defensible for experimental interpretation. Threshold-dependent enrichment is fast and transparent, but relies on an arbitrary significance cutoff and treats pathway members as exchangeable counts. Functional class scoring retains ranked or continuous measurements and can partially accommodate coordinated intra-pathway behavior, yet most implementations still evaluate pathways independently and do not model inter-pathway dependence. Topology-aware methods add mechanistic structure by incorporating directionality, edge sign or weight, node centrality, or perturbation propagation; their interpretability is constrained by incomplete, static, and context-agnostic pathway maps. Because signaling pathways share components and influence one another through regulatory edges, crosstalk can produce both false pathway prioritization and biologically meaningful inter-pathway signals. We therefore compare four representative crosstalk or pathway-deregulation tools Pathifier, PathTracer, PAGI, and PathwAX/PathwAX II across input requirements, outputs, crosstalk definitions, accessibility, computational burden, strengths, and limitations. Recent multi-omics and single-cell methods are also considered because pathway inference increasingly requires direction-aware, sample-level, and reproducible integration across molecular layers. No method is universally optimal: selection should be driven by data type, biological question, availability of curated topology, tolerance for model assumptions, and intended interpretation. Transparent reporting of input gene universes, pathway databases, software versions, parameter settings, and multiple-testing procedures remains essential for reproducible pathway-level inference.

Keywords

Main Subjects


Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D., Butler, H., Cherry, J. M., ... & Sherlock, G. (2000). Gene ontology: tool for the unification of biology. Nature Genetics, 25(1), 25-29.‏ https://doi.org/10.1038/75556
Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B, 57(1), 289-300.  https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
Bibby, J. A., Agarwal, D., Freiwald, T., Kunz, N., Merle, N. S., West, E. E., ... & Zhang, N. R. (2022). Systematic single-cell pathway analysis to characterize early T cell activation. Cell Reports, 41(8). 111697 https://doi.org/10.1016/j.celrep.2022.111697
De Anda-Jáuregui, G., Guo, K., McGregor, B. A., Feldman, E. L., & Hur, J. (2019). Pathway crosstalk perturbation network modeling for identification of connectivity changes induced by diabetic neuropathy and pioglitazone. BMC Systems Biology, 13(1), 1. https://doi.org/10.1186/s12918-018-0674-7
De Maturana, E., Alonso, L., Alarcon, P., Martin-Antoniano, I. A., Pineda, S., Piorno, L., ... & Malats, N. (2019). Challenges in the integration of omics and non-omics data. Genes, 10(3), 238. https://doi.org/10.3390/genes10030238
Donato, M., Xu, Z., Tomoiaga, A., Granneman, J. G., MacKenzie, R. G., Bao, R., ... & Draghici, S. (2013). Analysis and correction of crosstalk effects in pathway analysis. Genome Research, 23(11), 1885-1893.‏ https://doi.org/10.1101/gr.153551.112
Draghici, S., Khatri, P., Tarca, AL., Amin, K., Done, A., Voichita, C., Georgescu, C., Romero, R. (2007). A systems biology approach for pathway level analysis. Genome Research, 17(10), 1537-1545. https://doi.org/10.1101/gr.6202607
Drier, Y., Sheffer, M., & Domany, E. (2013). Pathway-based personalized analysis of cancer. Proceedings of the National Academy of Sciences of the United States of America, 110(16), 6388-6393. https://doi.org/10.1073/pnas.1219651110
Efron, B., & Tibshirani, R. (2007). On testing the significance of sets of genes. Annals of Applied Statistics, 1(1), 107-129. https://doi.org/10.1214/07-AOAS101
Fa, B., Luo, C., Tang, Z., Yan, Y., Zhang, Y., & Yu, Z. (2019). Pathway-based biomarker identification with crosstalk analysis for robust prognosis prediction in hepatocellular carcinoma. EBioMedicine, 44, 295-303. https://doi.org/10.1016/j.ebiom.2019.05.010
Gatti, D. M., Barry, W. T., Nobel, A. B., Rusyn, I., & Wright, F. A. (2010). Heading down the wrong pathway: On the influence of correlation within gene sets. BMC Genomics, 11, 574. https://doi.org/10.1186/1471-2164-11-574
Han, J., Li, C., Yang, H., Xu, Y., Zhang, C., Ma, J., ...& Li, X.  (2015). A novel dysregulated pathway-identification analysis based on global influence of within-pathway effects and crosstalk between pathways. Journal of the Royal Society Interface, 12(102), 20140937. https://doi.org/10.1098/rsif.2014.0937
Hänzelmann, S., Castelo, R., & Guinney, J. (2013). GSVA: Gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics, 14(1), 7. https://doi.org/10.1186/1471-2105-14-7
Huang, D. W., Sherman, B. T., & Lempicki, R. A. (2009). Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols, 4(1), 44-57. https://doi.org/10.1038/nprot.2008.211
Jaccard, P. (1908). Nouvelles recherches sur la distribution florale. Bulletin de la Societe Vaudoise des Sciences Naturelles, 44(163), 223-270. https://doi.org/10.5169/seals-268384
Jiang, Z., & Gentleman, R. (2007). Extensions to gene set enrichment. Bioinformatics, 23(3), 306-313. https://doi.org/10.1093/bioinformatics/btl599
Khatri, P., Sirota, M., & Butte, A. J. (2012). Ten years of pathway analysis: Current approaches and outstanding challenges. PLOS Computational Biology, 8(2), e1002375. https://doi.org/10.1371/journal.pcbi.1002375
Kim, S. Y., & Volsky, D. J. (2005). PAGE: Parametric analysis of gene set enrichment. BMC Bioinformatics, 6(1), 144. https://doi.org/10.1186/1471-2105-6-144
Liu, H., Yuan, M., Mitra, R., Zhou, X., Long, M., Lei, W., ... & Jiang, W. (2022). CTpathway: a CrossTalk-based pathway enrichment analysis method for cancer research. Genome Medicine, 14(1), 118.‏ https://doi.org/10.1186/s13073-022-01119-6
Mi, H., Muruganujan, A., Casagrande, J. T., & Thomas, P. D. (2013). Large-scale gene function analysis with the PANTHER classification system. Nature Protocols, 8(8), 1551-1566. https://doi.org/10.1038/nprot.2013.092
Nguyen, T. M., Shafi, A., Nguyen, T., & Draghici, S. (2019). Identifying significantly impacted pathways: A comprehensive review and assessment. Genome Biology, 20(1), 203. https://doi.org/10.1186/s13059-019-1790-4
Nygård, S., Lingjærde, O. C., Caldas, C., Hovig, E., Børresen-Dale, A. L., Helland, Å., & Haakensen, V. D. (2019). PathTracer: High-sensitivity detection of differential pathway activity in tumours. Scientific Reports, 9(1), 16332. https://doi.org/10.1038/s41598-019-52529-3
Ogris, C., Castresana-Aguirre, M., & Sonnhammer, E. L. L. (2022). PathwAX II: Network-based pathway analysis with interactive visualization of network crosstalk. Bioinformatics, 38(9), 2659-2660. https://doi.org/10.1093/bioinformatics/btac153
Ogris, C., Guala, D., Helleday, T., & Sonnhammer, E. L. L. (2017). A novel method for crosstalk analysis of biological networks: Improving accuracy of pathway annotation. Nucleic Acids Research, 45(2), e8. https://doi.org/10.1093/nar/gkw849
Ogris, C., Helleday, T., & Sonnhammer, E. L. L. (2016). PathwAX: A web server for network crosstalk-based pathway annotation. Nucleic Acids Research, 44(W1), W105-W109. https://doi.org/10.1093/nar/gkw356
Paczkowska, M., Barenboim, J., Sintupisut, N., Fox, N. S., Zhu, H., Abd-Rabbo, D., ... & Reimand, J. (2020). Integrative pathway enrichment analysis of multivariate omics data. Nature Communications, 11(1), 735.‏ https://doi.org/10.1038/s41467-019-13983-9
Persson, E., Castresana-Aguirre, M., Buzzao, D., Guala, D., & Sonnhammer, E. L. L. (2021). FunCoup 5: Functional association networks in all domains of life, supporting directed links and tissue-specificity. Journal of Molecular Biology, 433(18), 166835. https://doi.org/10.1016/j.jmb.2021.166835
Rowland, M. A., Greenbaum, J. M., & Deeds, E. J. (2017). Crosstalk and the evolvability of intracellular communication. Nature Communications, 8(1), 16009. https://doi.org/10.1038/ncomms16009
Sam, S. A., Teel, J., Tegge, A. N., Bharadwaj, A., & Murali, T. M. (2017). XTALKDB: A database of signaling pathway crosstalk. Nucleic Acids Research, 45(W1), W393-W397. https://doi.org/10.1093/nar/gkw1037
Shojaie, A., & Michailidis, G. (2010). Network enrichment analysis in complex experiments. Statistical Applications in Genetics and Molecular Biology, 9(1), 22. https://doi.org/10.2202/1544-6115.1559
Slobodyanyuk, M., Bahcheli, A. T., Klein, Z. P., Bayati, M., Strug, L. J., & Reimand, J. (2024). Directional integration and pathway enrichment analysis for multi-omics data. Nature Communications, 15(1), 5690. https://doi.org/10.1038/s41467-024-49986-4
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., & Gillette, M. A. (2005). Gene set enrichment Analysis: A knowledge-based approach for interpreting genome-wide expression profiles, Proceedings of the National Academy of Sciences of the United States of America, 102(43), 15545-15550.‏ https://doi.org/10.1073/pnas.0506580102
Szklarczyk, D., Kirsch, R., Koutrouli, M., Nastou, K., Mehryary, F., Hachilif, R., ... & Von Mering, C. (2023). The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Research, 51(D1), D638-D646.‏ https://doi.org/10.1093/nar/gkac1000
Tarca, A. L., Draghici, S., Khatri, P., Hassan, S. S., Mittal, P., Kim, J. S., ... & Romero, R. (2009). A novel signaling pathway impact analysis. Bioinformatics, 25(1), 75-82. https://doi.org/10.1093/bioinformatics/btn577
Tegge, A. N., Sharp, N., & Murali, T. M. (2016). Xtalk: A path-based approach for identifying crosstalk between signaling pathways. Bioinformatics, 32(12), 2016-2023. https://doi.org/10.1093/bioinformatics/btv549
Tian, L., Greenberg, S. A., Kong, S. W., Altschuler, J., Kohane, I. S., & Park, P. J. (2005). Discovering statistically significant pathways in expression profiling studies. Proceedings of the National Academy of Sciences of the United States of America, 102(38), 13544-13549. https://doi.org/10.1073/pnas.0506577102
Wadi, L., Meyer, M., Weiser, J., Stein, L. D., & Reimand, J. (2016). Impact of outdated gene annotations on pathway enrichment analysis. Nature Methods, 13(9), 705-706. https://doi.org/10.1038/nmeth.3963
Wang, T., Shi, X. Z., & Wu, W. H. (2019). Crosstalk analysis of dysregulated pathways in preeclampsia. Experimental and Therapeutic Medicine, 18(2), 1451-1456. https://doi.org/10.3892/etm.2019.7178
Ziemann, M., Poulain, P., & Bora, A. (2023). The five pillars of computational reproducibility: Bioinformatics and beyond. Briefings in Bioinformatics, 24(6), bbad375. https://doi.org/10.1093/bib/bbad375