Search
2026 Volume 2
Article Contents
INVITED REVIEW   Open Access    

AI-driven deciphering of cell niches with single-cell and spatial omics: a new perspective for traditional Chinese medicine research

More Information
  • Received: 26 December 2025
    Revised: 23 January 2026
    Accepted: 26 January 2026
    Published online: 05 March 2026
    Targetome  2(2) Article number: e009 (2026)  |  Cite this article
  • The therapeutic effects of traditional Chinese medicine (TCM) are characterized by holistic and systematic regulation of the organism, typically achieved through coordinating tissue microenvironments and multicellular interactions. However, conventional molecular biology approaches struggle to precisely characterize cellular composition, spatial architecture, and associated microenvironment (cell niche) at the tissue level, constraining a comprehensive understanding of the mechanism of action of TCM. In recent years, the rapid advancement of single-cell and spatial omics technologies, together with artificial intelligence (AI)-driven computational methods, has enabled systematic deciphering of cell niches within complex tissues, offering new opportunities for TCM research. Centered on the cell niche, this review outlines its conceptual development and research progress, with a particular focus on recent advances in AI-assisted cell niche analysis based on single-cell and spatial omics data. We further summarize representative scenarios of cell niche analysis in TCM research, while discussing current challenges and future directions, highlighting its potential to provide a new perspective and analytical paradigm in TCM.
  • 加载中
  • [1] Li X, Liu Z, Liao J, Chen Q, Lu X, et al. 2023. Network pharmacology approaches for research of Traditional Chinese Medicines. Chinese Journal of Natural Medicines 21:323−332 doi: 10.1016/S1875-5364(23)60429-7

    CrossRef   Google Scholar

    [2] Su X, Yan X, Zhang H. 2025. The tumor microenvironment in hepatocellular carcinoma: mechanistic insights and therapeutic potential of traditional Chinese medicine. Molecular Cancer 24:173 doi: 10.1186/s12943-025-02378-8

    CrossRef   Google Scholar

    [3] Wu L, Wu Q, Du X, Ling M, Tong H. 2025. Revolutionizing pharmacological research of traditional Chinese medicine with single-cell omics technologies. Fitoterapia 186:106846 doi: 10.1016/j.fitote.2025.106846

    CrossRef   Google Scholar

    [4] Wang Z, Zhang T, Wang B, Li S. 2025. TCM network pharmacology: new perspective integrating network target with artificial intelligence and multi-modal multi-omics technologies. Chinese Journal of Natural Medicines 23:1425−1434 doi: 10.1016/S1875-5364(25)60986-1

    CrossRef   Google Scholar

    [5] Zhai Y, Liu L, Zhang F, Chen X, Wang H, et al. 2025. Network pharmacology: a crucial approach in traditional Chinese medicine research. Chinese Medicine 20:8 doi: 10.1186/s13020-024-01056-z

    CrossRef   Google Scholar

    [6] Wang X, Wang ZY, Zheng JH, Li S. 2021. TCM network pharmacology: a new trend towards combining computational, experimental and clinical approaches. Chinese Journal of Natural Medicines 19:1−11 doi: 10.1016/S1875-5364(21)60001-8

    CrossRef   Google Scholar

    [7] Klein AM, Mazutis L, Akartuna I, Tallapragada N, Veres A, et al. 2015. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell 161:1187−1201 doi: 10.1016/j.cell.2015.04.044

    CrossRef   Google Scholar

    [8] Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, et al. 2015. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161:1202−1214 doi: 10.1016/j.cell.2015.05.002

    CrossRef   Google Scholar

    [9] Picelli S, Björklund ÅK, Faridani OR, Sagasser S, Winberg G, et al. 2013. Smart-seq2 for sensitive full-length transcriptome profiling in single cells. Nature Methods 10:1096−1098 doi: 10.1038/nmeth.2639

    CrossRef   Google Scholar

    [10] Ståhl PL, Salmén F, Vickovic S, Lundmark A, Navarro JF, et al. 2016. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science 353:78−82 doi: 10.1126/science.aaf2403

    CrossRef   Google Scholar

    [11] Moffitt JR, Bambah-Mukku D, Eichhorn SW, Vaughn E, Shekhar K, et al. 2018. Molecular, spatial, and functional single-cell profiling of the hypothalamic preoptic region. Science 362:eaau5324 doi: 10.1126/science.aau5324

    CrossRef   Google Scholar

    [12] Stickels RR, Murray E, Kumar P, Li J, Marshall JL, et al. 2021. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nature Biotechnology 39:313−319 doi: 10.1038/s41587-020-0739-1

    CrossRef   Google Scholar

    [13] Chen A, Liao S, Cheng M, Ma K, Wu L, et al. 2022. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 185:1777−1792.e21 doi: 10.1016/j.cell.2022.04.003

    CrossRef   Google Scholar

    [14] Goltsev Y, Samusik N, Kennedy-Darling J, Bhate S, Hale M, et al. 2018. Deep profiling of mouse splenic architecture with CODEX multiplexed imaging. Cell 174:968−981.e15 doi: 10.1016/j.cell.2018.07.010

    CrossRef   Google Scholar

    [15] Keren L, Bosse M, Marquez D, Angoshtari R, Jain S, et al. 2018. A structured tumor-immune microenvironment in triple negative breast cancer revealed by multiplexed ion beam imaging. Cell 174:1373−1387.e19 doi: 10.1016/j.cell.2018.08.039

    CrossRef   Google Scholar

    [16] Janesick A, Shelansky R, Gottscho AD, Wagner F, Williams SR, et al. 2023. High resolution mapping of the tumor microenvironment using integrated single-cell, spatial and in situ analysis. Nature Communications 14:8353 doi: 10.1038/s41467-023-43458-x

    CrossRef   Google Scholar

    [17] Shao X, Yang H, Zhuang X, Liao J, Yang P, et al. 2021. scDeepSort: a pre-trained cell-type annotation method for single-cell transcriptomics using deep learning with a weighted graph neural network. Nucleic Acids Research 49:e122 doi: 10.1093/nar/gkab775

    CrossRef   Google Scholar

    [18] Qian J, Liao J, Liu Z, Chi Y, Fang Y, et al. 2023. Reconstruction of the cell pseudo-space from single-cell RNA sequencing data with scSpace. Nature Communications 14:2484 doi: 10.1038/s41467-023-38121-4

    CrossRef   Google Scholar

    [19] Lopez R, Regier J, Cole MB, Jordan MI, Yosef N. 2018. Deep generative modeling for single-cell transcriptomics. Nature Methods 15:1053−1058 doi: 10.1038/s41592-018-0229-2

    CrossRef   Google Scholar

    [20] Qian J, Bao H, Shao X, Fang Y, Liao J, et al. 2024. Simulating multiple variability in spatially resolved transcriptomics with scCube. Nature Communications 15:5021 doi: 10.1038/s41467-024-49445-0

    CrossRef   Google Scholar

    [21] Levine D, Rizvi SA, Lévy S, Pallikkavaliyaveetil N, Zhang D, et al. 2023. Cell2Sentence: teaching large language models the language of biology. bioRxiv 00:Preprint doi: 10.1101/2023.09.11.557287

    CrossRef   Google Scholar

    [22] Cui H, Wang C, Maan H, Pang K, Luo F, et al. 2024. scGPT: toward building a foundation model for single-cell multi-omics using generative AI. Nature Methods 21:1470−1480 doi: 10.1038/s41592-024-02201-0

    CrossRef   Google Scholar

    [23] Theodoris CV, Xiao L, Chopra A, Chaffin MD, Al Sayed ZR, et al. 2023. Transfer learning enables predictions in network biology. Nature 618:616−624 doi: 10.1038/s41586-023-06139-9

    CrossRef   Google Scholar

    [24] He S, Zhu Y, Tavakol DN, Ye H, Lao YH, et al. 2026. Squidiff: predicting cellular development and responses to perturbations using a diffusion model. Nature Methods 23:65−77 doi: 10.1038/s41592-025-02877-y

    CrossRef   Google Scholar

    [25] Aivazidis A, Memi F, Kleshchevnikov V, Er S, Clarke B, et al. 2025. Cell2fate infers RNA velocity modules to improve cell fate prediction. Nature Methods 22:698−707 doi: 10.1038/s41592-025-02608-3

    CrossRef   Google Scholar

    [26] Mizukoshi C, Kojima Y, Nomura S, Hayashi S, Abe K, et al. 2024. DeepKINET: a deep generative model for estimating single-cell RNA splicing and degradation rates. Genome Biology 25:229 doi: 10.1186/s13059-024-03367-8

    CrossRef   Google Scholar

    [27] Fan X, Liu J, Yang Y, Gu C, Han Y, et al. 2024. scGraphformer: unveiling cellular heterogeneity and interactions in scRNA-seq data using a scalable graph transformer network. Communications Biology 7:1463 doi: 10.1038/s42003-024-07154-w

    CrossRef   Google Scholar

    [28] Raghavan V, Zheng Y, Li Y, Ding J. 2025. Harnessing agent-based frameworks in CellAgentChat to unravel cell-cell interactions from single-cell and spatial transcriptomics. Genome Research 35:1646−1663 doi: 10.1101/gr.279771.124

    CrossRef   Google Scholar

    [29] Velikic G, Maric DM, Maric DL, Supic G, Puletic M, et al. 2024. Harnessing the stem cell niche in regenerative medicine: innovative avenue to combat neurodegenerative diseases. International Journal of Molecular Sciences 25:993 doi: 10.3390/ijms25020993

    CrossRef   Google Scholar

    [30] Lázár E, Lundeberg J. 2026. Spatial architecture of development and disease. Nature Reviews Genetics 27:118−136 doi: 10.1038/s41576-025-00892-5

    CrossRef   Google Scholar

    [31] Gulati GS, D'Silva JP, Liu Y, Wang L, Newman AM. 2025. Profiling cell identity and tissue architecture with single-cell and spatial transcriptomics. Nature Reviews Molecular Cell Biology 26:11−31 doi: 10.1038/s41580-024-00768-2

    CrossRef   Google Scholar

    [32] Varrone M, Tavernari D, Santamaria-Martínez A, Walsh LA, Ciriello G. 2024. CellCharter reveals spatial cell niches associated with tissue remodeling and cell plasticity. Nature Genetics 56:74−84 doi: 10.1038/s41588-023-01588-4

    CrossRef   Google Scholar

    [33] Rajachandran S, Zhang X, Cao Q, Caldeira-Brant AL, Zhang X, et al. 2023. Dissecting the spermatogonial stem cell niche using spatial transcriptomics. Cell Reports 42:112737 doi: 10.1016/j.celrep.2023.112737

    CrossRef   Google Scholar

    [34] Adams GB, Martin RP, Alley IR, Chabner KT, Cohen KS, et al. 2007. Therapeutic targeting of a stem cell niche. Nature Biotechnology 25:238−243 doi: 10.1038/nbt1281

    CrossRef   Google Scholar

    [35] Lee CYC, McCaffrey J, McGovern D, Clatworthy MR. 2025. Profiling immune cell tissue niches in the spatial-omics era. The Journal of Allergy and Clinical Immunology 155:663−677 doi: 10.1016/j.jaci.2024.11.001

    CrossRef   Google Scholar

    [36] Qian J, Shao X, Bao H, Fang Y, Guo W, et al. 2025. Identification and characterization of cell niches in tissue from spatial omics data at single-cell resolution. Nature Communications 16:1693 doi: 10.1038/s41467-025-57029-9

    CrossRef   Google Scholar

    [37] Rojas-Ríos P, González-Reyes A. 2014. Concise review: The plasticity of stem cell niches: a general property behind tissue homeostasis and repair. Stem Cells 32:852−859 doi: 10.1002/stem.1621

    CrossRef   Google Scholar

    [38] Mendelson A, Frenette PS. 2014. Hematopoietic stem cell niche maintenance during homeostasis and regeneration. Nature Medicine 20:833−846 doi: 10.1038/nm.3647

    CrossRef   Google Scholar

    [39] Kanemaru K, Cranley J, Muraro D, Miranda AMA, Ho SY, et al. 2023. Spatially resolved multiomics of human cardiac niches. Nature 619:801−810 doi: 10.1038/s41586-023-06311-1

    CrossRef   Google Scholar

    [40] Ren Y, Huang Z, Zhou L, Xiao P, Song J, et al. 2023. Spatial transcriptomics reveals niche-specific enrichment and vulnerabilities of radial glial stem-like cells in malignant gliomas. Nature Communications 14:1028 doi: 10.1038/s41467-023-36707-6

    CrossRef   Google Scholar

    [41] Guilliams M, Bonnardel J, Haest B, Vanderborght B, Wagner C, et al. 2022. Spatial proteogenomics reveals distinct and evolutionarily conserved hepatic macrophage niches. Cell 185:379−396.e38 doi: 10.1016/j.cell.2021.12.018

    CrossRef   Google Scholar

    [42] Lake BB, Menon R, Winfree S, Hu Q, Melo Ferreira R, et al. 2023. An atlas of healthy and injured cell states and niches in the human kidney. Nature 619:585−594 doi: 10.1038/s41586-023-05769-3

    CrossRef   Google Scholar

    [43] Pascual-Reguant A, Kroh S, Hauser AE. 2024. Tissue niches and immunopathology through the lens of spatial tissue profiling techniques. European Journal of Immunology 54:e2350484 doi: 10.1002/eji.202350484

    CrossRef   Google Scholar

    [44] Hicks MR, Pyle AD. 2023. The emergence of the stem cell niche. Trends in Cell Biology 33:112−123 doi: 10.1016/j.tcb.2022.07.003

    CrossRef   Google Scholar

    [45] Park SY, Matte A, Jung Y, Ryu J, Anand WB, et al. 2020. Pathologic angiogenesis in the bone marrow of humanized sickle cell mice is reversed by blood transfusion. Blood 135:2071−2084 doi: 10.1182/blood.2019002227

    CrossRef   Google Scholar

    [46] Celià-Terrassa T, Kang Y. 2018. Metastatic niche functions and therapeutic opportunities. Nature Cell Biology 20:868−877 doi: 10.1038/s41556-018-0145-9

    CrossRef   Google Scholar

    [47] Chen Q, Boire A, Jin X, Valiente M, Er EE, et al. 2016. Carcinoma–astrocyte gap junctions promote brain metastasis by cGAMP transfer. Nature 533:493−498 doi: 10.1038/nature18268

    CrossRef   Google Scholar

    [48] Ma Y, Bao Y, Wang H, Jiang H, Zhou L, et al. 2024. 1H-NMR-based metabolomics to dissect the traditional Chinese medicine promotes mesenchymal stem cell homing as intervention in liver fibrosis in mouse model of Wilson's disease. The Journal of Pharmacy and Pharmacology 76:656−671 doi: 10.1093/jpp/rgae016

    CrossRef   Google Scholar

    [49] Voog J, D'Alterio C, Jones DL. 2008. Multipotent somatic stem cells contribute to the stem cell niche in the Drosophila testis. Nature 454:1132−1136 doi: 10.1038/nature07173

    CrossRef   Google Scholar

    [50] Kiel MJ, Yilmaz ÖH, Iwashita T, Yilmaz OH, Terhorst C, et al. 2005. SLAM family receptors distinguish hematopoietic stem and progenitor cells and reveal endothelial niches for stem cells. Cell 121:1109−1121 doi: 10.1016/j.cell.2005.05.026

    CrossRef   Google Scholar

    [51] Ali Abid Al-Juboori A, Ghosh A, Bin Jamaluddin MF, Kumar M, Sahoo SS, et al. 2019. Proteomic analysis of stromal and epithelial cell communications in human endometrial cancer using a unique 3D co-culture model. Proteomics 19:e1800448 doi: 10.1002/pmic.201800448

    CrossRef   Google Scholar

    [52] Bloise N, Giannaccari M, Guagliano G, Peluso E, Restivo E, et al. 2024. Growing role of 3D in vitro cell cultures in the study of cellular and molecular mechanisms: short focus on breast cancer, endometriosis, liver and infectious diseases. Cells 13:1054 doi: 10.3390/cells13121054

    CrossRef   Google Scholar

    [53] Katt ME, Placone AL, Wong AD, Xu ZS, Searson PC. 2016. In vitro tumor models: advantages, disadvantages, variables, and selecting the right platform. Frontiers in Bioengineering and Biotechnology 4:12 doi: 10.3389/fbioe.2016.00012

    CrossRef   Google Scholar

    [54] Shah S, D'Souza GGM. 2025. Modeling tumor microenvironment complexity in vitro: spheroids as physiologically relevant tumor models and strategies for their analysis. Cells 14:732 doi: 10.3390/cells14100732

    CrossRef   Google Scholar

    [55] van Dam S, Baars MJD, Vercoulen Y. 2022. Multiplex tissue imaging: spatial Revelations in the tumor microenvironment. Cancers 14:3170 doi: 10.3390/cancers14133170

    CrossRef   Google Scholar

    [56] Tang F, Barbacioru C, Wang Y, Nordman E, Lee C, et al. 2009. mRNA-Seq whole-transcriptome analysis of a single cell. Nature Methods 6:377−382 doi: 10.1038/nmeth.1315

    CrossRef   Google Scholar

    [57] Qadir MMF, Álvarez-Cubela S, Klein D, van Dijk J, Muñiz-Anquela R, et al. 2020. Single-cell resolution analysis of the human pancreatic ductal progenitor cell niche. Proceedings of the National Academy of Sciences of the United States of America 117:10876−10887 doi: 10.1073/pnas.1918314117

    CrossRef   Google Scholar

    [58] Cheng HW, Mörbe U, Lütge M, Engetschwiler C, Onder L, et al. 2022. Intestinal fibroblastic reticular cell niches control innate lymphoid cell homeostasis and function. Nature Communications 13:2027 doi: 10.1038/s41467-022-29734-2

    CrossRef   Google Scholar

    [59] Shao X, Wang Z, Wang K, Lu X, Zhang P, et al. 2024. A single-cell landscape of human liver transplantation reveals a pathogenic immune niche associated with early allograft dysfunction. Engineering 36:193−208 doi: 10.1016/j.eng.2023.12.004

    CrossRef   Google Scholar

    [60] Xie H, Lu Y, Zhang A, Zheng A, Rao B, et al. 2025. Single-cell transcriptome analyses reveal disturbed decidual microenvironment in women of advanced maternal age. Clinical and Translational Medicine 15:e70541 doi: 10.1002/ctm2.70541

    CrossRef   Google Scholar

    [61] Luca BA, Steen CB, Matusiak M, Azizi A, Varma S, et al. 2021. Atlas of clinically distinct cell states and ecosystems across human solid tumors. Cell 184:5482−5496.e28 doi: 10.1016/j.cell.2021.09.014

    CrossRef   Google Scholar

    [62] Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, et al. 2019. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nature Biotechnology 37:773−782 doi: 10.1038/s41587-019-0114-2

    CrossRef   Google Scholar

    [63] Shi Q, Chen Y, Li Y, Qin S, Yang Y, et al. 2025. Cross-tissue multicellular coordination and its rewiring in cancer. Nature 643:529−538 doi: 10.1038/s41586-025-09053-4

    CrossRef   Google Scholar

    [64] Jerby-Arnon L, Regev A. 2022. DIALOGUE maps multicellular programs in tissue from single-cell or spatial transcriptomics data. Nature Biotechnology 40(10):1467−1477 doi: 10.1038/s41587-022-01288-0

    CrossRef   Google Scholar

    [65] Tian L, Chen F, Macosko EZ. 2023. The expanding vistas of spatial transcriptomics. Nature Biotechnology 41:773−782 doi: 10.1038/s41587-022-01448-2

    CrossRef   Google Scholar

    [66] Liao J, Lu X, Shao X, Zhu L, Fan X. 2021. Uncovering an organ's molecular architecture at single-cell resolution by spatially resolved transcriptomics. Trends in Biotechnology 39:43−58 doi: 10.1016/j.tibtech.2020.05.006

    CrossRef   Google Scholar

    [67] Moffitt JR, Lundberg E, Heyn H. 2022. The emerging landscape of spatial profiling technologies. Nature Reviews Genetics 23:741−759 doi: 10.1038/s41576-022-00515-3

    CrossRef   Google Scholar

    [68] Close JL, Long BR, Zeng H. 2021. Spatially resolved transcriptomics in neuroscience. Nature Methods 18:23−25 doi: 10.1038/s41592-020-01040-z

    CrossRef   Google Scholar

    [69] Liu L, Chen A, Li Y, Mulder J, Heyn H, et al. 2024. Spatiotemporal omics for biology and medicine. Cell 187:4488−4519 doi: 10.1016/j.cell.2024.07.040

    CrossRef   Google Scholar

    [70] de Oliveira MF, Romero JP, Chung M, Williams SR, Gottscho AD, et al. 2025. High-definition spatial transcriptomic profiling of immune cell populations in colorectal cancer. Nature Genetics 57:1512−1523 doi: 10.1038/s41588-025-02193-3

    CrossRef   Google Scholar

    [71] Liu J, Tran V, Vemuri VNP, Byrne A, Borja M, et al. 2022. Concordance of MERFISH spatial transcriptomics with bulk and single-cell RNA sequencing. Life Science Alliance 6:e202201701 doi: 10.26508/lsa.202201701

    CrossRef   Google Scholar

    [72] He S, Bhatt R, Brown C, Brown EA, Buhr DL, et al. 2022. High-plex imaging of RNA and proteins at subcellular resolution in fixed tissue by spatial molecular imaging. Nature Biotechnology 40:1794−1806 doi: 10.1038/s41587-022-01483-z

    CrossRef   Google Scholar

    [73] Milosevic V. 2023. Different approaches to Imaging Mass Cytometry data analysis. Bioinformatics Advances 3:vbad046 doi: 10.1093/bioadv/vbad046

    CrossRef   Google Scholar

    [74] Passarelli MK, Pirkl A, Moellers R, Grinfeld D, Kollmer F, et al. 2017. The 3D OrbiSIMS—label-free metabolic imaging with subcellular lateral resolution and high mass-resolving power. Nature Methods 14:1175−1183 doi: 10.1038/nmeth.4504

    CrossRef   Google Scholar

    [75] Capolupo L, Khven I, Lederer AR, Mazzeo L, Glousker G, et al. 2022. Sphingolipids control dermal fibroblast heterogeneity. Science 376:eabh1623 doi: 10.1126/science.abh1623

    CrossRef   Google Scholar

    [76] Seydel C. 2021. Single-cell metabolomics hits its stride. Nature Methods 18:1452−1456 doi: 10.1038/s41592-021-01333-x

    CrossRef   Google Scholar

    [77] Yuan Z, Zhou Q, Cai L, Pan L, Sun W, et al. 2021. SEAM is a spatial single nuclear metabolomics method for dissecting tissue microenvironment. Nature Methods 18:1223−1232 doi: 10.1038/s41592-021-01276-3

    CrossRef   Google Scholar

    [78] Llorens-Bobadilla E, Zamboni M, Marklund M, Bhalla N, Chen X, et al. 2023. Solid-phase capture and profiling of open chromatin by spatial ATAC. Nature Biotechnology 41:1085−1088 doi: 10.1038/s41587-022-01603-9

    CrossRef   Google Scholar

    [79] Deng Y, Bartosovic M, Ma S, Zhang D, Kukanja P, et al. 2022. Spatial profiling of chromatin accessibility in mouse and human tissues. Nature 609:375−383 doi: 10.1038/s41586-022-05094-1

    CrossRef   Google Scholar

    [80] Deng Y, Bartosovic M, Kukanja P, Zhang D, Liu Y, et al. 2022. Spatial-CUT&Tag: spatially resolved chromatin modification profiling at the cellular level. Science 375:681−686 doi: 10.1126/science.abg7216

    CrossRef   Google Scholar

    [81] Ma Y, Guo W, Mou Q, Shao X, Lyu M, et al. 2024. Spatial imaging of glycoRNA in single cells with ARPLA. Nature Biotechnology 42:608−616 doi: 10.1038/s41587-023-01801-z

    CrossRef   Google Scholar

    [82] Kuppe C, Ramirez Flores RO, Li Z, Hayat S, Levinson RT, et al. 2022. Spatial multi-omic map of human myocardial infarction. Nature 608:766−777 doi: 10.1038/s41586-022-05060-x

    CrossRef   Google Scholar

    [83] Avila Cobos F, Vandesompele J, Mestdagh P, De Preter K. 2018. Computational deconvolution of transcriptomics data from mixed cell populations. Bioinformatics 34:1969−1979 doi: 10.1093/bioinformatics/bty019

    CrossRef   Google Scholar

    [84] Baron M, Veres A, Wolock SL, Faust AL, Gaujoux R, et al. 2016. A single-cell transcriptomic map of the human and mouse pancreas reveals inter- and intra-cell population structure. Cell Systems 3:346−360.e4 doi: 10.1016/j.cels.2016.08.011

    CrossRef   Google Scholar

    [85] Wang X, Park J, Susztak K, Zhang NR, Li M. 2019. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nature Communications 10:380 doi: 10.1038/s41467-018-08023-x

    CrossRef   Google Scholar

    [86] Dong M, Thennavan A, Urrutia E, Li Y, Perou CM, et al. 2021. SCDC bulk gene expression deconvolution by multiple single-cell RNA sequencing references. Briefings in Bioinformatics 22:416−427 doi: 10.1093/bib/bbz166

    CrossRef   Google Scholar

    [87] Gong T, Szustakowski JD. 2013. DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data. Bioinformatics 29:1083−1085 doi: 10.1093/bioinformatics/btt090

    CrossRef   Google Scholar

    [88] Du R, Carey V, Weiss ST. 2019. deconvSeq: deconvolution of cell mixture distribution in sequencing data. Bioinformatics 35:5095−5102 doi: 10.1093/bioinformatics/btz444

    CrossRef   Google Scholar

    [89] Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, et al. 2015. Robust enumeration of cell subsets from tissue expression profiles. Nature Methods 12:453−457 doi: 10.1038/nmeth.3337

    CrossRef   Google Scholar

    [90] Rodriques SG, Stickels RR, Goeva A, Martin CA, Murray E, et al. 2019. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science 363:1463−1467 doi: 10.1126/science.aaw1219

    CrossRef   Google Scholar

    [91] Elosua-Bayes M, Nieto P, Mereu E, Gut I, Heyn H. 2021. SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes. Nucleic Acids Research 49:e50 doi: 10.1093/nar/gkab043

    CrossRef   Google Scholar

    [92] Zhou Z, Zhong Y, Zhang Z, Ren X. 2023. Spatial transcriptomics deconvolution at single-cell resolution using Redeconve. Nature Communications 14:7930 doi: 10.1038/s41467-023-43600-9

    CrossRef   Google Scholar

    [93] Dong R, Yuan GC. 2021. SpatialDWLS: accurate deconvolution of spatial transcriptomic data. Genome Biology 22:145 doi: 10.1186/s13059-021-02362-7

    CrossRef   Google Scholar

    [94] Danaher P, Kim Y, Nelson B, Griswold M, Yang Z, et al. 2022. Advances in mixed cell deconvolution enable quantification of cell types in spatial transcriptomic data. Nature Communications 13:385 doi: 10.1038/s41467-022-28020-5

    CrossRef   Google Scholar

    [95] Liu Z, Wu D, Zhai W, Ma L. 2023. SONAR enables cell type deconvolution with spatially weighted Poisson-Gamma model for spatial transcriptomics. Nature Communications 14:4727 doi: 10.1038/s41467-023-40458-9

    CrossRef   Google Scholar

    [96] Xun Z, Ding X, Zhang Y, Zhang B, Lai S, et al. 2023. Reconstruction of the tumor spatial microenvironment along the malignant-boundary-nonmalignant axis. Nature Communications 14:933 doi: 10.1038/s41467-023-36560-7

    CrossRef   Google Scholar

    [97] Ru B, Huang J, Zhang Y, Aldape K, Jiang P. 2023. Estimation of cell lineages in tumors from spatial transcriptomics data. Nature Communications 14:568 doi: 10.1038/s41467-023-36062-6

    CrossRef   Google Scholar

    [98] Cable DM, Murray E, Zou LS, Goeva A, Macosko EZ, et al. 2022. Robust decomposition of cell type mixtures in spatial transcriptomics. Nature Biotechnology 40:517−526 doi: 10.1038/s41587-021-00830-w

    CrossRef   Google Scholar

    [99] Andersson A, Bergenstråhle J, Asp M, Bergenstråhle L, Jurek A, et al. 2020. Single-cell and spatial transcriptomics enables probabilistic inference of cell type topography. Communications Biology 3:565 doi: 10.1038/s42003-020-01247-y

    CrossRef   Google Scholar

    [100] Kleshchevnikov V, Shmatko A, Dann E, Aivazidis A, King HW, et al. 2022. Cell2location maps fine-grained cell types in spatial transcriptomics. Nature Biotechnology 40:661−671 doi: 10.1038/s41587-021-01139-4

    CrossRef   Google Scholar

    [101] He S, Jin Y, Nazaret A, Shi L, Chen X, et al. 2025. Starfysh integrates spatial transcriptomic and histologic data to reveal heterogeneous tumor–immune hubs. Nature Biotechnology 43:223−235 doi: 10.1038/s41587-024-02173-8

    CrossRef   Google Scholar

    [102] Miller BF, Huang F, Atta L, Sahoo A, Fan J. 2022. Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data. Nature Communications 13:2339 doi: 10.1038/s41467-022-30033-z

    CrossRef   Google Scholar

    [103] Sun D, Liu Z, Li T, Wu Q, Wang C. 2022. STRIDE: accurately decomposing and integrating spatial transcriptomics using single-cell RNA sequencing. Nucleic Acids Research 50:e42 doi: 10.1093/nar/gkac150

    CrossRef   Google Scholar

    [104] Liao J, Qian J, Fang Y, Chen Z, Zhuang X, et al. 2022. De novo analysis of bulk RNA-seq data at spatially resolved single-cell resolution. Nature Communications 13:6498 doi: 10.1038/s41467-022-34271-z

    CrossRef   Google Scholar

    [105] Lopez R, Li B, Keren-Shaul H, Boyeau P, Kedmi M, et al. 2022. DestVI identifies continuums of cell types in spatial transcriptomics data. Nature Biotechnology 40:1360−1369 doi: 10.1038/s41587-022-01272-8

    CrossRef   Google Scholar

    [106] Xu H, Wang S, Fang M, Luo S, Chen C, et al. 2023. SPACEL: deep learning-based characterization of spatial transcriptome architectures. Nature Communications 14:7603 doi: 10.1038/s41467-023-43220-3

    CrossRef   Google Scholar

    [107] Song Q, Su J. 2021. DSTG: deconvoluting spatial transcriptomics data through graph-based artificial intelligence. Briefings in Bioinformatics 22:bbaa414 doi: 10.1093/bib/bbaa414

    CrossRef   Google Scholar

    [108] Li Y, Luo Y. 2024. STdGCN: spatial transcriptomic cell-type deconvolution using graph convolutional networks. Genome Biology 25:206 doi: 10.1186/s13059-024-03353-0

    CrossRef   Google Scholar

    [109] Li H, Li H, Zhou J, Gao X. 2022. SD2: spatially resolved transcriptomics deconvolution through integration of dropout and spatial information. Bioinformatics 38:4878−4884 doi: 10.1093/bioinformatics/btac605

    CrossRef   Google Scholar

    [110] Bae S, Na KJ, Koh J, Lee DS, Choi H, et al. 2022. CellDART: cell type inference by domain adaptation of single-cell and spatial transcriptomic data. Nucleic Acids Research 50:e57 doi: 10.1093/nar/gkac084

    CrossRef   Google Scholar

    [111] Bae S, Choi H, Lee DS. 2023. spSeudoMap: cell type mapping of spatial transcriptomics using unmatched single-cell RNA-seq data. Genome Medicine 15:19 doi: 10.1186/s13073-023-01168-5

    CrossRef   Google Scholar

    [112] Zhao C, Xu Z, Wang X, Tao S, MacDonald WA, et al. 2024. Innovative super-resolution in spatial transcriptomics: a transformer model exploiting histology images and spatial gene expression. Briefings in Bioinformatics 25:bbae052 doi: 10.1093/bib/bbae052

    CrossRef   Google Scholar

    [113] Lund JB, Lindberg EL, Maatz H, Pottbaecker F, Hübner N, et al. 2022. AntiSplodge: a neural-network-based RNA-profile deconvolution pipeline designed for spatial transcriptomics. NAR Genomics and Bioinformatics 4:lqac073 doi: 10.1093/nargab/lqac073

    CrossRef   Google Scholar

    [114] Charytonowicz D, Brody R, Sebra R. 2023. Interpretable and context-free deconvolution of multi-scale whole transcriptomic data with UniCell deconvolve. Nature Communications 14:1350 doi: 10.1038/s41467-023-36961-8

    CrossRef   Google Scholar

    [115] Zhan Y, Zhang Y, Hu Z, Wang Y, Zhu Z, et al. 2025. LETSmix: a spatially informed and learning-based domain adaptation method for cell-type deconvolution in spatial transcriptomics. Genome Medicine 17:16 doi: 10.1186/s13073-025-01442-8

    CrossRef   Google Scholar

    [116] Biancalani T, Scalia G, Buffoni L, Avasthi R, Lu Z, et al. 2021. Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram. Nature Methods 18:1352−1362 doi: 10.1038/s41592-021-01264-7

    CrossRef   Google Scholar

    [117] Ma Y, Zhou X. 2022. Spatially informed cell-type deconvolution for spatial transcriptomics. Nature Biotechnology 40:1349−1359 doi: 10.1038/s41587-022-01273-7

    CrossRef   Google Scholar

    [118] Georgaka S, Morgans WG, Zhao Q, Martinez DS, Ali A, et al. 2025. CellPie: a scalable spatial transcriptomics factor discovery method via joint non-negative matrix factorization. Nucleic Acids Research 53:gkaf251 doi: 10.1093/nar/gkaf251

    CrossRef   Google Scholar

    [119] Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. 2018. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nature Biotechnology 36:411−420 doi: 10.1038/nbt.4096

    CrossRef   Google Scholar

    [120] Wei R, He S, Bai S, Sei E, Hu M, et al. 2022. Spatial charting of single-cell transcriptomes in tissues. Nature Biotechnology 40:1190−1199 doi: 10.1038/s41587-022-01233-1

    CrossRef   Google Scholar

    [121] Cang Z, Nie Q. 2020. Inferring spatial and signaling relationships between cells from single cell transcriptomic data. Nature Communications 11:2084 doi: 10.1038/s41467-020-15968-5

    CrossRef   Google Scholar

    [122] Nitzan M, Karaiskos N, Friedman N, Rajewsky N. 2019. Gene expression cartography. Nature 576:132−137 doi: 10.1038/s41586-019-1773-3

    CrossRef   Google Scholar

    [123] Mages S, Moriel N, Avraham-Davidi I, Murray E, Watter J, et al. 2023. TACCO unifies annotation transfer and decomposition of cell identities for single-cell and spatial omics. Nature Biotechnology 41:1465−1473 doi: 10.1038/s41587-023-01657-3

    CrossRef   Google Scholar

    [124] Rahimi A, Vale-Silva LA, Fälth Savitski M, Tanevski J, Saez-Rodriguez J. 2024. DOT: a flexible multi-objective optimization framework for transferring features across single-cell and spatial omics. Nature Communications 15:4994 doi: 10.1038/s41467-024-48868-z

    CrossRef   Google Scholar

    [125] Shah S, Lubeck E, Zhou W, Cai L. 2016. In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus. Neuron 92:342−357 doi: 10.1016/j.neuron.2016.10.001

    CrossRef   Google Scholar

    [126] Wang X, Allen WE, Wright MA, Sylwestrak EL, Samusik N, et al. 2018. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science 361:eaat5691 doi: 10.1126/science.aat5691

    CrossRef   Google Scholar

    [127] Fan J, Lu F, Qin T, Peng W, Zhuang X, et al. 2023. Multiomic analysis of cervical squamous cell carcinoma identifies cellular ecosystems with biological and clinical relevance. Nature Genetics 55:2175−2188 doi: 10.1038/s41588-023-01570-0

    CrossRef   Google Scholar

    [128] Welch JD, Kozareva V, Ferreira A, Vanderburg C, Martin C, et al. 2019. Single-cell multi-omic integration compares and contrasts features of brain cell identity. Cell 177:1873−1887.e17 doi: 10.1016/j.cell.2019.05.006

    CrossRef   Google Scholar

    [129] Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, et al. 2019. Fast, sensitive and accurate integration of single-cell data with Harmony. Nature Methods 16:1289−1296 doi: 10.1038/s41592-019-0619-0

    CrossRef   Google Scholar

    [130] Abdelaal T, Mourragui S, Mahfouz A, Reinders MJT. 2020. SpaGE: spatial gene enhancement using scRNA-seq. Nucleic Acids Research 48:e107 doi: 10.1093/nar/gkaa740

    CrossRef   Google Scholar

    [131] Lopez R, Nazaret A, Langevin M, Samaran J, Regier J, et al. 2019. A joint model of unpaired data from scRNA-seq and spatial transcriptomics for imputing missing gene expression measurements. arXiv 00:1905.02269 doi: 10.48550/arXiv.1905.02269

    CrossRef   Google Scholar

    [132] Chen S, Zhang B, Chen X, Zhang X, Jiang R. 2021. stPlus: a reference-based method for the accurate enhancement of spatial transcriptomics. Bioinformatics 37:i299−i307 doi: 10.1093/bioinformatics/btab298

    CrossRef   Google Scholar

    [133] Wan X, Xiao J, Tam SST, Cai M, Sugimura R, et al. 2023. Integrating spatial and single-cell transcriptomics data using deep generative models with SpatialScope. Nature Communications 14:7848 doi: 10.1038/s41467-023-43629-w

    CrossRef   Google Scholar

    [134] Yang P, Jin L, Liao J, Jin K, Shao X, et al. 2023. Revealing spatial multimodal heterogeneity in tissues with SpaTrio. Cell Genomics 3:100446 doi: 10.1016/j.xgen.2023.100446

    CrossRef   Google Scholar

    [135] Yang P, Jin K, Yao Y, Jin L, Shao X, et al. 2025. Spatial integration of multi-omics single-cell data with SIMO. Nature Communications 16:1265 doi: 10.1038/s41467-025-56523-4

    CrossRef   Google Scholar

    [136] Vahid MR, Brown EL, Steen CB, Zhang W, Jeon HS, et al. 2023. High-resolution alignment of single-cell and spatial transcriptomes with CytoSPACE. Nature Biotechnology 41:1543−1548 doi: 10.1038/s41587-023-01697-9

    CrossRef   Google Scholar

    [137] Zhu Q, Shah S, Dries R, Cai L, Yuan GC. 2018. Identification of spatially associated subpopulations by combining scRNAseq and sequential fluorescence in situ hybridization data. Nature Biotechnology 36:1183−1190 doi: 10.1038/nbt.4260

    CrossRef   Google Scholar

    [138] Liu W, Liao X, Yang Y, Lin H, Yeong J, et al. 2022. Joint dimension reduction and clustering analysis of single-cell RNA-seq and spatial transcriptomics data. Nucleic Acids Research 50:e72 doi: 10.1093/nar/gkac219

    CrossRef   Google Scholar

    [139] Zhao E, Stone MR, Ren X, Guenthoer J, Smythe KS, et al. 2021. Spatial transcriptomics at subspot resolution with BayesSpace. Nature Biotechnology 39:1375−1384 doi: 10.1038/s41587-021-00935-2

    CrossRef   Google Scholar

    [140] Li Z, Zhou X. 2022. BASS: multi-scale and multi-sample analysis enables accurate cell type clustering and spatial domain detection in spatial transcriptomic studies. Genome Biology 23:168 doi: 10.1186/s13059-022-02734-7

    CrossRef   Google Scholar

    [141] Liu W, Liao X, Luo Z, Yang Y, Lau MC, et al. 2023. Probabilistic embedding, clustering, and alignment for integrating spatial transcriptomics data with PRECAST. Nature Communications 14:296 doi: 10.1038/s41467-023-35947-w

    CrossRef   Google Scholar

    [142] Chidester B, Zhou T, Alam S, Ma J. 2023. SpiceMix enables integrative single-cell spatial modeling of cell identity. Nature Genetics 55:78−88 doi: 10.1038/s41588-022-01256-z

    CrossRef   Google Scholar

    [143] Cable DM, Murray E, Shanmugam V, Zhang S, Zou LS, et al. 2022. Cell type-specific inference of differential expression in spatial transcriptomics. Nature Methods 19:1076−1087 doi: 10.1038/s41592-022-01575-3

    CrossRef   Google Scholar

    [144] Mason K, Sathe A, Hess PR, Rong J, Wu CY, et al. 2024. Niche-DE niche-differential gene expression analysis in spatial transcriptomics data identifies context-dependent cell-cell interactions. Genome Biology 25:14 doi: 10.1186/s13059-023-03159-6

    CrossRef   Google Scholar

    [145] Kim J, Rustam S, Mosquera JM, Randell SH, Shaykhiev R, et al. 2022. Unsupervised discovery of tissue architecture in multiplexed imaging. Nature Methods 19:1653−1661 doi: 10.1038/s41592-022-01657-2

    CrossRef   Google Scholar

    [146] He Y, Tang X, Huang J, Ren J, Zhou H, et al. 2021. ClusterMap for multi-scale clustering analysis of spatial gene expression. Nature Communications 12:5909 doi: 10.1038/s41467-021-26044-x

    CrossRef   Google Scholar

    [147] Singhal V, Chou N, Lee J, Yue Y, Liu J, et al. 2024. BANKSY unifies cell typing and tissue domain segmentation for scalable spatial omics data analysis. Nature Genetics 56:431−441 doi: 10.1038/s41588-024-01664-3

    CrossRef   Google Scholar

    [148] Hu J, Li X, Coleman K, Schroeder A, Ma N, et al. 2021. SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nature Methods 18:1342−1351 doi: 10.1038/s41592-021-01255-8

    CrossRef   Google Scholar

    [149] Wang B, Luo J, Liu Y, Shi W, Xiong Z, et al. 2023. Spatial-MGCN: a novel multi-view graph convolutional network for identifying spatial domains with attention mechanism. Briefings in Bioinformatics 24:bbad262 doi: 10.1093/bib/bbad262

    CrossRef   Google Scholar

    [150] Shi X, Zhu J, Long Y, Liang C. 2023. Identifying spatial domains of spatially resolved transcriptomics via multi-view graph convolutional networks. Briefings in Bioinformatics 24:bbad278 doi: 10.1093/bib/bbad278

    CrossRef   Google Scholar

    [151] Dong K, Zhang S. 2022. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nature Communications 13:1739 doi: 10.1038/s41467-022-29439-6

    CrossRef   Google Scholar

    [152] Zhou X, Dong K, Zhang S. 2023. Integrating spatial transcriptomics data across different conditions, technologies and developmental stages. Nature Computational Science 3:894−906 doi: 10.1038/s43588-023-00528-w

    CrossRef   Google Scholar

    [153] Lin X, Gao L, Whitener N, Ahmed A, Wei Z. 2022. A model-based constrained deep learning clustering approach for spatially resolved single-cell data. Genome Research 32:1906−1917 doi: 10.1101/gr.276477.121

    CrossRef   Google Scholar

    [154] Li J, Chen S, Pan X, Yuan Y, Shen HB. 2022. Cell clustering for spatial transcriptomics data with graph neural networks. Nature Computational Science 2:399−408 doi: 10.1038/s43588-022-00266-5

    CrossRef   Google Scholar

    [155] Ren H, Walker BL, Cang Z, Nie Q. 2022. Identifying multicellular spatiotemporal organization of cells with SpaceFlow. Nature Communications 13:4076 doi: 10.1038/s41467-022-31739-w

    CrossRef   Google Scholar

    [156] Guo T, Yuan Z, Pan Y, Wang J, Chen F, et al. 2023. SPIRAL: integrating and aligning spatially resolved transcriptomics data across different experiments, conditions, and technologies. Genome Biology 24:241 doi: 10.1186/s13059-023-03078-6

    CrossRef   Google Scholar

    [157] Xu H, Fu H, Long Y, Ang KS, Sethi R, et al. 2024. Unsupervised spatially embedded deep representation of spatial transcriptomics. Genome Medicine 16:12 doi: 10.1186/s13073-024-01283-x

    CrossRef   Google Scholar

    [158] Xu C, Jin X, Wei S, Wang P, Luo M, et al. 2022. DeepST: identifying spatial domains in spatial transcriptomics by deep learning. Nucleic Acids Research 50:e131 doi: 10.1093/nar/gkac901

    CrossRef   Google Scholar

    [159] Huo Y, Guo Y, Wang J, Xue H, Feng Y, et al. 2023. Integrating multi-modal information to detect spatial domains of spatial transcriptomics by graph attention network. Journal of Genetics and Genomics 50:720−733 doi: 10.1016/j.jgg.2023.06.005

    CrossRef   Google Scholar

    [160] Wang L, Hu Y, Xiao K, Zhang C, Shi Q, et al. 2024. Multi-modal domain adaptation for revealing spatial functional landscape from spatially resolved transcriptomics. Briefings in Bioinformatics 25:bbae257 doi: 10.1093/bib/bbae257

    CrossRef   Google Scholar

    [161] Long Y, Ang KS, Li M, Chong KLK, Sethi R, et al. 2023. Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST. Nature Communications 14:1155 doi: 10.1038/s41467-023-36796-3

    CrossRef   Google Scholar

    [162] Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. 2008. Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment 2008:P10008 doi: 10.1088/1742-5468/2008/10/p10008

    CrossRef   Google Scholar

    [163] Traag VA, Waltman L, van Eck NJ. 2019. From Louvain to leiden: guaranteeing well-connected communities. Scientific Reports 9:5233 doi: 10.1038/s41598-019-41695-z

    CrossRef   Google Scholar

    [164] Miller BF, Bambah-Mukku D, Dulac C, Zhuang X, Fan J. 2021. Characterizing spatial gene expression heterogeneity in spatially resolved single-cell transcriptomic data with nonuniform cellular densities. Genome Research 31:1843−1855 doi: 10.1101/gr.271288.120

    CrossRef   Google Scholar

    [165] Wu Z, Kondo A, McGrady M, Baker EAG, Chidester B, et al. 2024. Discovery and generalization of tissue structures from spatial omics data. Cell Reports Methods 4:100838 doi: 10.1016/j.crmeth.2024.100838

    CrossRef   Google Scholar

    [166] Müller-Bötticher N, Sahay S, Eils R, Ishaque N. 2025. SpatialLeiden: spatially aware Leiden clustering. Genome Biology 26:24 doi: 10.1186/s13059-025-03489-7

    CrossRef   Google Scholar

    [167] Schürch CM, Bhate SS, Barlow GL, Phillips DJ, Noti L, et al. 2020. Coordinated cellular neighborhoods orchestrate antitumoral immunity at the colorectal cancer invasive front. Cell 183:838 doi: 10.1016/j.cell.2020.10.021

    CrossRef   Google Scholar

    [168] Chen Z, Soifer I, Hilton H, Keren L, Jojic V. 2020. Modeling multiplexed images with spatial-LDA reveals novel tissue microenvironments. Journal of Computational Biology 27:1204−1218 doi: 10.1089/cmb.2019.0340

    CrossRef   Google Scholar

    [169] Hu Y, Rong J, Xu Y, Xie R, Peng J, et al. 2024. Unsupervised and supervised discovery of tissue cellular neighborhoods from cell phenotypes. Nature Methods 21:267−278 doi: 10.1038/s41592-023-02124-2

    CrossRef   Google Scholar

    [170] Yuan Z, Zhao F, Lin S, Zhao Y, Yao J, et al. 2024. Benchmarking spatial clustering methods with spatially resolved transcriptomics data. Nature Methods 21:712−722 doi: 10.1038/s41592-024-02215-8

    CrossRef   Google Scholar

    [171] Chen R, Yao Y, Qian J, Peng X, Shao X, et al. 2025. A comprehensive benchmarking for spatially resolved transcriptomics clustering methods across variable technologies, organs, and replicates. iMeta 4:e70084 doi: 10.1002/imt2.70084

    CrossRef   Google Scholar

    [172] Asiry S, Kim G, Filippou PS, Sanchez LR, Entenberg D, et al. 2021. The cancer cell dissemination machinery as an immunosuppressive niche: a new obstacle towards the era of cancer immunotherapy. Frontiers in Immunology 12:654877 doi: 10.3389/fimmu.2021.654877

    CrossRef   Google Scholar

    [173] Gatenbee CD, Baker AM, Schenck RO, Strobl M, West J, et al. 2022. Immunosuppressive niche engineering at the onset of human colorectal cancer. Nature Communications 13:1798 doi: 10.1038/s41467-022-29027-8

    CrossRef   Google Scholar

    [174] Zou H, Xiong W, He Y. 2025. Single-cell and spatial transcriptome-based metabolism-immunity interaction network and therapeutic target discovery of matrine in cervical cancer. Naunyn-Schmiedeberg's Archives of Pharmacology doi: 10.1007/s00210-025-04801-9

    CrossRef   Google Scholar

    [175] Yu D, Yang P, Lu X, Huang S, Liu L, et al. 2023. Single-cell RNA sequencing reveals enhanced antitumor immunity after combined application of PD-1 inhibitor and Shenmai injection in non-small cell lung cancer. Cell Communication and Signaling 21:169 doi: 10.1186/s12964-023-01184-3

    CrossRef   Google Scholar

    [176] Lin W, Chen X, Wang D, Lu R, Zhang C, et al. 2023. Single-nucleus ribonucleic acid-sequencing and spatial transcriptomics reveal the cardioprotection of Shexiang Baoxin Pill (SBP) in mice with myocardial ischemia-reperfusion injury. Frontiers in Pharmacology 14:1173649 doi: 10.3389/fphar.2023.1173649

    CrossRef   Google Scholar

    [177] Liu J, Zhang Q, Wong YK, Luo P, Chen J, et al. 2024. Single-cell transcriptomics reveals the ameliorative effect of oridonin on septic liver injury. Advanced Biology 8:2300542 doi: 10.1002/adbi.202300542

    CrossRef   Google Scholar

    [178] Jin K, Gao S, Yang P, Guo R, Li D, et al. 2022. Single-cell RNA sequencing reveals the temporal diversity and dynamics of cardiac immunity after myocardial infarction. Small Methods 6:2100752 doi: 10.1002/smtd.202100752

    CrossRef   Google Scholar

    [179] Shu J, Qin D, Tu W, Zhao D, Yang Q, et al. 2026. Single-cell and spatially resolved transcriptomics elucidate the therapeutic mechanism of Tripterygium wilfordii Polyglycosidium in ulcerative colitis. Phytomedicine 150:157569 doi: 10.1016/j.phymed.2025.157569

    CrossRef   Google Scholar

    [180] Zhu Y, Zhao L, Yan W, Ma H, Zhao W, et al. 2025. Celastrol directly targets LRP1 to inhibit fibroblast-macrophage crosstalk and ameliorates psoriasis progression. Acta Pharmaceutica Sinica B 15:876−891 doi: 10.1016/j.apsb.2024.12.041

    CrossRef   Google Scholar

    [181] Xin J, Yang T, Wu X, Wu Y, Liu Y, et al. 2023. Spatial transcriptomics analysis of zone-dependent hepatic ischemia-reperfusion injury murine model. Communications Biology 6:194 doi: 10.1038/s42003-023-04564-0

    CrossRef   Google Scholar

    [182] Lu R, Sun W, Lin W, Zhu S, Wang D, et al. 2025. Fufang Danshen Pill improves mitochondrial homeostasis by regulating the S100a9/TLR4 axis to alleviate myocardial ischemia-reperfusion injury. Phytomedicine 148:157433 doi: 10.1016/j.phymed.2025.157433

    CrossRef   Google Scholar

    [183] Subramanian A, Nemat-Gorgani N, Ellis-Caleo TJ, van IJzendoorn DGP, Sears TJ, et al. 2024. Sarcoma microenvironment cell states and ecosystems are associated with prognosis and predict response to immunotherapy. Nature Cancer 5:642−658 doi: 10.1038/s43018-024-00743-y

    CrossRef   Google Scholar

    [184] Xiao M, Hong S, Peng P, Cai S, Huang Y, et al. 2024. Co-delivery of protopanaxatriol/icariin into niche cells restores bone marrow niches to rejuvenate HSCs for chemotherapy-induced myelosuppression. Phytomedicine 134:155978 doi: 10.1016/j.phymed.2024.155978

    CrossRef   Google Scholar

    [185] Du J, Yang YC, An ZJ, Zhang MH, Fu XH, et al. 2023. Advances in spatial transcriptomics and related data analysis strategies. Journal of Translational Medicine 21:330 doi: 10.1186/s12967-023-04150-2

    CrossRef   Google Scholar

    [186] De Jonghe J, Opzoomer JW, Vilas-Zornoza A, Nilges BS, Crane P, et al. 2024. scTrends: a living review of commercial single-cell and spatial 'omic technologies. Cell Genomics 4:100723 doi: 10.1016/j.xgen.2024.100723

    CrossRef   Google Scholar

    [187] Zhang D, Schroeder A, Yan H, Yang H, Hu J, et al. 2024. Inferring super-resolution tissue architecture by integrating spatial transcriptomics with histology. Nature Biotechnology 42:1372−1377 doi: 10.1038/s41587-023-02019-9

    CrossRef   Google Scholar

    [188] Yang J, Zheng Z, Jiao Y, Yu K, Bhatara S, et al. 2025. Spotiphy enables single-cell spatial whole transcriptomics across an entire section. Nature Methods 22:724−736 doi: 10.1038/s41592-025-02622-5

    CrossRef   Google Scholar

    [189] He HF, Peng P, Yang ST, Wang MG, Zhang XF, et al. 2026. Unlocking single-cell level and continuous whole-slide insights in spatial transcriptomics with PanoSpace. Nature Computational Science 00: 1−14 doi: 10.1038/s43588-025-00938-y

    CrossRef   Google Scholar

    [190] Ge S, Sun S, Xu H, Cheng Q, Ren Z. 2025. Deep learning in single-cell and spatial transcriptomics data analysis: advances and challenges from a data science perspective. Briefings in Bioinformatics 26:bbaf136 doi: 10.1093/bib/bbaf136

    CrossRef   Google Scholar

    [191] Velten B, Stegle O. 2023. Principles and challenges of modeling temporal and spatial omics data. Nature Methods 20:1462−1474 doi: 10.1038/s41592-023-01992-y

    CrossRef   Google Scholar

    [192] Hou ZK, Hu W, Liu FB, Liang YY, Huang ZY, et al. 2020. Reviews and thoughts on the relevance between qualitative Chinese medicinal properties and quantitative material components. Evidence-Based Complementary and Alternative Medicine 2020:8643746 doi: 10.1155/2020/8643746

    CrossRef   Google Scholar

    [193] Zhou J, Zhou T, Chen M, Jiang M, Wang X, et al. 2014. Research progress on synergistic anti-tumor mechanisms of compounds in Traditional Chinese Medicine. Journal of Traditional Chinese Medicine 34:100−105 doi: 10.1016/S0254-6272(14)60062-5

    CrossRef   Google Scholar

    [194] Zhang L, Yan J, Liu X, Ye Z, Yang X, et al. 2012. Pharmacovigilance practice and risk control of Traditional Chinese Medicine drugs in China: current status and future perspective. Journal of Ethnopharmacology 140:519−525 doi: 10.1016/j.jep.2012.01.058

    CrossRef   Google Scholar

    [195] Miao Z, Humphreys BD, McMahon AP, Kim J. 2021. Multi-omics integration in the age of million single-cell data. Nature Reviews Nephrology 17:710−724 doi: 10.1038/s41581-021-00463-x

    CrossRef   Google Scholar

    [196] Vandereyken K, Sifrim A, Thienpont B, Voet T. 2023. Methods and applications for single-cell and spatial multi-omics. Nature Reviews Genetics 24:494−515 doi: 10.1038/s41576-023-00580-2

    CrossRef   Google Scholar

    [197] Shao X, Lu X, Liao J, Chen H, Fan X. 2020. New avenues for systematically inferring cell-cell communication: through single-cell transcriptomics data. Protein & Cell 11:866−880 doi: 10.1007/s13238-020-00727-5

    CrossRef   Google Scholar

    [198] Liberali P, Snijder B, Pelkmans L. 2015. Single-cell and multivariate approaches in genetic perturbation screens. Nature Reviews Genetics 16:18−32 doi: 10.1038/nrg3768

    CrossRef   Google Scholar

    [199] Bunne C, Roohani Y, Rosen Y, Gupta A, Zhang X, et al. 2024. How to build the virtual cell with artificial intelligence: priorities and opportunities. Cell 187:7045−7063 doi: 10.1016/j.cell.2024.11.015

    CrossRef   Google Scholar

    [200] Ling S, Xu JW. 2013. Model organisms and traditional Chinese medicine syndrome models. Evidence-Based Complementary and Alternative Medicine 2013:761987 doi: 10.1155/2013/761987

    CrossRef   Google Scholar

  • Cite this article

    Qian J, Bao H, Yang H, Zhang J, Shao X, et al. 2026. AI-driven deciphering of cell niches with single-cell and spatial omics: a new perspective for traditional Chinese medicine research. Targetome 2(2): e009 doi: 10.48130/targetome-0026-0009
    Qian J, Bao H, Yang H, Zhang J, Shao X, et al. 2026. AI-driven deciphering of cell niches with single-cell and spatial omics: a new perspective for traditional Chinese medicine research. Targetome 2(2): e009 doi: 10.48130/targetome-0026-0009

Figures(3)  /  Tables(4)

Article Metrics

Article views(995) PDF downloads(115)

INVITED REVIEW   Open Access    

AI-driven deciphering of cell niches with single-cell and spatial omics: a new perspective for traditional Chinese medicine research

Targetome  2 Article number: e009  (2026)  |  Cite this article

Abstract: The therapeutic effects of traditional Chinese medicine (TCM) are characterized by holistic and systematic regulation of the organism, typically achieved through coordinating tissue microenvironments and multicellular interactions. However, conventional molecular biology approaches struggle to precisely characterize cellular composition, spatial architecture, and associated microenvironment (cell niche) at the tissue level, constraining a comprehensive understanding of the mechanism of action of TCM. In recent years, the rapid advancement of single-cell and spatial omics technologies, together with artificial intelligence (AI)-driven computational methods, has enabled systematic deciphering of cell niches within complex tissues, offering new opportunities for TCM research. Centered on the cell niche, this review outlines its conceptual development and research progress, with a particular focus on recent advances in AI-assisted cell niche analysis based on single-cell and spatial omics data. We further summarize representative scenarios of cell niche analysis in TCM research, while discussing current challenges and future directions, highlighting its potential to provide a new perspective and analytical paradigm in TCM.

    • Traditional Chinese medicine (TCM) is a comprehensive medical and ethnopharmacological system formed by the Chinese nation through long-term disease prevention and treatment practices. Unlike the conventional 'one drug, one target, one disease' paradigm, the therapeutic effects of TCM are characterized by holistic and systematic regulation of the organism, which is typically achieved through a 'multi-component, multi-target, and multi-pathway' mode of action[1]. In this process, the global modulation of tissue microenvironments and the coordinated changes among multiple cell types by TCM interventions are widely considered crucial foundations for its efficacy[2,3].

      To address the 'multi-component, multi-target, and multi-pathway' nature of TCM, systems biology approaches such as network pharmacology have been extensively applied in TCM research. For example, by constructing 'herbs-components-targets-pathways-diseases' comprehensive biological networks, network pharmacology has provided valuable insights into the molecular basis of the holistic regulatory effects of TCM[46]. Nevertheless, such approaches primarily focus on distinct signaling pathways or target networks regulated by TCM, exhibiting great limitations in capturing cellular heterogeneity, multicellular coordination, and the dynamic reshaping process of the spatial microenvironment at the tissue level induced by TCM interventions.

      Recent advances in single-cell and spatial omics technologies[716] have provided unprecedented opportunities to characterize cellular composition and spatial architecture within complex tissues. These technologies enable high-resolution profiling of cellular heterogeneity while directly linking cellular states to their spatial contexts. Meanwhile, progress in artificial intelligence (AI) has driven continuous innovation in computational methods, making it possible to systematically decode multi-level heterogeneity from these single-cell and spatial omics data. For instance, Shao et al. developed scDeepSort, a graph neural network (GNN)-based pretrained model for cell type annotation of single-cell RNA sequencing (scRNA-seq) data, which enables robust annotation of cell types across different tissues and conditions, including normal, disease, and drug-treated states[17]. Qian et al. proposed scSpace, a transfer-learning-based method for spatial reconstruction of cells and identification of spatially variated cell subpopulations, providing new insights into cellular spatial organization[18]. Recently, generative AI models, such as variational autoencoders (VAE), diffusion models, and transformer-based large language models (LLMs), have also been integrated into single-cell and spatial omics data analyses, further advancing studies on heterogeneity modeling[1923], trajectory inference[2426], and cell-cell interactions[27,28].

      In this context, increasing attention has been directed toward cell niche analysis. As a fundamental unit that characterizes the cellular composition, spatial architecture, and functional states within specific tissue regions, the cell niche has been increasingly proven to represent a comprehensive intervention unit that transcends individual molecules or cell types, serving as a novel functional multicellular target for disease modulation and intervention[29,30]. This perspective extends the study of single cells to multicellular hubs of interacting cells[31], facilitating a deeper understanding of a wide variety of biological mechanisms such as cell plasticity[32], fate determination[33], and drug actions[34]. Crucially, the emphasis of cell niche analysis on multicellular coordination and spatial microenvironmental dependence is highly consistent with the theoretical basis of holistic regulation and systemic intervention of TCM. Consequently, adopting a cell niche-centered perspective is expected to provide a more physiologically and pathologically relevant analytical framework for elucidating the mechanisms of action of TCM.

      In this review, we describe the conceptual development and research progress of the cell niche, focusing on recent advances in cell niche deciphering based on single-cell and spatial omics data with AI-driven computational methods. We also summarize representative scenarios of cell niche analysis in TCM research and discuss its potential value in elucidating the holistic regulatory mechanisms of TCM. Finally, we address current challenges and future directions, highlighting key limitations at the data, methodology, and application levels, and outlining emerging opportunities to advance interpretable, robust, and translational cell niche analytical frameworks for TCM research.

    • The concept of a niche in science originates from ecology, where it was used to describe the natural environment in which a species thrives, together with the environmental characteristics that support the functional role and long-term maintenance of the species[35]. With the advancement of research, this concept has gradually been extended to the life sciences to characterize the local environment or spatially proximal organization surrounding cells, referred to as the cell niche, also commonly termed the cellular microenvironment or spatial domain[36]. As a higher-order functional unit formed at the tissue level, the cell niche finely regulates essential cellular behaviors, including growth, proliferation, differentiation, and apoptosis, and plays a critical role in determining various biological processes, such as maintaining tissue homeostasis[3739] and shaping disease progression[4042]. Increasing evidence indicates that many complex pathological phenotypes cannot be fully explained by the abnormalities of a single cell type or molecular pathway, but instead arise from the coordinated dysfunction of multiple cellular components and their interactions within a specific cell niche[43]. This understanding positions the cell niche as a potential and an intervenable functional multicellular target, providing a crucial research level for understanding disease mechanisms and drug action.

      The crucial regulatory role of cell niches in various physiological and pathological contexts has been systematically validated. For example, the stem cell niche controls stem cell self-renewal and differentiation precisely by providing specific signaling pathways and structural support, which is essential for proper organogenesis and tissue homeostasis[44]. Dysregulation of this cell niche may lead to tissue-specific dysfunction and disease susceptibility. Specifically, in the hematopoietic system, sickle cell disease (SCD) can cause alterations in the quantity and transcriptional state of various cellular components within the bone marrow vascular niche, thereby impairing hematopoiesis and exacerbating anemia. Studies have shown that intervening in the abnormal state of this cell niche can effectively rescue the vascular defects in SCD, highlighting the potential value of targeting cell niches in disease intervention[45].

      The central role of cell niches in disease pathology further underscores their importance in drug research and therapeutic intervention. Unlike traditional strategies that focus on single molecular targets or individual cell types, targeting disease-associated cell niches offers the potential to globally intervene in multicellular coordination of regulatory networks, thereby enhancing therapeutic efficacy and reducing the risk of drug resistance. For example, in tumor tissues, the metastatic cell niche preserves and protects the cancer stem cell phenotype and cellular plasticity from differentiation cues, thereby promoting immune evasion and undermining the efficacy of conventional therapeutic interventions[46]. Related studies have found that targeting key interactions between tumor cells and astrocytes in brain metastatic cell niches can significantly inhibit metastatic lesion formation, providing direct evidence for cell niche-guided therapeutic strategies[47]. On the other hand, in the context of complex therapeutic interventions such as TCM, cell niche analysis provides a more comprehensive and consistent theoretical framework for elucidating the pharmacological mechanisms underlying such multi-target and multi-pathway synergistic regulation. For example, in the liver fibrosis model, Ma et al.[48] observed that the combination of the TCM formula Bushen Huoxue Huazhuo Formula (BSHXHZF) with bone marrow mesenchymal stem cells (BMSCs) transplantation significantly enhances the colonization of BMSCs in liver tissue, reduces hepatocyte apoptosis, suppresses pathological injury, and decreases fibrosis markers, surpassing monotherapy effects. Mechanistic investigations suggest that this synergistic effect is closely associated with the systematic remodeling of the fibrotic liver niche, including reduction of oxidative stress, inhibition of key pro-fibrotic signaling pathways (such as TGF-β1), and promotion of pro-regenerative chemokines, thus facilitating BMSCs engraftment and paracrine activity.

      Early studies on the cell niche primarily relied on experimental techniques such as in vitro cell co-culture systems and tissue imaging approaches (Fig. 1). Through live imaging of the Drosophila testis, Voog et al. discovered that somatic stem cells in the Drosophila testis not only contribute to the formation of the somatic cyst cell lineage, but also participate in the establishment and maintenance of the cell niche at the apical hub, revealing the dynamic regulatory relationship between stem cells and their niche[49]. Kiel et al. used immunofluorescence analysis to uncover the spatial heterogeneity of hematopoietic stem cells (HSCs) niches within the bone marrow, showing that HSCs preferentially associate with either sinusoidal endothelial cells or the endosteum in distinct anatomical locations[50]. In addition, Al-Juboori et al. employed in vitro co-culture systems of endometrial stromal cells with cancerous epithelial cells, combined with proteomics analysis, to investigate the cell-cell adhesion and reciprocal epithelium-stroma dialogue, and identified associated signaling pathways in the tumor niches of human endometrial cancer[51]. Collectively, these studies advanced our understanding of the structural and functional characteristics of the cell niche, and provided valuable experimental evidence supporting the concept of niche-dependent regulation of cellular behavior. However, owing to intrinsic limitations in experimental design and technical resolution, these approaches generally fail to characterize the cell niches in complex tissues comprehensively. A major limitation of in vitro cell co-culture systems lies in their oversimplification of the complex multicellular composition and tissue architecture present in vivo. Most existing co-culture models involve only two or a limited number of cell types, making it difficult to recapitulate the multicellular complexity of cell niches[52]. In addition, although these systems offer experimental controllability for dissecting cell-cell signaling, they remain limited in capturing spatially constrained, tissue- or microenvironment-specific interactions. Consequently, in vitro cell co-culture systems have a restricted capacity to explain the spatially selective therapeutic effects of drug interventions, such as niche-specific regulatory observed in the tumor microenvironment or ischemic border zones[53,54]. In contrast, tissue imaging approaches preserve spatial context to some extent, but are constrained by limited molecular detection throughput, depth, and resolution. These methods typically rely on a small set of predefined markers[55], making it difficult to characterize the subtle changes of cellular states as well as the complex composition of cell niches comprehensively in physiological or pathological tissues. Moreover, both in vitro cell co-culture systems and tissue imaging approaches are predominantly static, providing limited insight into the spatiotemporal interactions and dynamic remodeling of cell niches during disease progression or therapeutic intervention. Collectively, these limitations hinder the ability of traditional methodologies in cell niche analysis.

      Figure 1. 

      Overview of advances in cell niche research. With the emergence of single-cell and spatial omics technologies, along with advances in AI-driven computational methods, cell niche research has gradually evolved from early qualitative descriptions based on experimental observations toward quantitative and systematic analysis that integrate multidimensional molecular characteristics and spatial information.

      In recent years, the emergence and rapid development of single-cell and spatial omics technologies have enabled the acquisition of molecular profiles at single-cell and spatial resolution, providing critical technical foundations and data resources for cell niche analysis. Leveraging these high-resolution datasets, cell niche research has gradually evolved from largely qualitative, hypothesis-driven investigations toward quantitative and systematic analysis, allowing systematic exploration of cellular states, functions, and interactions in the tissue context at spatiotemporal resolutions. At the same time, the increasing scale and complexity of single-cell and spatial omics data has driven diverse analytical strategies and relevant AI methods in this field. Therefore, the following section focuses on recent advances in deciphering cell niches from single-cell and spatial omics data with AI-driven approaches.

    • Since the establishment of the first scRNA-seq protocol[56] in 2009, single-cell omics technologies have profoundly transformed our understanding of cellular heterogeneity and gene expression dynamics across species, tissues, and developmental stages. These technologies have enabled systematic exploration of cellular diversity and state transitions across a wide range of developmental and disease contexts, providing in-depth insights for understanding complex biological processes. Against this backdrop, early studies primarily employed two strategies to infer cell niches using the single-cell data (Fig. 2a). The first strategy treated the acquired tissue sample as a single cell niche and focused on analyzing its cellular composition and functional characteristics systematically at the single-cell resolution. For example, Qadir et al. examined the single-cell transcriptome of the human ductal tree and revealed lineage-specific differentiation potentials of progenitor-like cells in this cell niche[57]. Cheng et al. performed single-cell transcriptomic analysis of the mouse intestinal lamina propria and identified fibroblast subpopulations in the cell niche that control innate lymphoid cell homeostasis and functionality[58]. However, while this strategy partially deciphers cell niches, it simplifies the entire tissue into a homogeneous cell niche, failing to capture the heterogeneity of cell niches within the tissue.

      Figure 2. 

      Schematic pipeline for inferring cell niches from single-cell and spatial omics data. (a) Cell niche analysis based on single-cell omics data. Given the loss of spatial location information, cell niche inference from single-cell omics data mainly relies on systematic characterization of cellular composition and functional states at single-cell resolution, indirect inference of multicellular coordination states through changes in cell type proportions or CCIs, and machine learning-based identification of recurrent multicellular coordination states across large-scale single-cell cohorts. (b), (c) Cell niche analysis based on spatial omics data. In contrast to single-cell omics, spatial omics technologies explicitly preserve spatial context, enabling direct modeling of spatial proximity, cellular composition, and molecular interactions within tissues to delineate cell niches. Existing approaches can be broadly categorized into two conceptual paradigms, where the first (b), focuses on spatial inference of cell niches based on reconstructed high-resolution cellular composition and molecular features, and the second (c), emphasizes direct modeling and identification of cell niches by jointly utilizing the molecular profiles and spatial information from spatial omics data.

      Another more refined strategy infers multicellular coordination states under different conditions by examining changes in cell type proportions or cell-cell interactions (CCI), thereby identifying potential cell niches. This approach has been widely applied to single-cell datasets spanning distinct disease states or interventions. For example, Shao et al. analyzed single-cell transcriptomics on 12 human liver samples procured from four early allograft dysfunction (EAD) and three non-EAD patients post-liver transplantation surgery. Through cell proportion and CCI analysis, they discovered that MAIT cells, GZMB+ GZMK+ NK cells, and S100A12+ neutrophils were significantly enriched and highly activated in EAD patients, revealing an EAD-associated pathogenic immune cell niche[59]. Similarly, Xie et al. analyzed single-cell transcriptomics of first-trimester decidua from women of advanced maternal age (AMA) and controlled reproductive age, uncovering a pathologically remodeled decidual cell niche in the AMA samples characterized by aberrant activation of fibrotic signaling pathways, immune cell dysfunction, and disrupted intercellular communication[60].

      With the increasing availability of large-scale single-cell datasets containing hundreds to thousands of samples, computational methods specifically designed to identify recurrent multicellular coordination states across samples or individuals have been developed to infer conserved cell niches (Fig. 2a). Specifically, Luca et al. introduced EcoTyper[61], a machine learning framework that combines statistical learning techniques with the gene expression deconvolution algorithm CIBERSORTx[62] to systematically define cell states and potential cell niches from gene expression data. Applying EcoTyper to large-scale scRNA-seq and bulk RNA-seq datasets spanning 12 major cell lineages across 16 types of human carcinoma, the authors identified 69 transcriptionally-defined cell states and further delineated 10 cohesive cell niches conserved across cancer types and patient samples. Importantly, these cell niches exhibited improved predictive power of the clinical outcome compared with conventional biomarkers. Similarly, Shi et al. developed CoVarNet[63], a non-negative matrix factorization (NMF)-based computational method, and applied it to the high-quality scRNA-seq data containing over 2 million cells from 706 healthy samples across 35 human tissues. They identified 12 cross-tissue coordinated cell niches that exhibited distinct patterns of intercellular communication, age-associated dynamic change, and disruption of organization during cancer progression. Meanwhile, Jerby-Arnon et al. proposed DIALOGUE[64], which combines penalized matrix decomposition (PMD) with hierarchical models to identify coordinated cellular programs in different cell types, decoupling niche-dependent programs from intrinsic cell state programs, and revealing cell niches associated with specific phenotypes.

      Collectively, these approaches provide effective computational frameworks for inferring the composition and expression characteristics of potential cell niches in tissues. However, due to the inevitable loss of spatial location information during tissue dissociation and preparation of single-cell suspensions, cell niche analysis based solely on single-cell omics data is inherently indirect. Such analyses rely on quantitative characterization or statistical inference of cellular composition, functional coordination, or interaction patterns, rather than explicitly delineating cell niches within spatial contexts.

    • Spatial omics technologies enable the simultaneous measurement of molecular profiles while preserving the spatial context of tissues, effectively overcoming the intrinsic limitations of single-cell omics approaches. These technologies have demonstrated great potential in elucidating the spatial organization, cellular interactions, and functional characteristics of complex tissues such as the brain or tumor microenvironments. Among them, spatial transcriptomics (SRT), comprehensively reviewed elsewhere[6569], is one of the earliest developed and most widely recognized modalities. Broadly, SRT strategies can be categorized into next-generation sequencing (NGS)-based and imaging-based approaches, which differ in cellular resolution, transcriptomic coverage, cost, and commercial availability. For example, representative commercial platforms include 10X Visium and Visium HD[70], 10X Xenium[16], Vizgen MERSCOPE[71], NanoString CosMx[72], and Stereo-seq[13]. Beyond transcriptomics, an increasing number of spatial omics technologies have emerged in recent years, including spatial proteomics[14,15,73], spatial metabolomics[7477], and spatial epigenomics[7881], further enriching the molecular dimensions available for spatial analysis.

      Compared with single-cell omics, spatial omics technologies preserve spatial information explicitly, offering distinct advantages for cell niche analysis. Leveraging this data, a large number of computational methods have been developed to decipher cell niches by modeling spatial proximity, cellular composition, and molecular interactions within tissues. Broadly, existing approaches for cell niche analysis using spatial omics data can be divided into two conceptual paradigms (Fig. 2b, c). The first paradigm focuses on spatial inference, which aims to reconstruct high-resolution cellular composition and molecular features of cell niches from spatial data with limited resolution or transcriptomic coverage by integrating single-cell omics references. The second paradigm emphasizes direct modeling, in which cell niches are explicitly identified by jointly utilizing the molecular profiles as well as spatial information of spatial omics data.

      In this section, we review recent advances in computational methods related to the first paradigm. These methods primarily include spatial deconvolution and spatial mapping, which try to address the technical limitations of current spatial omics platforms to delineate fine-grained cellular composition and spatial organization of cell niches within specific anatomical regions, such as lesion-adjacent areas or tissue boundaries.

    • Spatial deconvolution methods are designed for low-resolution SRT data, in which each capture spot typically contains transcripts from multiple cells, ranging from several to dozens. These methods aim to decompose the mixed transcriptomic signals of each capture spot into contributions from discrete cell subpopulations based on matched scRNA-seq reference data. By estimating the cellular composition of individual spots, spatial deconvolution methods enable a refined characterization of the spatial distribution and co-localization patterns of different cell subpopulations that underpin cell niche organization. For example, Lake et al. generated 10X Visium spatial transcriptomic profiles of human kidney tissues from healthy reference as well as patients with acute kidney injury to chronic kidney disease[42]. Through spatial deconvolution analysis, they identified fibrotic and inflammatory cell niches specifically enriched in cortical injury regions, characterized by the spatial co-localization of fibroblasts, activated stromal cells, and immune populations. Similarly, Kuppe et al. applied SRT and spatial deconvolution to human heart tissues in homeostasis, and after myocardial infarction, revealing the dynamic remodeling of distinct cell niches within the border zones surrounding necrotic areas[82].

      Based on their mathematical frameworks, spatial deconvolution methods can be broadly categorized into four classes: regression-based, probabilistic, deep learning-based, and other machine learning-based approaches (Table 1). Regression-based methods represent the earliest class of spatial deconvolution approaches. Their conceptual foundation originates from deconvolution methods initially developed for bulk RNA-seq data[62,8389] and subsequently extended to spatial transcriptomics. These methods typically model the mixed gene expression profile of each capture spot as a linear combination of characteristic signatures of different cell types derived from scRNA-seq data, and then infer cell type proportions within each capture spot using regression algorithms such as non-negative least squares (NNLS) or dampened weighted least squares (DWLS). A representative example is NMFreg[90], which first applies the non-negative matrix factorization (NMF) algorithm to extract characteristic signatures of different cell types from scRNA-seq data, followed by the NNLS regression to estimate cell type proportions in SRT data. Regression-based methods are generally highly interpretable and naturally guarantee the non-negativity of cell type contributions, which has motivated the development of numerous extensions that enhance robustness and accuracy by improving signature selection or regression strategies. For instance, SPOTlight[91] and Redeconve[92] both build upon the NNLS framework, with SPOTlight incorporating prior biological knowledge via marker gene-guided NMF initialization to improve sensitivity in detecting rare populations, while Redeconve introduces a regularization term based on cell similarity to mitigate gene expression collinearity among transcriptionally similar cell types. spatialDWLS[93] further improves specificity by integrating enrichment analysis with the DWLS regression, whereas SpatialDecon[94] and SONAR[95] replace NNLS regression with log-normal regression and Poisson-Gamma regression coupled with geographically weighted regression (GWR), respectively, to better accommodate the characteristics of SRT data. For tumor microenvironment analysis, Cottrazm[96] and SpaCET[97] respectively utilize tailored DWLS and NNLS regression strategies to enhance the estimation of cell type proportions in tumor SRT data.

      Table 1.  AI methods for spatial deconvolution.

      Method Strategy Underlying AI algorithm Language Ref.
      NMFreg Regression-based NNLS MATLAB [90]
      SPOTlight Regression-based NNLS R [91]
      Redeconve Regression-based Regularized NNLS R [92]
      spatialDWLS Regression-based DWLS R [93]
      SpatialDecon Regression-based log-normal regression R [94]
      SONAR Regression-based Poisson-Gamma regression + GWR R [95]
      Cottrazm Regression-based DWLS R [96]
      SpaCET Regression-based NNLS R [97]
      RCTD Probabilistic modeling Poisson distribution with MLE R [98]
      stereoscope Probabilistic modeling NB distribution with MAP Python [99]
      cell2location Probabilistic modeling NB distribution with variational Bayesian inference Python [100]
      Starfysh Probabilistic modeling Poisson-Gamma distribution with variational inference Python [101]
      STdeconvolve Probabilistic modeling LDA R [102]
      STRIDE Probabilistic modeling LDA Python [103]
      Bulk2Space Deep learning-based VAE Python [104]
      DestVI Deep learning-based VAE Python [105]
      SPACEL Deep learning-based VAE Python [106]
      DSTG Deep learning-based GCN R and Python [107]
      STdGCN Deep learning-based GCN Python [108]
      SD2 Deep learning-based GCN + Autoencoder R and Python [109]
      CellDART Deep learning-based ADDA Python [110]
      spSeudoMap Deep learning-based ADDA Python [111]
      TransformerST Deep learning-based Transformer Python [112]
      AntiSplodge Deep learning-based Feedforward neural network Python [113]
      UCDbase Deep learning-based Feedforward neural network Python [114]
      LETSmix Deep learning-based Feedforward neural network Python [115]
      Tangram Deep learning-based Nonconvex optimization Python [116]
      CARD Other NMF + CAR R [117]
      CellPie Other NMF + factor analysis Python [118]
      Seurat Other Enrichment score R [119]
      CellTrek Other Random forest R [120]
      SpaOTsc Other OT Python [121]
      NovoSpaRc Other OT Python [122]
      TACCO Other OT Python [123]
      DOT Other OT R [124]

      Compared with regression-based methods, spatial deconvolution methods based on probabilistic models explicitly model gene expression counts, batch effects, platform-specific biases, and unobserved sources of variability within a unified probabilistic framework, often achieving improved accuracy in complex situations at the cost of increased computational complexity. A major subclass of these approaches constructs probability distribution models based on a Bayesian statistical framework, which typically assume Poisson, negative binomial (NB), or Poisson-Gamma distributions for gene expression counts and employ maximum likelihood estimation (MLE), maximum a posteriori (MAP), or variational inference for parameter estimation, as exemplified by RCTD[98], stereoscope[99], cell2location[100], and Starfysh[101]. Notably, RCTD and cell2location explicitly introduce a specific random variable to account for platform effects and other sources of natural variability, demonstrating superior performance in cross-platform inference and fine-grained cell type resolution. Another subclass of probabilistic model-based spatial deconvolution methods mainly employ a topic model framework, such as latent Dirichlet allocation (LDA), in which inferred 'topics' are interpreted as latent cell types or functional programs. For example, STdeconvolve[102] applies LDA to perform unsupervised modeling on SRT data directly, followed by a post hoc topic annotation step. In contrast, STRIDE[103] first learns potential functional topic distributions from scRNA-seq reference data, and then transfers the trained model to SRT data, enabling the end-to-end spatial deconvolution analysis without post-processing. However, it should be noted that the biological interpretability of such approaches largely depend on the choice of feature databases and annotation strategies.

      With advances in deep learning, deep learning-based spatial deconvolution methods have become increasingly prevalent for modeling complex, high-dimensional datasets. These approaches are particularly effective in capturing nonlinear relationships between scRNA-seq and SRT data, but often exhibit reduced interpretability and increased risk of overfitting when applied to small datasets. Most deep learning-based methods extract latent representations from scRNA-seq and SRT data using neural network architectures and train models on 'pseudo-SRT data' with known cellular compositions generated from scRNA-seq references. Representative examples include VAE-based approaches, such as Bulk2Space[104], DestVI[105], and the Spoint module in the SPACEL[106] toolkit, as well as GNN-based methods, including DSTG[107], STdGCN[108], and SD2[109]. Among these, STdGCN explicitly integrates spatial topology, whereas SD2 employs an autoencoder-based dimensionality reduction to mitigate dropout effects. Additional approaches explore adversarial discriminative domain adaptation (ADDA) models[110,111], transformer architectures[112], and lightweight feedforward neural networks[113115], highlighting the diversity of neural network architectures in deep learning-based solutions. In contrast, Tangram[116] adopts a different strategy based on nonconvex optimization, learning an optimal mapping matrix between individual single cells and spatial spots with neural networks. By introducing regularization terms that account for variable cell numbers across spots, Tangram not only performs spot-level deconvolution but also enables reconstruction of gene expression patterns within spatial contexts in single-cell resolution.

      Beyond the major categories described above, several alternative machine learning strategies have also been proposed. For instance, CARD[117] and CellPie[118] extend the NMF algorithm by incorporating the conditional autoregressive (CAR) modeling and factor analysis, respectively. Seurat[119] provides a marker gene-based scoring approach to estimate relative enrichment of specific cell types within each spot. CellTrek[120] combines joint dimensionality reduction of scRNA-seq and SRT data with random forest models to perform spatial deconvolution. In addition, several early methods based on the optimal transport (OT) algorithm, such as SpaOTsc[121] and NovoSpaRc[122], although not specifically designed for spatial deconvolution analysis have also demonstrated the potential to achieve accurate deconvolution of SRT data by constructing the OT mapping between scRNA-seq and SRT data. Therefore, subsequent studies including TACCO[123] and DOT[124] have further extended this framework to enable more unified and flexible information transfer and interpretation between scRNA-seq and SRT data.

      Overall, spatial deconvolution methods provide an effective means to recover cellular composition and spatial co-localization patterns from low-resolution SRT data, laying the foundation for the accurate identification and characterization of cell niches in tissues. However, most existing approaches rely heavily on high-quality matched scRNA-seq reference and primarily focus on estimating cell type proportions rather than reconstructing full molecular profiles at the single-cell resolution. As a result, spatial deconvolution is mainly applicable for characterizing coarse-grained microenvironmental composition, while its ability to resolve fine-scale spatial interactions and dynamic cell niche-associated molecular changes remains limited.

    • Similar to spatial deconvolution methods, spatial mapping methods are developed to enhance the effective resolution and quality of spatial omics data. These approaches are primarily designed for imaging-based SRT technologies, such as MERFISH[11], seqFISH[125], and STARmap[126], which offer subcellular spatial resolution and single-molecule sensitivity but typically profile only a limited number of genes, ranging from several hundred to a few thousand, corresponding to approximately 1%−10% of the whole transcriptome. Spatial mapping methods aim to assign individual cells from single-cell omics data to their most likely spatial locations within spatial reference data. By integrating the full transcriptome characteristics from single-cell omics data and the spatial information from spatial reference data, these methods can predict the expression of unmeasured genes in spatial omics data and further reconstruct the high-resolution, spatially resolved single-cell expression maps. In contrast to spatial deconvolution methods, which estimate cell type proportions at the spot level, spatial mapping methods can, in some cases, recover spatial expression profiles at the single-cell level, providing higher precision for analyzing multicellular interactions and cell niche organization. For example, Fan et al. conducted spatial mapping analysis to develop a high-resolution and spatially resolved map of intratumoral expression heterogeneity in cervical squamous cell carcinoma (CSCC), identifying two spatially distinct cell niches corresponding to different stages of squamous differentiation[127]. These cell niches exhibited opposite spatial associations with immune cells and were characterized by different transcriptional signatures and clinical outcomes. Similarly, Wei et al. applied spatial mapping methods to decipher the spatial tumor-immune cell niches of ductal carcinoma in situ (DCIS) tissues, revealing tumor subclones that were restricted to different ducts, and specific T cell states adjacent to the tumor areas[120].

      Based on their mathematical frameworks, spatial mapping methods can be broadly categorized into two classes: joint dimensionality reduction-based, and mapping matrix-based approaches (Table 2). Joint dimensionality reduction-based methods project single-cell and spatial omics data into a shared low-dimensional latent space, in which batch effects between platforms are usually removed while biologically meaningful variation is preserved. Within this shared latent space, the expression profiles of unmeasured genes in spatial data can be inferred from neighboring single cells of single-cell data. Early approaches rely on linear dimensionality reduction techniques, including Seurat[119], LIGER[128], Harmony[129], and SpaGE[130], which employ canonical correlation analysis (CCA), NMF, principal component analysis (PCA), or singular value decomposition (SVD) for cross-modality alignment, followed by the k-nearest neighbors (KNN)-based gene expression imputation. Recently, more methods based on nonlinear dimensionality reduction techniques have been proposed to better capture the complex relationships between datasets. For instance, gimVI[131] and stPlus[132] employ the joint VAE framework and autoencoder framework to model scRNA-seq and spatial data, respectively, whereas SpatialScope[133] leverages deep generative models to integrate scRNA-seq and SRT data. Notably, SpatialScope enhances NGS-based SRT data to achieve single-cell resolution and can also infer unmeasured gene expressions for imaging-based SRT data.

      Table 2.  AI methods for spatial mapping.

      Method Strategy Underlying AI algorithm Language Ref.
      Seurat Joint dimension reduction-based CCA + KNN MATLAB [119]
      LIGER Joint dimension reduction-based NMF + KNN R [128]
      Harmony Joint dimension reduction-based PCA + KNN R [129]
      SpaGE Joint dimension reduction-based SVD + KNN Python [130]
      gimVI Joint dimension reduction-based Joint VAE Python [131]
      stPlus Joint dimension reduction-based Autoencoder + KNN Python [132]
      SpatialScope Joint dimension reduction-based Deep generative model Python [133]
      SpaTrio Mapping matrix-based OT R and Python [134]
      SIMO Mapping matrix-based OT Python [135]
      Tangram Mapping matrix-based Nonconvex optimization Python [116]
      CellTrek Mapping matrix-based Random forest R [120]
      SpaOTsc Mapping matrix-based OT Python [121]
      novoSpaRc Mapping matrix-based OT Python [122]
      CytoSPACE Mapping matrix-based Convex optimization Python [136]

      Mapping matrix-based methods aim to learn an optimal mapping matrix between individual cells in single-cell data and spatial locations in spatial reference, thereby assigning cells with whole transcriptomes back to their most appropriate positions in space and enabling the estimation of expression profiles for unmeasured genes. Therefore, compared with joint dimensionality reduction-based methods, these approaches provide more explicit cell-to-location assignments and better preserve spatial continuity and tissue architecture. In most cases, the mapping problem is formulated as an optimization task, in which the optimization function (also called the cost function) is defined based on gene expression similarity or distance metrics between cells and spatial locations, such as Euclidean distance, Pearson correlation, or cosine similarity. For example, SpaTrio[134] and SIMO[135] are both based on the fused Gromov-Wasserstein optimal transport algorithm, and employ the Euclidean distance algorithm to calculate the cost matrix. Several spatial deconvolution methods also support spatial mapping analysis, such as Tangram[116], CellTrek[120], SpaOTsc[121], and novoSpaRc[122]. Moreover, CytoSPACE[136], a recently proposed spatial mapping method, formulates spatial mapping as a convex optimization problem and solves it using the Jonker-Volgenant shortest augmenting path algorithm, achieving global optimality and high computational efficiency.

      Collectively, spatial mapping methods establish explicit or implicit correspondences between single-cell and spatial omics data, and offer advantages over spatial deconvolution methods in achieving high-resolution reconstruction of spatial gene expression patterns and cell localization. However, their characterization of cell niches still relies on indirect inference, and the performance is also limited by the quality of the single-cell reference, as well as the intrinsic complexity of tissues.

    • Unlike spatial deconvolution and mapping methods, which infer cell niches indirectly through estimated cell-type composition or reconstructed spatial gene expression, recently developed direct modeling strategies explicitly treat the cell niche as the fundamental unit for analysis. These methods utilize the molecular profiles as well as spatial information of spatial omics data, jointly to model and identify cell niches in a data-driven manner, demonstrating substantial potential for uncovering spatially organized microenvironments. In practice, direct modeling is most commonly implemented through spatial clustering, where niche-specific representations are learned based on the features of the cell itself and its local spatial context extracted from appropriate cellular neighborhoods, and subsequently partitioned using unsupervised algorithms. The choice of the neighborhood scale typically depends on the resolution of the spatial technology and the biological interest, and can be defined using either a fixed radius, a fixed number of nearest neighboring cells, or graph-based neighborhoods computed by Delaunay triangulation[31].

      Based on their modeling principles, spatial clustering methods can be broadly categorized into five classes: probabilistic graphical model (PGM)-based, feature augmentation-based, GNN-based, improved community detection algorithm-based, and neighborhood phenotype-driven approaches (Table 3).

      Table 3.  AI methods for spatial clustering.

      Method Strategy Underlying AI algorithm Language Ref.
      Giotto PGM-based HMRF R and Python [137]
      DR-SC PGM-based HMRF R [138]
      BayesSpace PGM-based Fully Bayesian model R [139]
      BASS PGM-based Potts model with additional hierarchical
      modeling structures
      R [140]
      PRECAST PGM-based Potts model + CAR R [141]
      SPICEMIX PGM-based NMF-HMRF Python [142]
      UTAG Feature augmentation Linear weighting Python [145]
      CellCharter Feature augmentation Neighborhood aggregation Python [32]
      ClusterMap Feature augmentation Neighborhood aggregation Python [146]
      BANKSY Feature augmentation Linear combination R and Python [147]
      SpaGCN GNN-based GCN + DEC Python [148]
      Spatial-MGCN GNN-based Multi-view GCN Python [149]
      STMGCN GNN-based Multi-view GCN Python [150]
      STAGATE GNN-based GATE Python [151]
      STAligner GNN-based GATE Python [152]
      DSSC GNN-based GATE Python [153]
      CCST GNN-based DGI Python [154]
      SpaceFlow GNN-based Spatially regularized DGI Python [155]
      SPIRAL GNN-based GraphSAGE Python [156]
      SEDR GNN-based VGAE + DEC Python [157]
      scNiche GNN-based M-GAE + GFN Python [36]
      SPACEL GNN-based GCN + DANN Python [106]
      DeepST GNN-based Denoising autoencoder + VGAE + DANN Python [158]
      SpaDAC GNN-based graph attention network (GAT) + CNN Python [159]
      stMDA GNN-based GCN + CNN + VAE Python [160]
      GraphST GNN-based GCN with SCL Python [161]
      MERINGUE Improved community detection algorithm-based Expression graph with spatial weighting + Louvain R [164]
      SCGP Improved community detection algorithm-based Spatial graph with expression weighting + Leiden Python [165]
      SpatialLeiden Improved community detection algorithm-based Multiple graph + Leiden Python [166]
      Schürch's method Neighborhood phenotype-driven Neighborhood cell type frequency matrix + k-means Python [167]
      Spatial-LDA Neighborhood phenotype-driven Neighborhood cell type frequency matrix + Spatial-LDA + k-means Python [168]
      CytoCommunity Neighborhood phenotype-driven Neighborhood cell type frequency matrix + GNN Python [169]
    • Early PGM-based spatial clustering methods, such as Giotto[137], DR-SC[138], and BayesSpace[139] commonly assume that spatially proximate cells belong to the same cell niche and employ a Potts model to encourage these cells to have the same label. Among them, both Giotto and DR-SC employ a hidden Markov random field (HMRF) model, whereas BayesSpace implements a fully Bayesian model and performs parameter estimation via the Markov chain Monte Carlo (MCMC) algorithm.

      A key assumption underlying these methods is that cells within the same cell niche exhibit homogeneous gene expression profiles. However, this assumption does not fully reflect the biological reality, as a single cell niche often contains diverse cell types, and the same cell type may also participate in multiple cell niches. To address this limitation, subsequent PGM-based spatial clustering methods extend the framework to accommodate more complex neighborhood structures. For example, BASS[140] introduces additional hierarchical modeling structures on top of the Potts model to explicitly model heterogeneous gene expression of different cell types, thus enabling more flexible and effective modeling of spatial omics data. PRECAST[141] employs a conditional autoregressive model for spatial dependence and uses a Potts model for clustering, making it highly effective in aligning data from different tissue slices and resolving spatial structures. SPICEMIX[142] combines the HMRF model with NMF to propose a novel framework, NMF-HMRF. Compared with the original HMRF model, NMF-HMRF uniquely integrates the modeling process of the spatial topology of cells by HMRF with the dimensionality reduction step of gene expression profiles by NMF into a single model, which can unravel the complex mixing of latent intrinsic and spatial factors of heterogeneous cell identity in complex tissues effectively.

      PGM-based spatial clustering methods typically exhibit strong biological interpretability. However, their computationally intensive parameter estimation procedures often limit scalability to large spatial datasets.

    • Feature augmentation-based spatial clustering methods are motivated by the fundamental biological intuition that a cell's molecular profiles are influenced not only by intrinsic regulation, but also by interactions with neighboring cells[143,144]. Consequently, feature augmentation strategies aim to combine the molecular profiles of the cell itself with those of its neighbors to generate new features that are more representative of the cell niche. Specifically, UTAG[145] integrates the molecular profiles of neighbors into the cell's own molecular profiles using linear weighting, whereas CellCharter[32] and ClusterMap[146] achieve this through neighborhood aggregation. BANKSY[147] adopts a related but distinct strategy by incorporating neighborhood features as additional dimensions rather than direct aggregation, with a specific hyperparameter to tune the contributions of the cells and their local microenvironments.

      Feature augmentation-based spatial clustering methods typically involve fewer parameters and are computationally efficient, enabling robust performance and scalability across large datasets. However, overly simplified assumptions and modeling approaches limit their ability to accurately reveal cell niches within complex tissues.

    • Another widely used strategy for spatial clustering involves utilizing GNN deep learning models. Compared with PGM-based and feature augmentation-based approaches, GNN-based spatial clustering methods provide greater modeling flexibility, but typically involve numerous hyperparameters and are more prone to overfitting. Furthermore, such methods also face limitations in scalability.

      SpaGCN[148] is a representative GNN-based spatial clustering method. It integrates gene expression, spatial location, and histological information using a single-layer graph convolutional network (GCN) coupled with a deep embedding clustering (DEC) model to identify cell niches and spatially variable genes. Subsequent studies have extended this single-layer GCN to more complex GNN architectures, including multi-view graph convolutional networks (Multi-view GCNs)[149,150], graph attention autoencoders (GATE)[151153], unsupervised graph embedding approaches such as Deep Graph Infomax (DGI)[154,155], GraphSAGE[156], and variational graph autoencoders (VGAE)[157], to better capture the complex relationships in spatial omics data. More recently, scNiche[36] further extends GNN architectures by applying a multiple graph autoencoder (M-GAE) coupled with a graph fusion network (GFN) to integrate the multi-view features of the cell into a joint representation of the cell niche, achieving the state-of-the-art niche identification performance across multiple datasets.

      Beyond refining GNN architectures, combining them with other deep learning models represents another promising avenue for improvement. For example, the Splane module in SPACEL[106] and DeepST[158] combine GNNs with domain adversarial neural networks (DANN) to eliminate batch effects across multiple tissue sections, enabling the joint analysis of multiple SRT sections. SpaDAC[159] and stMDA[160] integrate GNNs with CNNs to jointly capture transcriptomic similarity and morphological patterns from histology images. Additionally, GraphST[161] proposes a novel graph self-supervised contrastive learning architecture by combining GCNs with self-supervised contrastive learning (SCL) strategies, enabling multiple tasks for cell niche analysis, such as spatial clustering and spatial deconvolution.

    • Community detection algorithms, such as Louvain[162] and Leiden[163], are widely used in single-cell omics data analysis to identify cell populations or subpopulations with similar molecular profiles. By integrating the spatial information into the cell similarity graph constructed based on the molecular profiles of cells, these improved community detection algorithms can be extended to cell niche identification. For example, MERINGUE[164] first constructs a cell similarity graph based on the gene expression profiles of cells in SRT data, where each node in the graph represents a cell (or capture spot). Subsequently, to incorporate the spatial information into the similarity graph, MERINGUE applies spatial weighting to the edges of the graph based on the physical proximity between cells, thereby transforming the similarity graph into a spatially informed similarity graph. Finally, MERINGUE directly performs the Louvain algorithm on this spatially informed similarity graph to identify cell niches. In contrast, SCGP[165] adopts an inverse strategy by first constructing a spatial topology graph based on the physical proximity between cells, and then incorporating expression similarity as edge weights, followed by Leiden clustering to identify cell niches.

      Beyond edge reweighting strategies, SpatialLeiden[166] further extends the Leiden algorithm by jointly modeling gene expression similarity and spatial adjacency using a multiple graph framework. Specifically, expression similarity and spatial neighborhood relationships are represented as separate graph layers, and clustering is achieved by optimizing a combined modularity objective across layers. This design enables a more flexible balance between transcriptional similarity and spatial continuity, resulting in robust identification of spatial domains across diverse SRT datasets.

    • Neighborhood phenotype-driven spatial clustering methods were initially developed for spatial proteomics data. Compared with the preceding four classes of spatial clustering methods, these approaches exhibit distinct differences in feature utilization, specifically selecting the cellular phenotype (e.g., cell type) labels of cellular neighborhoods as input rather than molecular profiles. Schürch et al.[167] first applied this strategy to the CODEX data from 35 advanced-stage colorectal cancer (CRC) patients with 56 protein markers. Specifically, they annotated 29 cell types based on the expression levels of marker proteins and then quantified the frequency of each cell type within a fixed neighborhood (10 cells) for every cell to construct a 29-dimensional neighborhood cell type frequency matrix. Subsequently, they performed unsupervised clustering on this frequency matrix using the k-means algorithm and identified nine distinct cell niches in the CRC tumor microenvironment. Spatial-LDA[168] improves this framework by applying the LDA model for dimensionality reduction of the frequency matrix, enhancing the robustness of the framework to potential noise. Furthermore, Spatial-LDA modifies the traditional LDA model by introducing spatial proximity priors to promote coherence of microenvironments between nearby cells. More recently, CytoCommunity[169] improves the unsupervised clustering step by innovatively employing a GNN model based on the Minimum Cut (MinCut) algorithm to achieve both unsupervised and supervised identification of cell niches across multiple tissue sections.

      Neighborhood phenotype-driven spatial clustering methods offer excellent interpretability by directly linking cell niches to specific cellular compositions. However, these methods may struggle to identify certain cell niches defined by molecular state changes rather than cellular composition, such as the cell niche located at the tumor-immune boundary, where both tumor and immune cells exhibit altered expression profiles[15].

    • In this section, we provide a systematic comparison of existing AI-driven cell niche analysis methods from multiple perspectives, including data requirement, inference strategy, interpretability, scalability, robustness, and typical application scenarios (Table 4). Overall, no single method is universally optimal. The choice of an appropriate strategy largely depends on the resolution and modality of the data, the availability of matched single-cell references, and the specific biological questions of interest.

      Table 4.  Comparison of AI methods for cell niche analysis.

      Method Data requirement Inference strategy Interpretability Scalability Robustness Typical application scenario
      Single-cell-based indirect inference Single-cell data Cell niches are inferred based on multicellular coordination states, without explicit spatial context High; results are directly linked to cell types, conditions, or samples High; scalable to large cohorts High; relatively stable across datasets and parameter settings Preliminary characterization of cell niches, large-scale cohort studies, and cross-condition comparisons
      Spatial inference strategy: Spatial deconvolution/spatial mapping Single-cell/low-quality spatial data Cell niches are inferred based on reconstructed high-resolution cellular composition and molecular features High; results are driven by biological priors and directly linked to cell types Moderate; depends on inference strategy and
      data size
      Moderate; affected by batch effects, reference quality, and parameter settings Intuitive and refined descriptions of composition and molecular features of cell niches at the spatial level
      PGM-based spatial clustering methods Spatial data Moderate to high; clear modeling assumptions and the strong statistical foundation Low to moderate; generally computationally intensive Moderate to high; relatively stable across parameter settings
      Feature augmentation-based spatial clustering methods Spatial data Moderate to high; interpretability depends on feature design High; generally computationally efficient Moderate to high; simplified modeling assumptions with few parameters, but sensitive to feature design
      GNN-based spatial clustering methods Spatial data Cell niches are inferred based on direct modeling and unsupervised partitioning by utilizing the molecular profiles as well as spatial information of spatial omics data jointly Low to moderate; results are often difficult to interpret Low to moderate; generally limited by graph size and
      GPU memory
      Low to moderate; greater modeling flexibility but typically involve numerous parameters and are more prone to overfitting Discovery of previously uncharacterized cell niches
      in complex tissues in a
      data-driven manner
      Improved community detection algorithm-based spatial clustering methods Spatial data Moderate to high; results are interpretable at the community structure level High; generally computationally efficient High; simplified modeling assumptions with few parameters
      Neighborhood phenotype-driven spatial clustering methods Spatial data High; results are directly linked to cell types and biologically intuitive Moderate to high; depends on inference strategy High; simplified modeling assumptions with few parameters

      From a methodological perspective, indirect inference methods based on single-cell omics data require only single-cell datasets and are therefore broadly applicable. However, due to the lack of explicit spatial context, these approaches provide a relatively coarse and indirect characterization of cell niches. As such, these approaches are more suitable for preliminary characterization of cell niches, large-scale cohort studies, and cross-condition comparisons. Spatial inference strategies, including spatial deconvolution and spatial mapping methods, integrate single-cell and spatial omics data to reconstruct spatial patterns of cellular composition and gene expression. Compared with indirect inference driven by single-cell data, these approaches offer more intuitive and refined descriptions of composition and molecular features of cell niches, with generally strong interpretability and direct links to known biological knowledge. Nevertheless, their performance critically depends on the quality of the single-cell reference, and the delineation of cell niches remains indirect and constrained by existing biological priors. In contrast, direct modeling strategies, represented by spatial clustering methods, explicitly treat the cell niche as the fundamental unit of analysis and identify potential cell niches in a data-driven manner, making them particularly suitable for discovering previously uncharacterized cell niches in complex tissues. However, as these approaches are typically based on unsupervised learning frameworks, their results can be sensitive to parameter selection and data noise, resulting in relatively limited interpretability.

      From an algorithmic perspective, traditional machine learning and statistical models typically possess clear modeling assumptions, the strong statistical foundation, and high interpretability and robustness, while also exhibiting stable computational efficiency and scalability. However, their relatively simplified model structures struggle to fully characterize the complex nonlinear relationships in high-dimensional omics data. Deep learning-based methods, especially GNN models, demonstrate greater flexibility in modeling complex spatial dependencies and multimodal features, achieving leading performance in multiple benchmarking studies[170,171]. However, they also introduce new challenges, including insufficient model interpretability, sensitivity to hyperparameters and random initialization, reproducibility issues across datasets and platforms, overfitting risks, and high computational resource requirements.

      Taken together, each AI-driven cell niche analysis method has unique strengths and limitations, and the most applicable method depends on the specific dataset and research objectives.

    • The cell niche, as the functional unit that characterizes the cellular composition, spatial architecture, and functional states within specific tissue regions, emphasizes multicellular coordination and spatial microenvironmental dependence, which is highly consistent with the theoretical basis of holistic regulation and systemic intervention of TCM. In recent years, with the rapid advancement of single-cell omics, spatial omics, and related AI-driven computational methods, cell niche analysis has been gradually introduced into TCM research, demonstrating unique advantages compared to traditional analytical paradigms. In this section, we summarize recent progress in representative scenarios of cell niche analysis in TCM research, including the identification of intervention-associated targets, delineation of dynamic regulatory processes, and elucidation of integrative mechanisms of action. We further discuss the potential translational implications of niche-level insights in TCM research (Fig. 3).

      Figure 3. 

      Representative scenarios and translational implications of AI-driven cell niche analysis in TCM research. AI-driven cell niche analysis provides a systematic framework for investigating the mechanisms of TCM interventions, with three representative scenarios, including identifying functional multi-cellular targets associated with TCM interventions by resolving coordinated cellular architectures within specific spatial contexts, characterizing the dynamic remodeling of pathological cell niches toward homeostatic or reparative states during TCM treatment, and elucidating the integrative mechanisms of action of TCM from a multi-cellular, multi-target, and multi-pathway perspective. The niche-level insights further demonstrate translational implications, offering guidance for the discovery of functional biomarkers, efficacy evaluation, the design of drug combination strategies, and the optimization of TCM formulas.

    • The 'multi-component, multi-target, and multi-pathway' nature of TCM implies that its therapeutic effects are rarely mediated by a single cell type or molecular pathway, but instead arise from coordinated regulation among multiple cell types within defined spatial contexts. Accordingly, identifying functional cell niches closely associated with TCM interventions has emerged as an important entry point for mechanistic interpretation.

      Facilitated by advanced AI-driven computational methods, studies on anti-tumor targets represent a classic application of cell niche analysis in TCM research. Previous studies indicate that tumor initiation and progression are often accompanied by complex remodeling of cell niches, particularly the formation of immunosuppressive niches that contribute to immune evasion and may serve as potential therapeutic targets[172,173]. By employing spatial deconvolution strategies, Zou et al.[174] integrated scRNA-seq and SRT data to resolve an immunosuppressive cell niche in the cervical cancer microenvironment, characterized by the spatial co-localization and close interaction of macrophages, neutrophils, and CD8+ T cells. Further analysis demonstrated that the natural compound matrine modulates this specific cell niche, alleviating macrophage-mediated suppression of CD8+ T cells, and restoring their cytotoxic activity to exert anti-tumor effects. This case effectively illustrates the application potential of AI-based spatial inference strategies for cell niche analysis in TCM research. By reconstructing high-resolution cellular composition and molecular features of cell niches, this approach enables the identification of niche-level targets that remain inaccessible to conventional bulk analysis, such as the multicellular spatial co-localization patterns and intercellular interactions. In studies lacking spatial omics data, the TCM intervention-related cell niche can also be characterized through indirect inference based on single-cell data. For example, Yu et al.[175] investigated the synergistic effects of Shenmai injection (SMI) and PD-1 inhibitor in non-small cell lung cancer (NSCLC) using single-cell RNA sequencing data. By inferring multicellular coordination states under different conditions through the analysis of changes in cell type proportions and CCIs, their study reported that SMI can reshape the tumor immune cell niches by enhancing the infiltration of NK cells, where NK cells further recruit dendritic cells and T cells via the XCL1 /XCL2-XCR1 axis and the CXCL10-CXCR3 axis, respectively.

      Beyond oncology, cell niche analysis has also been applied to elucidate functional targets of TCM intervention in other disease contexts. For example, using single-nucleus RNA-sequencing (sNuc-seq) and 10X Visium spatial transcriptomics, combined with spatial clustering methods, Lin et al.[176] systematically characterized the cardioprotection of Shexiang Baoxin Pill (SBP) in mice with myocardial ischemia-reperfusion injury, identifying two pathological cell niches located in the infarct and border zones, respectively. The study further revealed that the therapeutic effect of SBP was achieved through the synergistic regulation of endocardial cells, fibroblasts, and macrophages within these niches. Although this work represents an initial attempt; it highlights the potential of data-driven unsupervised clustering strategies for the identification of spatially constrained cell niche targets. In addition, Liu et al.[177] demonstrated via the indirect inference of the cell niche with scRNA-seq data, that Oridonin can remodel the hepatic immune cell niche by suppressing pro-inflammatory characteristics of Kupffer cells, as well as reducing the number of cytotoxic NK cells simultaneously, thereby alleviating septic liver injury.

    • Beyond the identification of functional cell niches targeted by TCM interventions, delineating the dynamic regulatory processes during TCM treatment is equally important for a comprehensive understanding of its therapeutic effects. By combining with trajectory inference and temporal modeling strategies, cell niche analysis offers a powerful analytical framework to characterize the continuous transition of disease-associated cell niches from pathological states toward homeostatic or reparative states.

      For instance, Jin et al.[178] employed scRNA-seq to systematically characterize the temporal diversity and dynamics of cardiac immune cell niches after myocardial infarction (MI). By analyzing the changes in the proportion of various immune cell populations at different time points and integrating trajectory inference approaches, the authors created a dynamic immune landscape after MI. Their results showed that MI induces the coordinated enrichment of multiple pro-inflammatory pathogenic immune cell populations, leading to aberrant remodeling of cardiac immune niches. Treatment with tanshinone IIA markedly mitigated this aberrant remodeling process, inhibiting some detrimental subsets, and progressively shifting the immune niche composition toward a more homeostatic state. This study demonstrates an effective strategy for delineating the dynamic regulation of cell niches under TCM interventions by integrating time-resolved single-cell data with AI-based indirect inference, even in the absence of explicit spatial information.

      In contrast, the integration of single-cell and spatially resolved transcriptomic analysis can usually provide more direct evidence. For example, based on the scRNA-seq and SRT data, Shu et al.[179] employed the spatial mapping computational strategy combined with immunofluorescence experiments to identify a pathogenic inflammatory cell niche located within the submucosa in a ulcerative colitis model, characterized by the spatial co-localization of neutrophils and macrophages. Following Tripterygium wilfordii Polyglycosidium (TWP) intervention, this pathogenic inflammatory cell niche was significantly remodeled, accompanied by a decline of S100a8-mediated intercellular communication. Disruption of this inflammatory circuitry dismantled the microenvironmental basis sustaining chronic inflammation, and facilitated a transition of the niche from an inflammatory state toward a mucosal repair-associated state.

    • Cell niche analysis also provides a useful theoretical and methodological framework for systematically elucidating the integrative mechanisms of action of TCM. Compared with traditional analytical paradigms focused on single molecules or cell types, analyses at the cell niche level are better suited to reveal how TCM interventions achieve therapeutic efficacy through the holistic and systematic regulation of the organism.

      For example, Zhu et al.[180] investigated the therapeutic mechanism of the natural small molecule celastrol in psoriasis using single-cell multi-omics and spatial transcriptomics. By constructing high-resolution spatial expression patterns of genes using spatial mapping approaches, their study identified a fibroblast-driven inflammatory cell niche in psoriatic lesions, characterized by the spatial co-expression of CCL2 and LRP1. Mechanistic analysis revealed that fibroblasts exacerbate disease progression by secreting CCL2 to promote macrophage recruitment, whereas celastrol was found to act directly on this inflammatory niche by binding to the β-chain of LRP1, which in turn inhibited CCL2 production and blocked fibroblast-macrophage crosstalk, thereby ameliorating psoriasis progression. Similarly, Xin et al.[181] employed spatial transcriptomics to reveal the niche-selective regulatory effects of celastrol in the context of hepatic ischemia-reperfusion (I/R) injury. Based on the spatial deconvolution approaches, their results demonstrated that I/R injury predominantly targets the pericentral zone-dependent cell niche, manifested as a significant reduction in hepatocytes and a coordinated enrichment of pro-inflammatory M1 macrophages, plasma cells, and follicular helper T cells. Celastrol preferentially modulated this cell niche by activating HIF1α signaling and restoring microenvironmental homeostasis. This integrative regulation involves enhanced cellular adaptation to hypoxia, suppression of inflammatory pathways, and a shift in macrophage polarization from a pro-inflammatory (M1) to a reparative (M2) phenotype. Collectively, these studies effectively demonstrate the utility of spatial inference strategies in elucidating the integrative mechanisms of action of TCM interventions by reconstructing high-resolution spatial gene expression patterns or cellular compositions.

      Furthermore, Lu et al.[182] investigated the cardioprotective mechanisms of Fufang Danshen Pill (FDP) in myocardial ischemia-reperfusion injury by integrating scRNA-seq and spatial transcriptomics. They comprehensively employed AI-driven cell niche analysis methods, including spatial deconvolution and spatial clustering approaches, to identify and confirm a pathogenic cell niche localized to injured regions from multiple perspectives, providing a complete and thorough analytical workflow for subsequent similar TCM research. Specifically, this pathogenic cell niche was enriched in I/R-injured hearts, and is primarily composed of macrophages with high expression of S100a9. Mechanistic analysis revealed that this cell niche drives mitochondrial dysfunction and sustained inflammatory amplification through the S100a9/TLR4 axis. FDP intervention selectively targeted this cell niche and inhibited the binding of macrophage-derived S100a9 to TLR4 on the surface of cardiomyocytes by reducing the number of macrophages and increasing cardiomyocyte populations. As a result, mitochondrial complexes I and IV activities were restored, reactive oxygen species (ROS) levels were reduced, ATP production was enhanced, and mitochondrial morphology and cristae structure were markedly improved, collectively contributing to alleviating I/R injury.

    • Collectively, cell niche analysis has demonstrated clear utility in elucidating intervention-associated targets, dynamic regulatory processes, and integrative mechanisms of action of TCM. Importantly, by positioning cell niches as the functional multicellular target, this analytical paradigm provides an intuitive and systematic conceptual framework and methodological basis for translating niche-level insights into biomarker discovery, efficacy evaluation, drug combination strategies, and optimization of TCM formulas.

      First, the multi-dimensional characteristics of cell niches, including cellular composition, relative abundance, spatial co-localization patterns, and intercellular interaction networks, are more likely to reflect disease-relevant functional states than single molecules or individual cell types, holding promise as more stable and functionally meaningful biomarkers for patient stratification and therapeutic response assessment. For example, an integrated bulk RNA-seq and scRNA-seq analysis of 299 soft tissue sarcoma samples identified a cell niche composed of activated immune cell states, NOSTRIN/PECAM1/CD36+ normal-like endothelial cells, and malignant cells with activation of the KRAS pathway. This cell niche captured classical 'immune hot' tumor features and robustly stratified patients, with high niche abundance being significantly associated with improved clinical outcomes[183]. Similarly, a large-scale study of 571 patients with advanced metastatic disease prior to receiving immune checkpoint blockade, demonstrated that disease-associated niche abundance outperformed numerous single-molecule and single-cell-state biomarkers in predicting immune checkpoint inhibitor response[61]. Together, these findings highlight the potential of niche-level features as promising functional biomarkers for efficacy evaluation.

      Second, mechanistic insights at the cell niche level provide a rational basis for designing drug combination strategies. Generally, different drugs or natural compounds may preferentially act on distinct cellular components or interaction axes within the same cell niche, resulting in synergistic effects that are difficult to capture at the single-target level. Recently, Xiao et al.[184] elucidated the combinatorial mechanism of ginsenoside protopanaxatriol (PPT) and icariin (ICA) in alleviating chemotherapy-induced myelosuppression. Their results showed that PPT primarily restored interactions between endothelial cells and mesenchymal stem cells within the bone marrow cell niche, whereas ICA repaired the endosteum by reducing osteoblast apoptosis and promoting osteogenesis. This study exemplifies how niche-level insights can guide rational drug combination strategies based on complementary regulatory mechanisms.

      Finally, although studies that optimize TCM formulas based on cell niche analysis remain limited, existing evidence suggests substantial potential. The therapeutic effects of TCM formulas typically arise from coordinated regulation of multiple cell types and interaction networks within specific cell niches. Therefore, systematically delineating the dynamic remodeling of cell niches induced by individual components may facilitate the identification of key functional modules and complementary regulatory axes, thereby providing a mechanism-guided framework for formulation optimization. Such a shift from experience-driven to mechanism-driven design is expected to substantially advance the modernization of TCM.

    • As discussed above, the rapid development of single-cell and spatial omics technologies has substantially advanced cell niche research and provided powerful new analytical perspectives for elucidating the integrative mechanisms underlying the 'multi-component, multi-target, and multi-pathway' action of TCM. However, the systematic application of cell niche analysis in TCM research still faces numerous challenges. These challenges arise not only from inherent technical limitations of single-cell and spatial omics, but also from methodological issues related to robustness and interpretability, which are further amplified by the intrinsic complexity of TCM interventions.

    • At the data level, although various sequencing platforms have been developed to generate single-cell and spatial omics data, substantial trade-offs remain among spatial resolution, molecular coverage, tissue compatibility, cost, and infrastructure requirements[31]. A universally applicable approach enabling spatially resolved single-cell whole molecular profiles across diverse tissues has not yet been fully realized. For example, the widely used commercial SRT platform 10X Visium captures transcripts of multiple cells within each capture spot, resulting in inherently mixed expression signals that limit the precise delineation of fine-grained cellular composition and functional heterogeneity of cell niches. In contrast, imaging-based SRT technologies provide superior spatial resolution but are constrained by limited targeted gene panels[66], hindering comprehensive molecular characterization of complex cell niches. These quality limitations of single-cell and spatial omics data pose a particular challenge for resolving subtle but biologically meaningful variations in molecular expression and intercellular interactions within cell niches.

      Recent advances, including high-resolution sequencing platforms such as 10X Visium HD[70] and Stereo-seq[13], partially alleviate this trade-off by enabling whole-transcriptome profiling at near single-cell spatial resolution. Nevertheless, their high cost, technical complexity, and reliance on specialized equipment still limit their widespread accessibility in pharmacological and translational studies[185,186]. Computation methods for integrating single-cell and spatial omics data, such as iStar[187], Spotiphy[188], and PanoSpace[189], offer additional opportunities to enhance the quality of the original sequencing data, yet their effectiveness in improving the accuracy and robustness of cell niche inference remains to be systematically evaluated.

      On the other hand, single-cell and spatial omics data commonly exhibit high sparsity, dropout effects, and batch effects[190,191]. These data noises further constrain the accurate inference of cell niches. In TCM research, heterogeneous sample sources, and experimental conditions further exacerbate these issues, potentially obscuring the relatively subtle cellular states and transitions induced by TCM's modulatory effects, and reducing reproducibility across studies[3].

      Furthermore, compared with fields such as cancer, immunology, and developmental biology, there is still a notable lack of systematically constructed and publicly available single-cell and spatial omics databases related to TCM. Existing data largely originates from isolated individual studies with limited sample sizes, heterogeneous disease models, intervention strategies, and annotation standards, which substantially hampers cross-study integration and methodological benchmarking. This limitation is particularly pronounced for spatial omics data, as high-quality spatial datasets targeting representative TCM syndromes or compound interventions remain scarce, constituting a major data-level bottleneck for the application of cell niche analysis in TCM research.

    • At the methodology level, existing cell niche analysis methods still face limitations in terms of stability, reproducibility, generalizability, and biological interpretability. As summarized above, spatial clustering methods, which directly model cell niches by utilizing the molecular profiles as well as the spatial information of spatial omics data jointly, have emerged as a primary strategy for niche identification. However, most of them rely on deep learning frameworks and exhibit high sensitivity to hyperparameter selection, neighborhood definition, and random initialization[36,147], leading to substantial variability in inferred cell niches. Most importantly, the robustness and reproducibility of these models across independent datasets, spatial platforms, and tissue contexts have not been systematically evaluated. Many existing methods demonstrate strong performance preferences toward specific datasets but fail to generalize or reproduce consistent results across datasets or platforms[170,171]. These limitations directly compromise the reliability of inferred cell niches as the functional multi-cellular targets and restrict their translational potential in biomarker discovery, drug mechanism elucidation, therapeutic efficacy evaluation, formulation optimization, and other pharmacological applications.

      In parallel, many deep learning-based methods, particularly those based on GNN architectures, offer greater flexibility for modeling complex spatial dependencies but typically involve numerous hyperparameters. In high-dimensional, noisy, and often small-sample spatial omics datasets, such models are more prone to overfitting, which further limits their generalizability to real tissue and disease contexts. Moreover, these methods are often highly dependent on computational resources. Tasks such as large-scale graph construction, complex network training, and hyperparameter optimization frequently require high-performance GPUs, substantial memory, and prolonged computation times, which may constrain their accessibility in ordinary laboratory or multi-center research settings. Consequently, future studies should balance performance optimization with systematic considerations of computational resource requirements, cross-dataset robustness assessment, and the establishment of standardized benchmarking frameworks.

      In addition, the 'black box' nature of deep learning models, coupled with unsupervised clustering-driven cell niche discovery, often struggles in mechanism interpretation. While these methods may achieve excellent performance in spatial pattern recognition on the simulated data, they frequently lack the ability to trace niche formation back to specific cell subpopulations, signaling pathways, or intercellular interactions. From an AI modeling perspective, most existing approaches are primarily optimized for data-driven spatial pattern fitting rather than explicitly modeling the biological processes underlying microenvironment formation, which fundamentally constrains their interpretability. This limitation is particularly problematic in TCM research, where the modern biological interpretation of the theoretical basis of TCM such as holistic regulation, systemic intervention, and 'Gui Jing' (meridian tropism)[192] urgently requires an interpretable multicellular coordination framework. Therefore, establishing quantifiable and experimentally verifiable links between the results of cell niche analysis and the theoretical basis of TCM remains a major methodological bottleneck.

    • At the application level, the complexity of TCM intervention imposes additional demands on cell niche analysis. Unlike gene perturbations or single-target drugs, TCM typically exerts effects through the synergistic regulation of multiple targets and pathways. Such effects are often gradual, distributed, and strongly time-dependent[193], rendering niche-level changes subtle and difficult to capture using static, single-time-point analysis.

      Furthermore, many TCM studies involve compound formulas, whose administration and dosing regimens are often dynamically adjusted based on disease progression stages and individual variations[194]. These characteristics often result in limited availability of longitudinal clinical samples and heterogeneous data quality, constraining systematic characterization of the dynamic remodeling process of cell niches. The current analytical paradigm, which primarily relies on cross-sectional comparisons, struggles to fully delineate the continuous trajectory of the cell niche under TCM intervention. This limitation also hinders a deeper understanding of the systemic mechanism of action of TCM.

    • To facilitate the broader application of cell niche analysis in TCM research, future efforts are expected to prioritize critical bottlenecks that currently limit its translational potential in biomarker discovery, drug mechanism elucidation, therapeutic efficacy evaluation, and other pharmacological applications. Overall, these bottlenecks can be summarized into three key aspects: (1) improving the biological interpretability and experimental verifiability of cell niches as the functional multicellular targets; (2) enhancing the robustness, reproducibility, and generalizability of AI-driven cell niche analysis methods across datasets, platforms, and tissue contexts; and (3) establishing high-quality, publicly available single-cell and spatial omics data resources and analysis frameworks tailored for TCM research.

      First, improving the interpretability of cell niches as the functional multicellular targets is a prerequisite for their meaningful application in pharmacology and TCM research. Currently, most cell niche analyses rely heavily on unsupervised spatial clustering, where the definition of cell niches remains largely at the level of data-driven spatial patterns, failing to directly link to specific cell subpopulations, key signaling pathways, or intercellular interactions. As a result, inferred cell niches are often difficult to validate experimentally and interpret mechanistically, limiting the utility of cell niches as actionable pharmacological targets. Therefore, future studies are supposed to integrate multi-modal information from transcriptomics, proteomics, metabolomics, and epigenomics, to comprehensively characterize the molecular basis of multicellular coordination within cell niches from complementary functional dimensions[195,196]. Meanwhile, methodological development in cell niche analysis is expected to move beyond descriptive paradigms dominated by unsupervised clustering toward frameworks that emphasize mechanistic interpretation and causal inference. Explicitly incorporating cell-cell communication networks[197], pathway activity information, and established biological priors into cell niche modeling, together with perturbation-based experimental designs[198] and causal inference frameworks[191] will facilitate the dissection and validation of key regulatory processes underlying niche formation, evolution, and remodeling. Moreover, cross-scale integration of molecular, cellular, and tissue-level data to construct biologically realistic virtual cell or virtual niche models[199] may help bridge the gap between correlative observations and causal mechanisms, thereby providing more robust and experimentally testable insights into TCM-induced niche remodeling.

      Second, improving the stability, reproducibility, and generalizability across datasets and platforms is a core technical bottleneck currently hindering the reliable application of AI-driven cell niche analysis methods. Currently, variations in single-cell and spatial omics platforms, sample preparation protocols, and analysis pipelines, coupled with the high sensitivity of deep learning models to data noise, hyperparameter selection, and random initialization, often lead to poor reproducibility of cell niche identification across datasets. These limitations directly compromise the reliability of inferred cell niches. Against this backdrop, future AI-driven cell niche analysis methods should prioritize stability and consistency across diverse data contexts rather than focusing solely on performance optimization. Incorporating biological constraints into model design, reducing dependence on sensitive hyperparameters or random factors, and establishing standardized benchmarking frameworks across platforms and tissues will be essential for providing a more reliable methodological foundation for the broader application of cell niche analysis in pharmacological and TCM research.

      Finally, establishing high-quality, publicly available single-cell and spatial omics data resources and analysis frameworks tailored for TCM research will scale up cell niche analysis and enhance its clinical relevance and translational potential. Compared with fields such as cancer, immunology, and developmental biology, TCM-related single-cell and spatial omics databases remain fragmented, with limited standardization in sample sizes, disease models, intervention strategies, and annotation. Future studies may focus on representative TCM syndrome models[200], such as pulmonary Qi-deficiency syndrome, Heart-Blood-Stasis syndrome, spleen-deficient syndrome, and establish large-scale, high-quality single-cell and spatial omics atlases through multi-center collaborations. By implementing standardized and reproducible niche analysis pipelines, it may become possible to quantitatively link cell niche features with experimental models, therapeutic responses, and TCM syndromes. Such efforts could provide a novel technical pathway for the objective and quantifiable characterization of TCM syndromes, ultimately supporting the development of personalized TCM[3].

      • Not applicable.

      • The authors confirm their contributions to the review as follows: conceptualization: Fan X, Shao X; manuscript writing: Qian J, Bao H, Yang H, Zhang J. All authors reviewed the results and approved the final version of the manuscript.

      • Data sharing is not applicable to this review as no datasets were generated or analyzed.

      • This work is supported by New Generation Artificial Intelligence-National Science and Technology Major Project (2025ZD0122805), and the National Natural Science Foundation of China (U23A20513).

      • The authors declare that there is no conflict of interest.

      • Copyright: © 2026 by the author(s). Published by Maximum Academic Press on behalf of China Pharmaceutical University. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (3)  Table (4) References (200)
  • About this article
    Cite this article
    Qian J, Bao H, Yang H, Zhang J, Shao X, et al. 2026. AI-driven deciphering of cell niches with single-cell and spatial omics: a new perspective for traditional Chinese medicine research. Targetome 2(2): e009 doi: 10.48130/targetome-0026-0009
    Qian J, Bao H, Yang H, Zhang J, Shao X, et al. 2026. AI-driven deciphering of cell niches with single-cell and spatial omics: a new perspective for traditional Chinese medicine research. Targetome 2(2): e009 doi: 10.48130/targetome-0026-0009

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return