Data-driven design of electromagnetic functional materials: a statistical perspective

Han Zhang; Jincan Che; Han Zhang; Jincan Che

doi:10.48130/stati-0025-0006

2025 Volume 2

Article Contents

Next Previous

REVIEW Open Access

Data-driven design of electromagnetic functional materials: a statistical perspective

Han Zhang^1,2,
Jincan Che^1,3,4, ,

1.
Division of Health Statistics, School of Public Health, Hebei Medical University, Shijiazhuang 050017, PR China
2.
School of Materials Science and Engineering, Beihang University, Beijing 100191, PR China
3.
Hebei Key Laboratory of Environment and Human Health, Shijiazhuang 050017, PR China
4.
Beijing Key Laboratory of Topological Statistics and Applications for Complex Systems, Beijing Institute of Mathematical Sciences and Applications, Beijing 101408, PR China

More Information

Corresponding author: chejincan@bimsa.cn

Received: 03 July 2025
Revised: 13 September 2025
Accepted: 13 October 2025
Published online: 31 October 2025
Statistics Innovation 2, Article number: e006 (2025) | Cite this article

Abstract

The growing demand for high-performance electromagnetic functional materials in radar stealth, 5G communications, and flexible electronics highlights the limitations of traditional empirical methods in addressing multi-physics coupling, high-dimensional optimization, and nonlinear responses. Data-driven approaches based on statistical methodologies provide effective solutions by uncovering complex relationships, quantifying uncertainties, and constructing interpretable models. This review systematically summarizes recent advances in statistical methods for the design and optimization of electromagnetic materials. Key techniques such as supervised learning, Bayesian inference, kernel methods, and deep learning are introduced, with innovative applications in electromagnetic performance prediction, mechanism modeling, and parameter inversion with rapid simulation. Representative case studies demonstrate their effectiveness. Challenges, including limited model generalizability, insufficient integration of physical mechanisms, and difficulties in processing high-dimensional data, are discussed. Future directions may focus on physics-informed statistical modeling, standardized multiscale feature extraction, and the development of intelligent design paradigms. This review aims to provide theoretical guidance and systematic reference for next-generation electromagnetic functional materials.
- Statistical methodologies,
- Electromagnetic functional materials,
- Machine learning,
- Performance prediction
Rights and permissions
Copyright: © 2025 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Zhang XC, Zhang M, Wang MQ, Chang L, Li L, et al. 2024. Metal single-atoms toward electromagnetic wave-absorbing materials: insights and perspective. Advanced Functional Materials 34:2405972 doi: 10.1002/adfm.202405972 CrossRef Google Scholar
[2]	Tang Z, Xu L, Xie C, Guo L, Zhang L, et al. 2023. Synthesis of CuCo₂S₄@Expanded Graphite with crystal/amorphous heterointerface and defects for electromagnetic wave absorption. Nature Communications 14:5951 doi: 10.1038/s41467-023-41697-6 CrossRef Google Scholar
[3]	Tao J, Xu L, Pei C, Gu Y, He Y, et al. 2023. Catfish effect induced by anion sequential doping for microwave absorption. Advanced Functional Materials 33:2211996 doi: 10.1002/adfm.202211996 CrossRef Google Scholar
[4]	Cao MS, Shu JC, Wen B, Wang XX, Cao WQ. 2021. Genetic dielectric genes inside 2D carbon-based materials with tunable electromagnetic function at elevated temperature. Small Structures 2:2100104 doi: 10.1002/sstr.202100104 CrossRef Google Scholar
[5]	Yuan M, Li B, Du Y, Liu J, Zhou X, et al. 2025. Programmable electromagnetic wave absorption via tailored metal single atom-support interactions. Advanced Materials 37:2417580 doi: 10.1002/adma.202417580 CrossRef Google Scholar
[6]	Kuznetsova V, Coogan Á, Botov D, Gromova Y, Ushakova EV, et al. 2024. Expanding the horizons of machine learning in nanomaterials to chiral nanostructures. Advanced Materials 36:2308912 doi: 10.1002/adma.202308912 CrossRef Google Scholar
[7]	Nguyen TH, Vuong HT, Shiau J, Nguyen-Thoi T, Nguyen DH, et al. 2024. Optimizing flexural strength of RC beams with recycled aggregates and CFRP using machine learning models. Scientific Reports 14:28621 doi: 10.1038/s41598-024-79287-1 CrossRef Google Scholar
[8]	Zong Y, Nian Y, Zhang C, Tang X, Wang L, et al. 2025. Hybrid Grid Search and Bayesian optimization-based random forest regression for predicting material compression pressure in manufacturing processes. Engineering Applications of Artificial Intelligence 141:109580 doi: 10.1016/j.engappai.2024.109580 CrossRef Google Scholar
[9]	Zhu C, Bamidele EA, Shen X, Zhu G, Li B. 2024. Machine learning aided design and optimization of thermal metamaterials. Chemical Reviews 124:4258−331 doi: 10.1021/acs.chemrev.3c00708 CrossRef Google Scholar
[10]	Li C, Bao L, Ji Y, Tian Z, Cui M, et al. 2024. Combining machine learning and metal–organic frameworks research: novel modeling, performance prediction, and materials discovery. Coordination Chemistry Reviews 514:215888 doi: 10.1016/j.ccr.2024.215888 CrossRef Google Scholar
[11]	Ding Z, Su W, Luo Y, Ye L, Wu H, et al. 2023. Design of an ultra-broadband terahertz absorber based on a patterned graphene metasurface with machine learning. Journal of Materials Chemistry C 11:5625−33 doi: 10.1039/D3TC00102D CrossRef Google Scholar
[12]	Gao R, Shang H, Zhou Q, Tan BF, Wei XS, et al. 2025. Machine learning-guided conductivity prediction in 2D organic metal chalcogenides for accelerated electromagnetic wave absorber design. ACS Applied Materials & Interfaces 17:38379−88 doi: 10.1021/acsami.5c07554 CrossRef Google Scholar
[13]	Xu C, Dong H, Yan Z, Wang L, Ning M, et al. 2025. Micromagnetic and quantitative prediction of hardness and impact energy in martensitic stainless steels using mutual information parameter screening and random forest modeling methods. Materials 18:1685 doi: 10.3390/ma18071685 CrossRef Google Scholar
[14]	Lai WWL, Chang RKW, Völker C, Cheung BWY. 2021. GPR wave dispersion for material characterization. Construction and Building Materials 282:122597 doi: 10.1016/j.conbuildmat.2021.122597 CrossRef Google Scholar
[15]	Kim EA, Park JH, Han SH, Lim YY, Kong KJ, et al. 2017. Exploratory factor analysis of fluoride removal efficiency associated with the chemical properties of geomaterials. Journal of Hazardous Materials 334:178−84 doi: 10.1016/j.jhazmat.2017.03.059 CrossRef Google Scholar
[16]	Zhai G, Chen J, Wang S, Li K, Zhang L. 2015. Material identification of loose particles in sealed electronic devices using PCA and SVM. Neurocomputing 148:222−28 doi: 10.1016/j.neucom.2013.10.043 CrossRef Google Scholar
[17]	Rao ARM, Lakshmi K, Kumar SK. 2015. Detection of delamination in laminated composites with limited measurements combining PCA and dynamic QPSO. Advances in Engineering Software 86:85−106 doi: 10.1016/j.advengsoft.2015.04.005 CrossRef Google Scholar
[18]	Li X, Wang S, Hou Q, Dong F. 2024. A stepwise clustering method of rock discontinuities dominated by multivariate parameters based on t-SNE. Rock and Soil Mechanics 45:1540−50 doi: 10.16285/j.rsm.2023.0897 CrossRef Google Scholar
[19]	Emery JM, Grigoriu MD, Field RV Jr. 2016. Bayesian methods for characterizing unknown parameters of material models. Applied Mathematical Modelling 40:6395−411 doi: 10.1016/j.apm.2016.01.046 CrossRef Google Scholar
[20]	Bernstein J, Schmidt K, Rivera D, Barton N, Florando J, et al. 2019. A comparison of material flow strength models using Bayesian cross-validation. Computational Materials Science 169:109098 doi: 10.1016/j.commatsci.2019.109098 CrossRef Google Scholar
[21]	Wang K, Dowling AW. 2022. Bayesian optimization for chemical products and functional materials. Current Opinion in Chemical Engineering 36:100728 doi: 10.1016/j.coche.2021.100728 CrossRef Google Scholar
[22]	Tian Y, Li T, Pang J, Zhou Y, Xue D, et al. 2025. Materials design with target-oriented Bayesian optimization. NPJ Computational Materials 11:209 doi: 10.1038/s41524-025-01704-4 CrossRef Google Scholar
[23]	Pfau D, Jung A. 2024. Engineering trustworthy AI: a developer guide for empirical risk minimization. IEEE Transactions on Artificial Intelligence:Early Access doi: 10.1109/TAI.2025.3617936 CrossRef Google Scholar
[24]	Kang EH, Yoganarasimhan H, Jain L. 2025. An empirical risk minimization approach for offline inverse RL and dynamic discrete choice model. arXiv:2502.14131 doi: 10.48550/arXiv.2502.14131 CrossRef Google Scholar
[25]	Treder MS, Shock JP, Stein DJ, du Plessis S, Seedat S, et al. 2021. Correlation constraints for regression models: controlling bias in brain age prediction. Frontiers in Psychiatry 12:615754 doi: 10.3389/fpsyt.2021.615754 CrossRef Google Scholar
[26]	Jiang Y, He Y, Zhang H. 2016. Variable selection with prior information for generalized linear models via the prior LASSO method. Journal of the American Statistical Association 111:355−76 doi: 10.1080/01621459.2015.1008363 CrossRef Google Scholar
[27]	Teodorescu V, Obreja Brașoveanu L. 2025. Assessing the validity of k-fold cross-validation for model selection: evidence from bankruptcy prediction using random forest and XGBoost. Computation 13:127 doi: 10.3390/computation13050127 CrossRef Google Scholar
[28]	Mohammadagha M. 2025. Hyperparameter optimization strategies for tree-based machine learning models prediction: a comparative study of AdaBoost, decision trees, and random forest. Open Science Framework:xbkr5_v1 doi: 10.31219/osf.io/xbkr5_v1 CrossRef Google Scholar
[29]	Deringer VL, Bartók AP, Bernstein N, Wilkins DM, Ceriotti M, et al. 2021. Gaussian process regression for materials and molecules. Chemical Reviews 121:10073−141 doi: 10.1021/acs.chemrev.1c00022 CrossRef Google Scholar
[30]	Rasmussen CE, Williams CKI. 2005. Gaussian processes for machine learning. US: MIT Press. 266 pp
[31]	Wilson AG, Nickisch H. 2015. Kernel interpolation for scalable structured Gaussian processes (KISS-GP). arXiv:1503.01057 doi: 10.48550/arXiv.1503.01057 CrossRef Google Scholar
[32]	Paun I, Husmeier D, Torney CJ. 2023. Stochastic variational inference for scalable non-stationary Gaussian process regression. Statistics and Computing 33:44 doi: 10.1007/s11222-023-10210-w CrossRef Google Scholar
[33]	Rossi S, Heinonen M, Bonilla EV, Shen Z, Filippone M. 2021. Sparse Gaussian processes revisited: bayesian approaches to inducing-variable approximations. arXiv:2003.03080 doi: 10.48550/arXiv.2003.03080 CrossRef Google Scholar
[34]	Yadav M, Sheldon DR, Musco C. 2022. Kernel interpolation with sparse grids. Advances in Neural Information Processing Systems 35:22883−94 Google Scholar
[35]	Rahaman R. 2021. Uncertainty quantification and deep ensembles. Advances in Neural Information Processing Systems 34:20063−75 Google Scholar
[36]	Zhang J, Kailkhura B, Han TYJ. 2021. Leveraging uncertainty from deep learning for trustworthy material discovery workflows. ACS Omega 6:12711−21 doi: 10.1021/acsomega.1c00975 CrossRef Google Scholar
[37]	Li X, Su M, Zhu Y, Ma S, Liu S, et al. 2025. Evidential interpretation approach for deep neural networks in high-frequency electromagnetic wave processing. Electronics 14:3277 doi: 10.3390/electronics14163277 CrossRef Google Scholar
[38]	Varivoda D, Dong R, Omee SS, Hu J. 2023. Materials property prediction with uncertainty quantification: a benchmark study. Applied Physics Reviews 10:021409 doi: 10.1063/5.0133528 CrossRef Google Scholar
[39]	Novick A, Cai D, Nguyen Q, Garnett R, Adams R, et al. 2024. Probabilistic prediction of material stability: integrating convex hulls into active learning. Materials Horizons 11:5381−93 doi: 10.1039/D4MH00432A CrossRef Google Scholar
[40]	Mamun O, Taufique MFN, Wenzlick M, Hawk J, Devanathan R. 2022. Uncertainty quantification for Bayesian active learning in rupture life prediction of ferritic steels. Scientific Reports 12:2083 doi: 10.1038/s41598-022-06051-8 CrossRef Google Scholar
[41]	Koizumi A, Deffrennes G, Terayama K, Tamura R. 2024. Performance of uncertainty-based active learning for efficient approximation of black-box functions in materials science. Scientific Reports 14:27019 doi: 10.1038/s41598-024-76800-4 CrossRef Google Scholar
[42]	Sain SR. 1996. The nature of statistical learning theory. Technometrics 38:409 doi: 10.1080/00401706.1996.10484565 CrossRef Google Scholar
[43]	Cristianini N, Scholkopf B. 2002. Support vector machines and kernel methods: the new generation of learning machines. AI Magazine 23:31−31 Google Scholar
[44]	Du J, Li T, Xu Z, Tang J, Qi Q, et al. 2023. Structure–activity relationship in microstructure design for electromagnetic wave absorption applications. Small Structures 4:2300152 doi: 10.1002/sstr.202300152 CrossRef Google Scholar
[45]	Jolliffe IT, Cadima J. 2016. Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 374:20150202 doi: 10.1098/rsta.2015.0202 CrossRef Google Scholar
[46]	Maaten Lvd, Hinton G. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9:2579−605 Google Scholar
[47]	Sarmina BG, Sun GH, Dong SH. 2023. Principal component analysis and t-distributed stochastic neighbor embedding analysis in the study of quantum approximate optimization algorithm entangled and non-entangled mixing operators. Entropy 25:1499 doi: 10.3390/e25111499 CrossRef Google Scholar
[48]	Arora S, Hu W, Kothari PK. 2018. An analysis of the t-sne algorithm for data visualization. Proceedings of the 31^st Conference On Learning Theory 75:1455−62 Google Scholar
[49]	Nascimento GM, Ogoshi E, Fazzio A, Acosta CM, Dalpian GM. 2022. High-throughput inverse design and Bayesian optimization of functionalities: spin splitting in two-dimensional compounds. Scientific Data 9:195 doi: 10.1038/s41597-022-01292-8 CrossRef Google Scholar
[50]	Frazier PI. 2018. A tutorial on Bayesian optimization. arXiv:11807.02811 doi: 10.48550/arXiv.1807.02811 CrossRef Google Scholar
[51]	Zuo Y, Qin M, Chen C, Ye W, Li X, et al. 2021. Accelerating materials discovery with Bayesian optimization and graph deep learning. Materials Today 51:126−35 doi: 10.1016/j.mattod.2021.08.012 CrossRef Google Scholar
[52]	Grandis H, Menvielle M, Roussignol M. 1999. Bayesian inversion with Markov chains—I. The magnetotelluric one-dimensional case. Geophysical Journal International 138:757−68 doi: 10.1046/j.1365-246x.1999.00904.x CrossRef Google Scholar
[53]	Malinverno A. 2002. Parsimonious Bayesian Markov chain Monte Carlo inversion in a nonlinear geophysical problem. Geophysical Journal International 151:675−88 doi: 10.1046/j.1365-246X.2002.01847.x CrossRef Google Scholar
[54]	De Iaco S. 2022. New spatio-temporal complex covariance functions for vectorial data through positive mixtures. Stochastic Environmental Research and Risk Assessment 36:2769−87 doi: 10.1007/s00477-022-02171-9 CrossRef Google Scholar
[55]	Zhang R, Yuan Y, Wang X, Sun X, Wang S, et al. 2025. Machine learning-assisted rapid electromagnetic design of flexible graphene-based absorptive composites. Chemical Engineering Journal 511:161634 doi: 10.1016/j.cej.2025.161634 CrossRef Google Scholar
[56]	Wang Q, Wang G, Lu X, Su L, Wang H. 2024. Prediction of broadband and highly-efficient electromagnetic wave-absorbing SiC@SiO₂ nanowire aerogel by genetic algorithm. ACS Applied Materials & Interfaces 16:57972−80 doi: 10.1021/acsami.4c13946 CrossRef Google Scholar
[57]	Bora PJ, Mahanta B, Raghavan N. 2024. Revolutionizing electromagnetic materials: machine learning enabled optimization of polymer nanocomposites for enhanced performance. Advanced Engineering Materials 26:2301518 doi: 10.1002/adem.202301518 CrossRef Google Scholar
[58]	Liu P, Cui Z, Sun Y, Yuan W, Qu L, et al. 2024. Research on high-entropy spinel microwave absorption materials: exploration of machine learning and experimental integration. Ceramics International 50:49906−14 doi: 10.1016/j.ceramint.2024.09.335 CrossRef Google Scholar
[59]	Zhou H, Li X, Xi Z, Li M, Zhang J, et al. 2025. Machine learning-driven interface engineering for enhanced microwave absorption in MXene films. Materials Today Physics 51:101640 doi: 10.1016/j.mtphys.2024.101640 CrossRef Google Scholar
[60]	Waseer WI, Baqir MA, Saqlain M, Mughal MJ, Khan S. 2025. Predictive modeling of MXene-based solar absorbers using a deep neural network. Journal of the Optical Society of America B 42:763−72 doi: 10.1364/JOSAB.550317 CrossRef Google Scholar
[61]	Lu S, Zhou Q, Guo Y, Zhang Y, Wu Y, et al. 2020. Coupling a crystal graph multilayer descriptor to active learning for rapid discovery of 2D ferromagnetic semiconductors/half-metals/metals. Advanced Materials 32:2002658 doi: 10.1002/adma.202002658 CrossRef Google Scholar
[62]	Li X, Qiu J, Cui H, Chen X, Yu J, et al. 2024. Machine learning accelerated discovery of functional MXenes with giant piezoelectric coefficients. ACS Applied Materials & Interfaces 16:12731−43 doi: 10.1021/acsami.3c14610 CrossRef Google Scholar
[63]	Cao M, Wang X, Cao W, Fang X, Wen B, et al. 2018. Thermally driven transport and relaxation switching self-powered electromagnetic energy conversion. Small 14:1800987 doi: 10.1002/smll.201800987 CrossRef Google Scholar
[64]	Cao MS, Wang XX, Zhang M, Cao WQ, Fang XY, et al. 2020. Variable-temperature electron transport and dipole polarization turning flexible multifunctional microsensor beyond electrical and optical energy. Advanced Materials 32:1907156 doi: 10.1002/adma.201907156 CrossRef Google Scholar
[65]	Wang H, Meng F, Huang F, Jing C, Li Y, et al. 2019. Interface modulating CNTs@PANi hybrids by controlled unzipping of the walls of CNTs to achieve tunable high-performance microwave absorption. ACS Applied Materials & Interfaces 11:12142−53 doi: 10.1021/acsami.9b01122 CrossRef Google Scholar
[66]	Shi M, Feng CP, Tu YL, Shi GS, He PY, et al. 2023. Visualization of deep convolutional neural networks to investigate porous nanocomposites for electromagnetic interference shielding. ACS Applied Materials & Interfaces 15:22602−15 doi: 10.1021/acsami.3c04557 CrossRef Google Scholar
[67]	Sun W, Li LS, Yin HC, Chen W. 2024. Study of permeability and permittivity of α-Fe₂O₃ using computer simulation method. Computational Materials Science 233:112756 doi: 10.1016/j.commatsci.2023.112756 CrossRef Google Scholar
[68]	Liu W, McLeod E. 2023. Fast and accurate electromagnetic field calculation for substrate-supported metasurfaces using the discrete dipole approximation. Nanophotonics 12:4157−73 doi: 10.1515/nanoph-2023-0423 CrossRef Google Scholar
[69]	Feng N, Wang H, Zhang Y, Huang Z, Elsherbeni AZ. 2024. Alternative implementation of EM propagation for 3-D layered lossy media by SMM method. IEEE Transactions on Antennas and Propagation 72:6599−613 doi: 10.1109/TAP.2024.3416053 CrossRef Google Scholar
[70]	Abouelyazied A, Dupré L. 2015. A unified electromagnetic inverse problem algorithm for the identification of the magnetic material characteristics of electromagnetic devices including uncertainty analysis: a review and application. IEEE Transactions on Magnetics 51:7300210 doi: 10.1109/TMAG.2014.2332978 CrossRef Google Scholar
[71]	Xu J, Xu P, Yang Z, Liu F, Xu L, et al. 2024. Freeform metasurface design with a conditional generative adversarial network. Applied Physics A 130:530 doi: 10.1007/s00339-024-07694-2 CrossRef Google Scholar
[72]	Zhu R, Wang J, Fu X, Liu X, Liu T, et al. 2022. Deep-learning-empowered holographic metasurface with simultaneously customized phase and amplitude. ACS Applied Materials & Interfaces 14:48303−10 doi: 10.1021/acsami.2c15362 CrossRef Google Scholar
[73]	Cai S, Mao Z, Wang Z, Yin M, Karniadakis GE. 2021. Physics-informed neural networks (PINNs) for fluid mechanics: a review. Acta Mechanica Sinica 37:1727−38 doi: 10.1007/s10409-021-01148-1 CrossRef Google Scholar
[74]	Melching D, Paysan F, Strohmann T, Breitbarth E. 2024. An iterative crack tip correction algorithm discovered by physical deep symbolic regression. International Journal of Fatigue 187:108432 doi: 10.1016/j.ijfatigue.2024.108432 CrossRef Google Scholar
[75]	Liu L, Liu S, Yang Y, Guo X, Sun J. 2024. A generalized grey model with symbolic regression algorithm and its application in predicting aircraft remaining useful life. Engineering Applications of Artificial Intelligence 136:108986 doi: 10.1016/j.engappai.2024.108986 CrossRef Google Scholar

About this article

Cite this article

Zhang H, Che J. 2025. Data-driven design of electromagnetic functional materials: a statistical perspective. Statistics Innovation 2: e006 doi: 10.48130/stati-0025-0006

Zhang H, Che J. 2025. Data-driven design of electromagnetic functional materials: a statistical perspective. Statistics Innovation 2: e006 doi: 10.48130/stati-0025-0006

Figures(6)

Download PDF

Article Metrics

Article views(408) PDF downloads(263)

Other Articles By Authors

on this site
- Han Zhang
- Jincan Che
on Google Scholar
- Han Zhang
- Jincan Che

HTML

Introduction

The rapid development of radar stealth, 5G communications, electromagnetic interference shielding, and flexible electronics has significantly elevated the importance of electromagnetic functional materials in critical domains including information technology, energy, defense, and intelligent manufacturing^[1,2]. These advanced materials must simultaneously demonstrate optimal dielectric properties, magnetic permeability, impedance matching characteristics, and broadband absorption performance while maintaining stable responses under complex multi-physics conditions involving temperature fluctuations, frequency variations, and electric field interactions^[3,4]. Nevertheless, substantial challenges persist due to intricate material architectures, multifactorial performance dependencies, protracted experimental cycles, and extensive parameter spaces^[5]. These complexities have rendered conventional empirical approaches and single-variable analysis methodologies increasingly inadequate for addressing contemporary engineering requirements. Consequently, the development of systematic, precise, and efficient theoretical frameworks for predictive design, mechanistic understanding, and performance optimization has emerged as a paramount research objective in this field.

In recent years, statistical methodologies—have also been referred to as data-driven design approaches^[6]. In the context of this review, statistical methodologies are defined as an overarching framework encompassing classical statistical tools (e.g., regression analysis, variance testing), statistical modeling paradigms (e.g., Bayesian inference, nonparametric kernel methods), and modern machine learning and deep learning algorithms. Machine learning is here regarded as a subset of statistical methodologies, with algorithms such as random forests^[7,8], support vector machines, Gaussian process regression^[9], and gradient boosting trees^[10] widely applied to materials research. More recently, deep learning architectures including convolutional neural networks (CNN), residual networks (ResNet), and generative adversarial networks (GAN) have been introduced for tasks such as property prediction, structure optimization, and rapid inverse design of metamaterials, thereby extending the predictive and design capabilities of data-driven frameworks. These statistical models now enable comprehensive research capabilities spanning material performance prediction, structural optimization, mechanism identification, parameter inversion, and efficient simulation. For electromagnetic functional materials specifically, statistical methodologies have not only substantially enhanced the efficiency of material design and performance modulation but have also created new pathways for deciphering complex physical mechanisms and establishing structure–performance–function relationships^[11,12].

This review is structured around statistical methodologies to systematically survey their applications in the design and performance optimization of electromagnetic functional materials. We begin with an in-depth examination of innovative implementations of statistical methodologies across three core research themes: electromagnetic performance prediction, Electromagnetic Mechanism Modeling, and Parameter Inversion & Rapid Simulation. Subsequently, representative case studies within each thematic direction are systematically compiled, demonstrating the pivotal role of statistics-driven approaches in addressing complex electromagnetic challenges. The review also identifies persisting challenges associated with statistical methodologies, including limitations in model generalizability, cross-system adaptability, and handling of high-dimensional multivariate problems. Through this synthesized overview, we aim it to serve as a systematic reference and provide guidance for performance enhancement, mechanistic interpretation, and inverse design in electromagnetic functional materials.

Statistical theories in electromagnetic materials research

Applications of statistical theories in electromagnetic functional materials research

Challenges in statistical modeling of electromagnetic functional materials

Despite the demonstrated strength of statistical methods in modeling and performance prediction for electromagnetic functional materials, significant challenges and limitations remain. First, model generalizability is often limited, particularly when applied across different material systems or frequency ranges. Many studies rely on models trained under specific experimental conditions without fully capturing the underlying physical principles of the materials, resulting in markedly diminished extrapolation capabilities for novel structures or broader frequency domains. For example, in the design of ultra-broadband terahertz absorbers based on patterned graphene metasurfaces, machine learning models achieved excellent predictive performance within the original metasurface structures and frequency bands; however, their adaptability to other configurations was notably inadequate^[11]. Similarly, another work employing random forest algorithms for rapid prediction of absorption performance in flexible graphene composites demonstrated strong results on the original dataset but relied heavily on fixed experimental parameters, limiting extension to novel composite systems^[55]. These cases highlight a common issue: complex models trained on small samples with high-dimensional features tend to learn 'data representations' rather than 'physical mechanisms', potentially leading to 'high predictive accuracy but poor physical interpretability'.

The field lacks standardized modeling protocols and unified frameworks for multiscale data integration. Divergent practices in feature selection, training–testing splits, and evaluation metrics impede comparability and critical reproducibility of results. For instance, a study integrating crystal graph neural networks with active learning for screening 2D magnetic materials incorporated multi-source data such as structural graphs and electronic density of states; however, the input space employed differs substantially from that used in discrete dipole approximation (DDA) or finite-difference time-domain (FDTD) based structural response simulations, complicating data granularity alignment and cross-method interoperability^[61]. Furthermore, approaches such as spatial mapping methods (SMM) and fast electromagnetic propagation simulations face challenges in synergistically coupling multiscale structural information with modeling variables^[70].

A more pressing issue lies in the limited incorporation of microscopic structural variables in most models. While some studies have attempted to enhance interpretability by integrating factors such as interfacial charge redistribution or defect-induced dipoles—for instance, explaining dielectric loss mechanisms via single-atom configurations and localized charge density variables^[5], or using Cole–Cole analysis to identify multiple Debye relaxation processes and quantify interfacial polarization contributions in hybrid materials^[65]—the majority still rely predominantly on macroscopic input parameters such as volume fraction, thickness, and frequency, neglecting critical microscopic factors like defects, grain boundaries, and functional groups that decisively influence electromagnetic responses. This insufficient dimensionality in characterization hampers the establishment of constitutive mappings among structure, mechanism, and performance, thereby limiting the interpretive depth and physical transferability of statistical models.

The integration of statistical models with fundamental electromagnetic physical laws remains inadequate. Electromagnetic properties are inherently constrained by Maxwell's equations, dispersion relations, and dielectric polarization theories; however, current modeling workflows are predominantly data-driven and lack explicit incorporation of physical priors. To address this limitation, recent advances have explored embedding physical constraints directly into the modeling workflow. Physics-informed neural networks (PINNs) can enforce Maxwell's equations and boundary conditions within the loss function, ensuring predictions respect conservation laws. Symbolic regression or physics-guided regression can generate interpretable expressions linking microstructural descriptors to macroscopic electromagnetic responses. Additionally, input features can be augmented with microscopic and multiscale descriptors—such as local charge distributions, defect types, or functional group densities—providing physical priors that guide learning and improve extrapolation. Regularization techniques and constrained optimization can incorporate energy conservation, symmetry protections, or known dispersion relations, while multi-scale hybrid models can jointly learn from microscopic simulations and macroscopic measurements, bridging data-driven and physics-based approaches. Although recent efforts have begun embedding physics constraints into training processes—such as PINNs^[73] and symbolic regression algorithms^[74,75]—applications in electromagnetic materials research remain nascent. Their effectiveness and scalability in complex wave-absorbing systems have yet to be rigorously validated. Nonetheless, these strategies offer a promising pathway to enhance model interpretability, physical consistency, and generalization across materials and frequency domains.

In summary, key bottlenecks persist for statistical methods in electromagnetic functional materials research, including poor model generalizability, limited structural characterization dimensionality, inconsistent modeling protocols, and weak physical coupling. Future efforts should prioritize embedding physical laws and multiscale descriptors into statistical workflows, standardizing feature engineering, and developing hybrid data–physics models. Such approaches can elevate statistical methods from auxiliary predictive tools to foundational theoretical pillars, enabling mechanistic elucidation, inverse design, and high-fidelity modeling of complex electromagnetic functional materials.

{{lists.name}}

Data-driven design of electromagnetic functional materials: a statistical perspective

Abstract