Genomic mating strategy for papilla number in <i>Apostichopus japonicus</i>

Ping Ni; Mu Nie; Yuan Tian; Kunya Wu; Luo Wang; Lingshu Han; Yaqing Chang; Jun Ding; Yangfan Wang; Lisui Bao; Ping Ni; Mu Nie; Yuan Tian; Kunya Wu; Luo Wang; Lingshu Han; Yaqing Chang; Jun Ding; Yangfan Wang; Lisui Bao

doi:10.48130/gcomm-0026-0013

2026 Volume 3

Article Contents

Next Previous

ARTICLE Open Access

Genomic mating strategy for papilla number in Apostichopus japonicus

1.
MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266000, China
2.
Key Laboratory of Mariculture, Ministry of Education, Ocean University of China, Qingdao 266000, China
3.
Key Laboratory of Mariculture & Stock Enhancement in North China's Sea, Ministry of Agriculture and Rural Affairs, Dalian Ocean University, Dalian 116000, China
4.
Key Laboratory of Northern Aquatic Germplasm Resources and Genetic Breeding in Liaoning Province, Dalian Ocean University, Dalian 116000, China
5.
Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao 266000, China
6.
SANYA Oceanographic Laboratory, Sanya 572000, China
^# Authors contributed equally: Ping Ni, Mu Nie, Yuan Tian

More Information

Corresponding authors: dingjun19731119@hotmail.com (Ding J); yfwang@ouc.edu.cn (Wang Y); baolisui@ouc.edu.cn (Bao L)

Received: 22 April 2026
Revised: 12 May 2026
Accepted: 22 May 2026
Published online: 16 June 2026
Genomics Communications 3, Article number: e012 (2026) | Cite this article

Abstract

Genomic mating algorithms, as an optimized mating strategy, aim to balance genetic gain against genetic diversity, compensating for the genetic diversity loss seen in genomic selection after multiple generations. To investigate changes in genetic gain and inbreeding under different schemes, we simulated 20 generations of selective breeding processes for papilla number using three genomic selection schemes (GBLUP, Bayesian Lasso, snnR) and two genomic mating schemes ('maxbv' and 'minskin'), involving over 6,000 real and simulated individuals. Results showed that genomic mating schemes consistently outperformed genomic selection schemes in genetic gain, especially in the long-term selection. On the 20^th generation, the genetic gain of 'maxbv' was higher than that of GBLUP, Bayesian Lasso, snnR, and 'minskin' by 68.99%, 32.28%, 53.07%, and 2.75%, respectively. Meanwhile, both genomic mating schemes exhibited lower inbreeding than the three genomic selection schemes, with 'minskin' reducing inbreeding coefficients by 94.40%, 91.42%, 93.55%, and 44.72% relative to GBLUP, Bayesian Lasso, snnR, and 'maxbv', respectively. These results demonstrated that genomic mating can achieve higher genetic gain while limiting the increase in population inbreeding, providing a sustainable genetic improvement strategy for Apostichopus japonicus and other aquaculture species through molecular breeding.
- Genomic mating,
- Genomic selection,
- Papilla number,
- Apostichopus japonicus

Supplementary information

Supplementary Table S1 Descriptive statistics for papilla number of 215 sea cucumbers from eight populations.
Supplementary Fig. S1 Genomic prediction accuracy at different SNP densities.
Supplementary Fig. S2 Frequency histogram of additive effect value (a) and dominant effect value (b) of SNPs for 10 replicates.
Supplementary Fig. S3 QQ map of simulated phenotype value for 10 replicates.
Supplementary Fig. S4 Genetic structure of the 20^th generation simulated population.

Rights and permissions
Copyright: © 2026 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Li Y, Wang R, Xun X, Wang J, Bao L, et al. 2018. Sea cucumber genome provides insights into saponin biosynthesis and aestivation regulation. Cell Discovery 4:29 doi: 10.1038/s41421-018-0030-5 CrossRef Google Scholar
[2]	Chang Y, Shi S, Zhao C, Han Z. 2011. Characteristics of papillae in wild, cultivated and hybrid sea cucumbers (Apostichopus japonicus). African Journal of Biotechnology 10(63):13780−13788 doi: 10.5897/AJB11.886 CrossRef Google Scholar
[3]	Lu K, Li F, Sun J, Failler P. 2019. 山东省海参资源开发评价与优化 [The evaluation and optimization of sea cucumber resource exploitation in Shandong]. 中国海洋经济 [Marine Economy in China] 2:16−28 (in Chinese) Google Scholar
[4]	Goddard ME, Hayes BJ. 2007. Genomic selection. Journal of Animal Breeding and Genetics 124(6):323−330 doi: 10.1111/j.1439-0388.2007.00702.x CrossRef Google Scholar
[5]	Wang Y, Ni P, Sturrock M, Zeng Q, Wang B, et al. 2024. Deep learning for genomic selection of aquatic animals. Marine Life Science & Technology 6(4):631−650 doi: 10.1007/s42995-024-00252-y CrossRef Google Scholar
[6]	Akdemir D, Sánchez JI. 2016. Efficient breeding by genomic mating. Frontiers in Genetics 7:210 doi: 10.3389/fgene.2016.00210 CrossRef Google Scholar
[7]	Jannink JL. 2010. Dynamics of long-term genomic selection. Genetics Selection Evolution 42(1):35 doi: 10.1186/1297-9686-42-35 CrossRef Google Scholar
[8]	Wientjes YCJ, Bijma P, Calus MPL, Zwaan BJ, Vitezica ZG, et al. 2022. The long-term effects of genomic selection: 1. response to selection, additive genetic variance, and genetic architecture. Genetics Selection Evolution 54(1):19 doi: 10.1186/s12711-022-00709-7 CrossRef Google Scholar
[9]	Meuwissen TH. 1997. Maximizing the response of selection with a predefined rate of inbreeding. Journal of Animal Science 75(4):934−940 doi: 10.2527/1997.754934x CrossRef Google Scholar
[10]	Zhao F, Zhang P, Wang X, Akdemir D, Garrick D, et al. 2023. Genetic gain and inbreeding from simulation of different genomic mating schemes for pig improvement. Journal of Animal Science and Biotechnology 14(1):87 doi: 10.1186/s40104-023-00872-x CrossRef Google Scholar
[11]	Woolliams J, Thomson R. 1994. A theory of genetic contributions. In Proceedings of the 5^th World Congress on Genetics Applied to Livestock Production, eds. Smith C, Gavora JS, Chesnais J, Fairfull W, Gibson JP, et al. Canada: Organising Committee. pp. 127−134
[12]	He J, Fernando BL, Wu XL. 2019. 动物基因组选配方法与应用 [Methods and applications of animal genomic mating]. 遗传 [Hereditas] 41(6):486−493 (in Chinese) doi: 10.16288/j.yczz.19-053 CrossRef Google Scholar
[13]	Pryce JE, Hayes BJ, Goddard ME. 2012. Novel strategies to minimize progeny inbreeding while maximizing genetic gain using genomic information. Journal of Dairy Science 95(1):377−388 doi: 10.3168/jds.2011-4254 CrossRef Google Scholar
[14]	Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30(15):2114−2120 doi: 10.1093/bioinformatics/btu170 CrossRef Google Scholar
[15]	Sun L, Jiang C, Su F, Cui W, Yang H. 2023. Chromosome-level genome assembly of the sea cucumber Apostichopus japonicus. Scientific Data 10:454 doi: 10.1038/s41597-023-02368-9 CrossRef Google Scholar
[16]	Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows – Wheeler transform. Bioinformatics 25(14):1754−1760 doi: 10.1093/bioinformatics/btp324 CrossRef Google Scholar
[17]	Garrison E, Marth G. 2012. Haplotype-based variant detection from short-read sequencing. arXiv 3907v2 doi: 10.48550/arXiv.1207.3907 CrossRef Google Scholar
[18]	Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, et al. 2021. Twelve years of SAMtools and BCFtools. GigaScience 10(2):giab008 doi: 10.1093/gigascience/giab008 CrossRef Google Scholar
[19]	Danecek P, Auton A, Abecasis G, Albers CA, Banks E, et al. 2011. The variant call format and VCFtools. Bioinformatics 27(15):2156−2158 doi: 10.1093/bioinformatics/btr330 CrossRef Google Scholar
[20]	Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, et al. 2007. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23(19):2633−2635 doi: 10.1093/bioinformatics/btm308 CrossRef Google Scholar
[21]	Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, et al. 2015. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4:s13742–015–0047–8 doi: 10.1186/s13742-015-0047-8 CrossRef Google Scholar
[22]	Alexander DH, Novembre J, Lange K. 2009. Fast model-based estimation of ancestry in unrelated individuals. Genome Research 19(9):1655−1664 doi: 10.1101/gr.094052.109 CrossRef Google Scholar
[23]	Chen C, Wu Y, Li J, Wang X, Zeng Z, et al. 2023. TBtools-II: a 'one for all, all for one' bioinformatics platform for biological big-data mining. Molecular Plant 16(11):1733−1742 doi: 10.1016/j.molp.2023.09.010 CrossRef Google Scholar
[24]	Zhang C, Dong SS, Xu JY, He WM, Yang TL. 2019. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35(10):1786−1788 doi: 10.1093/bioinformatics/bty875 CrossRef Google Scholar
[25]	Wang X, Yang Z, Xu C. 2015. A comparison of genomic selection methods for breeding value prediction. Science Bulletin 60(10):925−935 doi: 10.1007/s11434-015-0791-2 CrossRef Google Scholar
[26]	Clark SA, van der Werf J. 2013. Genomic best linear unbiased prediction (gBLUP) for the estimation of genomic breeding values. In Genome-Wide Association Studies and Genomic Prediction, eds. Gondro C, van der Werf J, Hayes B. Totowa, NJ: Humana. pp. 321−330 doi: 10.1007/978-1-62703-447-0_13
[27]	Usai MG, Goddard ME, Hayes BJ. 2009. LASSO with cross-validation for genomic selection. Genetics Research 91(6):427−436 doi: 10.1017/S0016672309990334 CrossRef Google Scholar
[28]	Hayes B, Goddard ME. 2001. The distribution of the effects of genes affecting quantitative traits in livestock. Genetics Selection Evolution 33(3):209 doi: 10.1186/1297-9686-33-3-209 CrossRef Google Scholar
[29]	Wang Y, Mi X, Rosa GJM, Chen Z, Lin P, et al. 2018. Technical note: an R package for fitting sparse neural networks with application in animal breeding. Journal of Animal Science 96(5):2016−2026 doi: 10.1093/jas/sky071 CrossRef Google Scholar
[30]	Gianola D, Okut H, Weigel KA, Rosa GJ. 2011. Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat. BMC Genetics 12:87 doi: 10.1186/1471-2156-12-87 CrossRef Google Scholar
[31]	Wellmann R. 2019. Optimum contribution selection for animal breeding and conservation: the R package optiSel. BMC Bioinformatics 20(1):25 doi: 10.1186/s12859-018-2450-5 CrossRef Google Scholar
[32]	Peripolli E, Munari DP, Silva MVGB, Lima ALF, Irgang R, et al. 2017. Runs of homozygosity: current knowledge and applications in livestock. Animal Genetics 48(3):255−271 doi: 10.1111/age.12526 CrossRef Google Scholar
[33]	de Cara MÁR, Villanueva B, Toro MÁ, Fernández J. 2013. Using genomic tools to maintain diversity and fitness in conservation programmes. Molecular Ecology 22(24):6091−6099 doi: 10.1111/mec.12560 CrossRef Google Scholar
[34]	Kang Z, Kong J, Sui J, Dai P, Luo K, et al. 2024. Optimal open nucleus breeding system for long-term genetic gain in the Pacific white shrimp using genomic selection. Aquaculture 586:740760 doi: 10.1016/j.aquaculture.2024.740760 CrossRef Google Scholar
[35]	Technow F. 2011. R package hypred: simulation of genomic data in applied genetics. https://github.com/cran/hypred
[36]	Xiang T, Christensen OF, Vitezica ZG, Legarra A. 2018. Genomic model with correlation between additive and dominance effects. Genetics 209(3):711−723 doi: 10.1534/genetics.118.301015 CrossRef Google Scholar
[37]	Yang J, Lee SH, Goddard ME, Visscher PM. 2011. GCTA: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics 88(1):76−82 doi: 10.1016/j.ajhg.2010.11.011 CrossRef Google Scholar
[38]	Alarcón JA, Magoulas A, Georgakopoulos T, Zouros E, Alvarez MC. 2004. Genetic comparison of wild and cultivated European populations of the gilthead sea bream (Sparus aurata). Aquaculture 230(1−4):65−80 doi: 10.1016/S0044-8486(03)00434-4 CrossRef Google Scholar
[39]	Aliloo H, Pryce JE, González-Recio O, Cocks BG, Goddard ME, et al. 2017. Including nonadditive genetic effects in mating programs to maximize dairy farm profitability. Journal of Dairy Science 100(2):1203−1222 doi: 10.3168/jds.2016-11261 CrossRef Google Scholar
[40]	Smýkal P, Nelson MN, Berger JD, Von Wettberg EJB. 2018. The impact of genetic changes during crop domestication. Agronomy 8(7):119 doi: 10.3390/agronomy8070119 CrossRef Google Scholar
[41]	O'Connor KM, Hayes BJ, Hardner CM, Alam M, Henry R J, et al. 2021. Genomic selection and genetic gain for nut yield in an Australian macadamia breeding population. BMC Genomics 22(1):370 doi: 10.1186/s12864-021-07694-z CrossRef Google Scholar
[42]	Kearney JF, Amer PR, Villanueva B. 2005. Cumulative discounted expressions of sire genotypes for the complex vertebral malformation and β-casein loci in commercial dairy herds. Journal of Dairy Science 88(12):4426−4433 doi: 10.3168/jds.S0022-0302(05)73129-5 CrossRef Google Scholar
[43]	Gorjanc G, Gaynor RC, Hickey JM. 2018. Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection. Theoretical and Applied Genetics 131(9):1953−1966 doi: 10.1007/s00122-018-3125-3 CrossRef Google Scholar
[44]	Pérez-Enciso M, Zingaretti LM. 2019. A guide on using deep learning for complex trait genomic prediction. Genes 10(7):553 doi: 10.3390/genes10070553 CrossRef Google Scholar
[45]	Bellot P, de los Campos G, Pérez-Enciso M. 2018. Can deep learning improve genomic prediction of complex human traits? Genetics 210(3):809−819 doi: 10.1534/genetics.118.301298 CrossRef Google Scholar
[46]	Sonesson AK, Woolliams JA, Meuwissen TH. 2012. Genomic selection requires genomic control of inbreeding. Genetics Selection Evolution 44(1):27 doi: 10.1186/1297-9686-44-27 CrossRef Google Scholar
[47]	Bérodier M, Berg P, Meuwissen T, Boichard D, Brochard M, et al. 2021. Improved dairy cattle mating plans at herd level using genomic information. Animal 15(1):100016 doi: 10.1016/j.animal.2020.100016 CrossRef Google Scholar
[48]	Villanueva B, Dekkers JCM, Woolliams JA, Settar P. 2004. Maximizing genetic gain over multiple generations with quantitative trait locus selection and control of inbreeding. Journal of Animal Science 82(5):1305−1314 doi: 10.2527/2004.8251305x CrossRef Google Scholar
[49]	Soularue JP, Kremer A. 2012. Assortative mating and gene flow generate clinal phenological variation in trees. BMC Evolutionary Biology 12:79 doi: 10.1186/1471-2148-12-79 CrossRef Google Scholar
[50]	De Beukelaer H, Badke Y, Fack V, De Meyer G. 2017. Moving beyond managing realized genomic relationship in long-term genomic selection. Genetics 206(2):1127−1138 doi: 10.1534/genetics.116.194449 CrossRef Google Scholar
[51]	Tang Z, Yin L, Yin D, Zhang H, Fu Y, et al. 2023. Development and application of an efficient genomic mating method to maximize the production performances of three-way crossbred pigs. Briefings in Bioinformatics 24(1):bbac587 doi: 10.1093/bib/bbac587 CrossRef Google Scholar
[52]	González-Diéguez D, Tusell L, Carillier-Jacquin C, Bouquet A, Vitezica ZG. 2019. SNP-based mate allocation strategies to maximize total genetic value in pigs. Genetics Selection Evolution 51(1):55 doi: 10.1186/s12711-019-0498-y CrossRef Google Scholar
[53]	Jighly A, Hayden M, Daetwyler H. 2021. Integrating genomic selection with a genotype plus genotype x environment (GGE) model improves prediction accuracy and computational efficiency. Plant, Cell & Environment 44(10):3459−3470 doi: 10.1111/pce.14145 CrossRef Google Scholar
[54]	Mas-Muñoz J, Blonk R, Schrama JW, van Arendonk J, Komen H. 2013. Genotype by environment interaction for growth of sole (Solea solea) reared in an intensive aquaculture system and in a semi-natural environment. Aquaculture 410−411:230−235 doi: 10.1016/j.aquaculture.2013.06.012 CrossRef Google Scholar
[55]	Sae-Lim P, Kause A, Mulder HA, Martin KE, Barfoot AJ, et al. 2013. Genotype-by-environment interaction of growth traits in rainbow trout (Oncorhynchus mykiss): a continental scale study. Journal of Animal Science 91(12):5572−5581 doi: 10.2527/jas.2012-5949 CrossRef Google Scholar
[56]	González-Diéguez D, Tusell L, Bouquet A, Legarra A, Vitezica ZG. 2020. Purebred and crossbred genomic evaluation and mate allocation strategies to exploit dominance in pig crossbreeding schemes. G3 Genes\|Genomes\|Genetics 10(8):2829−2841 doi: 10.1534/g3.120.401376 CrossRef Google Scholar
[57]	Waples RK, Larson WA, Waples RS. 2016. Estimating contemporary effective population size in non-model species using linkage disequilibrium across thousands of loci. Heredity 117:233−240 doi: 10.1038/hdy.2016.60 CrossRef Google Scholar
[58]	Woolliams JA, Berg P, Dagnachew BS, Meuwissen THE. 2015. Genetic contributions and their optimization. Journal of Animal Breeding and Genetics 132:89−99 doi: 10.1111/jbg.12148 CrossRef Google Scholar
[59]	Houston RD, Bean TP, Macqueen DJ, Gundappa MK, Jin YH, et al. 2020. Harnessing genomics to fast-track genetic improvement in aquaculture. Nature Reviews Genetics 21:389−409 doi: 10.1038/s41576-020-0227-y CrossRef Google Scholar
[60]	Ren W, Liang Z. 2024. Review on GPU accelerated methods for genome-wide SNP-SNP interactions. Molecular Genetics and Genomics 300(1):10 doi: 10.1007/s00438-024-02214-6 CrossRef Google Scholar

About this article

Cite this article

Ni P, Nie M, Tian Y, Wu K, Wang L, et al. 2026. Genomic mating strategy for papilla number in Apostichopus japonicus. Genomics Communications 3: e012 doi: 10.48130/gcomm-0026-0013

Ni P, Nie M, Tian Y, Wu K, Wang L, et al. 2026. Genomic mating strategy for papilla number in Apostichopus japonicus. Genomics Communications 3: e012 doi: 10.48130/gcomm-0026-0013

Figures(5)

Download PDF

Article Metrics

Article views(514) PDF downloads(161)

Other Articles By Authors

on this site
- Ping Ni
- Mu Nie
- Yuan Tian
- Kunya Wu
- Luo Wang
- Lingshu Han
- Yaqing Chang
- Jun Ding
- Yangfan Wang
- Lisui Bao
on Google Scholar
- Ping Ni
- Mu Nie
- Yuan Tian
- Kunya Wu
- Luo Wang
- Lingshu Han
- Yaqing Chang
- Jun Ding
- Yangfan Wang
- Lisui Bao

HTML

Introduction

Sea cucumber Apostichopus japonicus is one of the most economically important aquaculture species in the Western Pacific region, prized for its nutritional and medicinal value^[1]. Since sea cucumbers with more papillae fetch higher market prices, the number of papillae is regarded as a primary selection goal in breeding research^[2]. However, problems such as overfishing and inbreeding depression resulting from the rapid expansion of aquaculture have led to a reduction in effective population size and a significant decline in sea cucumber germplasm resources^[3]. The lag in the collection and protection of sea cucumber germplasm resources has made the problem increasingly prominent, which has diminished their ability to resist diseases and environmental stress, often leading to large-scale mortalities during aquaculture, resulting in significant economic losses, and ultimately, seriously affecting the healthy development of the sea cucumber aquaculture industry. Therefore, there is an urgent need for a breeding method that selects individuals with desirable traits to improve economic benefits and maintain the level of genetic diversity within the population, ensuring the sustainability of A. japonicus breeding industry.

Genomic selection (GS), a breeding technique that utilizes molecular markers covering the whole genome, has emerged with the rapid development and widespread application of high-throughput sequencing and genotyping technologies^[4]. Compared to pedigree-based best linear unbiased prediction (BLUP) and marker-assisted selection (MAS), GS uses genome-wide markers for breeding value estimation, which improves the accuracy of estimating breeding values. Additionally, GS enables early genotyping of individuals, significantly shortening the generation interval and improving breeding efficiency^[5]. However, similar to other breeding methods based on estimated breeding values, GS still involves truncation selection, where individuals below a certain selection threshold are eliminated to form the next generation population^[6]. Jannink^[7] suggested that GS could reduce breeding cycle time and greatly increase early selection gain, but also resulted in the loss of favorable quantitative trait locus (QTL) alleles, leading to a loss of genetic variance and eventual reduction in GS accuracy. Wientjes et al.^[8] simulated long-term effects of GS and found that the accuracy of selection, the rate of genetic gain, the amount of additive genetic and genetic variation, and the number of segregating causal loci decreased after long-term selection. Consequently, while GS can significantly increase genetic gains in the short term, it may lead to the purging of alleles and a reduction in genetic variation within the breeding population, ultimately resulting in inbreeding depression. This limitation hampers the long-term gains in the selected traits and jeopardizes the future reproductive potential of other traits^[7].

Different from randomly mating high breeding value individuals, optimized mating methods based on pedigree relationships preserve the contribution potential of each individual as a parent, allowing for a balance between obtaining genetic gains and controlling the average degree of inbreeding and probability of shared ancestors^[9]. This approach aims to maintain sustainable long-term genetic gains and enables the selection of appropriate mating schemes based on different breeding objectives. With the widespread application of genomic molecular markers, such as single-nucleotide polymorphisms (SNPs), genomic mating (GM), a method that uses genomic information to optimize parental mating combinations, has emerged^[10].

To address the predicaments of GS in long-term selection, the key to GM is to maximize genetic gain while simultaneously controlling the kinship among breeding individuals. The optimum contribution selection (OCS) algorithm was used to solve this problem^[9]. This method controls the genetic contribution of each candidate parent, thereby limiting the accumulation of coancestry among offspring and promoting sustainable long-term genetic gain. The theory of genetic contributions posits that the maximum rate of genetic gain is achieved when an optimal threshold linear relationship exists between the Mendelian sampling of ancestors and their genetic contributions to the descendants, under restricted parental relationships. This theory underpins the OCS algorithm, which has been shown to effectively balance genetic gain and genetic diversity in breeding programs^[11]. With the advent of genome-wide SNPs, tracing of chromosome segments has become feasible, allowing for more accurate estimation of relationships among candidate parents. Genomic optimal contribution selection (GOCS) can achieve a closer approximation to the exact threshold linear relationship by utilizing genomic information, thereby enhancing the accuracy of Mendelian sampling evaluation^[12,13]. At present, GM has been studied in several important livestock species, such as dairy cattle, but there are fewer reports in aquaculture breeding.

In this study, we utilized real genotype and phenotype data from sea cucumber samples collected from eight different geographic regions to simulate breeding for the complex trait of papilla number. To assess the long-term applicability of GM strategies in aquaculture breeding, we conducted simulations comparing average genetic gains and inbreeding coefficients across generations under two GM schemes and three GS schemes. The results demonstrated that, in long-term breeding, GM not only achieved higher genetic gains compared to GS but also effectively controlled the inbreeding coefficient. These findings provide theoretical support for the effective selection and breeding of superior A. japonicus strains.

Discussion

Genetic structure and parameter analysis of the initial population

Population studies can help understand the domestication process of A. japonicus as well as the evolution of population structure and assist in breeding selection. In this study, the initial population was analyzed for genetic structure and parameters. Both PCA result and admixture result reflected that the initial population could be divided into Russian and Chinese populations based on genotypic data, which was consistent with the results of the manual sampling records. LD decay results indicated a high level of genetic diversity in the initial population. The maintenance of genetic variation is essential for the long-term survival of populations, and species with higher levels of variation are most likely to show high additive genetic variation in traits of interest^[38], which would greatly benefit the breeding of A. japonicus. Dominant effects have been demonstrated to have a certain impact on traits^[39]. The reliability of the model for estimating additive and dominant effect values was also demonstrated by the reliable simulated phenotypes obtained after incorporating dominant effect values in the model. Consequently, the initial population in this study had the breeding potential to meet the needs of multigenerational breeding.

Nevertheless, with prolonged artificial selection, genetic variability in breeding populations reduces with decreased genetic diversity^[40]. The core of GM is to achieve, as far as possible, the best possible trade-off between the conflicting goals of maximising genetic gain and maintaining genetic diversity.

GS strategies
GS results indicated that from the initial generation to the 20^th generation, all the schemes showed a trend of increasing followed by decreasing genetic gain. Among them, GS schemes showed a transient rise in genetic gain only in the very early generations, which is in line with current findings that, in the short term, inclusion of GS in breeding programs may accelerate genetic gains^[41]. On the other hand, the majority of the discussion on the long-term effects of GS is still at the stage of modeling studies. As GS ranked candidates on GEBVs and truncated the distribution to select those with the highest GEBVs, genetic gain increased, but so did inbreeding rates per generation. The reduction in genetic variance caused by a high inbreeding rate might result in detrimental long-term consequences, such as the fact that monogenic recessive alleles drift to high frequencies, causing disease^[42]. In general, as a truncation selection approach^[6], GS ignores the role of mating and complementation as evolutionary forces, which can lead to a reduction in genetic diversity and allelic purification after multiple generations of selection, and even inbreeding depression, limiting the long-term gain^[7].

Consequently, the progeny produced by the parents with the highest GEBV selected by the three GS schemes had increased inbreeding coefficients accompanied by a loss of additive genetic variance and reduced genetic diversity, thus preventing a substantial long-term genetic benefit^[43]. Under high-intensity truncation selection in GBLUP, the contribution of a few individuals with high GEBVs was disproportionately large, resulting in high inbreeding levels and low heterozygosity in progeny. The strong degree of LD and the homogeneous combination of alleles diminished the adaptive capacity and evolutionary potential of the population. Bayesian Lasso assumes that the trait-related marker effect variance can occur in the form of maximum and minimum values^[27], thus reducing allelic homogeneity in probability, resulting in a lower inbreeding coefficient and LD than GBLUP, and consequently, higher long-term genetic gain. Neural networks (NNs) are emerging as potentially powerful tools for genomic prediction of complex quantitative traits due to their ability to simultaneously account for non-linear relationships between molecular markers and QTLs^[44]. As a non-linear model with what is currently perceived as black-box behavior^[45], neural networks (NNs) cannot be used to derive interpretable conclusions about the influence of SNPs on phenotypes. The difficulty lies in discerning which features, and which combinations of features, the model has learned^[5]. The snnR model used in this study obtained moderate genetic gain and inbreeding coefficient among the GS schemes, which can be attributed to the fact that a sparsified neural network model is suitable for high-dimensional input variables and large sample sizes^[29], whereas the input genotype value data and sample sizes were low in this study, and the low complexity of the computation of the snnR limits it from capturing all the important genetic variants.

GM strategies
Inbreeding leads to an increased risk of losing favourable alleles and affects the potential for epistasis between dominant effects of heterozygous genes. Reductions in inbreeding can be achieved through simple management practices; however, these practices often come at the cost of reduced genetic progress. The importance of reducing inbreeding for long-term breeding has also been suggested by many investigators^[46]. These studies argued that GS may accelerate the decline in selection response unless new alleles are continually added; this emphasizes the importance of balancing short- and long-term gains by inbreeding in genetic selection. Achieving substantial long-term genetic gains requires a balance between selection and the maintenance of genetic diversity.

Different from GS, GM incorporates parameters such as kinship into breeding considerations, preserves the potential of each individual to contribute as a parent^[9], maintains sustainable genetic gain, and enables the selection of appropriate selection schemes according to different breeding objectives. In this study, we simulated 20 generations of simulated breeding with two breeding strategies, maximising breeding value and minimising kinship, using GM schemes based on the OCS algorithm. The results showed that although the two GM schemes differed slightly due to their respective optimization objectives, no significant differences were observed in either genetic gain or inbreeding coefficient, and both GM strategies outperformed the three GS schemes. The two GM strategies, although manifested in the form of maximising GEBV and minimising inbreeding coefficient, within the algorithms, each imposed restrictions on kinship and genetic contributions so as to achieve the core objective of GM^[31]. Therefore, the parameters employed in the GM models of this study fulfilled the core objective of GM, the balance between genetic gain and inbreeding. Genetic structure analysis of the population after 20 generations of GM breeding suggested that the 20^th-generation simulated population exhibited higher genetic diversity compared to the initial population. This result is consistent with the findings reported by Bérodier et al.^[47], which demonstrated that mating strategies incorporating genomic information can maximize genetic gain while maintaining substantial genetic diversity and minimizing inbreeding. When the breeding objective was to maximise the genetic gain for papilla number, it might lead to a positive gametic phase disequilibrium between QTL and polygenes^[48]. However, when the inbreeding coefficient was controlled simultaneously, selection would preserve the genetic diversity within the population and prevent any subpopulation from becoming dominant due to over-selection. Moreover, in simulated mating, free mating among individuals resulted in unimpeded gene flow, with each individual carrying multiple ancestral components^[49].

It has been reported that GOCS, restricting genomic relationships and weighted genomic selection, amplifying the effect of rare alleles, can enhance genetic gain but fail to prevent inbreeding and loss of rare variants. Maximising a weighted index balancing genetic gain with controlling expected heterozygosity or maintaining rare alleles resulted in superior long-term breeding simulations^[50]. Kang et al.^[34], on the other hand, proposed an open nucleus breeding system and a closed nucleus breeding system for simulating Pacific white shrimp, which increased genetic gain by introducing multiplier population individuals, but could not control the acceleration of inbreeding. The GM simulation breeding schemes proposed in this study achieve a trade-off, and it is believed that through data volume expansion, parameter optimization, and practical validation, GM will provide new insights into the molecular breeding of aquatic organisms.

Despite its potential benefits, GM still faces numerous challenges that need to be addressed. Ongoing research is essential to address the persistent challenges faced by GM and to enhance its applicability and impact. Research on GM in Holstein cows proposed that GM with non-additive genetic effects performed better than models with only additive effects. Mating programs with dominant and heterozygosity effects were better able to improve milk, fat, and protein yields^[39]. Tang et al.^[51] also demonstrated that genetic advantage from dominant effects can be used to maximise offspring performance in mate allocation. However, the dominant effect provided a slight contribution to phenotypic expression and, because of the low magnitude and amount of data available, the estimate of dominance variance was less accurate than that of additive variance^[39]. A significant portion of dominance variance might be confounded with environmental effects when dominant genetic effects were not considered^[52]. Dominance variance estimates might be inflated when models do not include genomic inbreeding, making an accurate estimate of marker effects critical for GM. Models for calculating dominant genetic variance that are closer to the true value in real breeding processes need to be improved. Assigning appropriate weights to markers to balance their genetic contributions, thereby improving the predictive performance of genomic selection, is an approach worth trying in GM as well.

In this study, the genetic architecture was simplified by assuming physically unlinked loci with purely additive effects. While this assumption facilitated the analysis of GM and GS strategies, it likely underestimates the genetic complexity of quantitative traits in reality. QTLs often exhibit LD and non-additive interactions, including epistasis, which can substantially influence trait expression and response to selection. By assuming unlinked loci, the model does not account for the Hill-Robertson effect, in which selection at one locus can interfere with linked loci, potentially accelerating the loss of favorable alleles.

Genotype by environment (G × E) interaction significantly impacted the prediction accuracy of GS models^[53]. Mas-Munoz et al.^[54] investigated the G × E interaction of sole reared in a recirculation aquaculture system and a semi-natural outdoor pond. The observation of heritable variation and low genetic correlation for sole growth across different environments indicates pronounced G × E interaction effects. The accuracy of selection and the predicted genetic gain might vary across different environments. Low genetic correlations implied that the optimal genotypes differed between environments. Therefore, differences among environments may have implications for breeding programs, and optimizing breeding programs in the presence of G × E interactions may maximize genetic gain across all environments^[55]. In this study, the calculation of environmental effects was simplified, which may not fully capture the complex G × E interactions and stochastic environmental noise inherent in real-world breeding systems. In practice, higher environmental variance could potentially reduce the accuracy of GEBVs, thereby impacting the optimization efficiency of GM. Thus, incorporating G × E interaction into GM models is highly anticipated. Introducing heterosis and breed-specific QTL effects in GM is also an interesting strategy^[56].

Furthermore, the simulation in this study did not explicitly incorporate overlapping generations, variation in reproductive success, or skewed sex ratios, which are intrinsic features of many aquaculture species like A. japonicus. In real breeding programs, these factors significantly increase the variance of long-term genetic contributions among individuals, often leading to a more rapid decline in effective population size than observed in discrete-generation models^[57]. For instance, skewed reproductive success can limit the pool of available high-ranking candidates, which may introduce additional constraints on mate allocation and potentially bias the expected genetic gain of GM schemes. While our study demonstrated the inherent advantages of GM in balancing gain and inbreeding, these findings still need to be further verified under more realistic conditions, such as by incorporating these complex biological variables into future stochastic simulation frameworks.

The 'maxbv' scheme, while maximizing immediate gain, may lead to the rapid fixation of alleles in specific family lines, potentially narrowing the genetic base for traits not included in the current selection index^[58]. On the other hand, the practical implementation of 'minskin' requires exhaustive pedigree or genomic kinship tracking of all candidates, which significantly increases genotyping costs. In aquaculture systems with high fecundity but high larval mortality, the actual number of contributing parents often deviates from the optimized mating design, a factor that could erode the theoretical advantages of GM observed in this study^[59].

With the explosive increase of genotype data and a large amount of phenotype data, running efficiency has become a concern for researchers. The results of this study showed that the running efficiencies of GM schemes were generally lower than the GS schemes, which may be due to the fact that the genetic contribution of parents and kinship were calculated in the GM^[32,33], thus increasing the amount of computation. It has been demonstrated that GPU-based computation could improve the runtime performance of the integration of multi-dimensional datasets by almost four orders of magnitude and was able to greatly accelerate the detection of genome-wide SNP-SNP interactions^[60]. Therefore, the use of a GPU for GM program through algorithm design and hardware optimisation may improve computational efficiency.

{{lists.name}}

Genomic mating strategy for papilla number in Apostichopus japonicus