Search
2025 Volume 2
Article Contents
RESEARCH ARTICLE   Open Access    

Genome-wide selection signature analysis identified new candidate genes for the teat number in three distinct strains of Large White pigs

More Information
  • Received: 20 November 2024
    Revised: 03 January 2025
    Accepted: 08 January 2025
    Published online: 24 February 2025
    Animal Advances  2 Article number: e004 (2025)  |  Cite this article
  • The number of teats is a critical reproductive trait in sows, exerting a direct impact on the survival rate of weaned piglets. Existing studies on selection signals associated with teat numbers in different strains of Large White pigs are scarce. We performed selection signal detection on three distinct strains of Large White pigs with 16 or more nipples: the French CG (Choice Genetics), the French Cooperl, and the Dutch Topigs. Our goal was to identify pivotal candidate genes contributing to the higher nipple count in Large White pigs. A comprehensive analysis, including CLR (Composite Likelihood Ratio), iHS (integrated haplotype score), and nSL (number of segregating sites by length) methods, alongside XP-EHH (Cross-population Extended Haplotype Homozygosity), was undertaken to identify genetic positive selection signatures in Large White pigs. Through our analysis, we identified six key candidate genes in the CG Large White pigs: DPP6, COL22A1, TRPC7, PROX1, ROR2, and ABCG1. In the Cooperl Large White pigs, the key candidate genes were PPARA, TRPC7, and NR3C1. In the Topigs Large White pigs, the key candidate genes were DOCK1, PCM1, and MRTFA. Among the aforementioned genes, PCM1 and DOCK1 have been identified as correlated with the number of teats in pigs, while TRPC7, DOCK1, and MRTFA have shown a significant link with teat number based on PheWAS (Phenome-wide association study) analysis. Our findings show that multiple genes associated with teat development in Large White pigs have experienced significant selection pressure throughout the breeding program. These results deepen our understanding of the progress in breeding Large White pigs and provide a valuable framework for their continued selection.
  • 加载中
  • Supplementary Table S1 Summary of data and statistical analysis in this study.
    Supplementary Table S2 Candidate regions and genes detected using all methods.
    Supplementary Table S3 Overlapping gene.
    Supplementary Table S4 Enrichment analysis of the candidate genes within the CG, Cooperl, and Topig groups was performed using the iHS, CLR, nSL, and XP-EHH methods, along with the GO and KEGG databases.
    Supplementary Fig. S1 The linkage disequilibrium (LD) decay analysis.
  • [1] Groenen MAM, Archibald AL, Uenishi H, Tuggle CK, Takeuchi Y, et al. 2012. Analyses of pig genomes provide insight into porcine demography and evolution. Nature 491:393−98 doi: 10.1038/nature11622

    CrossRef   Google Scholar

    [2] Larson G, Albarella U, Dobney K, Rowley-Conwy P, Schibler J, et al. 2007. Ancient DNA, pig domestication, and the spread of the Neolithic into Europe. Proceedings of the National Academy of Sciences of the United States of America 104:15276−81 doi: 10.1073/pnas.0703411104

    CrossRef   Google Scholar

    [3] Larson G, Dobney K, Albarella U, Fang M, Matisoo-Smith E, et al. 2005. Worldwide phylogeography of wild boar reveals multiple centers of pig domestication. Science 307:1618−21 doi: 10.1126/science.1106927

    CrossRef   Google Scholar

    [4] Wang K, Wu P, Chen D, Zhou J, Yang X, et al. 2021. Detecting the selection signatures in Chinese duroc, Landrace, Yorkshire, Liangshan, and Qingyu pigs. Functional & Integrative Genomics 21(5-6):655−64 doi: 10.1007/s10142-021-00809-5

    CrossRef   Google Scholar

    [5] Moon S, Kim TH, Lee KT, Kwak W, Lee T, et al. 2015. A genome-wide scan for signatures of directional selection in domesticated pigs. BMC Genomics 16:130 doi: 10.1186/s12864-015-1330-x

    CrossRef   Google Scholar

    [6] Wang C, Wang H, Zhang Y, Tang Z, Li K, et al. 2015. Genome-wide analysis reveals artificial selection on coat colour and reproductive traits in Chinese domestic pigs. Molecular Ecology Resources 15(2):414−24 doi: 10.1111/1755-0998.12311

    CrossRef   Google Scholar

    [7] Zhuang Z, Ding R, Peng L, Wu J, Ye Y, et al. 2020. Genome-wide association analyses identify known and novel loci for teat number in Duroc pigs using single-locus and multi-locus models. BMC Genomics 21:344 doi: 10.1186/s12864-020-6742-6

    CrossRef   Google Scholar

    [8] Martins TF, Braga Magalhães AF, Verardo LL, Santos GC, Silva Fernandes AA, et al. 2022. Functional analysis of litter size and number of teats in pigs: from GWAS to post-GWAS. Theriogenology 193:157−66 doi: 10.1016/j.theriogenology.2022.09.005

    CrossRef   Google Scholar

    [9] Yin L, Zhang H, Tang Z, Yin D, Fu Y, et al. 2023. HIBLUP: an integration of statistical models on the BLUP framework for efficient genetic evaluation using big genomic data. Nucleic Acids Research 51:3501−12 doi: 10.1093/nar/gkad074

    CrossRef   Google Scholar

    [10] Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, et al. 2015. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4:7 doi: 10.1186/s13742-015-0047-8

    CrossRef   Google Scholar

    [11] Teng J, Gao Y, Yin H, Bai Z, Liu S, et al. 2024. A compendium of genetic regulatory effects across pig tissues. Nature Genetics 56:112−23 doi: 10.1038/s41588-023-01585-7

    CrossRef   Google Scholar

    [12] Liu K, Hou L, Yin Y, Wang B, Liu C, et al. 2023. Genome-wide association study reveals new QTL and functional candidate genes for the number of ribs and carcass length in pigs. Animal Genetics 54(4):435−45 doi: 10.1111/age.13315

    CrossRef   Google Scholar

    [13] Browning SR, Browning BL. 2007. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. The American Journal of Human Genetics 81:1084−97 doi: 10.1086/521987

    CrossRef   Google Scholar

    [14] Browning BL, Zhou Y, Browning SR. 2018. A one-penny imputed genome from next-generation reference panels. American Journal of Human Genetics 103(3):338−48 doi: 10.1016/j.ajhg.2018.07.015

    CrossRef   Google Scholar

    [15] Danecek P, Auton A, Abecasis G, Albers CA, Banks E, et al. 2011. The variant call format and VCFtools. Bioinformatics 27(15):2156−58 doi: 10.1093/bioinformatics/btr330

    CrossRef   Google Scholar

    [16] Zhang C, Dong SS, Xu JY, He WM, Yang TL. 2019. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35(10):1786−88 doi: 10.1093/bioinformatics/bty875

    CrossRef   Google Scholar

    [17] Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. 2013. MEGA6: molecular evolutionary genetics analysis version 6.0. Molecular Biology and Evolution 30(12):2725−29 doi: 10.1093/molbev/mst197

    CrossRef   Google Scholar

    [18] Ma Y, Ding X, Qanbari S, Weigend S, Zhang Q, et al. 2015. Properties of different selection signature statistics and a new strategy for combining them. Heredity 115(5):426−36 doi: 10.1038/hdy.2015.42

    CrossRef   Google Scholar

    [19] Eriksson A, Fernström P, Mehlig B, Sagitov S. 2008. An accurate model for genetic hitchhiking. Genetics 178:439−51 doi: 10.1534/genetics.107.076018

    CrossRef   Google Scholar

    [20] Rajawat D, Panigrahi M, Kumar H, Nayak SS, Parida S, et al. 2022. Identification of important genomic footprints using eight different selection signature statistics in domestic cattle breeds. Gene 816:146165 doi: 10.1016/j.gene.2021.146165

    CrossRef   Google Scholar

    [21] Voight BF, Kudaravalli S, Wen X, Pritchard JK. 2006. A map of recent positive selection in the human genome. PLoS Biology 4:e72 doi: 10.1371/journal.pbio.0040072

    CrossRef   Google Scholar

    [22] Ferrer-Admetlla A, Liang M, Korneliussen T, Nielsen R. 2014. On detecting incomplete soft or hard selective sweeps using haplotype structure. Molecular Biology and Evolution 31(5):1275−91 doi: 10.1093/molbev/msu077

    CrossRef   Google Scholar

    [23] Sabeti PC, Varilly P, Fry B, Lohmueller J, Hostetter E, et al. 2007. Genome-wide detection and characterization of positive selection in human populations. Nature 449:913−18 doi: 10.1038/nature06250

    CrossRef   Google Scholar

    [24] Szpiech ZA, Hernandez RD. 2014. selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Molecular Biology and Evolution 31(10):2824−27 doi: 10.1093/molbev/msu211

    CrossRef   Google Scholar

    [25] Bastarache L, Denny JC, Roden DM. 2022. Phenome-wide association studies. JAMA 327(1):75−76 doi: 10.1001/jama.2021.20356

    CrossRef   Google Scholar

    [26] Zeng H, Zhang W, Lin Q, Gao Y, Teng J, et al. 2024. PigBiobank: a valuable resource for understanding genetic and biological mechanisms of diverse complex traits in pigs. Nucleic Acids Research 52:D980−D89 doi: 10.1093/nar/gkad1080

    CrossRef   Google Scholar

    [27] Meng Y, Yuan C, Zhang J, Zhang F, Fu Q, et al. 2017. Stearic acid suppresses mammary gland development by inhibiting PI3K/Akt signaling pathway through GPR120 in pubertal mice. Biochemical and Biophysical Research Communications 491:192−97 doi: 10.1016/j.bbrc.2017.07.075

    CrossRef   Google Scholar

    [28] Rädler PD, Wehde BL, Wagner KU. 2017. Crosstalk between STAT5 activation and PI3K/AKT functions in normal and transformed mammary epithelial cells. Molecular and Cellular Endocrinology 451:31−39 doi: 10.1016/j.mce.2017.04.025

    CrossRef   Google Scholar

    [29] Alexander CM. 2021. Wnt signaling and mammary stem cells. Vitamins and Hormones 116:21−50 doi: 10.1016/bs.vh.2021.02.001

    CrossRef   Google Scholar

    [30] Rubin CJ, Megens HJ, Martinez Barrio A, Maqbool K, Sayyab S, et al. 2012. Strong signatures of selection in the domestic pig genome. Proceedings of the National Academy of Sciences of the United States of America 109:19529−36 doi: 10.1073/pnas.1217149109

    CrossRef   Google Scholar

    [31] Kemper KE, Saxton SJ, Bolormaa S, Hayes BJ, Goddard ME. 2014. Selection for complex traits leaves little or no classic signatures of selection. BMC Genomics 15:246 doi: 10.1186/1471-2164-15-246

    CrossRef   Google Scholar

    [32] Schiavo G, Galimberti G, Calò DG, Samorè AB, Bertolini F, et al. 2016. Twenty years of artificial directional selection have shaped the genome of the Italian Large White pig breed. Animal Genetics 47(2):181−91 doi: 10.1111/age.12392

    CrossRef   Google Scholar

    [33] Biswas S, Akey JM. 2006. Genomic insights into positive selection. Trends in Genetics 22:437−46 doi: 10.1016/j.tig.2006.06.005

    CrossRef   Google Scholar

    [34] Kim ES, Elbeltagy AR, Aboul-Naga AM, Rischkowsky B, Sayre B, et al. 2016. Multiple genomic signatures of selection in goats and sheep indigenous to a hot arid environment. Heredity 116:255−64 doi: 10.1038/hdy.2015.94

    CrossRef   Google Scholar

    [35] López ME, Cádiz MI, Rondeau EB, Koop BF, Yáñez JM. 2021. Detection of selection signatures in farmed coho salmon (Oncorhynchus kisutch) using dense genome-wide information. Scientific Reports 11:9685 doi: 10.1038/s41598-021-86154-w

    CrossRef   Google Scholar

    [36] Lan D, Xiong X, Mipam TD, Fu C, Li Q, et al. 2018. Genetic diversity, molecular phylogeny, and selection evidence of Jinchuan yak revealed by whole-genome resequencing. G3 Genes|Genomes|Genetics 8(3):945−52 doi: 10.1534/g3.118.300572

    CrossRef   Google Scholar

    [37] Biegelmeyer P, Gulias-Gomes CC, Caetano AR, Steibel JP, Cardoso FF. 2016. Linkage disequilibrium, persistence of phase and effective population size estimates in Hereford and Braford cattle. BMC Genetics 17:32 doi: 10.1186/s12863-016-0339-8

    CrossRef   Google Scholar

    [38] Verardo LL, Silva FF, Lopes MS, Madsen O, Bastiaansen JWM, et al. 2016. Revealing new candidate genes for reproductive traits in pigs: combining Bayesian GWAS and functional pathways. Genetics, Selection, Evolution 48:9 doi: 10.1186/s12711-016-0189-x

    CrossRef   Google Scholar

    [39] van Son M, Lopes MS, Martell HJ, Derks MFL, Gangsei LE, et al. 2019. A QTL for number of teats shows breed specific effects on number of vertebrae in pigs: bridging the gap between molecular and quantitative genetics. Frontiers in Genetics 10:272 doi: 10.3389/fgene.2019.00272

    CrossRef   Google Scholar

    [40] Park J, Do KT, Park KD, Lee HK. 2023. Genome-wide association study using a single-step approach for teat number in Duroc, Landrace and Yorkshire pigs in Korea. Animal Genetics 54(6):743−51 doi: 10.1111/age.13357

    CrossRef   Google Scholar

    [41] Verardo LL, Lopes MS, Wijga S, Madsen O, Silva FF, et al. 2016. After genome-wide association studies: Gene networks elucidating candidate genes divergences for number of teats across two pig populations. Journal of Animal Science 94(4):1446−58 doi: 10.2527/jas.2015-9917

    CrossRef   Google Scholar

    [42] Liu Z, Li H, Zhong Z, Jiang S. 2022. A whole genome sequencing-based genome-wide association study reveals the potential associations of teat number in Qingping pigs. Animals 12(9):1057 doi: 10.3390/ani12091057

    CrossRef   Google Scholar

    [43] Pedrosa VB, Schenkel FS, Chen SY, Oliveira HR, Casey TM, et al. 2021. Genomewide association analyses of lactation persistency and milk production traits in Holstein cattle based on imputed whole-genome sequence data. Genes 12(11):1830 doi: 10.3390/genes12111830

    CrossRef   Google Scholar

    [44] Li RR, Hu HH, Feng X, Hu CL, Ma YF, et al. 2024. Polymorphism of ADAM12 DPP6 and PRKN genes and their associations with milk production traits in Holstein. Reproduction in Domestic Animals 59:e14497 doi: 10.1111/rda.14497

    CrossRef   Google Scholar

    [45] Jiang L, Liu J, Sun D, Ma P, Ding X, et al. 2010. Genome wide association studies for milk production traits in Chinese Holstein population. PLoS One 5:e13661 doi: 10.1371/journal.pone.0013661

    CrossRef   Google Scholar

    [46] Anantamongkol U, Charoenphandhu N, Wongdee K, Teerapornpuntakit J, Suthiphongchai T, et al. 2010. Transcriptome analysis of mammary tissues reveals complex patterns of transporter gene expression during pregnancy and lactation. Cell Biology International 34:67−74 doi: 10.1042/CBI20090023

    CrossRef   Google Scholar

    [47] Sheridan JM, Ritchie ME, Best SA, Jiang K, Beck TJ, et al. 2015. A pooled shRNA screen for regulators of primary mammary stem and progenitor cells identifies roles for Asap1 and Prox1. BMC Cancer 15:221 doi: 10.1186/s12885-015-1187-z

    CrossRef   Google Scholar

    [48] Roarty K, Shore AN, Creighton CJ, Rosen JM. 2015. Ror2 regulates branching, differentiation, and actin-cytoskeletal dynamics within the mammary epithelium. The Journal of Cell Biology 208(3):351−66 doi: 10.1083/jcb.201408058

    CrossRef   Google Scholar

    [49] Kessenbrock K, Smith P, Steenbeek SC, Pervolarakis N, Kumar R, et al. 2017. Diverse regulation of mammary epithelial growth and branching morphogenesis through noncanonical Wnt signaling. Proceedings of the National Academy of Sciences of the United States of America 114(12):3121−26 doi: 10.1073/pnas.1701464114

    CrossRef   Google Scholar

    [50] Sharma , Aggarwal , Sodhi , Kishore , Mishra , et al. 2014. Stage specific expression of ATP-binding cassette and solute carrier superfamily of transporter genes in mammary gland of riverine buffalo (Bubalus bubalis). Animal Biotechnology 25(3):200−9 doi: 10.1080/10495398.2013.839949

    CrossRef   Google Scholar

    [51] Wintermantel TM, Bock D, Fleig V, Greiner EF, Schütz G. 2005. The epithelial glucocorticoid receptor is required for the normal timing of cell proliferation during mammary lobuloalveolar development but is dispensable for milk production. Molecular Endocrinology 19(2):340−49 doi: 10.1210/me.2004-0068

    CrossRef   Google Scholar

    [52] Tian H, Luo J, Shi H, Chen X, Wu J, et al. 2020. Role of peroxisome proliferator-activated receptor-α on the synthesis of monounsaturated fatty acids in goat mammary epithelial cells. Journal of Animal Science 98(3):skaa062 doi: 10.1093/jas/skaa062

    CrossRef   Google Scholar

    [53] Bagci H, Laurin M, Huber J, Muller WJ, Côté JF. 2014. Impaired cell death and mammary gland involution in the absence of Dock1 and Rac1 signaling. Cell Death & Disease 5(8):e1375 doi: 10.1038/cddis.2014.338

    CrossRef   Google Scholar

    [54] Seifert A, Posern G. 2017. Tightly controlled MRTF-a activity regulates epithelial differentiation during formation of mammary acini. Breast Cancer Research 19:68 doi: 10.1186/s13058-017-0860-3

    CrossRef   Google Scholar

    [55] Melcher ML, Block I, Kropf K, Singh AK, Posern G. 2022. Interplay of the transcription factor MRTF-A and matrix stiffness controls mammary acinar structure and protrusion formation. Cell Communication and Signaling 20:158 doi: 10.1186/s12964-022-00977-2

    CrossRef   Google Scholar

    [56] Morlon A, Smahi A, Munnich A. 2006. New genes candidates for ectodermal dysplasia: TAB2, TRAF6 and TAK1. Medecine Sciences 22(3):229−30 doi: 10.1051/medsci/2006223229

    CrossRef   Google Scholar

    [57] Wickenden JA, Watson CJ. 2010. Key signalling nodes in mammary gland development and cancer. Signalling downstream of PI3 kinase in mammary epithelium: a play in 3 Akts. Breast Cancer Research 12:202 doi: 10.1186/bcr2558

    CrossRef   Google Scholar

    [58] Khan MZ, Khan A, Xiao J, Ma Y, Ma J, et al. 2020. Role of the JAK-STAT pathway in bovine mastitis and milk production. Animals 10(11):2107 doi: 10.3390/ani10112107

    CrossRef   Google Scholar

    [59] Boras-Granic K, Hamel PA. 2013. Wnt-signalling in the embryonic mammary gland. Journal of Mammary Gland Biology and Neoplasia 18(2):155−63 doi: 10.1007/s10911-013-9280-x

    CrossRef   Google Scholar

  • Cite this article

    Ma J, Liu C, Cai W, Liu Q, Yu J, et al. 2025. Genome-wide selection signature analysis identified new candidate genes for the teat number in three distinct strains of Large White pigs. Animal Advances 2: e004 doi: 10.48130/animadv-0025-0002
    Ma J, Liu C, Cai W, Liu Q, Yu J, et al. 2025. Genome-wide selection signature analysis identified new candidate genes for the teat number in three distinct strains of Large White pigs. Animal Advances 2: e004 doi: 10.48130/animadv-0025-0002

Figures(6)  /  Tables(2)

Article Metrics

Article views(2545) PDF downloads(787)

RESEARCH ARTICLE   Open Access    

Genome-wide selection signature analysis identified new candidate genes for the teat number in three distinct strains of Large White pigs

Animal Advances  2 Article number: e004  (2025)  |  Cite this article

Abstract: The number of teats is a critical reproductive trait in sows, exerting a direct impact on the survival rate of weaned piglets. Existing studies on selection signals associated with teat numbers in different strains of Large White pigs are scarce. We performed selection signal detection on three distinct strains of Large White pigs with 16 or more nipples: the French CG (Choice Genetics), the French Cooperl, and the Dutch Topigs. Our goal was to identify pivotal candidate genes contributing to the higher nipple count in Large White pigs. A comprehensive analysis, including CLR (Composite Likelihood Ratio), iHS (integrated haplotype score), and nSL (number of segregating sites by length) methods, alongside XP-EHH (Cross-population Extended Haplotype Homozygosity), was undertaken to identify genetic positive selection signatures in Large White pigs. Through our analysis, we identified six key candidate genes in the CG Large White pigs: DPP6, COL22A1, TRPC7, PROX1, ROR2, and ABCG1. In the Cooperl Large White pigs, the key candidate genes were PPARA, TRPC7, and NR3C1. In the Topigs Large White pigs, the key candidate genes were DOCK1, PCM1, and MRTFA. Among the aforementioned genes, PCM1 and DOCK1 have been identified as correlated with the number of teats in pigs, while TRPC7, DOCK1, and MRTFA have shown a significant link with teat number based on PheWAS (Phenome-wide association study) analysis. Our findings show that multiple genes associated with teat development in Large White pigs have experienced significant selection pressure throughout the breeding program. These results deepen our understanding of the progress in breeding Large White pigs and provide a valuable framework for their continued selection.

    • The domestication of pigs from wild boars was a consequence of human selection. About one million years ago, the European and Asian boar populations began to diverge[1]. Roughly 9,000 years ago, in the initial phase of pig domestication, distinct populations of domestic pigs in East Asia and Europe started to follow separate breeding and evolutionary paths[2,3]. This divergence resulted in the emergence of unique genetic groups. During this period, extensive artificial selection caused significant phenotypic changes in domestic pigs. Phenotypic changes suggest that specific mutations occurred in genes associated with particular traits. These mutations lead to modifications in the genetic information, known as selection signatures, within the genome.

      Identifying genes under selective pressures is a primary objective in the study of pig domestication. Numerous studies have used genomic methodologies to identify genetic changes underlying phenotypic alterations in pigs during domestication. Wang et al. pinpointed a series of genes that are crucial for the performance of several traits[4]. For example, the PTPRM gene is essential in controlling various cellular functions, while the HACD3 gene is pivotal for aiding the accumulation of body fat. Moon et al.[5] identified several genes, such as ABLIM1, CXADR, INSR, RIMS1, and SYNE1, that are connected to the control of growth during development and the formation of anatomical structures. Wang et al.[6] have shown that the color-associated genes MITF and EDNRB have undergone strong selective breeding in the Chinese White-spotted pig breed.

      In the domain of pig farming, teat count is a key morphological and reproductive trait. Healthy teats yield milk rich in essential nutrients, providing a vital energy source for the growth and development of young piglets. Zhuang et al.[7] have determined that the VRTN gene's influence on the number of teats may be through linkage with the mutation responsible for the trait. Martins et al.[8] constructed a network that incorporates various biological processes and transcription factors, successfully identifying candidate genes linked to teat number. Within this set, GHR, IFT80, FSTL3, SKOR1, SMURF1, and AKT3 have emerged as the most promising candidates.

      The Western pig, characterized by its large White pig breed, has undergone substantial artificial selection during its evolutionary history, leading to marked divergence in teat number breeds. This variation has arisen due to extensive selection processes carried out by companies with unique breeding goals. Several prominent Large White pig strains, such as the French CG, the French Cooperl, and the Dutch Topigs strains, have a higher average teat count of approximately 16 compared to 14 teats of the Danish Large White breed, enabling them to effectively meet the daily lactation requirements of piglets. Numerous studies have explored selection signatures; however, the molecular mechanisms responsible for the higher number of nipples in Large White pig strains under selection pressure need further study.

      The present research focuses on identifying selection regions and candidate genes that contribute to the higher number of teats in three distinct Large White pig strains, through an analysis of their selection signatures. Four methods (CLR, nSL, iHS, and XP-EHH) were employed for detecting selection signals to analyze the complete genomes of three prominent domesticated pig breeds: French CG Large White, French Cooperl Large White, and Dutch Topigs Large White. Danish Large White pigs served as the outgroup for comparative analysis. The present findings will shed light on the genomic traits of Large White pigs, offering valuable insights into their selection and breeding in the context of teat number.

    • Phenotypic data was collected on teat number for various Large White pigs, including CG, Topigs, Cooperl, and DanBred from local Chinese companies, originating from different companies and countries. Shandong Huanshan Group had already introduced Topigs Large White pigs from the Dutch Topigs Company in 2011. In 2020, Henan Minwang Company introduced CG Large White pigs from France, and Henan Fengyuan Hepu Company introduced Cooperl Large White pigs from the French Cooperl Company. Liyuan Group introduced Danish Large White pigs from DanBred. Each strain is a purebred core breeding population without crossbreeding. We have partitioned the samples from each breed into families and utilized HIBLUP[9] software to calculate the estimated breeding values for the total teat number trait within each breed. By integrating family data and estimated breeding values, 50 individuals were meticulously selected with an exceptionally high number of teats from each breed.

    • The Megi Universal Nucleic Acid Extraction Kit was employed to isolate genomic DNA from ear tissue samples. The quantity and quality of the extracted DNA were assessed using a NanoDrop 2000 spectrophotometer, verifying that the OD260/280 ratio was within the range of 1.7 to 2.1 and that the DNA concentration exceeded 50 ng/μL. Genotyping was conducted with the Compass Porcine 50K Plus Breeding Beadchip from Tianjin, China, encompassing 57,466 SNPs across the entire porcine genome. The physical locations of these SNPs were aligned to the Sus scrofa 11.1 assembly (Sscrofa11.1) of the pig reference genome. Quality control measures were implemented using PLINK v1.9 software[10]. Samples with SNP call rates below 90% were excluded. SNPs with genotype-missing rates exceeding 0.1 and a minor allele frequency (MAF) below 0.05 were also discarded. The analysis was restricted to SNPs located on autosomal chromosomes.

      In the present research, a dataset comprising whole-genome resequencing from 1,662 pigs formed the basis of the reference panel. This dataset included resequencing information from 1,602 pigs of diverse breeds sourced through the PigGTEX initiative[11], and 60 Suhuai pigs[12]. Beagle software (version 5.2) with its standard settings was utilized to perform haplotype phasing on the reference panel[13] and to impute SNP-chip data to achieve whole-genome resolution[14]. SNPs were filtered out if they had a dosage R-squared (DR2) value below 0.9 and a minor allele frequency (MAF) less than 0.05. Following stringent quality control assessments, a total of 9,705,084, 9,214,835, 8,642,894, and 6,328,251 SNPs corresponding to CG, Cooperl, Topigs, and Danbred were chosen for analysis.

    • PLINK v1.9[10] was utilized to assess genetic diversity in four distinct Large White pig populations by calculating both observed (Ho), and expected (He) heterozygosity. VCFtools[15] was utilized to calculate nucleotide diversity for each breed using a window of 500 kb with a step of 50 kb. PopLDdecay v3.40 software was employed to evaluate the genome-wide linkage disequilibrium patterns by calculating the r2 values for SNPs pairs across Large White pig populations within a 500 kb window[16].

    • To delve into the genetic connections among cohorts of Large White pigs, we began by refining the marker dataset, eliminating SNPs with substantial relatedness within a 50 SNP window. The window was shifted in intervals of 5 SNPs, with an r2 threshold set at 0.2. After conducting linkage disequilibrium (LD) filtering, PLINK v1.9 software[10] was used to perform principal component analysis (PCA). The PCA plot was then generated using the ggplot2 package in R. The PLINK format file was converted into MEGA format, and a neighbor-joining phylogenetic tree (NJ-tree) was assembled utilizing the p-distance algorithm in MEGA's neighbor-joining module, version 6[17]. The results were visualized using FigTree v1.4.3.

    • The site-frequency spectrum (SFS) is defined by the frequency at which various allele frequencies occur within a specific target area of the genome. Methods based on the SFS are highly effective at detecting selection signals that have become fixed, with the Composite Likelihood Ratio (CLR) method showing greater detection power than other methods based on the SFS[18]. The CLR test compares the maximum likelihood under neutral conditions (absence of selection sweep) with the maximum likelihood under scenarios of selective sweep[19]. Allele frequency data were utilized to compute CLR test values for scanning regions with complete sweeps. Employing the SweeD software, the CLR was calculated with a 50 kb grid dimension. The highest 1% of the CLR value spectrum from empirical data was used to detect potential regions, and genes located within these regions were designated as candidate genes[20].

      Statistical tests based on linkage disequilibrium are highly effective in detecting recent positive selection signatures. These tests are fundamentally rooted in the theory of long-range extended haplotype homozygosity. The construction principles of nSL and iHS (integrated haplotype score) statistics are similar; iHS uses a single marker site to replace the core haplotype in the statistical measure, defining this site as the core site. iHS integrates the genetic distance of the extended haplotypes, where the ancestral alleles are located at the core site and where the newly mutated alleles are located, and calculates the ratio between the two to detect selection signals. The iHS evaluates the relative decay of EHH between ancestral and derived core alleles[21]. Compared to iHS, nSL measures the length of the haplotype homozygosity segment between pairs of haplotypes in the same region of the dataset in terms of the number of mutations, thereby enhancing robustness to recombination and/or variations in mutation rates. The nSL statistic assesses haplotype length based on the number of segregating sites and has shown improved efficacy in detecting soft sweeps[22]. The XP-EHH statistic assesses the extended haplotype homozygosity within the same genomic region across two populations, detecting signals of ongoing or near-fixed selection[23]. Selscan v.1.0.4 software was used to perform nSL, iHS, and XP-EHH scans[24]. The nSL and iHS statistics were calculated with a scaling factor of 20,000, a maximum interval of 200,000, and an EHH threshold of 0.05. The nSL, iHS, and XP-EHH results were normalized by frequency into 100 bins using the 'norm' package included in selscan[24]. The test statistic was computed by calculating the mean normalized value for each 50 kb region. The top 1% of regions exhibiting the highest mean |iHS|, |nSL|, and XP-EHH scores were considered potential targets of positive selection.

    • To investigate the biological roles of the candidate genes, the Ensembl BioMart tool (https://asia.ensembl.org/index.html) was utilized for gene annotation in the selected regions. The Phenome-Wide Association Study (PheWAS) has been widely used to associate genetic variants with phenotypes[25]. The PheWAS module available on the website (http://pigbiobank.farmgtex.org) was used to explore the connections between the genes of interest identified here and the various complex traits present within the PigBiobank[26]. To enhance the comprehension of the functions of the candidate genes and their related signaling pathways, GO and KEGG pathway enrichment analyses were performed via the KOBAS platform.

    • Tthe maximum, minimum, mean, standard error, and coefficient of variation values for teat number across different groups of Large White pigs are presented. Each company collected 50 ear tissue samples, including samples from various family lines (Table 1).

      Table 1.  Descriptive statistics of the number of teats in 50 Large White pigs of each strain.

      Groups N1 Min2 Max3 Mean4 ± SE5
      French CG Large White pigs 50 16 20 18.27 ± 0.10
      French Cooperl Large White pigs 50 18 20 18.32 ± 0.07
      Dutch Topigs Large White pigs 50 18 19 18.26 ± 0.06
      Danish Large White pig 50 10 13 12.40 ± 0.09
      1 Number of individuals with phenotypic records. 2 Minimum of phenotype. 3 Maximum of phenotype. 4 Mean of phenotype. 5 Standard error.
    • After performing quality control procedures, the average Ho, He, and nucleotide diversity were calcuated to evaluate the polymorphism within four Large White pig populations. As shown in Supplementary Table S1, He was significantly lower than the Ho in each population among all the groups analyzed in this study. In the analysis of nucleotide diversity, a substantial difference was identified between the CG and Topigs strains, with the CG strain having the highest nucleotide diversity. The influence of artificial selection on the levels of genome LD within each population was observable. Research indicates that artificial selection increases the degree of linkage disequilibrium in a population. In this study, the LD attenuation rate was highest for the CG strain, followed by the Cooperl and Topigs strains, with the Danbred strain exhibiting the slowest rate (Supplementary Fig. S1).

    • PCA was conducted to explore clustering patterns among the animals included in the study (Fig. 1). The PCA results demonstrated distinct separation among the Danbred and Topigs strains, and the CG and Cooperl strains. Subsequently, an NJ-tree was built from the matrix of pairwise genetic distance among individuals. The phylogenetic revealed that members of the CG and Cooperl groups clustered closely, forming a distinct branch with a high level of genetic coherence, which is consistent with the observations from PCA.

      Figure 1. 

      Analysis of the population structure in four Large White pigs. (a) Principal component analysis. (b) Neighbor-joining tree.

    • Within this research, four methods: iHS, nSL, CLR, and XP-EHH were employed to identify genomic areas that have undergone recent selection in Large White pigs. Figure 2 presents the SNPs that were determined to be under selection pressure in the CG strain through the conducted tests. In the CG strain, 179,186,252, and 86 candidate genes were identified (Supplementary Table S2) that may be subject to selection according to the CLR, iHS, nSL, and XP-EHH tests, respectively. The results of the four tests conducted on the Cooperl group are displayed in Fig. 3. Within the Cooperl strain, 196,188,275, and 94 candidate genes were found (Supplementary Table S2) as potentially under selection through the CLR, iHS, nSL, and XP-EHH tests, respectively. The Manhattan plot shown in Fig. 4 highlights the discovery of SNPs that may have been influenced by selective pressures within the Topigs strain. These SNPs were evaluated using the CLR, iHS, nSL, and XP-EHH tests. The analysis identified 204, 198, 252, and 113 candidate genes (Supplementary Table S2), respectively, suggesting their potential for being under selection. In this study, genes related to teat number were found by utilizing the Pig QTLdb database (Table 2).

      Figure 2. 

      Manhattan plots for selective sweep analyses of CG strain with the (a) CLR, (b) iHS, (c) nSL, and (d) XP-EHH analyses.

      Figure 3. 

      Manhattan plots for selective sweep analyses of Cooperl strain with the (a) CLR, (b) iHS, (c) nSL, and (d) XP-EHH analyses.

      Figure 4. 

      Manhattan plots for selective sweep analyses of Topigs strain with the (a) CLR, (b) iHS, (c) nSL, and (d) XP-EHH analyses.

      Table 2.  Potential candidate genes detected using all methods*.

      Group Methods Genes Trait
      CG nSL PAFAH1B1 Teat number
      UBE2G1 Teat number
      DNAH12 Teat number
      PCM1 Teat number
      PACC1 Teat number
      iHS CDH13 Teat number
      CLR JAZF1 Teat number
      Cooperl CLR ARL4C Teat number
      RAPGEF2 Teat number
      Topig nSL DOCK1 Teat number
      PCM1 Teat number
      CLR DNAH12 Teat number
      DOCK1 Teat number
      PCM1 Teat number
      * This table is based on the information retrieved in the pig QTLdb database (accessed on 2 April 2024).

      The CG strain identified four genes (UMODL1, DPP6, ARHGAP26, and COL22A1) (Fig. 5a, Supplementary Table S3) using CLR, nSL, and iHS methods. Additionally, the CG strain detected the presence of four genes (TRPC7, SMIM32, ARMH1, and PROX1) (Fig. 5a, Supplementary Table S3) by simultaneously employing the nSL, iHS, and XP-EHH methods. The Cooperl strain identified three genes (MACROD2, IGSF3, and ZSWIM5) using CLR, nSL, and iHS (Fig. 5b, Supplementary Table S3). Meanwhile, the Cooper strain also identified two genes, TRPC7 and CDH8, using the nSL, iHS, and XP-EHH analysis methods (Fig. 5b, Supplementary Table S3). In the Topigs strain, the analysis using CLR, nSL, and iHS methods revealed seven overlapping genes (TAB2, USP20, FNBP1, NCS1, DYTN, TTC1, and ADRA1B) (Fig. 5c, Supplementary Table S3). Concurrently, by applying the nSL, iHS, and XP-EHH analytical techniques, the Topigs strain identified three genes, ZC3H12D, TAB2, and GRAP2 (Fig. 5c, Supplementary Table S3).

      Figure 5. 

      The Venn diagram illustrates the overlap of genes between four different approaches. (a) CG, (b) Cooperl, (c) Topigs.

      To further explore the functions of the candidate genes in pigs, PheWAS analysis was conducted for these genes. Based on GWAS summary statistics, the PheWAS analysis revealed significant correlations between the candidate genes TRPC7 (P = 8.62 × 10−9), DOCK1 (P = 1.81 × 10−8), MRTFA (P = 1.34 × 10−6), and the phenotypic trait of teat number in pigs (Fig. 6). The point plot shows gene-trait associations from gene-based analysis. This result suggests that these three genes are likely candidate genes affecting the trait of teat number in pigs.

      Figure 6. 

      Results of the PheWAS analysis based on the identified candidate genes.

      For each population, GO functional annotation and KEGG pathway analysis were performed on the genes identified through all methods. In the CG strain of Large White pigs, GO and KEGG analyses found 35 Gene Ontology (GO) terms and 11 pathways, including those for animal organ morphogenesis (GO:0009887, p = 0.018), PI3K-Akt signaling pathway[27] (KEGG: ssc04151, p = 0.034), and Jak-STAT signaling pathway[28] (KEGG: ssc04630, p = 0.038) (Supplementary Table S4). The Cooperl strain showed enrichment for the Wnt signaling pathway (KEGG: ssc04310, p = 0.01) (Supplementary Table S4). This pathway plays a crucial role in the developmental processes of mammary gland differentiation[29]. Unfortunately, the pathways related to mammary gland development did not show significant enrichment within the Topigs Large White pig population.

    • The Large White pig, also known as the Yorkshire pig, is a prominent breed of domestic pig originating from England. Selective breeding by different companies has led to the diversification of this breed into a range of unique phenotypes and genetic strains. This deliberate breeding has significantly influenced the genetic makeup of the Large White pig, particularly affecting loci regulated by major genes that determine key traits of the breed[3032]. By conducting a comparative analysis of different strains of Large White pigs, we have assessed their genetic diversity, population structure, and phylogenetic relationships. Artificial selection might have etched a distinctive imprint into the genomic sequences of individuals in the population[23,33,34]. In the present investigation, different highly sensitive detection techniques were implemented to detect recent selection signatures[35]. The main objective of this research was to pinpoint the potential genes linked to the higher number of teats in Large White pigs as a result of domestication and selective breeding.

      The nucleotide diversity in the Cooperl strain of Large White pigs (mean θπ = 0.00145) was found to be similar to that of the CG strain (mean θπ = 0.00146) but greater than that of the Topigs strain of Large White pigs (mean θπ = 0.00138). The present analysis discovered characteristics of the Cooperl strain within Large White pigs exhibited levels of expected and observed heterozygosity comparable to those of the CG strain. This similarity can be attributed to their similar genetic background. Artificial selection can cause a rise in linkage disequilibrium within the population[36,37]. It was observed that the CG strain of Large White pigs exhibited a rapid decline in LD compared to the Cooperl and Topigs strains.

      The results of PCA and NJ-tree analysis demonstrate the genetic relationships and differentiation between strains of Large White pigs. In both the PCA and NJ-tree, the CG and Cooperl strains cluster together, which might be associated with their similar genetic backgrounds.

      In this study, several genes have been identified that have been reported to influence the count of nipples in pigs. Specifically, PAFAH1B1, RAPGEF2, JAZF1, and ARL4C are identified as influencing teat number in Large White pigs[3840]. In Landrace pigs, UBE2G1, ARL4C, and CDH13 are associated with teat count[39,41], while DOCK1 affects teat number in Duroc pigs[7]. Additionally, DNAH12, PCM1, and PACC1 are found to influence teat number in Qingping pigs[42]. Notably, this study has identified PCM1 and DOCK1 as potential candidate genes that may affect teat number in Large White pigs.

      Through literature review, it was determined that specific genes within potentially selected regions are associated with essential functional traits. In the CG strain, the DPP6 gene is a significant contributor to lactation performance in dairy cows and is a potential candidate for improving milk production traits[43,44]. COL22A1 was identified as a significant candidate gene for milk yield, fat percentage, and protein yield through GWAS, conducted in Chinese Holstein cattle[45]. The significant up-regulation of TRPC7 channels during lactation indicates that these channels are likely essential for facilitating the transport of calcium ions (Ca2+) across mammary epithelial cells[46]. The PROX1 gene is critical for regulating the activity of mammary basal progenitor cells, acting as a negative regulator of mammary stem/progenitor function[47]. Disruption of ROR2 impacts the branching and differentiation of mammary epithelium[48,49]. ABCG1 is essential for modulating lipid transport within the mammary gland, exhibiting distinct physiological functions that change with the gland's functional stage[50].

      In the Cooperl strain, the TRPC7 gene was also identified. Moreover, NR3C1 was found to play a pivotal role in regulating cell proliferation during mammary lobuloalveolar development[51]. PPARA may boost the production of monounsaturated fatty acids in goat mammary epithelial cells by modulating the mRNA levels of genes involved in fatty acid synthesis, oxidation, transportation, and triacylglycerol synthesis[52].

      In the Topigs strain, several genes have been identified as playing important roles. Notably, DOCK1 plays a significant role in mammary gland involution, and DOCK1-null mammary glands fail to activate Stat3 properly[53]. MRTFA is vital for the proliferation and formation of mammary acini from luminal epithelial cells[54,55]. The TAB2[56] gene is a potential candidate for influencing ectodermal development and is likely to be involved in the early stages of mammary gland development.

      CG and Cooperl are well-known French Large White pig breeds. Using iHS and nSL methods, we identified overlapping genes in CG and Cooperl Large White pigs, including TRPC7, SPOCK1, NR3C1, and ARHGAP26. The TRPC7 gene may serve as a critical component in facilitating the transport of calcium ions (Ca2+) across mammary epithelial cells. The NR3C1 gene is essential for regulating cell proliferation during the development of mammary lobules and alveoli. Phenome-wide association studies (PheWAS) is an analytical method that explores the associations between all phenotypes and a certain exposure within the entire phenome. PigBiobank stands as the most extensive and comprehensive database to date, amassing genotype and phenotype data from 71,885 pigs across more than 100 breeds. We leveraged the PheWAS module within this platform for the present analyses and discovered significant associations between the genes TRPC7, MRTFA, and DOCK1 and the trait of teat number. We hypothesize that the TRPC7 gene is a significant candidate influencing teat number in French Large White pigs. Similarly, we propose that the MRTFA and DOCK1 genes are important candidates influencing teat number in Topigs Large White pigs.

      Significant enrichment of 18, 11, and 14 genes was identified in the PI3K-Akt, Jak-STAT, and Wnt signaling pathways, respectively. The PI3K/Akt signaling pathway is indispensable for managing key cellular processes such as proliferation, differentiation, and apoptosis in mammary epithelial cells[57]. The JAK-STAT pathway is pivotal in signaling cascades within mammary epithelial tissue, significantly contributing to lactation and mammary gland development[28,58]. The Wnt signaling pathway plays a critical role at every phase of embryonic mammary gland development, exerting canonical influence on both mammary epithelial cells and their associated mesenchyme[59].

    • In this study, it was confirmed that the different strains of Large White pigs have developed their distinct genetic structure features, and several candidate genes were identified, including DPP6, COL22A1, TRPC7, PROX1, ROR2, ABCG1, PPARA, NR3C1, DOCK1, PCM1, and MRTFA, that may exert potential influence on the development of the mammary gland. Among these, DPP6, COL22A1, TRPC7, PROX1, ROR2, ABCG1, PPARA, NR3C1, and MRTFA have been newly identified as genes of interest. The present research sheds new light on the distinct genetic characteristics and evolutionary relationships among various strains of Large White pigs. This study offers novel insights into the genetic mechanisms that govern teat number phenotypes in selected populations.

      • This work was supported by the Jiangsu Seed Industry Revitalization Project (JBGS[2021]024; JBGS[2021]098), STI 2030 - Major Projects (2023ZD04045), Postdoctoral Fellowship Program of CPSF under Grant Number 2024M751448, and the Jiangsu Funding Program for Excellent Postdoctoral Talent.

      • All animal experiments were performed according to the Guidelines for the Care and Use of Laboratory Animals prepared by the Institutional Animal Welfare and Ethics Committee of Nanjing Agricultural University, Nanjing, China [Certification No.NJAULLSC2022069], approval date: 2022/7/16. The research followed the 'Replacement, Reduction, and Refinement' principles to minimize harm to animals. This article provides details on the housing conditions, care, and pain management for the animals, ensuring that the impact on the animals is minimized during the experiment.

      • The authors confirm contribution to the paper as follows: study conception and design: Huang R, Li P, Zhao Q, Su G; data collection: Cai W, Yu J, Zhou J; analysis and interpretation of results: Ma J, Liu C, Liu Q; draft manuscript preparation: Ma J, Ai X. All authors reviewed the results and approved the final version of the manuscript.

      • The authors declare that they have no conflict of interest. Ruihuang Huang is the Editorial Board member of Animal Advances who was blinded from reviewing or making decisions on the manuscript. The article was subject to the journal's standard procedures, with peer-review handled independently of this Editorial Board member and the research groups.

      • Copyright: © 2025 by the author(s). Published by Maximum Academic Press on behalf of Nanjing Agricultural University. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (6)  Table (2) References (59)
  • About this article
    Cite this article
    Ma J, Liu C, Cai W, Liu Q, Yu J, et al. 2025. Genome-wide selection signature analysis identified new candidate genes for the teat number in three distinct strains of Large White pigs. Animal Advances 2: e004 doi: 10.48130/animadv-0025-0002
    Ma J, Liu C, Cai W, Liu Q, Yu J, et al. 2025. Genome-wide selection signature analysis identified new candidate genes for the teat number in three distinct strains of Large White pigs. Animal Advances 2: e004 doi: 10.48130/animadv-0025-0002

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return