Assembly of a pan-genome for global cattle reveals missing sequence and novel structural variation, providing new insights into their diversity and evolution history
2022
Zhou, Yang | Yang, Lv | Han, Xiaotao | Hu, Yan | Li, Fan | Xia, Han | Han, Jiazheng | Peng, Lingwei | Boschiero, Clarissa | Rosen, Benjamin D. | Bickhart, Derek M. | Zhang, Shujun | Guo, Aizhen | Tassell, Curtis P. | Smith, Timothy P. | Yang, Liguo | Liu, Ge
Using an integrated bioinformatics pipeline, we generated an enhanced structural variation (SV) catalog from the genome sequence of 898 cattle covering 60 breeds worldwide, resulting in ~3.3 million deletions, ~0.13 million duplications and ~0.15 million inversions. In addition, we built a cattle pan-genome, revealing ~74 Mb or ~2.3% novel sequences beyond the current cattle reference genome ARS-UCD1.2 assembly. After examining the sequence features of deletions near their breakpoints, we performed deletion-based population genetic analyses, producing breed ancestry and hybridization results similar to those derived from single nucleotide polymorphism (SNP). We discovered hundreds of deletions with frequency differentiation across subspecies and breeds, including dozens of them that were reported before as the lead variants at their corresponding loci. A Bov-tA1 insertion/deletion event in the first intron of the APPL2, potentially affecting immune response, olfactory functions and mediating growth factor–induced cell proliferation and glucose metabolism in muscle, corresponds to the cattle breed geographic distributions. Therefore, we conclude that domestication, breeding, and adaptive introgression have remodeled the domestic cattle genomes, and the pan-genome is a valuable resource for studying their diversity and evolution history.
Afficher plus [+] Moins [-]Mots clés AGROVOC
Informations bibliographiques
Cette notice bibliographique a été fournie par National Agricultural Library
Découvrez la collection de ce fournisseur de données dans AGRIS