Assembly of a pan-genome for global cattle reveals missing sequence and novel structural variation, providing new insights into their diversity and evolution history
2022
Zhou, Yang | Yang, Lv | Han, Xiaotao | Hu, Yan | Li, Fan | Xia, Han | Han, Jiazheng | Peng, Lingwei | Boschiero, Clarissa | Rosen, Benjamin D. | Bickhart, Derek M. | Zhang, Shujun | Guo, Aizhen | Tassell, Curtis P. | Smith, Timothy P. | Yang, Liguo | Liu, Ge
Using an integrated bioinformatics pipeline, we generated an enhanced structural variation (SV) catalog from the genome sequence of 898 cattle covering 60 breeds worldwide, resulting in ~3.3 million deletions, ~0.13 million duplications and ~0.15 million inversions. In addition, we built a cattle pan-genome, revealing ~74 Mb or ~2.3% novel sequences beyond the current cattle reference genome ARS-UCD1.2 assembly. After examining the sequence features of deletions near their breakpoints, we performed deletion-based population genetic analyses, producing breed ancestry and hybridization results similar to those derived from single nucleotide polymorphism (SNP). We discovered hundreds of deletions with frequency differentiation across subspecies and breeds, including dozens of them that were reported before as the lead variants at their corresponding loci. A Bov-tA1 insertion/deletion event in the first intron of the APPL2, potentially affecting immune response, olfactory functions and mediating growth factor–induced cell proliferation and glucose metabolism in muscle, corresponds to the cattle breed geographic distributions. Therefore, we conclude that domestication, breeding, and adaptive introgression have remodeled the domestic cattle genomes, and the pan-genome is a valuable resource for studying their diversity and evolution history.
اظهر المزيد [+] اقل [-]الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)
المعلومات البيبليوغرافية
تم تزويد هذا السجل من قبل National Agricultural Library