DNA Transposons Favor De Novo Transcript Emergence Through Enrichment of Transcription Factor Binding Motifs
2024
Lebherz, Marie Kristin | Fouks, Bertrand | Schmidt, Julian | Bornberg-Bauer, Erich | Grandchamp, Anna | Institute for Evolution and Biodiversity (IEB) ; Westfälische Wilhelms-Universität Münster = University of Münster (WWU) | Centre d’Ecologie Fonctionnelle et Evolutive (CEFE) ; Université Paul-Valéry - Montpellier 3 (UPVM)-École Pratique des Hautes Études (EPHE) ; Université Paris Sciences et Lettres (PSL)-Université Paris Sciences et Lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche pour le Développement (IRD [Occitanie])-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Institut Agro Montpellier ; Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Université de Montpellier (UM) | Amélioration génétique et adaptation des plantes méditerranéennes et tropicales (UMR AGAP) ; Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Institut Agro Montpellier ; Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Université de Montpellier (UM) | Département Systèmes Biologiques (Cirad-BIOS) ; Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad) | Max-Planck-Institut fur Biologie = Max Planck Institute for Biology [Tübingen] ; Max-Planck-Gesellschaft | M.K.L. and A.G. acknowledge funding from the Deutsche Forschungsgemeinschaft priority program “Genomic Basis of Evolutionary Innovations” (SPP 2349), project BO 2544/20-1 awarded to E.B.B. A.G. and E.B.B. acknowledge funding from grant 3.3-1213745-Fra-HFST-P from the Alexander von Humboldt-Stiftung. B.F. was funded by the European Union REA through a EU-H2020 Marie Skłodowska-Curie Action, Grant Number 101024100 (TEEPI). We acknowledge support from the Open Access Publication Fund of the University of Münster.
The files containing processed data is available in the Zenodo archive https://doi.org/10.5281/zenodo.8403184, and is referred in the main text as “Supplemental Deposit”. Supplemental figures, information, analyses and models are found in the Supplementary Information (SI). All programs are stored on GitHub (https://github.com/MarieLebh). The position frequency matices (PFM) of the studied motifs can be downloaded from https://jaspar2020.genereg.net/collection/POLII/ (Pol II database for core motifs) and https://jaspar2022.genereg.net/downloads/ (tFBS motifs, download the insect core non redundant database).
Afficher plus [+] Moins [-]International audience
Afficher plus [+] Moins [-]anglais. De novo genes emerge from noncoding regions of genomes via succession of mutations. Among others, such mutations activate transcription and create a new open reading frame (ORF). Although the mechanisms underlying ORF emergence are well documented, relatively little is known about the mechanisms enabling new transcription events. Yet, in many species a continuum between absent and very prominent transcription has been reported for essentially all regions of the genome. In this study, we searched for de novo transcripts by using newly assembled genomes and transcriptomes of seven inbred lines of Drosophila melanogaster, originating from six European and one African population. This setup allowed us to detect sample specific de novo transcripts, and compare them to their homologous nontranscribed regions in other samples, as well as genic and intergenic control sequences. We studied the association with transposable elements (TEs) and the enrichment of transcription factor motifs upstream of de novo emerged transcripts and compared them with regulatory elements. We found that de novo transcripts overlap with TEs more often than expected by chance. The emergence of new transcripts correlates with regions of high guanine-cytosine content and TE expression. Moreover, upstream regions of de novo transcripts are highly enriched with regulatory motifs. Such motifs are more enriched in new transcripts overlapping with TEs, particularly DNA TEs, and are more conserved upstream de novo transcripts than upstream their ‘nontranscribed homologs’. Overall, our study demonstrates that TE insertion is important for transcript emergence, partly by introducing new regulatory motifs from DNA TE families. Significance: In the present study, we used inbred lines of Drosophila melanogaster to detect earlier stages of de novo emerged transcripts in samples. We determined and studied the impact of transposable elements (TEs) and TFBS motifs on the emergence of de novo transcripts. We show that the insertion of DNA transposons plays a role in de novo transcripts emergence. We demonstrate enrichment of transcription factor binding motif (motifs whose identity to a reference motif is low) upstream de novo transcripts compared to regions upstream annotated genes and control non transcribed intergenic sequences.This enrichment is even more frequent upstream de novo transcripts overlapping with DNA TEs. Our findings help elucidate main molecular drivers of transcription gain, namely insertions of DNA TEs and enrichment in transcription factor motifs with lower similarity to the reference.Graphical Abstract: https://academic.oup.com/view-large/figure/476385418/evae134_ga.jpg
Afficher plus [+] Moins [-]Mots clés AGROVOC
Informations bibliographiques
Cette notice bibliographique a été fournie par Institut national de la recherche agronomique
Découvrez la collection de ce fournisseur de données dans AGRIS