RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
2021
Martin, Darren, P | Varsani, Arvind | Roumagnac, Philippe | Botha, Gerrit | Maslamoney, Suresh | Schwab, Tiana | Kelz, Zena | Kumar, Venkatesh | Murrell, Ben | Institute of Infectious Disease and Molecular Medicine (IDM) ; University of Cape Town | Arizona State University [Tempe] (ASU) | University of Cape Town | Plant Health Institute of Montpellier (UMR PHIM) ; Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-Institut de Recherche pour le Développement (IRD)-Université de Montpellier (UM)-Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement (INRAE)-Institut Agro - Montpellier SupAgro ; Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro) | Département Systèmes Biologiques (Cirad-BIOS) ; Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad) | Ecole Polytechnique Fédérale de Lausanne (EPFL) | Karolinska Institutet [Stockholm] | South African National Research Foundation | Swedish Research Council (2018-02381). | H3Africa
International audience
Show more [+] Less [-]English. For the past 20 years, the recombination detection program (RDP) project has focused on the development of a fast, flexible, and easy to use Windows-based recombination analysis tool. Whereas previous versions of this tool have relied on considerable user-mediated verification of detected recombination events, the latest iteration, RDP5, is automated enough that it can be integrated within analysis pipelines and run without any user input. The main innovation enabling this degree of automation is the implementation of statistical tests to identify recombination signals that could be attributable to evolutionary processes other than recombination. The additional analysis time required for these tests has been offset by algorithmic improvements throughout the program such that, relative to RDP4, RDP5 will still run up to five times faster and be capable of analyzing alignments containing twice as many sequences (up to 5000) that are five times longer (up to 50 million sites). For users wanting to remove signals of recombination from their datasets before using them for downstream phylogenetics-based molecular evolution analyses, RDP5 can disassemble detected recombinant sequences into their constituent parts and output a variety of different recombination-free datasets in an array of different alignment formats. For users that are interested in exploring the recombination history of their datasets, all the manual verification, data management and data visualization components of RDP5 have been extensively updated to minimize the amount of time needed by users to individually verify and refine the program's interpretation of each of the individual recombination events that it detects.
Show more [+] Less [-]Bibliographic information
This bibliographic record has been provided by Institut national de la recherche agronomique