Massive DNA sequencing has significantly increased the amount of data avail- able for population genetics and molecular ecology studies. However, the paral- lel computation of simple statistics within and between populations from large panels of polymorphic sites is not yet available, making the exploratory analyses of a set or subset of data a very laborious task. Here, we present 4P (parallel processing of polymorphism panels), a stand-alone software program for the rapid computation of genetic variation statistics (including the joint frequency spectrum) from millions of DNA variants in multiple individuals and multiple populations. It handles a standard input file format commonly used to store DNA variation from empirical or simulation experiments. The computational performance of 4P was evaluated using large SNP (single nucleotide polymor- phism) datasets from human genomes or obtained by simulations. 4P was faster or much faster than other comparable programs, and the impact of parallel computing using multicore computers or servers was evident. 4P is a useful tool for biologists who need a simple and rapid computer program to run explor- atory population genetics analyses in large panels of genomic data. It is also particularly suitable to analyze multiple data sets produced in simulation stud- ies. Unix, Windows, and MacOs versions are provided, as well as the source code for easier pipeline implementations.
Citation: | Benazzo, A.; Panziera A.; Bertorelle, G. (2015). 4P: fast computing of population genetics statistics from large DNA polymorphism panels. ECOLOGY AND EVOLUTION, 5 (1): 172-175. doi: 10.1002/ece3.1261 handle: http://hdl.handle.net/10449/25065 |
Internal authors: | |
Organization unit: | Biodiversity and Molecular Ecology Department # CRI_2011-JAN2016 |
Authors: | Benazzo, A.; Panziera A.; Bertorelle, G. |
Title: | 4P: fast computing of population genetics statistics from large DNA polymorphism panels |
Journal: | ECOLOGY AND EVOLUTION |
Issue Date: | 2015 |
Scientific Disciplinary Area: | Settore BIO/11 - Biologia Molecolare |
Keywords ENG: | Allelic spectrum Fst Genetic indicators Genetic variation NGS Software |
Language: | English |
IF: | With Impact Factor ISI |
Publication status: | Published |
Nature of content: | Articolo in rivista/Article |
Digital Object Identifier (DOI): | 10.1002/ece3.1261 |
Appears in Collections: | 01 - Journal article |
Files in This Item:
File | Description | Type | License | |
---|---|---|---|---|
Benazzo et al 2014 4P Panziera 2014.pdf | N/A | ![]() | Open AccessView/Open |