A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae)

Detalhes bibliográficos
Autor(a) principal: Zerbini, F. Murilo
Data de Publicação: 2013
Outros Autores: Muhire, Brejnev, Martin, Darren P., Brown, Judith K., Navas-Castillo, Jesús, Moriones, Enrique, Rivera-Bustamante, Rafael, Malathi, V. G., Briddon, Rob W., Varsani, Arvind
Tipo de documento: Artigo
Idioma: eng
Título da fonte: LOCUS Repositório Institucional da UFV
Texto Completo: http://dx.doi.org/10.1007/s00705-012-1601-7
http://www.locus.ufv.br/handle/123456789/22639
Resumo: Recent advances in the ease with which the genomes of small circular single-stranded DNA viruses can be amplified, cloned, and sequenced have greatly accelerated the rate at which full genome sequences of mastreviruses (family Geminiviridae, genus Mastrevirus) are being deposited in public sequence databases. Although guidelines currently exist for species-level classification of newly determined, complete mastrevirus genome sequences, these are difficult to apply to large sequence datasets and are permissive enough that, effectively, a high degree of leeway exists for the proposal of new species and strains. The lack of a standardised and rigorous method for testing whether a new genome sequence deserves such a classification is resulting in increasing numbers of questionable mastrevirus species proposals. Importantly, the recommended sequence alignment and pairwise identity calculation protocols of the current guidelines could easily be modified to make the classification of newly determined mastrevirus genome sequences significantly more objective. Here, we propose modified versions of these protocols that should substantially minimise the degree of classification inconsistency that is permissible under the current system. To facilitate the objective application of these guidelines for mastrevirus species demarcation, we additionally present a user-friendly computer program, SDT (species demarcation tool), for calculating and graphically displaying pairwise genome identity scores. We apply SDT to the 939 full genome sequences of mastreviruses that were publically available in May 2012, and based on the distribution of pairwise identity scores yielded by our protocol, we propose mastrevirus species and strain demarcation thresholds of >78 % and >94 % identity, respectively.