SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses

Detalhes bibliográficos
Autor(a) principal: Borges, Rafael Junqueira [UNESP]
Data de Publicação: 2020
Outros Autores: Meindl, Kathrin, Triviño, Josep, Sammito, Massimo, Medina, Ana, Millán, Claudia, Alcorlo, Martin, Hermoso, Juan A., Fontes, Marcos Roberto De Mattos [UNESP], Usón, Isabel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1107/S2059798320000339
http://hdl.handle.net/11449/201603
Resumo: Fragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content.
id UNSP_e6498aa6bbe6ad933ea0e9deb0372f5f
oai_identifier_str oai:repositorio.unesp.br:11449/201603
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypothesesARCIMBOLDOfragment-based molecular replacementmolecular replacementPhaserphasingSEQUENCE SLIDERSHELXEside-chain extensionFragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content.Crystallographic Methods Institute of Molecular Biology of Barcelona (IBMB-CSIC), Baldiri Reixach 15Departamento de Física e Biofísica Instituto de Biociências Universidade Estadual Paulista (UNESP)Department of Haematology Cambridge Institute for Medical Research University of Cambridge, Hills RoadDepartment of Crystallography and Structural Biology Instituto de Química-Física Rocasolano Consejo Superior de Investigaciones Científicas (CSIC)Icrea at IBMB-CSIC, Baldiri Reixach 13-15Departamento de Física e Biofísica Instituto de Biociências Universidade Estadual Paulista (UNESP)Institute of Molecular Biology of Barcelona (IBMB-CSIC)Universidade Estadual Paulista (Unesp)University of CambridgeConsejo Superior de Investigaciones Científicas (CSIC)Icrea at IBMB-CSICBorges, Rafael Junqueira [UNESP]Meindl, KathrinTriviño, JosepSammito, MassimoMedina, AnaMillán, ClaudiaAlcorlo, MartinHermoso, Juan A.Fontes, Marcos Roberto De Mattos [UNESP]Usón, Isabel2020-12-12T02:36:53Z2020-12-12T02:36:53Z2020-03-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article221-237http://dx.doi.org/10.1107/S2059798320000339Acta Crystallographica Section D: Structural Biology, v. 76, p. 221-237.2059-7983http://hdl.handle.net/11449/20160310.1107/S20597983200003392-s2.0-85081043471Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengActa Crystallographica Section D: Structural Biologyinfo:eu-repo/semantics/openAccess2021-10-22T20:36:21Zoai:repositorio.unesp.br:11449/201603Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T14:41:09.905325Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
title SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
spellingShingle SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
Borges, Rafael Junqueira [UNESP]
ARCIMBOLDO
fragment-based molecular replacement
molecular replacement
Phaser
phasing
SEQUENCE SLIDER
SHELXE
side-chain extension
title_short SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
title_full SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
title_fullStr SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
title_full_unstemmed SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
title_sort SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
author Borges, Rafael Junqueira [UNESP]
author_facet Borges, Rafael Junqueira [UNESP]
Meindl, Kathrin
Triviño, Josep
Sammito, Massimo
Medina, Ana
Millán, Claudia
Alcorlo, Martin
Hermoso, Juan A.
Fontes, Marcos Roberto De Mattos [UNESP]
Usón, Isabel
author_role author
author2 Meindl, Kathrin
Triviño, Josep
Sammito, Massimo
Medina, Ana
Millán, Claudia
Alcorlo, Martin
Hermoso, Juan A.
Fontes, Marcos Roberto De Mattos [UNESP]
Usón, Isabel
author2_role author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Institute of Molecular Biology of Barcelona (IBMB-CSIC)
Universidade Estadual Paulista (Unesp)
University of Cambridge
Consejo Superior de Investigaciones Científicas (CSIC)
Icrea at IBMB-CSIC
dc.contributor.author.fl_str_mv Borges, Rafael Junqueira [UNESP]
Meindl, Kathrin
Triviño, Josep
Sammito, Massimo
Medina, Ana
Millán, Claudia
Alcorlo, Martin
Hermoso, Juan A.
Fontes, Marcos Roberto De Mattos [UNESP]
Usón, Isabel
dc.subject.por.fl_str_mv ARCIMBOLDO
fragment-based molecular replacement
molecular replacement
Phaser
phasing
SEQUENCE SLIDER
SHELXE
side-chain extension
topic ARCIMBOLDO
fragment-based molecular replacement
molecular replacement
Phaser
phasing
SEQUENCE SLIDER
SHELXE
side-chain extension
description Fragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content.
publishDate 2020
dc.date.none.fl_str_mv 2020-12-12T02:36:53Z
2020-12-12T02:36:53Z
2020-03-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1107/S2059798320000339
Acta Crystallographica Section D: Structural Biology, v. 76, p. 221-237.
2059-7983
http://hdl.handle.net/11449/201603
10.1107/S2059798320000339
2-s2.0-85081043471
url http://dx.doi.org/10.1107/S2059798320000339
http://hdl.handle.net/11449/201603
identifier_str_mv Acta Crystallographica Section D: Structural Biology, v. 76, p. 221-237.
2059-7983
10.1107/S2059798320000339
2-s2.0-85081043471
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Acta Crystallographica Section D: Structural Biology
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 221-237
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128400898916352