SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1107/S2059798320000339 http://hdl.handle.net/11449/201603 |
Resumo: | Fragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content. |
id |
UNSP_e6498aa6bbe6ad933ea0e9deb0372f5f |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/201603 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypothesesARCIMBOLDOfragment-based molecular replacementmolecular replacementPhaserphasingSEQUENCE SLIDERSHELXEside-chain extensionFragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content.Crystallographic Methods Institute of Molecular Biology of Barcelona (IBMB-CSIC), Baldiri Reixach 15Departamento de Física e Biofísica Instituto de Biociências Universidade Estadual Paulista (UNESP)Department of Haematology Cambridge Institute for Medical Research University of Cambridge, Hills RoadDepartment of Crystallography and Structural Biology Instituto de Química-Física Rocasolano Consejo Superior de Investigaciones Científicas (CSIC)Icrea at IBMB-CSIC, Baldiri Reixach 13-15Departamento de Física e Biofísica Instituto de Biociências Universidade Estadual Paulista (UNESP)Institute of Molecular Biology of Barcelona (IBMB-CSIC)Universidade Estadual Paulista (Unesp)University of CambridgeConsejo Superior de Investigaciones Científicas (CSIC)Icrea at IBMB-CSICBorges, Rafael Junqueira [UNESP]Meindl, KathrinTriviño, JosepSammito, MassimoMedina, AnaMillán, ClaudiaAlcorlo, MartinHermoso, Juan A.Fontes, Marcos Roberto De Mattos [UNESP]Usón, Isabel2020-12-12T02:36:53Z2020-12-12T02:36:53Z2020-03-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article221-237http://dx.doi.org/10.1107/S2059798320000339Acta Crystallographica Section D: Structural Biology, v. 76, p. 221-237.2059-7983http://hdl.handle.net/11449/20160310.1107/S20597983200003392-s2.0-85081043471Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengActa Crystallographica Section D: Structural Biologyinfo:eu-repo/semantics/openAccess2021-10-22T20:36:21Zoai:repositorio.unesp.br:11449/201603Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T14:41:09.905325Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
title |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
spellingShingle |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses Borges, Rafael Junqueira [UNESP] ARCIMBOLDO fragment-based molecular replacement molecular replacement Phaser phasing SEQUENCE SLIDER SHELXE side-chain extension |
title_short |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
title_full |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
title_fullStr |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
title_full_unstemmed |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
title_sort |
SEQUENCE SLIDER: Expanding polyalanine fragments for phasing with multiple side-chain hypotheses |
author |
Borges, Rafael Junqueira [UNESP] |
author_facet |
Borges, Rafael Junqueira [UNESP] Meindl, Kathrin Triviño, Josep Sammito, Massimo Medina, Ana Millán, Claudia Alcorlo, Martin Hermoso, Juan A. Fontes, Marcos Roberto De Mattos [UNESP] Usón, Isabel |
author_role |
author |
author2 |
Meindl, Kathrin Triviño, Josep Sammito, Massimo Medina, Ana Millán, Claudia Alcorlo, Martin Hermoso, Juan A. Fontes, Marcos Roberto De Mattos [UNESP] Usón, Isabel |
author2_role |
author author author author author author author author author |
dc.contributor.none.fl_str_mv |
Institute of Molecular Biology of Barcelona (IBMB-CSIC) Universidade Estadual Paulista (Unesp) University of Cambridge Consejo Superior de Investigaciones Científicas (CSIC) Icrea at IBMB-CSIC |
dc.contributor.author.fl_str_mv |
Borges, Rafael Junqueira [UNESP] Meindl, Kathrin Triviño, Josep Sammito, Massimo Medina, Ana Millán, Claudia Alcorlo, Martin Hermoso, Juan A. Fontes, Marcos Roberto De Mattos [UNESP] Usón, Isabel |
dc.subject.por.fl_str_mv |
ARCIMBOLDO fragment-based molecular replacement molecular replacement Phaser phasing SEQUENCE SLIDER SHELXE side-chain extension |
topic |
ARCIMBOLDO fragment-based molecular replacement molecular replacement Phaser phasing SEQUENCE SLIDER SHELXE side-chain extension |
description |
Fragment-based molecular-replacement methods can solve a macromolecular structure quasi-ab initio. ARCIMBOLDO, using a common secondary-structure or tertiary-structure template or a library of folds, locates these with Phaser and reveals the rest of the structure by density modification and autotracing in SHELXE. The latter stage is challenging when dealing with diffraction data at lower resolution, low solvent content, high β-sheet composition or situations in which the initial fragments represent a low fraction of the total scattering or where their accuracy is low. SEQUENCE SLIDER aims to overcome these complications by extending the initial polyalanine fragment with side chains in a multisolution framework. Its use is illustrated on test cases and previously unknown structures. The selection and order of fragments to be extended follows the decrease in log-likelihood gain (LLG) calculated with Phaser upon the omission of each single fragment. When the starting substructure is derived from a remote homolog, sequence assignment to fragments is restricted by the original alignment. Otherwise, the secondary-structure prediction is matched to that found in fragments and traces. Sequence hypotheses are trialled in a brute-force approach through side-chain building and refinement. Scoring the refined models through their LLG in Phaser may allow discrimination of the correct sequence or filter the best partial structures for further density modification and autotracing. The default limits for the number of models to pursue are hardware dependent. In its most economic implementation, suitable for a single laptop, the main-chain trace is extended as polyserine rather than trialling models with different sequence assignments, which requires a grid or multicore machine. SEQUENCE SLIDER has been instrumental in solving two novel structures: that of MltC from 2.7 Å resolution data and that of a pneumococcal lipoprotein with 638 residues and 35% solvent content. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-12-12T02:36:53Z 2020-12-12T02:36:53Z 2020-03-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1107/S2059798320000339 Acta Crystallographica Section D: Structural Biology, v. 76, p. 221-237. 2059-7983 http://hdl.handle.net/11449/201603 10.1107/S2059798320000339 2-s2.0-85081043471 |
url |
http://dx.doi.org/10.1107/S2059798320000339 http://hdl.handle.net/11449/201603 |
identifier_str_mv |
Acta Crystallographica Section D: Structural Biology, v. 76, p. 221-237. 2059-7983 10.1107/S2059798320000339 2-s2.0-85081043471 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Acta Crystallographica Section D: Structural Biology |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
221-237 |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128400898916352 |