NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.

MURAD, A. M.; RECH FILHO, E. L.

NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.

Detalhes bibliográficos
Autor(a) principal:	MURAD, A. M.
Data de Publicação:	2012
Outros Autores:	RECH FILHO, E. L.
Tipo de documento:	Artigo
Idioma:	por
Título da fonte:	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo:	http://www.alice.cnptia.embrapa.br/alice/handle/doc/952421
Resumo:	Background: Recombinant DNA technology has been extensively employed to generate a variety of products from genetically modified organisms (GMOs) over the last decade, and the development of technologies capable of analyzing these products is crucial to understanding gene expression patterns. Liquid chromatography coupled with mass spectrometry is a powerful tool for analyzing protein contents and possible expression modifications in GMOs. Specifically, the NanoUPLC-MSE technique provides rapid protein analyses of complex mixtures with supported steps for high sample throughput, identification and quantization using low sample quantities with outstanding repeatability. Here, we present an assessment of the peptide and protein identification and quantification of soybean seed EMBRAPA BR16 cultivar contents using NanoUPLC-MSE and provide a comparison to the theoretical tryptic digestion of soybean sequences from Uniprot database. Results: The NanoUPLC-MSE peptide analysis resulted in 3,400 identified peptides, 58% of which were identified to have no miscleavages. The experiment revealed that 13% of the peptides underwent in-source fragmentation, and 82% of the peptides were identified with a mass measurement accuracy of less than 5 ppm. More than 75% of the identified proteins have at least 10 matched peptides, 88% of the identified proteins have greater than 30% of coverage, and 87% of the identified proteins occur in all four replicates. 78% of the identified proteins correspond to all glycinin and betaconglycinin chains. The theoretical Uniprot peptide database has 723,749 entries, and 548,336 peptides have molecular weights of greater than 500 Da. Seed proteins represent 0.86% of the protein database entries. At the peptide level, trypsin-digested seed proteins represent only 0.3% of the theoretical Uniprot peptide database. A total of 22% of all database peptides have a pI value of less than 5, and 25% of them have a pI value between 5 and 8. Based on the detection range of typical NanoUPLC-MSE experiments, i.e., 500 to 5000 Da, 64 proteins will not be identified. Conclusions: NanoUPLC-MSE experiments provide good protein coverage within a peptide error of 5 ppm and a wide MW detection range from 500 to 5000 Da. A second digestion enzyme should be used depending on the tissue or proteins to be analyzed. In the case of seed tissue, trypsin protein digestion results offer good databank coverage. The Uniprot database has many duplicate entries that may result in false protein homolog associations when using NanoUPLC-MSE analysis. The proteomic profile of the EMBRAPA BR-16 seed lacks certain described proteins relative to the profiles of transgenic soybeans reported in other works.

Metadados do item

id	EMBR_2ea65878533e3507bc5e78167c0d535f
oai_identifier_str	oai:www.alice.cnptia.embrapa.br:doc/952421
network_acronym_str	EMBR
network_name_str	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str	2154
spelling	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.SoybeanSeed proteomicsNanoUPLC-MSEUniprot databaseSojaBackground: Recombinant DNA technology has been extensively employed to generate a variety of products from genetically modified organisms (GMOs) over the last decade, and the development of technologies capable of analyzing these products is crucial to understanding gene expression patterns. Liquid chromatography coupled with mass spectrometry is a powerful tool for analyzing protein contents and possible expression modifications in GMOs. Specifically, the NanoUPLC-MSE technique provides rapid protein analyses of complex mixtures with supported steps for high sample throughput, identification and quantization using low sample quantities with outstanding repeatability. Here, we present an assessment of the peptide and protein identification and quantification of soybean seed EMBRAPA BR16 cultivar contents using NanoUPLC-MSE and provide a comparison to the theoretical tryptic digestion of soybean sequences from Uniprot database. Results: The NanoUPLC-MSE peptide analysis resulted in 3,400 identified peptides, 58% of which were identified to have no miscleavages. The experiment revealed that 13% of the peptides underwent in-source fragmentation, and 82% of the peptides were identified with a mass measurement accuracy of less than 5 ppm. More than 75% of the identified proteins have at least 10 matched peptides, 88% of the identified proteins have greater than 30% of coverage, and 87% of the identified proteins occur in all four replicates. 78% of the identified proteins correspond to all glycinin and betaconglycinin chains. The theoretical Uniprot peptide database has 723,749 entries, and 548,336 peptides have molecular weights of greater than 500 Da. Seed proteins represent 0.86% of the protein database entries. At the peptide level, trypsin-digested seed proteins represent only 0.3% of the theoretical Uniprot peptide database. A total of 22% of all database peptides have a pI value of less than 5, and 25% of them have a pI value between 5 and 8. Based on the detection range of typical NanoUPLC-MSE experiments, i.e., 500 to 5000 Da, 64 proteins will not be identified. Conclusions: NanoUPLC-MSE experiments provide good protein coverage within a peptide error of 5 ppm and a wide MW detection range from 500 to 5000 Da. A second digestion enzyme should be used depending on the tissue or proteins to be analyzed. In the case of seed tissue, trypsin protein digestion results offer good databank coverage. The Uniprot database has many duplicate entries that may result in false protein homolog associations when using NanoUPLC-MSE analysis. The proteomic profile of the EMBRAPA BR-16 seed lacks certain described proteins relative to the profiles of transgenic soybeans reported in other works.ANDRE MELRO MURAD, CENARGEN; ELIBIO LEOPOLDO RECH FILHO, CENARGEN.MURAD, A. M.RECH FILHO, E. L.2013-03-07T11:11:11Z2013-03-07T11:11:11Z2013-03-0720122018-06-29T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleBMC Biotechnology, v. 12, n. 82, 2012.http://www.alice.cnptia.embrapa.br/alice/handle/doc/952421porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2017-08-15T23:52:38Zoai:www.alice.cnptia.embrapa.br:doc/952421Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542017-08-15T23:52:38falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542017-08-15T23:52:38Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
title	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
spellingShingle	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database. MURAD, A. M. Soybean Seed proteomics NanoUPLC-MSE Uniprot database Soja
title_short	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
title_full	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
title_fullStr	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
title_full_unstemmed	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
title_sort	NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.
author	MURAD, A. M.
author_facet	MURAD, A. M. RECH FILHO, E. L.
author_role	author
author2	RECH FILHO, E. L.
author2_role	author
dc.contributor.none.fl_str_mv	ANDRE MELRO MURAD, CENARGEN; ELIBIO LEOPOLDO RECH FILHO, CENARGEN.
dc.contributor.author.fl_str_mv	MURAD, A. M. RECH FILHO, E. L.
dc.subject.por.fl_str_mv	Soybean Seed proteomics NanoUPLC-MSE Uniprot database Soja
topic	Soybean Seed proteomics NanoUPLC-MSE Uniprot database Soja
description	Background: Recombinant DNA technology has been extensively employed to generate a variety of products from genetically modified organisms (GMOs) over the last decade, and the development of technologies capable of analyzing these products is crucial to understanding gene expression patterns. Liquid chromatography coupled with mass spectrometry is a powerful tool for analyzing protein contents and possible expression modifications in GMOs. Specifically, the NanoUPLC-MSE technique provides rapid protein analyses of complex mixtures with supported steps for high sample throughput, identification and quantization using low sample quantities with outstanding repeatability. Here, we present an assessment of the peptide and protein identification and quantification of soybean seed EMBRAPA BR16 cultivar contents using NanoUPLC-MSE and provide a comparison to the theoretical tryptic digestion of soybean sequences from Uniprot database. Results: The NanoUPLC-MSE peptide analysis resulted in 3,400 identified peptides, 58% of which were identified to have no miscleavages. The experiment revealed that 13% of the peptides underwent in-source fragmentation, and 82% of the peptides were identified with a mass measurement accuracy of less than 5 ppm. More than 75% of the identified proteins have at least 10 matched peptides, 88% of the identified proteins have greater than 30% of coverage, and 87% of the identified proteins occur in all four replicates. 78% of the identified proteins correspond to all glycinin and betaconglycinin chains. The theoretical Uniprot peptide database has 723,749 entries, and 548,336 peptides have molecular weights of greater than 500 Da. Seed proteins represent 0.86% of the protein database entries. At the peptide level, trypsin-digested seed proteins represent only 0.3% of the theoretical Uniprot peptide database. A total of 22% of all database peptides have a pI value of less than 5, and 25% of them have a pI value between 5 and 8. Based on the detection range of typical NanoUPLC-MSE experiments, i.e., 500 to 5000 Da, 64 proteins will not be identified. Conclusions: NanoUPLC-MSE experiments provide good protein coverage within a peptide error of 5 ppm and a wide MW detection range from 500 to 5000 Da. A second digestion enzyme should be used depending on the tissue or proteins to be analyzed. In the case of seed tissue, trypsin protein digestion results offer good databank coverage. The Uniprot database has many duplicate entries that may result in false protein homolog associations when using NanoUPLC-MSE analysis. The proteomic profile of the EMBRAPA BR-16 seed lacks certain described proteins relative to the profiles of transgenic soybeans reported in other works.
publishDate	2012
dc.date.none.fl_str_mv	2012 2013-03-07T11:11:11Z 2013-03-07T11:11:11Z 2013-03-07 2018-06-29T11:11:11Z
dc.type.driver.fl_str_mv	info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	BMC Biotechnology, v. 12, n. 82, 2012. http://www.alice.cnptia.embrapa.br/alice/handle/doc/952421
identifier_str_mv	BMC Biotechnology, v. 12, n. 82, 2012.
url	http://www.alice.cnptia.embrapa.br/alice/handle/doc/952421
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA
instname_str	Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str	EMBRAPA
institution	EMBRAPA
reponame_str	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv	cg-riaa@embrapa.br
_version_	1794503376296214528

NanoUPLC-MSE proteomic data assessment of soybean seeds using the Uniprot database.

Registros relacionados