Teoria de modelos mistos aplicada ao delineamento em blocos aumentados
Autor(a) principal: | |
---|---|
Data de Publicação: | 2000 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Institucional da UFLA |
Texto Completo: | http://repositorio.ufla.br/jspui/handle/1/33785 |
Resumo: | Augmented designs were proposed to deal with low availability of material to constitute replications and experimental plots, and with large amounts of treatments. Amongst them, augmented block design is largely used. In this design, there are replicated (common) and non-replicated (regular) treatments; in breeding programs the latter are often the selection units and the former standard cultivars. Inference has traditionally been made by means of an intrablock analysis (fixed models). If regular treatments and/or blocks can be considered of random nature, however, a mixed linear model could be used instead, specially if genetic covariance matrix among selection units can be assigned, from pedigree information or molecular marker data. This work aimed at the evaluation of the use of mixed models to describe augmented block designs, and their efficiency over traditional intrablock analysis, using computer simulation. Data were generated considering a fictitious plant species with200 independent genes controlling a trait of interest. Gene effects (that is, half the difference among homozygotes) were random outcomes from an exponential density with mean equal to 1. Frequencies of favorable allele of each locus ranged from 0.2 to 0.8. Populations consisted of sets of randomly generated inbred lines, with different sizes (50, 100 or 200). Molecular data were also generated, considering 100marker loci, in such a way that expected similarities between any pair of lines were the same for trait and marker loci. Such data were used in the estimation of the genetic covariance matrix. Environmental variances were established according to predetermined heritabilities h2 (0.2, 0.5 or 0.8) and split into block and residual components with weights determined by the magnitude of Smith's coefficient b of soil heterogeneity (0.1, 0.5 or 0.9). Two amounts of blocks were considered (0.2 or 0.05 the number of lines). In each combination of such parameters, 100 simulations were made, considering four linear models, varying the nature of block and regular treatment effects (fixed - F, or random - R), respectively FF, FR, RF and RR. The effects of such mixed models were estimated using best linear unbiased prediction (BLUP) and, when at least one of those factors was regarded as random, two variants were considered, assuming or not variance components as known. In the latter, components were estimating by restricted maximum likelihood, using the EM algorithm. The mixed models were evaluated through bias, mean squared error (MSE), Pearson's and Spearman's correlation, and bias on estimating the actual percentage of elite lines, that is, those superior to best common treatment ("elite bias"). Results showed that biases were negligible for all models. Considering the other criteria, for most situations the RR model with known variance components was the best, as theoretically expected. When variance components were estimated, however, correlation of Pearson and that of Spearman were highest with RF model, which is that with recovery of interblock information only. On the other hand, RR model generally showed the least elite biases. Comparing to FF, mixed models (RF or RR) were more efficient specially under lower precision, that is, low to intermediate heritability and high b (high residual variance in relation to block differences). Under such conditions, therefore, results suggested that mixed models could improve inference in breeding programs and that the choice of the model should rely on the kind of selection. If this is truncated, RF model should be preferred; if this is not, such as that based on the performance of check varieties, then RR would be more suitable, justifying the costs of generating molecular data. |
id |
UFLA_d5f4521cd0b9806608f83a0e56542130 |
---|---|
oai_identifier_str |
oai:localhost:1/33785 |
network_acronym_str |
UFLA |
network_name_str |
Repositório Institucional da UFLA |
repository_id_str |
|
spelling |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentadosAnálise de variânciaEstatísticaModelos matemáticosAnálise estatísticaEstatísticaAugmented designs were proposed to deal with low availability of material to constitute replications and experimental plots, and with large amounts of treatments. Amongst them, augmented block design is largely used. In this design, there are replicated (common) and non-replicated (regular) treatments; in breeding programs the latter are often the selection units and the former standard cultivars. Inference has traditionally been made by means of an intrablock analysis (fixed models). If regular treatments and/or blocks can be considered of random nature, however, a mixed linear model could be used instead, specially if genetic covariance matrix among selection units can be assigned, from pedigree information or molecular marker data. This work aimed at the evaluation of the use of mixed models to describe augmented block designs, and their efficiency over traditional intrablock analysis, using computer simulation. Data were generated considering a fictitious plant species with200 independent genes controlling a trait of interest. Gene effects (that is, half the difference among homozygotes) were random outcomes from an exponential density with mean equal to 1. Frequencies of favorable allele of each locus ranged from 0.2 to 0.8. Populations consisted of sets of randomly generated inbred lines, with different sizes (50, 100 or 200). Molecular data were also generated, considering 100marker loci, in such a way that expected similarities between any pair of lines were the same for trait and marker loci. Such data were used in the estimation of the genetic covariance matrix. Environmental variances were established according to predetermined heritabilities h2 (0.2, 0.5 or 0.8) and split into block and residual components with weights determined by the magnitude of Smith's coefficient b of soil heterogeneity (0.1, 0.5 or 0.9). Two amounts of blocks were considered (0.2 or 0.05 the number of lines). In each combination of such parameters, 100 simulations were made, considering four linear models, varying the nature of block and regular treatment effects (fixed - F, or random - R), respectively FF, FR, RF and RR. The effects of such mixed models were estimated using best linear unbiased prediction (BLUP) and, when at least one of those factors was regarded as random, two variants were considered, assuming or not variance components as known. In the latter, components were estimating by restricted maximum likelihood, using the EM algorithm. The mixed models were evaluated through bias, mean squared error (MSE), Pearson's and Spearman's correlation, and bias on estimating the actual percentage of elite lines, that is, those superior to best common treatment ("elite bias"). Results showed that biases were negligible for all models. Considering the other criteria, for most situations the RR model with known variance components was the best, as theoretically expected. When variance components were estimated, however, correlation of Pearson and that of Spearman were highest with RF model, which is that with recovery of interblock information only. On the other hand, RR model generally showed the least elite biases. Comparing to FF, mixed models (RF or RR) were more efficient specially under lower precision, that is, low to intermediate heritability and high b (high residual variance in relation to block differences). Under such conditions, therefore, results suggested that mixed models could improve inference in breeding programs and that the choice of the model should rely on the kind of selection. If this is truncated, RF model should be preferred; if this is not, such as that based on the performance of check varieties, then RR would be more suitable, justifying the costs of generating molecular data.Os delineamentos aumentados foram propostos para lidar com situações de baixa disponibilidade de material para compor repetições e parcelas e com grandes quantidades de tratamentos. Dentre eles, o delineamento em blocos aumentados é amplamente utilizado. Nesse delineamento, existem tratamentos repetidos (comuns) e não-repetidos (regulares); em programas de melhoramento, geralmente os últimos são as unidades de seleção e os primeiros são cultivares comerciais. A inferência com esse delineamento tem tradicionalmente sido feita pela análise intrablocos (modelos fixos). No entanto, se os efeitos de tratamentos regulares e/ou de blocos puderem ser considerados de natureza aleatória, um modelo linear misto poderia ser utilizado, particularmente se a matriz de covariâncias genéticas entre as unidades de seleção puder ser levada em conta, utilizando dados de pedigree ou de marcadores moleculares. Com este estudo objetivu-se a avaliação do uso da teoria de modelos mistos para descrever delineamentos em blocos aumentados, e sua eficiência em relação à análise tradicional intrablocos, por meio de simulação em computador. Para tanto, dados eram gerados considerando uma espécie vegetal fictícia com 200 genes independentes controlando uma característica de interesse. Os efeitos dos genes (isto é, metade da diferença entre os homozigotos) eram realizações aleatórias de uma densidade exponencial com média igual a 1. As freqüências dos alelos favoráveis de cada loco variou de 0,2 a 0,8. As populações consistiam de conjuntos de linhagens endogâmicas geradas aleatoriamente, com diferentes tamanhos (50, 100 ou 200). Dados moleculares também eram gerados, considerando 100 locos marcadores, de tal forma que as similaridades entre qualquer par de linhagens eram as mesmas para os locos da característica e dos marcadores. Esses dados eram utilizados na estimação da matriz de covariâncias genéticas. As variâncias ambientais eram estabelecidas de acordo com valores de herdabilidade h2 predeterminadas (0,2, 0,5 ou 0,8) e particionadas em componentes relativos a blocos e ao resíduo, em frações determinadas pela magnitude do coeficiente b de heterogeneidade de solo de Smith (0,1, 0,5 ou 0,9). Duas quantidades de blocos foram consideradas (0,2 ou 0,05 vezes o número de linhagens). Em cada combinação desses parâmetros, 100 simulações eram feitas, considerando 4 modelos lineares, variando a natureza dos efeitos de bloco e de tratamentos regulares (fixo - F, ou aleatório - A), respectivamente FF, FA, AF e AA. Os efeitos de tais modelos mistos eram estimados utilizando a melhor predição linear não tendenciosa (BLUP) e, quando pelo menos um desses fatores era tido como aleatório, duas variações eram consideradas, assumindo e não assumindo os componentes de variância como conhecidos. Neste último caso, os componentes eram estimados pelo método da máxima verossimilhança restrita, utilizando o algoritmo EM. Os modelos mistos foram avaliados por meio do viés, erro quadrático médio, correlações de Pearson e Spearman, e do viés na estimação da porcentagem de linhagens elite, ou seja, superiores à melhor testemunha ou tratamento comum. Os resultados mostraram que os vieses foram negligíveis em todos os modelos. Considerando os demais critérios, na maioria das situações, o modelo AA com componentes de variância conhecidos mostrou-se como o melhor, como teoricamente esperado. Quando os componentes de variância eram estimados, contudo, as correlações de Pearson e Spearman foram maiores com o modelo AF, que é o que recupera a informação interblocos, somente. Por outro lado, o modelo AA geralmente apresentou os menores vieses na estimação de linhagens elites. Comparando com o modelo FF, os modelos mistos (AF ou AA) foram mais eficientes, especialmente sob baixa precisão, ou seja, com herdabilidade baixaa intermediária e alto b (alta variância residual em relação à variação entre blocos). Sob tais condições, portanto, os resultados sugeriram que modelos mistos podem melhorar a inferência e, portanto, o sucesso de programas de melhoramento, e que a escolha do modelo deve se basear no tipo de seleção. Se esta for truncada, o modelo AF deveria ser preferido; se não for, tal como aquela baseada em relação a uma cultivar comercial, então o modelo AA é mais apropriado, justificando os custos de geração de dados moleculares.Universidade Federal de LavrasPrograma de Pós-Graduação em Estatística e Experimentação AgropecuáriaUFLAbrasilDepartamento EstatísticaBearzoti, EduardoFerreira, Daniel FurtadoBueno Filho, Julio Silvio de SousaDuarte, João BatistaSantos, Aladir Horacio dos2019-04-24T16:39:51Z2019-04-24T16:39:51Z2019-04-092000-08-25info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfSANTOS, A. H. dos. Teoria de modelos mistos aplicada ao delineamento em blocos aumentados. 2000. 138 p. Dissertação (Mestrado em Estatística e Experimentação Agropecuária)-Universidade Federal de Lavras, Lavras, 2000.http://repositorio.ufla.br/jspui/handle/1/33785porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFLAinstname:Universidade Federal de Lavras (UFLA)instacron:UFLA2023-05-11T16:42:24Zoai:localhost:1/33785Repositório InstitucionalPUBhttp://repositorio.ufla.br/oai/requestnivaldo@ufla.br || repositorio.biblioteca@ufla.bropendoar:2023-05-11T16:42:24Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA)false |
dc.title.none.fl_str_mv |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
title |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
spellingShingle |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados Santos, Aladir Horacio dos Análise de variância Estatística Modelos matemáticos Análise estatística Estatística |
title_short |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
title_full |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
title_fullStr |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
title_full_unstemmed |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
title_sort |
Teoria de modelos mistos aplicada ao delineamento em blocos aumentados |
author |
Santos, Aladir Horacio dos |
author_facet |
Santos, Aladir Horacio dos |
author_role |
author |
dc.contributor.none.fl_str_mv |
Bearzoti, Eduardo Ferreira, Daniel Furtado Bueno Filho, Julio Silvio de Sousa Duarte, João Batista |
dc.contributor.author.fl_str_mv |
Santos, Aladir Horacio dos |
dc.subject.por.fl_str_mv |
Análise de variância Estatística Modelos matemáticos Análise estatística Estatística |
topic |
Análise de variância Estatística Modelos matemáticos Análise estatística Estatística |
description |
Augmented designs were proposed to deal with low availability of material to constitute replications and experimental plots, and with large amounts of treatments. Amongst them, augmented block design is largely used. In this design, there are replicated (common) and non-replicated (regular) treatments; in breeding programs the latter are often the selection units and the former standard cultivars. Inference has traditionally been made by means of an intrablock analysis (fixed models). If regular treatments and/or blocks can be considered of random nature, however, a mixed linear model could be used instead, specially if genetic covariance matrix among selection units can be assigned, from pedigree information or molecular marker data. This work aimed at the evaluation of the use of mixed models to describe augmented block designs, and their efficiency over traditional intrablock analysis, using computer simulation. Data were generated considering a fictitious plant species with200 independent genes controlling a trait of interest. Gene effects (that is, half the difference among homozygotes) were random outcomes from an exponential density with mean equal to 1. Frequencies of favorable allele of each locus ranged from 0.2 to 0.8. Populations consisted of sets of randomly generated inbred lines, with different sizes (50, 100 or 200). Molecular data were also generated, considering 100marker loci, in such a way that expected similarities between any pair of lines were the same for trait and marker loci. Such data were used in the estimation of the genetic covariance matrix. Environmental variances were established according to predetermined heritabilities h2 (0.2, 0.5 or 0.8) and split into block and residual components with weights determined by the magnitude of Smith's coefficient b of soil heterogeneity (0.1, 0.5 or 0.9). Two amounts of blocks were considered (0.2 or 0.05 the number of lines). In each combination of such parameters, 100 simulations were made, considering four linear models, varying the nature of block and regular treatment effects (fixed - F, or random - R), respectively FF, FR, RF and RR. The effects of such mixed models were estimated using best linear unbiased prediction (BLUP) and, when at least one of those factors was regarded as random, two variants were considered, assuming or not variance components as known. In the latter, components were estimating by restricted maximum likelihood, using the EM algorithm. The mixed models were evaluated through bias, mean squared error (MSE), Pearson's and Spearman's correlation, and bias on estimating the actual percentage of elite lines, that is, those superior to best common treatment ("elite bias"). Results showed that biases were negligible for all models. Considering the other criteria, for most situations the RR model with known variance components was the best, as theoretically expected. When variance components were estimated, however, correlation of Pearson and that of Spearman were highest with RF model, which is that with recovery of interblock information only. On the other hand, RR model generally showed the least elite biases. Comparing to FF, mixed models (RF or RR) were more efficient specially under lower precision, that is, low to intermediate heritability and high b (high residual variance in relation to block differences). Under such conditions, therefore, results suggested that mixed models could improve inference in breeding programs and that the choice of the model should rely on the kind of selection. If this is truncated, RF model should be preferred; if this is not, such as that based on the performance of check varieties, then RR would be more suitable, justifying the costs of generating molecular data. |
publishDate |
2000 |
dc.date.none.fl_str_mv |
2000-08-25 2019-04-24T16:39:51Z 2019-04-24T16:39:51Z 2019-04-09 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
SANTOS, A. H. dos. Teoria de modelos mistos aplicada ao delineamento em blocos aumentados. 2000. 138 p. Dissertação (Mestrado em Estatística e Experimentação Agropecuária)-Universidade Federal de Lavras, Lavras, 2000. http://repositorio.ufla.br/jspui/handle/1/33785 |
identifier_str_mv |
SANTOS, A. H. dos. Teoria de modelos mistos aplicada ao delineamento em blocos aumentados. 2000. 138 p. Dissertação (Mestrado em Estatística e Experimentação Agropecuária)-Universidade Federal de Lavras, Lavras, 2000. |
url |
http://repositorio.ufla.br/jspui/handle/1/33785 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Universidade Federal de Lavras Programa de Pós-Graduação em Estatística e Experimentação Agropecuária UFLA brasil Departamento Estatística |
publisher.none.fl_str_mv |
Universidade Federal de Lavras Programa de Pós-Graduação em Estatística e Experimentação Agropecuária UFLA brasil Departamento Estatística |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFLA instname:Universidade Federal de Lavras (UFLA) instacron:UFLA |
instname_str |
Universidade Federal de Lavras (UFLA) |
instacron_str |
UFLA |
institution |
UFLA |
reponame_str |
Repositório Institucional da UFLA |
collection |
Repositório Institucional da UFLA |
repository.name.fl_str_mv |
Repositório Institucional da UFLA - Universidade Federal de Lavras (UFLA) |
repository.mail.fl_str_mv |
nivaldo@ufla.br || repositorio.biblioteca@ufla.br |
_version_ |
1815439103361024000 |