Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient

Detalhes bibliográficos
Autor(a) principal: CARVALHO, PRISCILLA R.
Data de Publicação: 2018
Outros Autores: MUNITA, CASIMIRO S., LAPOLLI, ANDRE L., INTERNATIONAL NUCLEAR ATLANTIC CONFERENCE
Tipo de documento: Artigo de conferência
Título da fonte: Repositório Institucional do IPEN
Texto Completo: http://repositorio.ipen.br/handle/123456789/28213
Resumo: The literature presents many methods for partitioning of data base, and is difficult choose which is the most suitable, since the various combinations of methods based on different measures of dissimilarity can lead to different patterns of grouping and false interpretations. Nevertheless, little effort has been expended in evaluating these methods empirically using an archaeological data base. In this way, the objective of this work is make a comparative study of the different cluster analysis methods and identify which is the most appropriate. For this, the study was carried out using a data base of the Archaeometric Studies Group from IPEN-CNEN/SP, in which 45 samples of ceramic fragments from three archaeological sites were analyzed by instrumental neutron activation analysis (INAA) which were determinated the mass fraction of 13 elements (As, Ce, Cr, Eu, Fe, Hf, La, Na, Nd, Sc, Sm, Th, U). The methods used for this study were: single linkage, complete linkage, average linkage, centroid and Ward. The validation was done using the cophenetic correlation coefficient and comparing these values the average linkage method obtained better results. A script of the statistical program R with some functions was created to obtain the cophenetic correlation. By means of these values was possible to choose the most appropriate method to be used in the data base.
id IPEN_221ac2b4e266b39bb6a0b0e5ac55bba4
oai_identifier_str oai:repositorio.ipen.br:123456789/28213
network_acronym_str IPEN
network_name_str Repositório Institucional do IPEN
repository_id_str 4510
spelling 2018-01-03T18:18:08Z2018-01-03T18:18:08ZOctober 22-27, 2017http://repositorio.ipen.br/handle/123456789/28213The literature presents many methods for partitioning of data base, and is difficult choose which is the most suitable, since the various combinations of methods based on different measures of dissimilarity can lead to different patterns of grouping and false interpretations. Nevertheless, little effort has been expended in evaluating these methods empirically using an archaeological data base. In this way, the objective of this work is make a comparative study of the different cluster analysis methods and identify which is the most appropriate. For this, the study was carried out using a data base of the Archaeometric Studies Group from IPEN-CNEN/SP, in which 45 samples of ceramic fragments from three archaeological sites were analyzed by instrumental neutron activation analysis (INAA) which were determinated the mass fraction of 13 elements (As, Ce, Cr, Eu, Fe, Hf, La, Na, Nd, Sc, Sm, Th, U). The methods used for this study were: single linkage, complete linkage, average linkage, centroid and Ward. The validation was done using the cophenetic correlation coefficient and comparing these values the average linkage method obtained better results. A script of the statistical program R with some functions was created to obtain the cophenetic correlation. By means of these values was possible to choose the most appropriate method to be used in the data base.Submitted by Marco Antonio Oliveira da Silva (maosilva@ipen.br) on 2018-01-03T18:18:08Z No. of bitstreams: 1 24038.pdf: 643971 bytes, checksum: 753e1b3feab6cca9f298620adacaede7 (MD5)Made available in DSpace on 2018-01-03T18:18:08Z (GMT). No. of bitstreams: 1 24038.pdf: 643971 bytes, checksum: 753e1b3feab6cca9f298620adacaede7 (MD5)Associa????o Brasileira de Energia Nucleararchaeologyceramicscluster analysiscomparative evaluationselementsmultivariate analysisquality controlvalidationValidity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficientinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjectINACIRio de Janeiro, RJBelo Horizonte, MGCARVALHO, PRISCILLA R.MUNITA, CASIMIRO S.LAPOLLI, ANDRE L.INTERNATIONAL NUCLEAR ATLANTIC CONFERENCEinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional do IPENinstname:Instituto de Pesquisas Energéticas e Nucleares (IPEN)instacron:IPEN240382017CARVALHO, PRISCILLA R.MUNITA, CASIMIRO S.LAPOLLI, ANDRE L.18-01Proceedings142571325154CARVALHO, PRISCILLA R.:14257:320:SMUNITA, CASIMIRO S.:1325:320:NLAPOLLI, ANDRE L.:154:110:NORIGINAL24038.pdf24038.pdfapplication/pdf643971http://repositorio.ipen.br/bitstream/123456789/28213/1/24038.pdf753e1b3feab6cca9f298620adacaede7MD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81748http://repositorio.ipen.br/bitstream/123456789/28213/2/license.txt8a4605be74aa9ea9d79846c1fba20a33MD52123456789/282132022-03-25 17:57:23.416oai:repositorio.ipen.br:123456789/28213Tk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=Repositório InstitucionalPUBhttp://repositorio.ipen.br/oai/requestbibl@ipen.bropendoar:45102022-03-25T17:57:23Repositório Institucional do IPEN - Instituto de Pesquisas Energéticas e Nucleares (IPEN)false
dc.title.pt_BR.fl_str_mv Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
title Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
spellingShingle Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
CARVALHO, PRISCILLA R.
archaeology
ceramics
cluster analysis
comparative evaluations
elements
multivariate analysis
quality control
validation
title_short Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
title_full Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
title_fullStr Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
title_full_unstemmed Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
title_sort Validity studies among hierarchical methods of cluster analysis using cophenetic correlation coefficient
author CARVALHO, PRISCILLA R.
author_facet CARVALHO, PRISCILLA R.
MUNITA, CASIMIRO S.
LAPOLLI, ANDRE L.
INTERNATIONAL NUCLEAR ATLANTIC CONFERENCE
author_role author
author2 MUNITA, CASIMIRO S.
LAPOLLI, ANDRE L.
INTERNATIONAL NUCLEAR ATLANTIC CONFERENCE
author2_role author
author
author
dc.contributor.author.fl_str_mv CARVALHO, PRISCILLA R.
MUNITA, CASIMIRO S.
LAPOLLI, ANDRE L.
INTERNATIONAL NUCLEAR ATLANTIC CONFERENCE
dc.subject.por.fl_str_mv archaeology
ceramics
cluster analysis
comparative evaluations
elements
multivariate analysis
quality control
validation
topic archaeology
ceramics
cluster analysis
comparative evaluations
elements
multivariate analysis
quality control
validation
description The literature presents many methods for partitioning of data base, and is difficult choose which is the most suitable, since the various combinations of methods based on different measures of dissimilarity can lead to different patterns of grouping and false interpretations. Nevertheless, little effort has been expended in evaluating these methods empirically using an archaeological data base. In this way, the objective of this work is make a comparative study of the different cluster analysis methods and identify which is the most appropriate. For this, the study was carried out using a data base of the Archaeometric Studies Group from IPEN-CNEN/SP, in which 45 samples of ceramic fragments from three archaeological sites were analyzed by instrumental neutron activation analysis (INAA) which were determinated the mass fraction of 13 elements (As, Ce, Cr, Eu, Fe, Hf, La, Na, Nd, Sc, Sm, Th, U). The methods used for this study were: single linkage, complete linkage, average linkage, centroid and Ward. The validation was done using the cophenetic correlation coefficient and comparing these values the average linkage method obtained better results. A script of the statistical program R with some functions was created to obtain the cophenetic correlation. By means of these values was possible to choose the most appropriate method to be used in the data base.
publishDate 2018
dc.date.evento.pt_BR.fl_str_mv October 22-27, 2017
dc.date.accessioned.fl_str_mv 2018-01-03T18:18:08Z
dc.date.available.fl_str_mv 2018-01-03T18:18:08Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://repositorio.ipen.br/handle/123456789/28213
url http://repositorio.ipen.br/handle/123456789/28213
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.coverage.pt_BR.fl_str_mv I
dc.publisher.none.fl_str_mv Associa????o Brasileira de Energia Nuclear
publisher.none.fl_str_mv Associa????o Brasileira de Energia Nuclear
dc.source.none.fl_str_mv reponame:Repositório Institucional do IPEN
instname:Instituto de Pesquisas Energéticas e Nucleares (IPEN)
instacron:IPEN
instname_str Instituto de Pesquisas Energéticas e Nucleares (IPEN)
instacron_str IPEN
institution IPEN
reponame_str Repositório Institucional do IPEN
collection Repositório Institucional do IPEN
bitstream.url.fl_str_mv http://repositorio.ipen.br/bitstream/123456789/28213/1/24038.pdf
http://repositorio.ipen.br/bitstream/123456789/28213/2/license.txt
bitstream.checksum.fl_str_mv 753e1b3feab6cca9f298620adacaede7
8a4605be74aa9ea9d79846c1fba20a33
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional do IPEN - Instituto de Pesquisas Energéticas e Nucleares (IPEN)
repository.mail.fl_str_mv bibl@ipen.br
_version_ 1767254243334422528