The impact of feature selection methods on online handwritten signature by using clustering-based analysis

Detalhes bibliográficos
Autor(a) principal: Marques, Julliana Caroline Gonçalves de Araújo Silva
Data de Publicação: 2021
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Institucional da UFRN
Texto Completo: https://repositorio.ufrn.br/handle/123456789/32052
Resumo: Handwritten signature is one of the oldest and most accepted biometric authentication methods for human identity establishment in society. With the popularisation of computers and, consequently, computational biometric authentication systems, the signature was chosen for being one of the biometric traits of an individual that is likely to be relatively unique for every person. However, when dealing with biometric data, including signature data, problems related to high dimensional space, can be generated. Among other issues, irrelevant, redundant data and noise are the most significant, as they result in a decreased of identification accuracy. Thus, it is necessary to reduce the space by selecting the smallest set of features that contain the most discriminative features, increasing the accuracy of the system. In this way, our proposal in this work is to analyse the impact of feature selection on individuals identification accuracy based on the handwritten online signature. For this, we will use two well-known online signature databases: SVC2004 and xLongSignDB. For the feature selection process, we have applied two filter and one wrapper methods. Then, the resulted datasets are evaluated by classification algorithms and validated with a clustering technique. Besides, we have used a statistical test to corroborate our conclusions. Experiments presented satisfactory results when using a smaller number of features which are more representative, showing that we reached an average accuracy of over 98\% for both datasets which were validated with the clustering methods, which achieved an average accuracy over 80\% (SVC2004) and 70\% (xLongSignDB).
id UFRN_9f869ee48fb2559e6e05069664237dd2
oai_identifier_str oai:https://repositorio.ufrn.br:123456789/32052
network_acronym_str UFRN
network_name_str Repositório Institucional da UFRN
repository_id_str
spelling Marques, Julliana Caroline Gonçalves de Araújo Silvahttp://lattes.cnpq.br/5554033822360657http://lattes.cnpq.br/2234040548103596Carvalho, Bruno Motta dehttp://lattes.cnpq.br/0330924133337698Souza Neto, Plácido Antônio dehttp://lattes.cnpq.br/3641504724164977Abreu, Marjory Cristiany da Costa2021-04-06T19:02:41Z2021-04-06T19:02:41Z2021-01-29MARQUES, Julliana Caroline Gonçalves de Araújo Silva. The impact of feature selection methods on online handwritten signature by using clustering-based analysis. 2021. 68f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.https://repositorio.ufrn.br/handle/123456789/32052Universidade Federal do Rio Grande do NortePROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃOUFRNBrasilOnline handwritten signatureFeature selectionClusteringSVC2004xLongSignDBThe impact of feature selection methods on online handwritten signature by using clustering-based analysisinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisHandwritten signature is one of the oldest and most accepted biometric authentication methods for human identity establishment in society. With the popularisation of computers and, consequently, computational biometric authentication systems, the signature was chosen for being one of the biometric traits of an individual that is likely to be relatively unique for every person. However, when dealing with biometric data, including signature data, problems related to high dimensional space, can be generated. Among other issues, irrelevant, redundant data and noise are the most significant, as they result in a decreased of identification accuracy. Thus, it is necessary to reduce the space by selecting the smallest set of features that contain the most discriminative features, increasing the accuracy of the system. In this way, our proposal in this work is to analyse the impact of feature selection on individuals identification accuracy based on the handwritten online signature. For this, we will use two well-known online signature databases: SVC2004 and xLongSignDB. For the feature selection process, we have applied two filter and one wrapper methods. Then, the resulted datasets are evaluated by classification algorithms and validated with a clustering technique. Besides, we have used a statistical test to corroborate our conclusions. Experiments presented satisfactory results when using a smaller number of features which are more representative, showing that we reached an average accuracy of over 98\% for both datasets which were validated with the clustering methods, which achieved an average accuracy over 80\% (SVC2004) and 70\% (xLongSignDB).info:eu-repo/semantics/openAccessporreponame:Repositório Institucional da UFRNinstname:Universidade Federal do Rio Grande do Norte (UFRN)instacron:UFRNTEXTImpactfeatureselection_Marques_2021.pdf.txtImpactfeatureselection_Marques_2021.pdf.txtExtracted texttext/plain1146https://repositorio.ufrn.br/bitstream/123456789/32052/2/Impactfeatureselection_Marques_2021.pdf.txtb25a673690342a0af1cc294f2d2e7dafMD52THUMBNAILImpactfeatureselection_Marques_2021.pdf.jpgImpactfeatureselection_Marques_2021.pdf.jpgGenerated Thumbnailimage/jpeg1330https://repositorio.ufrn.br/bitstream/123456789/32052/3/Impactfeatureselection_Marques_2021.pdf.jpg15c2f8b51598240c167906eca46bdc81MD53ORIGINALImpactfeatureselection_Marques_2021.pdfapplication/pdf15205966https://repositorio.ufrn.br/bitstream/123456789/32052/1/Impactfeatureselection_Marques_2021.pdf9853cf5d244d9cd7fa2f776801d946c3MD51123456789/320522021-04-11 06:05:09.341oai:https://repositorio.ufrn.br:123456789/32052Repositório de PublicaçõesPUBhttp://repositorio.ufrn.br/oai/opendoar:2021-04-11T09:05:09Repositório Institucional da UFRN - Universidade Federal do Rio Grande do Norte (UFRN)false
dc.title.pt_BR.fl_str_mv The impact of feature selection methods on online handwritten signature by using clustering-based analysis
title The impact of feature selection methods on online handwritten signature by using clustering-based analysis
spellingShingle The impact of feature selection methods on online handwritten signature by using clustering-based analysis
Marques, Julliana Caroline Gonçalves de Araújo Silva
Online handwritten signature
Feature selection
Clustering
SVC2004
xLongSignDB
title_short The impact of feature selection methods on online handwritten signature by using clustering-based analysis
title_full The impact of feature selection methods on online handwritten signature by using clustering-based analysis
title_fullStr The impact of feature selection methods on online handwritten signature by using clustering-based analysis
title_full_unstemmed The impact of feature selection methods on online handwritten signature by using clustering-based analysis
title_sort The impact of feature selection methods on online handwritten signature by using clustering-based analysis
author Marques, Julliana Caroline Gonçalves de Araújo Silva
author_facet Marques, Julliana Caroline Gonçalves de Araújo Silva
author_role author
dc.contributor.authorID.pt_BR.fl_str_mv
dc.contributor.authorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/5554033822360657
dc.contributor.advisorID.pt_BR.fl_str_mv
dc.contributor.advisorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/2234040548103596
dc.contributor.referees1.none.fl_str_mv Carvalho, Bruno Motta de
dc.contributor.referees1ID.pt_BR.fl_str_mv
dc.contributor.referees1Lattes.pt_BR.fl_str_mv http://lattes.cnpq.br/0330924133337698
dc.contributor.referees2.none.fl_str_mv Souza Neto, Plácido Antônio de
dc.contributor.referees2ID.pt_BR.fl_str_mv
dc.contributor.referees2Lattes.pt_BR.fl_str_mv http://lattes.cnpq.br/3641504724164977
dc.contributor.author.fl_str_mv Marques, Julliana Caroline Gonçalves de Araújo Silva
dc.contributor.advisor1.fl_str_mv Abreu, Marjory Cristiany da Costa
contributor_str_mv Abreu, Marjory Cristiany da Costa
dc.subject.por.fl_str_mv Online handwritten signature
Feature selection
Clustering
SVC2004
xLongSignDB
topic Online handwritten signature
Feature selection
Clustering
SVC2004
xLongSignDB
description Handwritten signature is one of the oldest and most accepted biometric authentication methods for human identity establishment in society. With the popularisation of computers and, consequently, computational biometric authentication systems, the signature was chosen for being one of the biometric traits of an individual that is likely to be relatively unique for every person. However, when dealing with biometric data, including signature data, problems related to high dimensional space, can be generated. Among other issues, irrelevant, redundant data and noise are the most significant, as they result in a decreased of identification accuracy. Thus, it is necessary to reduce the space by selecting the smallest set of features that contain the most discriminative features, increasing the accuracy of the system. In this way, our proposal in this work is to analyse the impact of feature selection on individuals identification accuracy based on the handwritten online signature. For this, we will use two well-known online signature databases: SVC2004 and xLongSignDB. For the feature selection process, we have applied two filter and one wrapper methods. Then, the resulted datasets are evaluated by classification algorithms and validated with a clustering technique. Besides, we have used a statistical test to corroborate our conclusions. Experiments presented satisfactory results when using a smaller number of features which are more representative, showing that we reached an average accuracy of over 98\% for both datasets which were validated with the clustering methods, which achieved an average accuracy over 80\% (SVC2004) and 70\% (xLongSignDB).
publishDate 2021
dc.date.accessioned.fl_str_mv 2021-04-06T19:02:41Z
dc.date.available.fl_str_mv 2021-04-06T19:02:41Z
dc.date.issued.fl_str_mv 2021-01-29
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv MARQUES, Julliana Caroline Gonçalves de Araújo Silva. The impact of feature selection methods on online handwritten signature by using clustering-based analysis. 2021. 68f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.
dc.identifier.uri.fl_str_mv https://repositorio.ufrn.br/handle/123456789/32052
identifier_str_mv MARQUES, Julliana Caroline Gonçalves de Araújo Silva. The impact of feature selection methods on online handwritten signature by using clustering-based analysis. 2021. 68f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2021.
url https://repositorio.ufrn.br/handle/123456789/32052
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal do Rio Grande do Norte
dc.publisher.program.fl_str_mv PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO
dc.publisher.initials.fl_str_mv UFRN
dc.publisher.country.fl_str_mv Brasil
publisher.none.fl_str_mv Universidade Federal do Rio Grande do Norte
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFRN
instname:Universidade Federal do Rio Grande do Norte (UFRN)
instacron:UFRN
instname_str Universidade Federal do Rio Grande do Norte (UFRN)
instacron_str UFRN
institution UFRN
reponame_str Repositório Institucional da UFRN
collection Repositório Institucional da UFRN
bitstream.url.fl_str_mv https://repositorio.ufrn.br/bitstream/123456789/32052/2/Impactfeatureselection_Marques_2021.pdf.txt
https://repositorio.ufrn.br/bitstream/123456789/32052/3/Impactfeatureselection_Marques_2021.pdf.jpg
https://repositorio.ufrn.br/bitstream/123456789/32052/1/Impactfeatureselection_Marques_2021.pdf
bitstream.checksum.fl_str_mv b25a673690342a0af1cc294f2d2e7daf
15c2f8b51598240c167906eca46bdc81
9853cf5d244d9cd7fa2f776801d946c3
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFRN - Universidade Federal do Rio Grande do Norte (UFRN)
repository.mail.fl_str_mv
_version_ 1802117597867540480