Predicting direct marketing response in banking: comparison of class imbalance methods

Vera Miguéis; Ana Camanho; José Luís Borges

Predicting direct marketing response in banking: comparison of class imbalance methods

Detalhes bibliográficos
Autor(a) principal:	Vera Miguéis
Data de Publicação:	2017
Outros Autores:	Ana Camanho, José Luís Borges
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://repositorio.inesctec.pt/handle/123456789/5774 http://dx.doi.org/10.1007/s11628-016-0332-3
Resumo:	Customers' response is an important topic in direct marketing. This study proposes a data mining response model supported by random forests to support the definition of target customers for banking campaigns. Class imbalance is a typical problem in telemarketing that can affect the performance of the data mining techniques. This study also contributes to the literature by exploring the use of class imbalance methods in the banking context. The performance of an undersampling method (the EasyEnsemble algorithm) is compared with that of an oversampling method (the Synthetic Minority Oversampling Technique) in order to determine the most appropriate specification. The importance of the attribute features included in the response model is also explored. In particular, discriminative performance was enhanced by the inclusion of demographic information, contact details and socio-economic features. Random forests, supported by an undersampling algorithm, presented very high prediction performance, outperforming the other techniques explored.

Metadados do item

id	RCAP_9a8c9f09f91408b52978b7be021cf57d
oai_identifier_str	oai:repositorio.inesctec.pt:123456789/5774
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Predicting direct marketing response in banking: comparison of class imbalance methodsCustomers' response is an important topic in direct marketing. This study proposes a data mining response model supported by random forests to support the definition of target customers for banking campaigns. Class imbalance is a typical problem in telemarketing that can affect the performance of the data mining techniques. This study also contributes to the literature by exploring the use of class imbalance methods in the banking context. The performance of an undersampling method (the EasyEnsemble algorithm) is compared with that of an oversampling method (the Synthetic Minority Oversampling Technique) in order to determine the most appropriate specification. The importance of the attribute features included in the response model is also explored. In particular, discriminative performance was enhanced by the inclusion of demographic information, contact details and socio-economic features. Random forests, supported by an undersampling algorithm, presented very high prediction performance, outperforming the other techniques explored.2018-01-09T16:12:43Z2017-01-01T00:00:00Z2017info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://repositorio.inesctec.pt/handle/123456789/5774http://dx.doi.org/10.1007/s11628-016-0332-3engVera MiguéisAna CamanhoJosé Luís Borgesinfo:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-05-15T10:20:16Zoai:repositorio.inesctec.pt:123456789/5774Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:52:53.888617Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Predicting direct marketing response in banking: comparison of class imbalance methods
title	Predicting direct marketing response in banking: comparison of class imbalance methods
spellingShingle	Predicting direct marketing response in banking: comparison of class imbalance methods Vera Miguéis
title_short	Predicting direct marketing response in banking: comparison of class imbalance methods
title_full	Predicting direct marketing response in banking: comparison of class imbalance methods
title_fullStr	Predicting direct marketing response in banking: comparison of class imbalance methods
title_full_unstemmed	Predicting direct marketing response in banking: comparison of class imbalance methods
title_sort	Predicting direct marketing response in banking: comparison of class imbalance methods
author	Vera Miguéis
author_facet	Vera Miguéis Ana Camanho José Luís Borges
author_role	author
author2	Ana Camanho José Luís Borges
author2_role	author author
dc.contributor.author.fl_str_mv	Vera Miguéis Ana Camanho José Luís Borges
description	Customers' response is an important topic in direct marketing. This study proposes a data mining response model supported by random forests to support the definition of target customers for banking campaigns. Class imbalance is a typical problem in telemarketing that can affect the performance of the data mining techniques. This study also contributes to the literature by exploring the use of class imbalance methods in the banking context. The performance of an undersampling method (the EasyEnsemble algorithm) is compared with that of an oversampling method (the Synthetic Minority Oversampling Technique) in order to determine the most appropriate specification. The importance of the attribute features included in the response model is also explored. In particular, discriminative performance was enhanced by the inclusion of demographic information, contact details and socio-economic features. Random forests, supported by an undersampling algorithm, presented very high prediction performance, outperforming the other techniques explored.
publishDate	2017
dc.date.none.fl_str_mv	2017-01-01T00:00:00Z 2017 2018-01-09T16:12:43Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://repositorio.inesctec.pt/handle/123456789/5774 http://dx.doi.org/10.1007/s11628-016-0332-3
url	http://repositorio.inesctec.pt/handle/123456789/5774 http://dx.doi.org/10.1007/s11628-016-0332-3
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv	embargoedAccess
dc.format.none.fl_str_mv	application/pdf
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799131604172406785

Predicting direct marketing response in banking: comparison of class imbalance methods

Registros relacionados