A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features

Detalhes bibliográficos
Autor(a) principal: Moro, S.
Data de Publicação: 2017
Outros Autores: Cortez, P., Rita, P.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10071/15400
Resumo: The need to leverage knowledge through data mining has driven enterprises in a demand for more data. However, there is a gap between the availability of data and the application of extracted knowledge for improving decision support. In fact, more data do not necessarily imply better predictive data-driven marketing models, since it is often the case that the problem domain requires a deeper characterization. Aiming at such characterization, we propose a framework drawn on three feature selection strategies, where the goal is to unveil novel features that can effectively increase the value of data by providing a richer characterization of the problem domain. Such strategies involve encompassing context (e.g., social and economic variables), evaluating past history, and disaggregate the main problem into smaller but interesting subproblems. The framework is evaluated through an empirical analysis for a real bank telemarketing application, with the results proving the benefits of such approach, as the area under the receiver operating characteristic curve increased with each stage, improving previous model in terms of predictive performance.
id RCAP_95e6c68001e3e5cc70884738762a8c8b
oai_identifier_str oai:repositorio.iscte-iul.pt:10071/15400
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel featuresFeature selectionDecision supportData miningTelemarketingBank marketingThe need to leverage knowledge through data mining has driven enterprises in a demand for more data. However, there is a gap between the availability of data and the application of extracted knowledge for improving decision support. In fact, more data do not necessarily imply better predictive data-driven marketing models, since it is often the case that the problem domain requires a deeper characterization. Aiming at such characterization, we propose a framework drawn on three feature selection strategies, where the goal is to unveil novel features that can effectively increase the value of data by providing a richer characterization of the problem domain. Such strategies involve encompassing context (e.g., social and economic variables), evaluating past history, and disaggregate the main problem into smaller but interesting subproblems. The framework is evaluated through an empirical analysis for a real bank telemarketing application, with the results proving the benefits of such approach, as the area under the receiver operating characteristic curve increased with each stage, improving previous model in terms of predictive performance.Springer2018-03-20T10:52:25Z2017-01-01T00:00:00Z20172019-04-05T16:19:41Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10071/15400eng0941-064310.1007/s00521-015-2157-8Moro, S.Cortez, P.Rita, P.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-09T17:41:56Zoai:repositorio.iscte-iul.pt:10071/15400Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:19:33.347416Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
title A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
spellingShingle A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
Moro, S.
Feature selection
Decision support
Data mining
Telemarketing
Bank marketing
title_short A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
title_full A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
title_fullStr A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
title_full_unstemmed A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
title_sort A framework for increasing the value of predictive data-driven models by enriching problem domain characterization with novel features
author Moro, S.
author_facet Moro, S.
Cortez, P.
Rita, P.
author_role author
author2 Cortez, P.
Rita, P.
author2_role author
author
dc.contributor.author.fl_str_mv Moro, S.
Cortez, P.
Rita, P.
dc.subject.por.fl_str_mv Feature selection
Decision support
Data mining
Telemarketing
Bank marketing
topic Feature selection
Decision support
Data mining
Telemarketing
Bank marketing
description The need to leverage knowledge through data mining has driven enterprises in a demand for more data. However, there is a gap between the availability of data and the application of extracted knowledge for improving decision support. In fact, more data do not necessarily imply better predictive data-driven marketing models, since it is often the case that the problem domain requires a deeper characterization. Aiming at such characterization, we propose a framework drawn on three feature selection strategies, where the goal is to unveil novel features that can effectively increase the value of data by providing a richer characterization of the problem domain. Such strategies involve encompassing context (e.g., social and economic variables), evaluating past history, and disaggregate the main problem into smaller but interesting subproblems. The framework is evaluated through an empirical analysis for a real bank telemarketing application, with the results proving the benefits of such approach, as the area under the receiver operating characteristic curve increased with each stage, improving previous model in terms of predictive performance.
publishDate 2017
dc.date.none.fl_str_mv 2017-01-01T00:00:00Z
2017
2018-03-20T10:52:25Z
2019-04-05T16:19:41Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10071/15400
url http://hdl.handle.net/10071/15400
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 0941-0643
10.1007/s00521-015-2157-8
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer
publisher.none.fl_str_mv Springer
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134755051012096