Decision trees for loss prediction in retail - case of Pingo Doce

Detalhes bibliográficos
Autor(a) principal: Henriques, Mariana Bonito
Data de Publicação: 2020
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10362/100957
Resumo: Project Work presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business Intelligence
id RCAP_335372212f541666077bf903dbf07b20
oai_identifier_str oai:run.unl.pt:10362/100957
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Decision trees for loss prediction in retail - case of Pingo DoceMachine LearningRetailLossClassification AlgorithmClassification TreeDecision TreeProject Work presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business IntelligenceThe use of data mining as a way of solving problems from the widest range of areas with the main purpose of gaining competitive advantage is rising, specially in retail, an extremely competitive sector that requires an even bigger advantage. Additionally, food loss, beyond representing a huge waste of resources, can also be considered a major issue to the retail sector due to the financial losses originated from it. Thus, I proposed to help Pingo Doce, a well-known Portuguese retail company, to solve their food loss issue which, despite being the major cause of a huge drop in the company’s profits, has never been solved till this day. Therefore, this project focuses on the development of a classification algorithm that will allow to predict future significant losses in several fruits sold in certain Pingo Doce stores. To do so I applied a Decision Tree algorithm that, due to its representation in the form of if-then rules, will help to identify the main features that lead to a higher number of losses, namely the period of the year and the category to which each fruit belongs, among others. The dataset provided by the company contains variables that measure the quantity and value of sales, stocks, identified and unidentified losses, over a one-year period, and regarding 81 different fruits and 20 stores from all over the country. Additionally, I created new variables such as the criminality rate of the municipality and the climate class of each store, as well as the seasons and the day of the week in which each observation occurred. All these variables allowed me to create four different datasets that originated four different Classification Trees. The results show that, using a dataset with no information regarding stocks and sales, containing only variables that describe the characteristics of the stores, products and periods of time, as well as the value of product sold per unit of measurement, i.e. the price per unit of measurement of each fruit, it is possible to create a Decision Tree that reaches an accuracy of 74% and correctly predicts 82% of the observations that represent significant losses. The algorithm obtained allowed to identify the variables that are more prone to originate significant losses, namely: the day of the week, the fruit’s category, the season of the year, the position of that week in the respective month and the price at which the product is being sold.Henriques, Roberto André PereiraRUNHenriques, Mariana Bonito2020-07-16T18:36:40Z2020-07-062020-07-06T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/100957TID:202501191enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:47:18Zoai:run.unl.pt:10362/100957Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:39:28.273982Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Decision trees for loss prediction in retail - case of Pingo Doce
title Decision trees for loss prediction in retail - case of Pingo Doce
spellingShingle Decision trees for loss prediction in retail - case of Pingo Doce
Henriques, Mariana Bonito
Machine Learning
Retail
Loss
Classification Algorithm
Classification Tree
Decision Tree
title_short Decision trees for loss prediction in retail - case of Pingo Doce
title_full Decision trees for loss prediction in retail - case of Pingo Doce
title_fullStr Decision trees for loss prediction in retail - case of Pingo Doce
title_full_unstemmed Decision trees for loss prediction in retail - case of Pingo Doce
title_sort Decision trees for loss prediction in retail - case of Pingo Doce
author Henriques, Mariana Bonito
author_facet Henriques, Mariana Bonito
author_role author
dc.contributor.none.fl_str_mv Henriques, Roberto André Pereira
RUN
dc.contributor.author.fl_str_mv Henriques, Mariana Bonito
dc.subject.por.fl_str_mv Machine Learning
Retail
Loss
Classification Algorithm
Classification Tree
Decision Tree
topic Machine Learning
Retail
Loss
Classification Algorithm
Classification Tree
Decision Tree
description Project Work presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business Intelligence
publishDate 2020
dc.date.none.fl_str_mv 2020-07-16T18:36:40Z
2020-07-06
2020-07-06T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10362/100957
TID:202501191
url http://hdl.handle.net/10362/100957
identifier_str_mv TID:202501191
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799138010584842240