Using data mining to predict students' academic success

Detalhes bibliográficos
Autor(a) principal: André Filipe Roque Silva
Data de Publicação: 2016
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/110241
Resumo: Currently, universities record large amounts of data about students. Despite its potential to inform decisions regarding the allocation of resources and efforts, this information tends to be overlooked. Educational data mining is a recent research field that focuses on the use of data mining techniques to transform large volumes of educational data into useful and relevant knowledge that can improve the educational processes and decisions. This work intends to propose a set of three models. The first two will use the information available at start of the first semester and second semester, respectively, of the first year of the student's academic path to predict the academic success of the students enrolled at FEUP at the end of that semester. The third model will use the information available at the end of the first year to predict the academic performance of the students enrolled at FEUP at the end of their degree. At the same time, this work also intends to identify the factors that are the most critical to these models. The results of this project could allow college principals to identify students in need of more pedagogical support, as well as students with a high probability of excelling in their studies. It could also allow them to focus their attention on the critical aspects, by implementing mechanisms that tackle students' difficulties. The first step of the developed work consists of data cleaning and preparation processes that normalize the data retrieved from the university's information system and the definition of the derived variables. The second step is an empirical analysis of different algorithms with interpretable models in order to identify the models best suited for the tasks of predicting academic success regarding their performance with the available entry data. Thirdly, an analysis of the generated models will be presented, along with the identification of their main predictive attributes.
id RCAP_6555c27dc6add3a0885e602999fb6667
oai_identifier_str oai:repositorio-aberto.up.pt:10216/110241
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Using data mining to predict students' academic successEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringCurrently, universities record large amounts of data about students. Despite its potential to inform decisions regarding the allocation of resources and efforts, this information tends to be overlooked. Educational data mining is a recent research field that focuses on the use of data mining techniques to transform large volumes of educational data into useful and relevant knowledge that can improve the educational processes and decisions. This work intends to propose a set of three models. The first two will use the information available at start of the first semester and second semester, respectively, of the first year of the student's academic path to predict the academic success of the students enrolled at FEUP at the end of that semester. The third model will use the information available at the end of the first year to predict the academic performance of the students enrolled at FEUP at the end of their degree. At the same time, this work also intends to identify the factors that are the most critical to these models. The results of this project could allow college principals to identify students in need of more pedagogical support, as well as students with a high probability of excelling in their studies. It could also allow them to focus their attention on the critical aspects, by implementing mechanisms that tackle students' difficulties. The first step of the developed work consists of data cleaning and preparation processes that normalize the data retrieved from the university's information system and the definition of the derived variables. The second step is an empirical analysis of different algorithms with interpretable models in order to identify the models best suited for the tasks of predicting academic success regarding their performance with the available entry data. Thirdly, an analysis of the generated models will be presented, along with the identification of their main predictive attributes.2016-02-122016-02-12T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/110241TID:201296535engAndré Filipe Roque Silvainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T13:43:26Zoai:repositorio-aberto.up.pt:10216/110241Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:46:38.299704Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Using data mining to predict students' academic success
title Using data mining to predict students' academic success
spellingShingle Using data mining to predict students' academic success
André Filipe Roque Silva
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Using data mining to predict students' academic success
title_full Using data mining to predict students' academic success
title_fullStr Using data mining to predict students' academic success
title_full_unstemmed Using data mining to predict students' academic success
title_sort Using data mining to predict students' academic success
author André Filipe Roque Silva
author_facet André Filipe Roque Silva
author_role author
dc.contributor.author.fl_str_mv André Filipe Roque Silva
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description Currently, universities record large amounts of data about students. Despite its potential to inform decisions regarding the allocation of resources and efforts, this information tends to be overlooked. Educational data mining is a recent research field that focuses on the use of data mining techniques to transform large volumes of educational data into useful and relevant knowledge that can improve the educational processes and decisions. This work intends to propose a set of three models. The first two will use the information available at start of the first semester and second semester, respectively, of the first year of the student's academic path to predict the academic success of the students enrolled at FEUP at the end of that semester. The third model will use the information available at the end of the first year to predict the academic performance of the students enrolled at FEUP at the end of their degree. At the same time, this work also intends to identify the factors that are the most critical to these models. The results of this project could allow college principals to identify students in need of more pedagogical support, as well as students with a high probability of excelling in their studies. It could also allow them to focus their attention on the critical aspects, by implementing mechanisms that tackle students' difficulties. The first step of the developed work consists of data cleaning and preparation processes that normalize the data retrieved from the university's information system and the definition of the derived variables. The second step is an empirical analysis of different algorithms with interpretable models in order to identify the models best suited for the tasks of predicting academic success regarding their performance with the available entry data. Thirdly, an analysis of the generated models will be presented, along with the identification of their main predictive attributes.
publishDate 2016
dc.date.none.fl_str_mv 2016-02-12
2016-02-12T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/110241
TID:201296535
url https://hdl.handle.net/10216/110241
identifier_str_mv TID:201296535
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135783915880448