Mineração de dados para identificação de fatores de reprovação no ensino superior

Dutra, Bruna Luiza

Mineração de dados para identificação de fatores de reprovação no ensino superior

Detalhes bibliográficos
Autor(a) principal:	Dutra, Bruna Luiza
Data de Publicação:	2019
Tipo de documento:	Trabalho de conclusão de curso
Idioma:	por
Título da fonte:	Repositório Institucional da UFU
Texto Completo:	https://repositorio.ufu.br/handle/123456789/28415
Resumo:	Brazil and several countries are facing the problem of students dropout in the universities. Particularly, dropout rates of computer science related courses are among the highest ones, with approximately one of three students receiving the diploma. For example, recent studies showed that some classes of the Information Systems course at Federal University of Uberlândia achieved dropout rates greater than 70%. Among the several reasons for students dropout, one of the most cited in the literature is students retention. In this sense, Data Mining methods can be adopted to find indicators that contribute to student dropout or retention, which can be a relevant tool to alleviate this problem. By considering a data set of technical, social and economic features of students from the Information Systems course at Monte Carmelo Campus, this work aims at evaluating data classification techniques in the problem of predicting the students final performance regarding the discipline of Introduction to Computer Programming. The present study also seeks to identify the factors that most contribute to a student's poor performance. The experiments were performed with the Weka tool. A total of six classification techniques were considered, in addition to two database situations: with and without missing attributes. To compare the results we adopted two metrics: accuracy and F1. Statistical tests attested the good results of the JRIP rule induction model in comparison with the other techniques. Among the indicators that stand out are: family per capita income, average math grade in high school, distance from parents city to the university city, distance from where student lives to the campus, type of school attended in high school and with who the student resides. It is expected that such indicators can assist course managers in the early identification of students who are most likely to fail and also in the planning of actions to reduce dropout rates.

Metadados do item

id	UFU_0cfb88c37475de82349a7cd0c7111913
oai_identifier_str	oai:repositorio.ufu.br:123456789/28415
network_acronym_str	UFU
network_name_str	Repositório Institucional da UFU
repository_id_str
spelling	Mineração de dados para identificação de fatores de reprovação no ensino superiorEvasãoMineração de dadosClassificaçãoFatores de reprovaçãoDropoutData miningData classificationRetention factorsCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOBrazil and several countries are facing the problem of students dropout in the universities. Particularly, dropout rates of computer science related courses are among the highest ones, with approximately one of three students receiving the diploma. For example, recent studies showed that some classes of the Information Systems course at Federal University of Uberlândia achieved dropout rates greater than 70%. Among the several reasons for students dropout, one of the most cited in the literature is students retention. In this sense, Data Mining methods can be adopted to find indicators that contribute to student dropout or retention, which can be a relevant tool to alleviate this problem. By considering a data set of technical, social and economic features of students from the Information Systems course at Monte Carmelo Campus, this work aims at evaluating data classification techniques in the problem of predicting the students final performance regarding the discipline of Introduction to Computer Programming. The present study also seeks to identify the factors that most contribute to a student's poor performance. The experiments were performed with the Weka tool. A total of six classification techniques were considered, in addition to two database situations: with and without missing attributes. To compare the results we adopted two metrics: accuracy and F1. Statistical tests attested the good results of the JRIP rule induction model in comparison with the other techniques. Among the indicators that stand out are: family per capita income, average math grade in high school, distance from parents city to the university city, distance from where student lives to the campus, type of school attended in high school and with who the student resides. It is expected that such indicators can assist course managers in the early identification of students who are most likely to fail and also in the planning of actions to reduce dropout rates.Trabalho de Conclusão de Curso (Graduação)O Brasil e diversos países vêm enfrentando o problema da evasão no ensino superior. Especificamente, as taxas de evasão dos cursos da área de computação estão entre as maiores, sendo que aproximadamente um a cada três alunos que ingressam recebe o diploma. No curso noturno de Sistemas de Informação da Universidade Federal de Uberlândia, por exemplo, um estudo recente apontou que houve turmas com taxas de evasão maior do que 70%. Entre os vários motivos para a evasão, um dos mais citados na literatura é a reprovação em disciplinas do curso. Nesse sentido, métodos de Mineração de Dados podem ser adotados para encontrar os fatores que contribuem para o aluno evadir ou reprovar, o que pode ser um ferramenta relevante para a amenização desse problema. Desse modo, partindo de uma bases de dados contendo registros técnicos, sociais e econômicos de alunos do curso de Sistemas de Informação do Campus Monte Carmelo, o objetivo deste trabalho é avaliar técnicas de classificação de dados no problema de predizer o desempenho final do aluno em relação à disciplina de Introdução à Programação de Computadores. O presente estudo também busca identificar os fatores que mais contribuem para o desempenho ruim de um aluno. Os experimentos foram realizados com a ferramenta Weka. Um total de seis técnicas de classificação foram consideradas, além de duas situações da base de dados: com e sem atributos faltantes. Para a comparação dos resultados foram utilizadas as métricas acurácia e medida F1. Através de testes estatísticos, é possível afirmar que o modelo de indução de regras JRIP se destacou dentre as demais técnicas. Entre os indicadores que se destacaram estão a renda per capita familiar, nota média em matemática no ensino médio, distância da cidade dos pais até a cidade da universidade, distância da localidade em que o aluno mora até o campus, tipo de escola que cursou no ensino médio e com quem o aluno reside. Espera-se que esses fatores auxiliem os gestores do curso na identificação precoce dos alunos com mais chances de reprovar e permita também o planejamento de ações a fim de reduzir as taxas de evasão.2021-12-18Universidade Federal de UberlândiaBrasilSistemas de InformaçãoCarneiro, Murillo Guimarãeshttp://lattes.cnpq.br/8158868389973535Gabriel, Paulo Henrique Ribeirohttp://lattes.cnpq.br/3181954061121790Paiva, José Gustavo de Souzahttp://lattes.cnpq.br/4981210260282182Dutra, Bruna Luiza2020-01-21T12:39:06Z2020-01-21T12:39:06Z2019-12-18info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesisapplication/pdfDUTRA, Bruna Luiza. Mineração de dados para identificação de fatores de reprovação no ensino superior. 2019. 52 f. Trabalho de Conclusão de Curso (Graduação em Sistemas de Informação) – Universidade Federal de Uberlândia, Uberlândia, 2020.https://repositorio.ufu.br/handle/123456789/28415porhttp://creativecommons.org/licenses/by-nc-nd/3.0/us/info:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFUinstname:Universidade Federal de Uberlândia (UFU)instacron:UFU2022-10-13T19:31:17Zoai:repositorio.ufu.br:123456789/28415Repositório InstitucionalONGhttp://repositorio.ufu.br/oai/requestdiinf@dirbi.ufu.bropendoar:2022-10-13T19:31:17Repositório Institucional da UFU - Universidade Federal de Uberlândia (UFU)false
dc.title.none.fl_str_mv	Mineração de dados para identificação de fatores de reprovação no ensino superior
title	Mineração de dados para identificação de fatores de reprovação no ensino superior
spellingShingle	Mineração de dados para identificação de fatores de reprovação no ensino superior Dutra, Bruna Luiza Evasão Mineração de dados Classificação Fatores de reprovação Dropout Data mining Data classification Retention factors CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Mineração de dados para identificação de fatores de reprovação no ensino superior
title_full	Mineração de dados para identificação de fatores de reprovação no ensino superior
title_fullStr	Mineração de dados para identificação de fatores de reprovação no ensino superior
title_full_unstemmed	Mineração de dados para identificação de fatores de reprovação no ensino superior
title_sort	Mineração de dados para identificação de fatores de reprovação no ensino superior
author	Dutra, Bruna Luiza
author_facet	Dutra, Bruna Luiza
author_role	author
dc.contributor.none.fl_str_mv	Carneiro, Murillo Guimarães http://lattes.cnpq.br/8158868389973535 Gabriel, Paulo Henrique Ribeiro http://lattes.cnpq.br/3181954061121790 Paiva, José Gustavo de Souza http://lattes.cnpq.br/4981210260282182
dc.contributor.author.fl_str_mv	Dutra, Bruna Luiza
dc.subject.por.fl_str_mv	Evasão Mineração de dados Classificação Fatores de reprovação Dropout Data mining Data classification Retention factors CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
topic	Evasão Mineração de dados Classificação Fatores de reprovação Dropout Data mining Data classification Retention factors CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	Brazil and several countries are facing the problem of students dropout in the universities. Particularly, dropout rates of computer science related courses are among the highest ones, with approximately one of three students receiving the diploma. For example, recent studies showed that some classes of the Information Systems course at Federal University of Uberlândia achieved dropout rates greater than 70%. Among the several reasons for students dropout, one of the most cited in the literature is students retention. In this sense, Data Mining methods can be adopted to find indicators that contribute to student dropout or retention, which can be a relevant tool to alleviate this problem. By considering a data set of technical, social and economic features of students from the Information Systems course at Monte Carmelo Campus, this work aims at evaluating data classification techniques in the problem of predicting the students final performance regarding the discipline of Introduction to Computer Programming. The present study also seeks to identify the factors that most contribute to a student's poor performance. The experiments were performed with the Weka tool. A total of six classification techniques were considered, in addition to two database situations: with and without missing attributes. To compare the results we adopted two metrics: accuracy and F1. Statistical tests attested the good results of the JRIP rule induction model in comparison with the other techniques. Among the indicators that stand out are: family per capita income, average math grade in high school, distance from parents city to the university city, distance from where student lives to the campus, type of school attended in high school and with who the student resides. It is expected that such indicators can assist course managers in the early identification of students who are most likely to fail and also in the planning of actions to reduce dropout rates.
publishDate	2019
dc.date.none.fl_str_mv	2019-12-18 2020-01-21T12:39:06Z 2020-01-21T12:39:06Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/bachelorThesis
format	bachelorThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	DUTRA, Bruna Luiza. Mineração de dados para identificação de fatores de reprovação no ensino superior. 2019. 52 f. Trabalho de Conclusão de Curso (Graduação em Sistemas de Informação) – Universidade Federal de Uberlândia, Uberlândia, 2020. https://repositorio.ufu.br/handle/123456789/28415
identifier_str_mv	DUTRA, Bruna Luiza. Mineração de dados para identificação de fatores de reprovação no ensino superior. 2019. 52 f. Trabalho de Conclusão de Curso (Graduação em Sistemas de Informação) – Universidade Federal de Uberlândia, Uberlândia, 2020.
url	https://repositorio.ufu.br/handle/123456789/28415
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	http://creativecommons.org/licenses/by-nc-nd/3.0/us/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-nd/3.0/us/
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de Uberlândia Brasil Sistemas de Informação
publisher.none.fl_str_mv	Universidade Federal de Uberlândia Brasil Sistemas de Informação
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFU instname:Universidade Federal de Uberlândia (UFU) instacron:UFU
instname_str	Universidade Federal de Uberlândia (UFU)
instacron_str	UFU
institution	UFU
reponame_str	Repositório Institucional da UFU
collection	Repositório Institucional da UFU
repository.name.fl_str_mv	Repositório Institucional da UFU - Universidade Federal de Uberlândia (UFU)
repository.mail.fl_str_mv	diinf@dirbi.ufu.br
_version_	1813711436484968448

Mineração de dados para identificação de fatores de reprovação no ensino superior

Registros relacionados