Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10773/31377 |
Resumo: | As humans our vision is one of the most developed sense, we are able to identify and recognize intricate patterns in everything we see. We can perceive colours, shapes, textures, luminance and shadows and with these characteristics we are easily able to process images. Analysing images with the machine learning cooperation humans can monitorize, detect and anticipate the anthropogenic influences. For example we can monitorize and prevent the erosion of the portuguese coastline, or predict climate changes or even monitorize the water and beach pollution. This thesis aims to develop algorithms to segment and classify aerial images of the Portuguese coast. The first step was the study and development of a manual image segmentation algorithm. A tool was designed to segment and annotate these regions with the labels: rock, sea, sand and people. Thereafter, several low-level characteristics were extracted locally, such as colour, entropy, edges, local binary pattern, and the intensities of each pixel in different colour scales (RGB, gray scale and HSV). Regarding the classification part, different clustering algorithms were studied, in particular the K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH and their performance was tested with the most common metrics, namely precision, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows and Silhouette so that the classification process for each class is as correct as possible. Finally with the clustering method that obtained the best performance (K-Means) was trained and tested. The train and test process is the most critical factor affecting the success of machine learning. The dataset is divided into two parts, one for training and the other for testing. After the machine learning model is trained according to the training data, it is then tested. The K-Means is trained in order to obtain the centroids of each cluster. These centroids are the crucial element for the final classification. It is then calculated the shortest distance between the centroids and the pixel data of the test images. This distance defines at which cluster the pixel of the test image belongs to, classifying it. In the end this approach revealed satisfying results towards the main goal. With the acquired knowledge I would probably chose different low-level features and more low-level features to obtain a more complete and precise segmentation. |
id |
RCAP_182a7844998b60ede06710261cf021f9 |
---|---|
oai_identifier_str |
oai:ria.ua.pt:10773/31377 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zoneImage segmentationImage classificationMachine learningClusteringLow-level featuresAs humans our vision is one of the most developed sense, we are able to identify and recognize intricate patterns in everything we see. We can perceive colours, shapes, textures, luminance and shadows and with these characteristics we are easily able to process images. Analysing images with the machine learning cooperation humans can monitorize, detect and anticipate the anthropogenic influences. For example we can monitorize and prevent the erosion of the portuguese coastline, or predict climate changes or even monitorize the water and beach pollution. This thesis aims to develop algorithms to segment and classify aerial images of the Portuguese coast. The first step was the study and development of a manual image segmentation algorithm. A tool was designed to segment and annotate these regions with the labels: rock, sea, sand and people. Thereafter, several low-level characteristics were extracted locally, such as colour, entropy, edges, local binary pattern, and the intensities of each pixel in different colour scales (RGB, gray scale and HSV). Regarding the classification part, different clustering algorithms were studied, in particular the K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH and their performance was tested with the most common metrics, namely precision, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows and Silhouette so that the classification process for each class is as correct as possible. Finally with the clustering method that obtained the best performance (K-Means) was trained and tested. The train and test process is the most critical factor affecting the success of machine learning. The dataset is divided into two parts, one for training and the other for testing. After the machine learning model is trained according to the training data, it is then tested. The K-Means is trained in order to obtain the centroids of each cluster. These centroids are the crucial element for the final classification. It is then calculated the shortest distance between the centroids and the pixel data of the test images. This distance defines at which cluster the pixel of the test image belongs to, classifying it. In the end this approach revealed satisfying results towards the main goal. With the acquired knowledge I would probably chose different low-level features and more low-level features to obtain a more complete and precise segmentation.A visão humana é um dos sentidos mais desenvolvido, somos capazes de identificar e reconhecer padrões complexos em tudo o que observamos. Conseguimos distinguir diferentes cores, formas, texturas, intensidades de luz, sombras, entre outras, e com essas características somos facilmente capazes de processar imagens. Ao analisar as imagens com a cooperação da aprendizagem automática os humanos conseguem monitorizar, detectar e antecipar as influências antropogénicas. Por exemplo, podemos monitorizar e prevenir a erosão da costa portuguesa, prever as alterações climáticas ou até mesmo monitorizar a poluição da água e das praias. Esta tese tem como objetivo desenvolver algoritmos para segmentar e classificar imagens aéreas da costa portuguesa. O primeiro passo foi o estudo e desenvolvimento de um algoritmo de segmentação manual de imagens. Foi criada uma ferramenta para segmentar e anotar as regiões com as seguintes etiquetas: rocha, mar, areia e pessoas. De seguida, foram extraídas localmente várias características de baixo nível, como a cor, a entropia, as margens, o padrão binário local e as intensidades de cada pixel nas diferentes escalas de cores (RGB, escala cinzenta e HSV). Em relação à parte de classificação, foram estudados diferentes algoritmos de agregação, em particular o K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH e o seu desempenho foi testado através das métricas mais comuns, nomeadamente a precisão, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows e Silhouette para que o processo de classificação de cada classe fosse o mais correto possível. Finalmente, foi treinado e testado o método de agregação que obteve o melhor desempenho (K-Means). O processo de treino e teste é um fator extremamente crítico que afeta o sucesso da aprendizagem automática. O conjunto de dados é dividido em duas partes, uma para treino e outra para teste. Depois de o modelo de aprendizagem automática ter sido treinado de acordo com os dados de treino, o modelo é testado. O K-Means é treinado para obter os centroids de cada cluster. Estes centroids são um elemento crucial para a classificação final. De seguida é calculada a menor distância entre os centroids e os pixeis das imagens de teste. Esta distância vai definir a que cluster pertence o pixel da imagem teste, classificando-o. Para concluir, esta abordagem revelou resultados satisfatórios relativamente ao objetivo principal. Através do conhecimento adquirido, provavelmente escolheria diferentes características de baixo nível e mais características de baixo nível de modo a tornar a segmentação mais completa e precisa.2021-05-17T09:43:37Z2021-02-19T00:00:00Z2021-02-19info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/31377engBorges, Catarina Maria Pintoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-05-06T04:31:56Zoai:ria.ua.pt:10773/31377Portal AgregadorONGhttps://www.rcaap.pt/oai/openairemluisa.alvim@gmail.comopendoar:71602024-05-06T04:31:56Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
title |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
spellingShingle |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone Borges, Catarina Maria Pinto Image segmentation Image classification Machine learning Clustering Low-level features |
title_short |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
title_full |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
title_fullStr |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
title_full_unstemmed |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
title_sort |
Image processing algorithms for drone aerial imagery from the Portuguese coastal zone |
author |
Borges, Catarina Maria Pinto |
author_facet |
Borges, Catarina Maria Pinto |
author_role |
author |
dc.contributor.author.fl_str_mv |
Borges, Catarina Maria Pinto |
dc.subject.por.fl_str_mv |
Image segmentation Image classification Machine learning Clustering Low-level features |
topic |
Image segmentation Image classification Machine learning Clustering Low-level features |
description |
As humans our vision is one of the most developed sense, we are able to identify and recognize intricate patterns in everything we see. We can perceive colours, shapes, textures, luminance and shadows and with these characteristics we are easily able to process images. Analysing images with the machine learning cooperation humans can monitorize, detect and anticipate the anthropogenic influences. For example we can monitorize and prevent the erosion of the portuguese coastline, or predict climate changes or even monitorize the water and beach pollution. This thesis aims to develop algorithms to segment and classify aerial images of the Portuguese coast. The first step was the study and development of a manual image segmentation algorithm. A tool was designed to segment and annotate these regions with the labels: rock, sea, sand and people. Thereafter, several low-level characteristics were extracted locally, such as colour, entropy, edges, local binary pattern, and the intensities of each pixel in different colour scales (RGB, gray scale and HSV). Regarding the classification part, different clustering algorithms were studied, in particular the K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH and their performance was tested with the most common metrics, namely precision, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows and Silhouette so that the classification process for each class is as correct as possible. Finally with the clustering method that obtained the best performance (K-Means) was trained and tested. The train and test process is the most critical factor affecting the success of machine learning. The dataset is divided into two parts, one for training and the other for testing. After the machine learning model is trained according to the training data, it is then tested. The K-Means is trained in order to obtain the centroids of each cluster. These centroids are the crucial element for the final classification. It is then calculated the shortest distance between the centroids and the pixel data of the test images. This distance defines at which cluster the pixel of the test image belongs to, classifying it. In the end this approach revealed satisfying results towards the main goal. With the acquired knowledge I would probably chose different low-level features and more low-level features to obtain a more complete and precise segmentation. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-05-17T09:43:37Z 2021-02-19T00:00:00Z 2021-02-19 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10773/31377 |
url |
http://hdl.handle.net/10773/31377 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
mluisa.alvim@gmail.com |
_version_ |
1817543779557572608 |