Image processing algorithms for drone aerial imagery from the Portuguese coastal zone

Detalhes bibliográficos
Autor(a) principal: Borges, Catarina Maria Pinto
Data de Publicação: 2021
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10773/31377
Resumo: As humans our vision is one of the most developed sense, we are able to identify and recognize intricate patterns in everything we see. We can perceive colours, shapes, textures, luminance and shadows and with these characteristics we are easily able to process images. Analysing images with the machine learning cooperation humans can monitorize, detect and anticipate the anthropogenic influences. For example we can monitorize and prevent the erosion of the portuguese coastline, or predict climate changes or even monitorize the water and beach pollution. This thesis aims to develop algorithms to segment and classify aerial images of the Portuguese coast. The first step was the study and development of a manual image segmentation algorithm. A tool was designed to segment and annotate these regions with the labels: rock, sea, sand and people. Thereafter, several low-level characteristics were extracted locally, such as colour, entropy, edges, local binary pattern, and the intensities of each pixel in different colour scales (RGB, gray scale and HSV). Regarding the classification part, different clustering algorithms were studied, in particular the K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH and their performance was tested with the most common metrics, namely precision, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows and Silhouette so that the classification process for each class is as correct as possible. Finally with the clustering method that obtained the best performance (K-Means) was trained and tested. The train and test process is the most critical factor affecting the success of machine learning. The dataset is divided into two parts, one for training and the other for testing. After the machine learning model is trained according to the training data, it is then tested. The K-Means is trained in order to obtain the centroids of each cluster. These centroids are the crucial element for the final classification. It is then calculated the shortest distance between the centroids and the pixel data of the test images. This distance defines at which cluster the pixel of the test image belongs to, classifying it. In the end this approach revealed satisfying results towards the main goal. With the acquired knowledge I would probably chose different low-level features and more low-level features to obtain a more complete and precise segmentation.
id RCAP_182a7844998b60ede06710261cf021f9
oai_identifier_str oai:ria.ua.pt:10773/31377
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Image processing algorithms for drone aerial imagery from the Portuguese coastal zoneImage segmentationImage classificationMachine learningClusteringLow-level featuresAs humans our vision is one of the most developed sense, we are able to identify and recognize intricate patterns in everything we see. We can perceive colours, shapes, textures, luminance and shadows and with these characteristics we are easily able to process images. Analysing images with the machine learning cooperation humans can monitorize, detect and anticipate the anthropogenic influences. For example we can monitorize and prevent the erosion of the portuguese coastline, or predict climate changes or even monitorize the water and beach pollution. This thesis aims to develop algorithms to segment and classify aerial images of the Portuguese coast. The first step was the study and development of a manual image segmentation algorithm. A tool was designed to segment and annotate these regions with the labels: rock, sea, sand and people. Thereafter, several low-level characteristics were extracted locally, such as colour, entropy, edges, local binary pattern, and the intensities of each pixel in different colour scales (RGB, gray scale and HSV). Regarding the classification part, different clustering algorithms were studied, in particular the K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH and their performance was tested with the most common metrics, namely precision, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows and Silhouette so that the classification process for each class is as correct as possible. Finally with the clustering method that obtained the best performance (K-Means) was trained and tested. The train and test process is the most critical factor affecting the success of machine learning. The dataset is divided into two parts, one for training and the other for testing. After the machine learning model is trained according to the training data, it is then tested. The K-Means is trained in order to obtain the centroids of each cluster. These centroids are the crucial element for the final classification. It is then calculated the shortest distance between the centroids and the pixel data of the test images. This distance defines at which cluster the pixel of the test image belongs to, classifying it. In the end this approach revealed satisfying results towards the main goal. With the acquired knowledge I would probably chose different low-level features and more low-level features to obtain a more complete and precise segmentation.A visão humana é um dos sentidos mais desenvolvido, somos capazes de identificar e reconhecer padrões complexos em tudo o que observamos. Conseguimos distinguir diferentes cores, formas, texturas, intensidades de luz, sombras, entre outras, e com essas características somos facilmente capazes de processar imagens. Ao analisar as imagens com a cooperação da aprendizagem automática os humanos conseguem monitorizar, detectar e antecipar as influências antropogénicas. Por exemplo, podemos monitorizar e prevenir a erosão da costa portuguesa, prever as alterações climáticas ou até mesmo monitorizar a poluição da água e das praias. Esta tese tem como objetivo desenvolver algoritmos para segmentar e classificar imagens aéreas da costa portuguesa. O primeiro passo foi o estudo e desenvolvimento de um algoritmo de segmentação manual de imagens. Foi criada uma ferramenta para segmentar e anotar as regiões com as seguintes etiquetas: rocha, mar, areia e pessoas. De seguida, foram extraídas localmente várias características de baixo nível, como a cor, a entropia, as margens, o padrão binário local e as intensidades de cada pixel nas diferentes escalas de cores (RGB, escala cinzenta e HSV). Em relação à parte de classificação, foram estudados diferentes algoritmos de agregação, em particular o K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH e o seu desempenho foi testado através das métricas mais comuns, nomeadamente a precisão, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows e Silhouette para que o processo de classificação de cada classe fosse o mais correto possível. Finalmente, foi treinado e testado o método de agregação que obteve o melhor desempenho (K-Means). O processo de treino e teste é um fator extremamente crítico que afeta o sucesso da aprendizagem automática. O conjunto de dados é dividido em duas partes, uma para treino e outra para teste. Depois de o modelo de aprendizagem automática ter sido treinado de acordo com os dados de treino, o modelo é testado. O K-Means é treinado para obter os centroids de cada cluster. Estes centroids são um elemento crucial para a classificação final. De seguida é calculada a menor distância entre os centroids e os pixeis das imagens de teste. Esta distância vai definir a que cluster pertence o pixel da imagem teste, classificando-o. Para concluir, esta abordagem revelou resultados satisfatórios relativamente ao objetivo principal. Através do conhecimento adquirido, provavelmente escolheria diferentes características de baixo nível e mais características de baixo nível de modo a tornar a segmentação mais completa e precisa.2021-05-17T09:43:37Z2021-02-19T00:00:00Z2021-02-19info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/31377engBorges, Catarina Maria Pintoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T12:00:34Zoai:ria.ua.pt:10773/31377Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:03:16.547984Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
title Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
spellingShingle Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
Borges, Catarina Maria Pinto
Image segmentation
Image classification
Machine learning
Clustering
Low-level features
title_short Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
title_full Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
title_fullStr Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
title_full_unstemmed Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
title_sort Image processing algorithms for drone aerial imagery from the Portuguese coastal zone
author Borges, Catarina Maria Pinto
author_facet Borges, Catarina Maria Pinto
author_role author
dc.contributor.author.fl_str_mv Borges, Catarina Maria Pinto
dc.subject.por.fl_str_mv Image segmentation
Image classification
Machine learning
Clustering
Low-level features
topic Image segmentation
Image classification
Machine learning
Clustering
Low-level features
description As humans our vision is one of the most developed sense, we are able to identify and recognize intricate patterns in everything we see. We can perceive colours, shapes, textures, luminance and shadows and with these characteristics we are easily able to process images. Analysing images with the machine learning cooperation humans can monitorize, detect and anticipate the anthropogenic influences. For example we can monitorize and prevent the erosion of the portuguese coastline, or predict climate changes or even monitorize the water and beach pollution. This thesis aims to develop algorithms to segment and classify aerial images of the Portuguese coast. The first step was the study and development of a manual image segmentation algorithm. A tool was designed to segment and annotate these regions with the labels: rock, sea, sand and people. Thereafter, several low-level characteristics were extracted locally, such as colour, entropy, edges, local binary pattern, and the intensities of each pixel in different colour scales (RGB, gray scale and HSV). Regarding the classification part, different clustering algorithms were studied, in particular the K-Means, Affinity Propagation, MeanShift, Spectral, Agglomerative, DBSCAN, OPTICS, BIRCH and their performance was tested with the most common metrics, namely precision, Rand Index, Mutual Information, Adjusted and Normalized, Homogeneity, Completeness, V-Measure, Fowlkes-Mallows and Silhouette so that the classification process for each class is as correct as possible. Finally with the clustering method that obtained the best performance (K-Means) was trained and tested. The train and test process is the most critical factor affecting the success of machine learning. The dataset is divided into two parts, one for training and the other for testing. After the machine learning model is trained according to the training data, it is then tested. The K-Means is trained in order to obtain the centroids of each cluster. These centroids are the crucial element for the final classification. It is then calculated the shortest distance between the centroids and the pixel data of the test images. This distance defines at which cluster the pixel of the test image belongs to, classifying it. In the end this approach revealed satisfying results towards the main goal. With the acquired knowledge I would probably chose different low-level features and more low-level features to obtain a more complete and precise segmentation.
publishDate 2021
dc.date.none.fl_str_mv 2021-05-17T09:43:37Z
2021-02-19T00:00:00Z
2021-02-19
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10773/31377
url http://hdl.handle.net/10773/31377
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137687772332032