Detecting garment and its landmarks
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/107701 |
Resumo: | Garment folding is a task happening daily at our homes, retail and industry. When put into numbers, over a lifetime people spend on average 375 days performing this chore, while employees at a store may fold the same shirt 119 times per day. Despite the associated repetitive characteristics of this task, its automation is still far from being achieved mainly due to the large number of possible configurations that a crumpled piece of clothing may assume. In general, highly deformable objects still present large challenges for both fields of Robotics and Computer Vision. We attempt to offer a contribution to the garment folding automation by addressing the recognition of clothing pieces without much constraints on their pose or wrinkling, mimicking a most realistic scenario as possible. Such capability would enable a folding robot to choose and adapt its execution plan to the current clothing category. Because the considered problem revolves around clothe recognition, this work may also be of the interest of many other clothe related software applications such as recommendation systems existing on e.g., online e-commerce platforms, or intelligent surveillance setups that require tracking of people by their clothing description. Some work has been produced using Machine Learning techniques that, in general, consist on extracting a set of engineered features from the source image and then applying classification algorithms (e.g., Support Vector Machines) to find the associated clothing category and or pose. With the recent success of Convolutional Neural Networks, where features extraction is incorporated in the learning process, on the object classification problem, these have been preferred in favor of the previous pipelines. We apply Deep Learning techniques on images containing a single piece of clothing in a flat, wrinkled and semi-folded pose, existing on a clean background with the goal of classify and localize each piece. Furthermore, its relevant landmarks (shoulders, legs, crotch, etc) are equally treated. We train and evaluate our solution using the datasets produced by CTU at the CloPeMa project. |
id |
RCAP_507bd8a5e391c92094f045507418231a |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/107701 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Detecting garment and its landmarksEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringGarment folding is a task happening daily at our homes, retail and industry. When put into numbers, over a lifetime people spend on average 375 days performing this chore, while employees at a store may fold the same shirt 119 times per day. Despite the associated repetitive characteristics of this task, its automation is still far from being achieved mainly due to the large number of possible configurations that a crumpled piece of clothing may assume. In general, highly deformable objects still present large challenges for both fields of Robotics and Computer Vision. We attempt to offer a contribution to the garment folding automation by addressing the recognition of clothing pieces without much constraints on their pose or wrinkling, mimicking a most realistic scenario as possible. Such capability would enable a folding robot to choose and adapt its execution plan to the current clothing category. Because the considered problem revolves around clothe recognition, this work may also be of the interest of many other clothe related software applications such as recommendation systems existing on e.g., online e-commerce platforms, or intelligent surveillance setups that require tracking of people by their clothing description. Some work has been produced using Machine Learning techniques that, in general, consist on extracting a set of engineered features from the source image and then applying classification algorithms (e.g., Support Vector Machines) to find the associated clothing category and or pose. With the recent success of Convolutional Neural Networks, where features extraction is incorporated in the learning process, on the object classification problem, these have been preferred in favor of the previous pipelines. We apply Deep Learning techniques on images containing a single piece of clothing in a flat, wrinkled and semi-folded pose, existing on a clean background with the goal of classify and localize each piece. Furthermore, its relevant landmarks (shoulders, legs, crotch, etc) are equally treated. We train and evaluate our solution using the datasets produced by CTU at the CloPeMa project.2017-07-182017-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/107701TID:201796546engDaniel Fernandes Gomesinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T13:34:35Zoai:repositorio-aberto.up.pt:10216/107701Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:43:01.062714Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Detecting garment and its landmarks |
title |
Detecting garment and its landmarks |
spellingShingle |
Detecting garment and its landmarks Daniel Fernandes Gomes Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Detecting garment and its landmarks |
title_full |
Detecting garment and its landmarks |
title_fullStr |
Detecting garment and its landmarks |
title_full_unstemmed |
Detecting garment and its landmarks |
title_sort |
Detecting garment and its landmarks |
author |
Daniel Fernandes Gomes |
author_facet |
Daniel Fernandes Gomes |
author_role |
author |
dc.contributor.author.fl_str_mv |
Daniel Fernandes Gomes |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
Garment folding is a task happening daily at our homes, retail and industry. When put into numbers, over a lifetime people spend on average 375 days performing this chore, while employees at a store may fold the same shirt 119 times per day. Despite the associated repetitive characteristics of this task, its automation is still far from being achieved mainly due to the large number of possible configurations that a crumpled piece of clothing may assume. In general, highly deformable objects still present large challenges for both fields of Robotics and Computer Vision. We attempt to offer a contribution to the garment folding automation by addressing the recognition of clothing pieces without much constraints on their pose or wrinkling, mimicking a most realistic scenario as possible. Such capability would enable a folding robot to choose and adapt its execution plan to the current clothing category. Because the considered problem revolves around clothe recognition, this work may also be of the interest of many other clothe related software applications such as recommendation systems existing on e.g., online e-commerce platforms, or intelligent surveillance setups that require tracking of people by their clothing description. Some work has been produced using Machine Learning techniques that, in general, consist on extracting a set of engineered features from the source image and then applying classification algorithms (e.g., Support Vector Machines) to find the associated clothing category and or pose. With the recent success of Convolutional Neural Networks, where features extraction is incorporated in the learning process, on the object classification problem, these have been preferred in favor of the previous pipelines. We apply Deep Learning techniques on images containing a single piece of clothing in a flat, wrinkled and semi-folded pose, existing on a clean background with the goal of classify and localize each piece. Furthermore, its relevant landmarks (shoulders, legs, crotch, etc) are equally treated. We train and evaluate our solution using the datasets produced by CTU at the CloPeMa project. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-07-18 2017-07-18T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/107701 TID:201796546 |
url |
https://hdl.handle.net/10216/107701 |
identifier_str_mv |
TID:201796546 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135745726742528 |