Self-supervised learning methods for label-efficient dental caries classification

Detalhes bibliográficos
Autor(a) principal: Taleb, Aiham
Data de Publicação: 2022
Outros Autores: Rohrer, Csaba, Bergner, Benjamin Sebastian, De Leon, Guilherme, Rodrigues, Jonas de Almeida, Schwendicke, Falk, Lippert, Christoph, Krois, Joachim
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UFRGS
Texto Completo: http://hdl.handle.net/10183/242600
Resumo: High annotation costs are a substantial bottleneck in applying deep learning architectures to clinically relevant use cases, substantiating the need for algorithms to learn from unlabeled data. In this work, we propose employing self-supervised methods. To that end, we trained with three selfsupervised algorithms on a large corpus of unlabeled dental images, which contained 38K bitewing radiographs (BWRs). We then applied the learned neural network representations on tooth-level dental caries classification, for which we utilized labels extracted from electronic health records (EHRs). Finally, a holdout test-set was established, which consisted of 343 BWRs and was annotated by three dental professionals and approved by a senior dentist. This test-set was used to evaluate the fine-tuned caries classification models. Our experimental results demonstrate the obtained gains by pretraining models using self-supervised algorithms. These include improved caries classification performance (6 p.p. increase in sensitivity) and, most importantly, improved label-efficiency. In other words, the resulting models can be fine-tuned using few labels (annotations). Our results show that using as few as 18 annotations can produce ě45% sensitivity, which is comparable to human-level diagnostic performance. This study shows that self-supervision can provide gains in medical image analysis, particularly when obtaining labels is costly and expensive.
id UFRGS-2_323fe569fa0410efd6c83ae05bc9cc1d
oai_identifier_str oai:www.lume.ufrgs.br:10183/242600
network_acronym_str UFRGS-2
network_name_str Repositório Institucional da UFRGS
repository_id_str
spelling Taleb, AihamRohrer, CsabaBergner, Benjamin SebastianDe Leon, GuilhermeRodrigues, Jonas de AlmeidaSchwendicke, FalkLippert, ChristophKrois, Joachim2022-07-15T04:49:39Z20222075-4418http://hdl.handle.net/10183/242600001144806High annotation costs are a substantial bottleneck in applying deep learning architectures to clinically relevant use cases, substantiating the need for algorithms to learn from unlabeled data. In this work, we propose employing self-supervised methods. To that end, we trained with three selfsupervised algorithms on a large corpus of unlabeled dental images, which contained 38K bitewing radiographs (BWRs). We then applied the learned neural network representations on tooth-level dental caries classification, for which we utilized labels extracted from electronic health records (EHRs). Finally, a holdout test-set was established, which consisted of 343 BWRs and was annotated by three dental professionals and approved by a senior dentist. This test-set was used to evaluate the fine-tuned caries classification models. Our experimental results demonstrate the obtained gains by pretraining models using self-supervised algorithms. These include improved caries classification performance (6 p.p. increase in sensitivity) and, most importantly, improved label-efficiency. In other words, the resulting models can be fine-tuned using few labels (annotations). Our results show that using as few as 18 annotations can produce ě45% sensitivity, which is comparable to human-level diagnostic performance. This study shows that self-supervision can provide gains in medical image analysis, particularly when obtaining labels is costly and expensive.application/pdfengDiagnostics. Basel. Vol. 12, no. 5 (2022), 1237, 15 p.Cárie dentáriaUnsupervised methodsSelf-supervised learningRepresentation learningDental caries classificationData driven approachesAnnotation efficient deep learningSelf-supervised learning methods for label-efficient dental caries classificationEstrangeiroinfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFRGSinstname:Universidade Federal do Rio Grande do Sul (UFRGS)instacron:UFRGSTEXT001144806.pdf.txt001144806.pdf.txtExtracted Texttext/plain58156http://www.lume.ufrgs.br/bitstream/10183/242600/2/001144806.pdf.txt092ca2216d269144aefa16b80bfaf2ebMD52ORIGINAL001144806.pdfTexto completo (inglês)application/pdf1013883http://www.lume.ufrgs.br/bitstream/10183/242600/1/001144806.pdf0f8835284dd3df1baab33e37231737cbMD5110183/2426002022-07-16 05:05:45.999489oai:www.lume.ufrgs.br:10183/242600Repositório de PublicaçõesPUBhttps://lume.ufrgs.br/oai/requestopendoar:2022-07-16T08:05:45Repositório Institucional da UFRGS - Universidade Federal do Rio Grande do Sul (UFRGS)false
dc.title.pt_BR.fl_str_mv Self-supervised learning methods for label-efficient dental caries classification
title Self-supervised learning methods for label-efficient dental caries classification
spellingShingle Self-supervised learning methods for label-efficient dental caries classification
Taleb, Aiham
Cárie dentária
Unsupervised methods
Self-supervised learning
Representation learning
Dental caries classification
Data driven approaches
Annotation efficient deep learning
title_short Self-supervised learning methods for label-efficient dental caries classification
title_full Self-supervised learning methods for label-efficient dental caries classification
title_fullStr Self-supervised learning methods for label-efficient dental caries classification
title_full_unstemmed Self-supervised learning methods for label-efficient dental caries classification
title_sort Self-supervised learning methods for label-efficient dental caries classification
author Taleb, Aiham
author_facet Taleb, Aiham
Rohrer, Csaba
Bergner, Benjamin Sebastian
De Leon, Guilherme
Rodrigues, Jonas de Almeida
Schwendicke, Falk
Lippert, Christoph
Krois, Joachim
author_role author
author2 Rohrer, Csaba
Bergner, Benjamin Sebastian
De Leon, Guilherme
Rodrigues, Jonas de Almeida
Schwendicke, Falk
Lippert, Christoph
Krois, Joachim
author2_role author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Taleb, Aiham
Rohrer, Csaba
Bergner, Benjamin Sebastian
De Leon, Guilherme
Rodrigues, Jonas de Almeida
Schwendicke, Falk
Lippert, Christoph
Krois, Joachim
dc.subject.por.fl_str_mv Cárie dentária
topic Cárie dentária
Unsupervised methods
Self-supervised learning
Representation learning
Dental caries classification
Data driven approaches
Annotation efficient deep learning
dc.subject.eng.fl_str_mv Unsupervised methods
Self-supervised learning
Representation learning
Dental caries classification
Data driven approaches
Annotation efficient deep learning
description High annotation costs are a substantial bottleneck in applying deep learning architectures to clinically relevant use cases, substantiating the need for algorithms to learn from unlabeled data. In this work, we propose employing self-supervised methods. To that end, we trained with three selfsupervised algorithms on a large corpus of unlabeled dental images, which contained 38K bitewing radiographs (BWRs). We then applied the learned neural network representations on tooth-level dental caries classification, for which we utilized labels extracted from electronic health records (EHRs). Finally, a holdout test-set was established, which consisted of 343 BWRs and was annotated by three dental professionals and approved by a senior dentist. This test-set was used to evaluate the fine-tuned caries classification models. Our experimental results demonstrate the obtained gains by pretraining models using self-supervised algorithms. These include improved caries classification performance (6 p.p. increase in sensitivity) and, most importantly, improved label-efficiency. In other words, the resulting models can be fine-tuned using few labels (annotations). Our results show that using as few as 18 annotations can produce ě45% sensitivity, which is comparable to human-level diagnostic performance. This study shows that self-supervision can provide gains in medical image analysis, particularly when obtaining labels is costly and expensive.
publishDate 2022
dc.date.accessioned.fl_str_mv 2022-07-15T04:49:39Z
dc.date.issued.fl_str_mv 2022
dc.type.driver.fl_str_mv Estrangeiro
info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10183/242600
dc.identifier.issn.pt_BR.fl_str_mv 2075-4418
dc.identifier.nrb.pt_BR.fl_str_mv 001144806
identifier_str_mv 2075-4418
001144806
url http://hdl.handle.net/10183/242600
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartof.pt_BR.fl_str_mv Diagnostics. Basel. Vol. 12, no. 5 (2022), 1237, 15 p.
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFRGS
instname:Universidade Federal do Rio Grande do Sul (UFRGS)
instacron:UFRGS
instname_str Universidade Federal do Rio Grande do Sul (UFRGS)
instacron_str UFRGS
institution UFRGS
reponame_str Repositório Institucional da UFRGS
collection Repositório Institucional da UFRGS
bitstream.url.fl_str_mv http://www.lume.ufrgs.br/bitstream/10183/242600/2/001144806.pdf.txt
http://www.lume.ufrgs.br/bitstream/10183/242600/1/001144806.pdf
bitstream.checksum.fl_str_mv 092ca2216d269144aefa16b80bfaf2eb
0f8835284dd3df1baab33e37231737cb
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFRGS - Universidade Federal do Rio Grande do Sul (UFRGS)
repository.mail.fl_str_mv
_version_ 1815447798082961408