Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides

Detalhes bibliográficos
Autor(a) principal: Oliveira, Sara P.
Data de Publicação: 2020
Outros Autores: Pinto, João Ribeiro, Gonçalves, Tiago, Canas-Marques, Rita, Cardoso, Maria João, Oliveira, Hélder P., Cardoso, Jaime S.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10362/103269
Resumo: Human epidermal growth factor receptor 2 (HER2) evaluation commonly requires immunohistochemistry (IHC) tests on breast cancer tissue, in addition to the standard haematoxylin and eosin (H&E) staining tests. Additional costs and time spent on further testing might be avoided if HER2 overexpression could be effectively inferred from H&E stained slides, as a preliminary indication of the IHC result. In this paper, we propose the first method that aims to achieve this goal. The proposed method is based on multiple instance learning (MIL), using a convolutional neural network (CNN) that separately processes H&E stained slide tiles and outputs an IHC label. This CNN is pretrained on IHC stained slide tiles but does not use these data during inference/testing. H&E tiles are extracted from invasive tumour areas segmented with the HASHI algorithm. The individual tile labels are then combined to obtain a single label for the whole slide. The network was trained on slides from the HER2 Scoring Contest dataset (HER2SC) and tested on two disjoint subsets of slides from the HER2SC database and the TCGA-TCIA-BRCA (BRCA) collection. The proposed method attained 83.3% classification accuracy on the HER2SC test set and 53.8% on the BRCA test set. Although further efforts should be devoted to achieving improved performance, the obtained results are promising, suggesting that it is possible to perform HER2 overexpression classification on H&E stained tissue slides.
id RCAP_7748c5f7f20c14e4c3aa1e20be58dcdd
oai_identifier_str oai:run.unl.pt:10362/103269
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slidesBreast cancerHER2Weakly-supervised learningMaterials Science(all)InstrumentationEngineering(all)Process Chemistry and TechnologyComputer Science ApplicationsFluid Flow and Transfer ProcessesSDG 3 - Good Health and Well-beingHuman epidermal growth factor receptor 2 (HER2) evaluation commonly requires immunohistochemistry (IHC) tests on breast cancer tissue, in addition to the standard haematoxylin and eosin (H&E) staining tests. Additional costs and time spent on further testing might be avoided if HER2 overexpression could be effectively inferred from H&E stained slides, as a preliminary indication of the IHC result. In this paper, we propose the first method that aims to achieve this goal. The proposed method is based on multiple instance learning (MIL), using a convolutional neural network (CNN) that separately processes H&E stained slide tiles and outputs an IHC label. This CNN is pretrained on IHC stained slide tiles but does not use these data during inference/testing. H&E tiles are extracted from invasive tumour areas segmented with the HASHI algorithm. The individual tile labels are then combined to obtain a single label for the whole slide. The network was trained on slides from the HER2 Scoring Contest dataset (HER2SC) and tested on two disjoint subsets of slides from the HER2SC database and the TCGA-TCIA-BRCA (BRCA) collection. The proposed method attained 83.3% classification accuracy on the HER2SC test set and 53.8% on the BRCA test set. Although further efforts should be devoted to achieving improved performance, the obtained results are promising, suggesting that it is possible to perform HER2 overexpression classification on H&E stained tissue slides.NOVA Medical School|Faculdade de Ciências Médicas (NMS|FCM)RUNOliveira, Sara P.Pinto, João RibeiroGonçalves, TiagoCanas-Marques, RitaCardoso, Maria JoãoOliveira, Hélder P.Cardoso, Jaime S.2020-09-03T02:24:20Z2020-072020-07-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10362/103269eng2076-3417PURE: 19666985https://doi.org/10.3390/app10144728info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:48:42Zoai:run.unl.pt:10362/103269Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:39:50.181536Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
title Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
spellingShingle Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
Oliveira, Sara P.
Breast cancer
HER2
Weakly-supervised learning
Materials Science(all)
Instrumentation
Engineering(all)
Process Chemistry and Technology
Computer Science Applications
Fluid Flow and Transfer Processes
SDG 3 - Good Health and Well-being
title_short Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
title_full Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
title_fullStr Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
title_full_unstemmed Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
title_sort Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
author Oliveira, Sara P.
author_facet Oliveira, Sara P.
Pinto, João Ribeiro
Gonçalves, Tiago
Canas-Marques, Rita
Cardoso, Maria João
Oliveira, Hélder P.
Cardoso, Jaime S.
author_role author
author2 Pinto, João Ribeiro
Gonçalves, Tiago
Canas-Marques, Rita
Cardoso, Maria João
Oliveira, Hélder P.
Cardoso, Jaime S.
author2_role author
author
author
author
author
author
dc.contributor.none.fl_str_mv NOVA Medical School|Faculdade de Ciências Médicas (NMS|FCM)
RUN
dc.contributor.author.fl_str_mv Oliveira, Sara P.
Pinto, João Ribeiro
Gonçalves, Tiago
Canas-Marques, Rita
Cardoso, Maria João
Oliveira, Hélder P.
Cardoso, Jaime S.
dc.subject.por.fl_str_mv Breast cancer
HER2
Weakly-supervised learning
Materials Science(all)
Instrumentation
Engineering(all)
Process Chemistry and Technology
Computer Science Applications
Fluid Flow and Transfer Processes
SDG 3 - Good Health and Well-being
topic Breast cancer
HER2
Weakly-supervised learning
Materials Science(all)
Instrumentation
Engineering(all)
Process Chemistry and Technology
Computer Science Applications
Fluid Flow and Transfer Processes
SDG 3 - Good Health and Well-being
description Human epidermal growth factor receptor 2 (HER2) evaluation commonly requires immunohistochemistry (IHC) tests on breast cancer tissue, in addition to the standard haematoxylin and eosin (H&E) staining tests. Additional costs and time spent on further testing might be avoided if HER2 overexpression could be effectively inferred from H&E stained slides, as a preliminary indication of the IHC result. In this paper, we propose the first method that aims to achieve this goal. The proposed method is based on multiple instance learning (MIL), using a convolutional neural network (CNN) that separately processes H&E stained slide tiles and outputs an IHC label. This CNN is pretrained on IHC stained slide tiles but does not use these data during inference/testing. H&E tiles are extracted from invasive tumour areas segmented with the HASHI algorithm. The individual tile labels are then combined to obtain a single label for the whole slide. The network was trained on slides from the HER2 Scoring Contest dataset (HER2SC) and tested on two disjoint subsets of slides from the HER2SC database and the TCGA-TCIA-BRCA (BRCA) collection. The proposed method attained 83.3% classification accuracy on the HER2SC test set and 53.8% on the BRCA test set. Although further efforts should be devoted to achieving improved performance, the obtained results are promising, suggesting that it is possible to perform HER2 overexpression classification on H&E stained tissue slides.
publishDate 2020
dc.date.none.fl_str_mv 2020-09-03T02:24:20Z
2020-07
2020-07-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10362/103269
url http://hdl.handle.net/10362/103269
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2076-3417
PURE: 19666985
https://doi.org/10.3390/app10144728
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799138014950064128