Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/103269 |
Resumo: | Human epidermal growth factor receptor 2 (HER2) evaluation commonly requires immunohistochemistry (IHC) tests on breast cancer tissue, in addition to the standard haematoxylin and eosin (H&E) staining tests. Additional costs and time spent on further testing might be avoided if HER2 overexpression could be effectively inferred from H&E stained slides, as a preliminary indication of the IHC result. In this paper, we propose the first method that aims to achieve this goal. The proposed method is based on multiple instance learning (MIL), using a convolutional neural network (CNN) that separately processes H&E stained slide tiles and outputs an IHC label. This CNN is pretrained on IHC stained slide tiles but does not use these data during inference/testing. H&E tiles are extracted from invasive tumour areas segmented with the HASHI algorithm. The individual tile labels are then combined to obtain a single label for the whole slide. The network was trained on slides from the HER2 Scoring Contest dataset (HER2SC) and tested on two disjoint subsets of slides from the HER2SC database and the TCGA-TCIA-BRCA (BRCA) collection. The proposed method attained 83.3% classification accuracy on the HER2SC test set and 53.8% on the BRCA test set. Although further efforts should be devoted to achieving improved performance, the obtained results are promising, suggesting that it is possible to perform HER2 overexpression classification on H&E stained tissue slides. |
id |
RCAP_7748c5f7f20c14e4c3aa1e20be58dcdd |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/103269 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slidesBreast cancerHER2Weakly-supervised learningMaterials Science(all)InstrumentationEngineering(all)Process Chemistry and TechnologyComputer Science ApplicationsFluid Flow and Transfer ProcessesSDG 3 - Good Health and Well-beingHuman epidermal growth factor receptor 2 (HER2) evaluation commonly requires immunohistochemistry (IHC) tests on breast cancer tissue, in addition to the standard haematoxylin and eosin (H&E) staining tests. Additional costs and time spent on further testing might be avoided if HER2 overexpression could be effectively inferred from H&E stained slides, as a preliminary indication of the IHC result. In this paper, we propose the first method that aims to achieve this goal. The proposed method is based on multiple instance learning (MIL), using a convolutional neural network (CNN) that separately processes H&E stained slide tiles and outputs an IHC label. This CNN is pretrained on IHC stained slide tiles but does not use these data during inference/testing. H&E tiles are extracted from invasive tumour areas segmented with the HASHI algorithm. The individual tile labels are then combined to obtain a single label for the whole slide. The network was trained on slides from the HER2 Scoring Contest dataset (HER2SC) and tested on two disjoint subsets of slides from the HER2SC database and the TCGA-TCIA-BRCA (BRCA) collection. The proposed method attained 83.3% classification accuracy on the HER2SC test set and 53.8% on the BRCA test set. Although further efforts should be devoted to achieving improved performance, the obtained results are promising, suggesting that it is possible to perform HER2 overexpression classification on H&E stained tissue slides.NOVA Medical School|Faculdade de Ciências Médicas (NMS|FCM)RUNOliveira, Sara P.Pinto, João RibeiroGonçalves, TiagoCanas-Marques, RitaCardoso, Maria JoãoOliveira, Hélder P.Cardoso, Jaime S.2020-09-03T02:24:20Z2020-072020-07-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10362/103269eng2076-3417PURE: 19666985https://doi.org/10.3390/app10144728info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:48:42Zoai:run.unl.pt:10362/103269Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:39:50.181536Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
title |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
spellingShingle |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides Oliveira, Sara P. Breast cancer HER2 Weakly-supervised learning Materials Science(all) Instrumentation Engineering(all) Process Chemistry and Technology Computer Science Applications Fluid Flow and Transfer Processes SDG 3 - Good Health and Well-being |
title_short |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
title_full |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
title_fullStr |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
title_full_unstemmed |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
title_sort |
Weakly-supervised classification of HER2 expression in breast cancer haematoxylin and eosin stained slides |
author |
Oliveira, Sara P. |
author_facet |
Oliveira, Sara P. Pinto, João Ribeiro Gonçalves, Tiago Canas-Marques, Rita Cardoso, Maria João Oliveira, Hélder P. Cardoso, Jaime S. |
author_role |
author |
author2 |
Pinto, João Ribeiro Gonçalves, Tiago Canas-Marques, Rita Cardoso, Maria João Oliveira, Hélder P. Cardoso, Jaime S. |
author2_role |
author author author author author author |
dc.contributor.none.fl_str_mv |
NOVA Medical School|Faculdade de Ciências Médicas (NMS|FCM) RUN |
dc.contributor.author.fl_str_mv |
Oliveira, Sara P. Pinto, João Ribeiro Gonçalves, Tiago Canas-Marques, Rita Cardoso, Maria João Oliveira, Hélder P. Cardoso, Jaime S. |
dc.subject.por.fl_str_mv |
Breast cancer HER2 Weakly-supervised learning Materials Science(all) Instrumentation Engineering(all) Process Chemistry and Technology Computer Science Applications Fluid Flow and Transfer Processes SDG 3 - Good Health and Well-being |
topic |
Breast cancer HER2 Weakly-supervised learning Materials Science(all) Instrumentation Engineering(all) Process Chemistry and Technology Computer Science Applications Fluid Flow and Transfer Processes SDG 3 - Good Health and Well-being |
description |
Human epidermal growth factor receptor 2 (HER2) evaluation commonly requires immunohistochemistry (IHC) tests on breast cancer tissue, in addition to the standard haematoxylin and eosin (H&E) staining tests. Additional costs and time spent on further testing might be avoided if HER2 overexpression could be effectively inferred from H&E stained slides, as a preliminary indication of the IHC result. In this paper, we propose the first method that aims to achieve this goal. The proposed method is based on multiple instance learning (MIL), using a convolutional neural network (CNN) that separately processes H&E stained slide tiles and outputs an IHC label. This CNN is pretrained on IHC stained slide tiles but does not use these data during inference/testing. H&E tiles are extracted from invasive tumour areas segmented with the HASHI algorithm. The individual tile labels are then combined to obtain a single label for the whole slide. The network was trained on slides from the HER2 Scoring Contest dataset (HER2SC) and tested on two disjoint subsets of slides from the HER2SC database and the TCGA-TCIA-BRCA (BRCA) collection. The proposed method attained 83.3% classification accuracy on the HER2SC test set and 53.8% on the BRCA test set. Although further efforts should be devoted to achieving improved performance, the obtained results are promising, suggesting that it is possible to perform HER2 overexpression classification on H&E stained tissue slides. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-09-03T02:24:20Z 2020-07 2020-07-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/103269 |
url |
http://hdl.handle.net/10362/103269 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
2076-3417 PURE: 19666985 https://doi.org/10.3390/app10144728 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799138014950064128 |