Explaining dimensionality reduction results using Shapley values

Marcílio-Jr, Wilson E. [UNESP]; Eler, Danilo M. [UNESP]

Explaining dimensionality reduction results using Shapley values

Detalhes bibliográficos
Autor(a) principal:	Marcílio-Jr, Wilson E. [UNESP]
Data de Publicação:	2021
Outros Autores:	Eler, Danilo M. [UNESP]
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Institucional da UNESP
Texto Completo:	http://dx.doi.org/10.1016/j.eswa.2021.115020 http://hdl.handle.net/11449/206269
Resumo:	Dimensionality reduction (DR) techniques have been consistently supporting high-dimensional data analysis in various applications. Besides the patterns uncovered by these techniques, the interpretation of DR results based on each feature's contribution to the low-dimensional representation supports new finds through exploratory analysis. Current literature approaches designed to interpret DR techniques do not explain the features’ contributions well since they focus only on the low-dimensional representation or do not consider the relationship among features. This paper presents ClusterShapley to address these problems, using Shapley values to generate explanations of dimensionality reduction techniques and interpret these algorithms using a cluster-oriented analysis. ClusterShapley explains the formation of clusters and the meaning of their relationship, which is useful for exploratory data analysis in various domains. We propose novel visualization techniques to guide the interpretation of features’ contributions on clustering formation and validate our methodology through case studies of publicly available datasets. The results demonstrate our approach's interpretability and analysis power to generate insights about pathologies and patients in different conditions using DR results.

Metadados do item

id	UNSP_366fb4f7ecbd6b3909f628895d75d036
oai_identifier_str	oai:repositorio.unesp.br:11449/206269
network_acronym_str	UNSP
network_name_str	Repositório Institucional da UNESP
repository_id_str	2946
spelling	Explaining dimensionality reduction results using Shapley valuesDimensionality reductionExplainabilityShapley valuesVisualizationDimensionality reduction (DR) techniques have been consistently supporting high-dimensional data analysis in various applications. Besides the patterns uncovered by these techniques, the interpretation of DR results based on each feature's contribution to the low-dimensional representation supports new finds through exploratory analysis. Current literature approaches designed to interpret DR techniques do not explain the features’ contributions well since they focus only on the low-dimensional representation or do not consider the relationship among features. This paper presents ClusterShapley to address these problems, using Shapley values to generate explanations of dimensionality reduction techniques and interpret these algorithms using a cluster-oriented analysis. ClusterShapley explains the formation of clusters and the meaning of their relationship, which is useful for exploratory data analysis in various domains. We propose novel visualization techniques to guide the interpretation of features’ contributions on clustering formation and validate our methodology through case studies of publicly available datasets. The results demonstrate our approach's interpretability and analysis power to generate insights about pathologies and patients in different conditions using DR results.Faculty of Sciences and Technology São Paulo State University (UNESP)Faculty of Sciences and Technology São Paulo State University (UNESP)Universidade Estadual Paulista (Unesp)Marcílio-Jr, Wilson E. [UNESP]Eler, Danilo M. [UNESP]2021-06-25T10:29:21Z2021-06-25T10:29:21Z2021-09-15info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1016/j.eswa.2021.115020Expert Systems with Applications, v. 178.0957-4174http://hdl.handle.net/11449/20626910.1016/j.eswa.2021.1150202-s2.0-85105034882Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengExpert Systems with Applicationsinfo:eu-repo/semantics/openAccess2021-10-23T02:54:11Zoai:repositorio.unesp.br:11449/206269Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T20:27:09.803135Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv	Explaining dimensionality reduction results using Shapley values
title	Explaining dimensionality reduction results using Shapley values
spellingShingle	Explaining dimensionality reduction results using Shapley values Marcílio-Jr, Wilson E. [UNESP] Dimensionality reduction Explainability Shapley values Visualization
title_short	Explaining dimensionality reduction results using Shapley values
title_full	Explaining dimensionality reduction results using Shapley values
title_fullStr	Explaining dimensionality reduction results using Shapley values
title_full_unstemmed	Explaining dimensionality reduction results using Shapley values
title_sort	Explaining dimensionality reduction results using Shapley values
author	Marcílio-Jr, Wilson E. [UNESP]
author_facet	Marcílio-Jr, Wilson E. [UNESP] Eler, Danilo M. [UNESP]
author_role	author
author2	Eler, Danilo M. [UNESP]
author2_role	author
dc.contributor.none.fl_str_mv	Universidade Estadual Paulista (Unesp)
dc.contributor.author.fl_str_mv	Marcílio-Jr, Wilson E. [UNESP] Eler, Danilo M. [UNESP]
dc.subject.por.fl_str_mv	Dimensionality reduction Explainability Shapley values Visualization
topic	Dimensionality reduction Explainability Shapley values Visualization
description	Dimensionality reduction (DR) techniques have been consistently supporting high-dimensional data analysis in various applications. Besides the patterns uncovered by these techniques, the interpretation of DR results based on each feature's contribution to the low-dimensional representation supports new finds through exploratory analysis. Current literature approaches designed to interpret DR techniques do not explain the features’ contributions well since they focus only on the low-dimensional representation or do not consider the relationship among features. This paper presents ClusterShapley to address these problems, using Shapley values to generate explanations of dimensionality reduction techniques and interpret these algorithms using a cluster-oriented analysis. ClusterShapley explains the formation of clusters and the meaning of their relationship, which is useful for exploratory data analysis in various domains. We propose novel visualization techniques to guide the interpretation of features’ contributions on clustering formation and validate our methodology through case studies of publicly available datasets. The results demonstrate our approach's interpretability and analysis power to generate insights about pathologies and patients in different conditions using DR results.
publishDate	2021
dc.date.none.fl_str_mv	2021-06-25T10:29:21Z 2021-06-25T10:29:21Z 2021-09-15
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://dx.doi.org/10.1016/j.eswa.2021.115020 Expert Systems with Applications, v. 178. 0957-4174 http://hdl.handle.net/11449/206269 10.1016/j.eswa.2021.115020 2-s2.0-85105034882
url	http://dx.doi.org/10.1016/j.eswa.2021.115020 http://hdl.handle.net/11449/206269
identifier_str_mv	Expert Systems with Applications, v. 178. 0957-4174 10.1016/j.eswa.2021.115020 2-s2.0-85105034882
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Expert Systems with Applications
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP
instname_str	Universidade Estadual Paulista (UNESP)
instacron_str	UNESP
institution	UNESP
reponame_str	Repositório Institucional da UNESP
collection	Repositório Institucional da UNESP
repository.name.fl_str_mv	Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_	1808129203857522688

Explaining dimensionality reduction results using Shapley values

Registros relacionados