Financial credit risk assessment via learning-based hashing

Detalhes bibliográficos
Autor(a) principal: Ribeiro, Bernardete Martins
Data de Publicação: 2017
Outros Autores: Chen, Ning
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10316/87225
https://doi.org/10.3233/IDT-170286
Resumo: With the increasing amount of financial data produced today, the problem of finding the k-nearest neighbors to the query point in high-dimensional space is itself of importance to access the financial credit risk. Binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The idea is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. By exploring out-of-sample extension to test data it is possible to set forth a go-forward strategy to establish a fast retrieval model of companies' status thereby rendering the stakeholders' evaluation task very efficiently. First, we use semi-supervised learning-based hashing to take into account the pairwise information for constructing the weight adjacency graph matrix needed or building the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash function. Third, the k-bit binary code for the test data is efficiently found in the recall phase. Experimental results on financial data demonstrated the proposed approach showed the applicability and advantages of learning-based hashing to credit risk assessment.
id RCAP_7ffa63bdc98162848aa1500cca003841
oai_identifier_str oai:estudogeral.uc.pt:10316/87225
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Financial credit risk assessment via learning-based hashingHashing methodFinancial credit riskGeneralised regression neural networkBinary embeddingK-bits codeWith the increasing amount of financial data produced today, the problem of finding the k-nearest neighbors to the query point in high-dimensional space is itself of importance to access the financial credit risk. Binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The idea is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. By exploring out-of-sample extension to test data it is possible to set forth a go-forward strategy to establish a fast retrieval model of companies' status thereby rendering the stakeholders' evaluation task very efficiently. First, we use semi-supervised learning-based hashing to take into account the pairwise information for constructing the weight adjacency graph matrix needed or building the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash function. Third, the k-bit binary code for the test data is efficiently found in the recall phase. Experimental results on financial data demonstrated the proposed approach showed the applicability and advantages of learning-based hashing to credit risk assessment.IOS Press2017info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10316/87225http://hdl.handle.net/10316/87225https://doi.org/10.3233/IDT-170286eng1875-8843 (E)1872-4981 (P)https://content.iospress.com/articles/intelligent-decision-technologies/idt286Ribeiro, Bernardete MartinsChen, Ninginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2021-08-18T08:43:24Zoai:estudogeral.uc.pt:10316/87225Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:08:11.760776Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Financial credit risk assessment via learning-based hashing
title Financial credit risk assessment via learning-based hashing
spellingShingle Financial credit risk assessment via learning-based hashing
Ribeiro, Bernardete Martins
Hashing method
Financial credit risk
Generalised regression neural network
Binary embedding
K-bits code
title_short Financial credit risk assessment via learning-based hashing
title_full Financial credit risk assessment via learning-based hashing
title_fullStr Financial credit risk assessment via learning-based hashing
title_full_unstemmed Financial credit risk assessment via learning-based hashing
title_sort Financial credit risk assessment via learning-based hashing
author Ribeiro, Bernardete Martins
author_facet Ribeiro, Bernardete Martins
Chen, Ning
author_role author
author2 Chen, Ning
author2_role author
dc.contributor.author.fl_str_mv Ribeiro, Bernardete Martins
Chen, Ning
dc.subject.por.fl_str_mv Hashing method
Financial credit risk
Generalised regression neural network
Binary embedding
K-bits code
topic Hashing method
Financial credit risk
Generalised regression neural network
Binary embedding
K-bits code
description With the increasing amount of financial data produced today, the problem of finding the k-nearest neighbors to the query point in high-dimensional space is itself of importance to access the financial credit risk. Binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The idea is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. By exploring out-of-sample extension to test data it is possible to set forth a go-forward strategy to establish a fast retrieval model of companies' status thereby rendering the stakeholders' evaluation task very efficiently. First, we use semi-supervised learning-based hashing to take into account the pairwise information for constructing the weight adjacency graph matrix needed or building the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash function. Third, the k-bit binary code for the test data is efficiently found in the recall phase. Experimental results on financial data demonstrated the proposed approach showed the applicability and advantages of learning-based hashing to credit risk assessment.
publishDate 2017
dc.date.none.fl_str_mv 2017
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10316/87225
http://hdl.handle.net/10316/87225
https://doi.org/10.3233/IDT-170286
url http://hdl.handle.net/10316/87225
https://doi.org/10.3233/IDT-170286
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1875-8843 (E)
1872-4981 (P)
https://content.iospress.com/articles/intelligent-decision-technologies/idt286
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv IOS Press
publisher.none.fl_str_mv IOS Press
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133974226796544