Financial credit risk assessment via learning-based hashing
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Outros Autores: | |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10316/87225 https://doi.org/10.3233/IDT-170286 |
Resumo: | With the increasing amount of financial data produced today, the problem of finding the k-nearest neighbors to the query point in high-dimensional space is itself of importance to access the financial credit risk. Binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The idea is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. By exploring out-of-sample extension to test data it is possible to set forth a go-forward strategy to establish a fast retrieval model of companies' status thereby rendering the stakeholders' evaluation task very efficiently. First, we use semi-supervised learning-based hashing to take into account the pairwise information for constructing the weight adjacency graph matrix needed or building the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash function. Third, the k-bit binary code for the test data is efficiently found in the recall phase. Experimental results on financial data demonstrated the proposed approach showed the applicability and advantages of learning-based hashing to credit risk assessment. |
id |
RCAP_7ffa63bdc98162848aa1500cca003841 |
---|---|
oai_identifier_str |
oai:estudogeral.uc.pt:10316/87225 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Financial credit risk assessment via learning-based hashingHashing methodFinancial credit riskGeneralised regression neural networkBinary embeddingK-bits codeWith the increasing amount of financial data produced today, the problem of finding the k-nearest neighbors to the query point in high-dimensional space is itself of importance to access the financial credit risk. Binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The idea is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. By exploring out-of-sample extension to test data it is possible to set forth a go-forward strategy to establish a fast retrieval model of companies' status thereby rendering the stakeholders' evaluation task very efficiently. First, we use semi-supervised learning-based hashing to take into account the pairwise information for constructing the weight adjacency graph matrix needed or building the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash function. Third, the k-bit binary code for the test data is efficiently found in the recall phase. Experimental results on financial data demonstrated the proposed approach showed the applicability and advantages of learning-based hashing to credit risk assessment.IOS Press2017info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10316/87225http://hdl.handle.net/10316/87225https://doi.org/10.3233/IDT-170286eng1875-8843 (E)1872-4981 (P)https://content.iospress.com/articles/intelligent-decision-technologies/idt286Ribeiro, Bernardete MartinsChen, Ninginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2021-08-18T08:43:24Zoai:estudogeral.uc.pt:10316/87225Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:08:11.760776Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Financial credit risk assessment via learning-based hashing |
title |
Financial credit risk assessment via learning-based hashing |
spellingShingle |
Financial credit risk assessment via learning-based hashing Ribeiro, Bernardete Martins Hashing method Financial credit risk Generalised regression neural network Binary embedding K-bits code |
title_short |
Financial credit risk assessment via learning-based hashing |
title_full |
Financial credit risk assessment via learning-based hashing |
title_fullStr |
Financial credit risk assessment via learning-based hashing |
title_full_unstemmed |
Financial credit risk assessment via learning-based hashing |
title_sort |
Financial credit risk assessment via learning-based hashing |
author |
Ribeiro, Bernardete Martins |
author_facet |
Ribeiro, Bernardete Martins Chen, Ning |
author_role |
author |
author2 |
Chen, Ning |
author2_role |
author |
dc.contributor.author.fl_str_mv |
Ribeiro, Bernardete Martins Chen, Ning |
dc.subject.por.fl_str_mv |
Hashing method Financial credit risk Generalised regression neural network Binary embedding K-bits code |
topic |
Hashing method Financial credit risk Generalised regression neural network Binary embedding K-bits code |
description |
With the increasing amount of financial data produced today, the problem of finding the k-nearest neighbors to the query point in high-dimensional space is itself of importance to access the financial credit risk. Binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The idea is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. By exploring out-of-sample extension to test data it is possible to set forth a go-forward strategy to establish a fast retrieval model of companies' status thereby rendering the stakeholders' evaluation task very efficiently. First, we use semi-supervised learning-based hashing to take into account the pairwise information for constructing the weight adjacency graph matrix needed or building the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash function. Third, the k-bit binary code for the test data is efficiently found in the recall phase. Experimental results on financial data demonstrated the proposed approach showed the applicability and advantages of learning-based hashing to credit risk assessment. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10316/87225 http://hdl.handle.net/10316/87225 https://doi.org/10.3233/IDT-170286 |
url |
http://hdl.handle.net/10316/87225 https://doi.org/10.3233/IDT-170286 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
1875-8843 (E) 1872-4981 (P) https://content.iospress.com/articles/intelligent-decision-technologies/idt286 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
IOS Press |
publisher.none.fl_str_mv |
IOS Press |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799133974226796544 |