Machine Learning powered serverless fraud detection

Chostak, Christian

Machine Learning powered serverless fraud detection

Detalhes bibliográficos
Autor(a) principal:	Chostak, Christian
Data de Publicação:	2020
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10400.22/17423
Resumo:	Dissertação de Mestrado em Engenharia Informática

Metadados do item

id	RCAP_143d595c26d4998b0deaad4ac6058869
oai_identifier_str	oai:recipp.ipp.pt:10400.22/17423
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Machine Learning powered serverless fraud detectionServerlessMachine LearningLambdaFraudInformáticaDissertação de Mestrado em Engenharia InformáticaThere is an increasing concern about fraud in all market sectors. Although there is a great fuzz about fraud and fraud detection, just a small fraction of it was fully incorporated into real world applications. Counterfeited documents are reproductions or imitations of the originals ones. The present work aims to fulfill a gap in fraud analysis by automating and identifying those documents in seconds. Generally speaking, a payload containing a suspect fraudulent document will reach an Application Programming Interface gateway, which will redirect the request to Lambda functions and based on the event store it on SQS - Simple Queue Service, this queue will trigger a fleet of micro-services powered by Lambda functions as well. The non-exhaustive list of functions will proceed to read this queue and in the first moment create the metadata of the received document, registering on a Serverless Relational Database, whilst storing the document itself on S3 - Simple Storage Service. After that, it will call the second batch that will start the process of machine learning on the already saved image. Triggered by the finished process, a message will go to the SNS - Simple Notification Service - alerting the user. The output of the given analysis contains a sample of the input document showing where the fraud is if there is one. With the percentage and area given, the operator will be able to see what portion of the image was considered a fraud and from that moment forward, the user will have technical basis to accept the document or not.Existe uma preocupação crescente sobre fraude em todos os setores da sociedade. Apesar de existir grande alvoroço sobre fraude e detecção de fraude, apenas uma pequena parte dela foi implementada em aplicações reais e ainda sim, em setor relacionados a streaming de mídia. Documentos falsificados são reproduções ou imitações, inteiras ou parciais de seus originais. O presente trabalho tenta preencher uma lacuna na análise de fraudes automatizando, e identificando-a em segundos. Em termos gerais, um payload contendo um documento fraudulento atingirá uma Interface de Programação Aplicacional - API, que então direcionará os pedidos para funções Lambda, e, baseado no evento, armazenará em SQS - Serviço de Queue Simples. Esta queue iniciará o gatilho para uma frota de micro-serviços, também executados em Lambda. A lista não exaustiva de funções prosseguirá e lerá os eventos da queue que nessa fase contém apenas um identificador único do arquivo, bem como uma breve descrição informada pelo utilizador, e, em primeiro momento criará a metadata do documento recebido, registrando-o em uma base de dados relacional, enquanto armazena o próprio documento no S3 - Serviço de Armazenamento Simples. Depois disso, iniciará o segundo lote de processamento sobre a imagem já salva, neste momento começam algoritmos de Machine Learning, bem como, processamento habitual de imagem. Iniciado pelo fim do processo, uma mensagem irá passar pelo SNS - Sistema de notificação simples, alertando o utilizador final. O relatório da análise conterá uma amostra do documento que foi processado indicando onde está a fraude, se existir uma. Com a percentagem e área indicada, o utilizador poderá ver quais porções do documento foram possivelmente alteradas e poderá considerar ou não o documento, afinal, terá base técnica para fazê-lo.Instituto Politécnico do Porto. Escola Superior de Tecnologia e GestãoCosta, Ricardo André FernandesRepositório Científico do Instituto Politécnico do PortoChostak, Christian2021-03-11T12:31:33Z202020202020-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.22/17423TID:202636267enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T13:08:11Zoai:recipp.ipp.pt:10400.22/17423Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:37:04.050231Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Machine Learning powered serverless fraud detection
title	Machine Learning powered serverless fraud detection
spellingShingle	Machine Learning powered serverless fraud detection Chostak, Christian Serverless Machine Learning Lambda Fraud Informática
title_short	Machine Learning powered serverless fraud detection
title_full	Machine Learning powered serverless fraud detection
title_fullStr	Machine Learning powered serverless fraud detection
title_full_unstemmed	Machine Learning powered serverless fraud detection
title_sort	Machine Learning powered serverless fraud detection
author	Chostak, Christian
author_facet	Chostak, Christian
author_role	author
dc.contributor.none.fl_str_mv	Costa, Ricardo André Fernandes Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv	Chostak, Christian
dc.subject.por.fl_str_mv	Serverless Machine Learning Lambda Fraud Informática
topic	Serverless Machine Learning Lambda Fraud Informática
description	Dissertação de Mestrado em Engenharia Informática
publishDate	2020
dc.date.none.fl_str_mv	2020 2020 2020-01-01T00:00:00Z 2021-03-11T12:31:33Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10400.22/17423 TID:202636267
url	http://hdl.handle.net/10400.22/17423
identifier_str_mv	TID:202636267
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Instituto Politécnico do Porto. Escola Superior de Tecnologia e Gestão
publisher.none.fl_str_mv	Instituto Politécnico do Porto. Escola Superior de Tecnologia e Gestão
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799131461384667136

Machine Learning powered serverless fraud detection

Registros relacionados