FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease

Detalhes bibliográficos
Autor(a) principal: Asif, Muhammad
Data de Publicação: 2019
Outros Autores: Vicente, Astrid M., Couto, Francisco M.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.18/6508
Resumo: In recent years, the technological advances for capturing genetic variation in large populations led to the identification of large numbers of putative or disease-causing variants. However, their mechanistic understanding is lagging far behind and has posed new challenges regarding their relevance for disease phenotypes, particularly for common complex disorders. In this study, we propose a systematic pipeline to infer biological meaning from genetic variants, namely rare Copy Number Variants (CNVs). The pipeline consists of three modules that seek to (1) improve genetic data quality by excluding low confidence CNVs, (2) identify disrupted biological processes, and (3) aggregate similar enriched biological processes terms using semantic similarity. The proposed pipeline was applied to CNVs from individuals diagnosed with Autism Spectrum Disorder (ASD). We found that rare CNVs disrupting brain expressed genes dysregulated a wide range of biological processes, such as nervous system development and protein polyubiquitination. The disrupted biological processes identified in ASD patients were in accordance with previous findings. This coherence with literature indicates the feasibility of the proposed pipeline in interpreting the biological role of genetic variants in complex disease development. The suggested pipeline is easily adjustable at each step and its independence from any specific dataset and software makes it an effective tool in analyzing existing genetic resources. The FunVar pipeline is available at https://github.com/lasigeBioTM/FunVar and includes pre and post processing steps to effectively interpret biological mechanisms of putative disease causing genetic variants.
id RCAP_485dca97832fadbaeee340fb7b899ca7
oai_identifier_str oai:repositorio.insa.pt:10400.18/6508
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex diseaseComplex DiseasesFunctional Enrichment AnalysisGenetic VariantsSemantic SimilarityPerturbações do Desenvolvimento Infantil e Saúde MentalIn recent years, the technological advances for capturing genetic variation in large populations led to the identification of large numbers of putative or disease-causing variants. However, their mechanistic understanding is lagging far behind and has posed new challenges regarding their relevance for disease phenotypes, particularly for common complex disorders. In this study, we propose a systematic pipeline to infer biological meaning from genetic variants, namely rare Copy Number Variants (CNVs). The pipeline consists of three modules that seek to (1) improve genetic data quality by excluding low confidence CNVs, (2) identify disrupted biological processes, and (3) aggregate similar enriched biological processes terms using semantic similarity. The proposed pipeline was applied to CNVs from individuals diagnosed with Autism Spectrum Disorder (ASD). We found that rare CNVs disrupting brain expressed genes dysregulated a wide range of biological processes, such as nervous system development and protein polyubiquitination. The disrupted biological processes identified in ASD patients were in accordance with previous findings. This coherence with literature indicates the feasibility of the proposed pipeline in interpreting the biological role of genetic variants in complex disease development. The suggested pipeline is easily adjustable at each step and its independence from any specific dataset and software makes it an effective tool in analyzing existing genetic resources. The FunVar pipeline is available at https://github.com/lasigeBioTM/FunVar and includes pre and post processing steps to effectively interpret biological mechanisms of putative disease causing genetic variants.Highlights: For functional inference of rare CNVs, FunVar includes pre and post processing of CNVs; Putative disease-causing variants aggregate in disease related biological processes; Rare CNVs from ASD cases disrupt neural mechanisms e.g. nervous system development.The work was supported by Portuguese Fundação para a Ciência e Tecnologia (FCT) through funding grant to BioISI (Ref: UID/MULTI/04046/2019), LASIGE Research Unit (Ref: UID/CEC/00408/2019), and to DeST: Deep Semantic Tagger project (Ref: PTDC/CCI-BIO/28685/2017). MA was the recipient of BioSys PhD programme fellowship from FCT (Portugal) with references: SFRH/BD/52485/2014.Elsevier/ Academic PressRepositório Científico do Instituto Nacional de SaúdeAsif, MuhammadVicente, Astrid M.Couto, Francisco M.2020-04-24T09:41:27Z2019-08-242019-08-24T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.18/6508engJ Biomed Inform . 2019 Oct;98:103273. doi: 10.1016/j.jbi.2019.103273. Epub 2019 Aug 241532-046410.1016/j.jbi.2019.103273info:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-20T15:41:29Zoai:repositorio.insa.pt:10400.18/6508Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:41:11.791752Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
title FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
spellingShingle FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
Asif, Muhammad
Complex Diseases
Functional Enrichment Analysis
Genetic Variants
Semantic Similarity
Perturbações do Desenvolvimento Infantil e Saúde Mental
title_short FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
title_full FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
title_fullStr FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
title_full_unstemmed FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
title_sort FunVar: A systematic pipeline to unravel the convergence patterns of genetic variants in ASD, a paradigmatic complex disease
author Asif, Muhammad
author_facet Asif, Muhammad
Vicente, Astrid M.
Couto, Francisco M.
author_role author
author2 Vicente, Astrid M.
Couto, Francisco M.
author2_role author
author
dc.contributor.none.fl_str_mv Repositório Científico do Instituto Nacional de Saúde
dc.contributor.author.fl_str_mv Asif, Muhammad
Vicente, Astrid M.
Couto, Francisco M.
dc.subject.por.fl_str_mv Complex Diseases
Functional Enrichment Analysis
Genetic Variants
Semantic Similarity
Perturbações do Desenvolvimento Infantil e Saúde Mental
topic Complex Diseases
Functional Enrichment Analysis
Genetic Variants
Semantic Similarity
Perturbações do Desenvolvimento Infantil e Saúde Mental
description In recent years, the technological advances for capturing genetic variation in large populations led to the identification of large numbers of putative or disease-causing variants. However, their mechanistic understanding is lagging far behind and has posed new challenges regarding their relevance for disease phenotypes, particularly for common complex disorders. In this study, we propose a systematic pipeline to infer biological meaning from genetic variants, namely rare Copy Number Variants (CNVs). The pipeline consists of three modules that seek to (1) improve genetic data quality by excluding low confidence CNVs, (2) identify disrupted biological processes, and (3) aggregate similar enriched biological processes terms using semantic similarity. The proposed pipeline was applied to CNVs from individuals diagnosed with Autism Spectrum Disorder (ASD). We found that rare CNVs disrupting brain expressed genes dysregulated a wide range of biological processes, such as nervous system development and protein polyubiquitination. The disrupted biological processes identified in ASD patients were in accordance with previous findings. This coherence with literature indicates the feasibility of the proposed pipeline in interpreting the biological role of genetic variants in complex disease development. The suggested pipeline is easily adjustable at each step and its independence from any specific dataset and software makes it an effective tool in analyzing existing genetic resources. The FunVar pipeline is available at https://github.com/lasigeBioTM/FunVar and includes pre and post processing steps to effectively interpret biological mechanisms of putative disease causing genetic variants.
publishDate 2019
dc.date.none.fl_str_mv 2019-08-24
2019-08-24T00:00:00Z
2020-04-24T09:41:27Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.18/6508
url http://hdl.handle.net/10400.18/6508
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv J Biomed Inform . 2019 Oct;98:103273. doi: 10.1016/j.jbi.2019.103273. Epub 2019 Aug 24
1532-0464
10.1016/j.jbi.2019.103273
dc.rights.driver.fl_str_mv info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv embargoedAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Elsevier/ Academic Press
publisher.none.fl_str_mv Elsevier/ Academic Press
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1817554096132980736