Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies

Detalhes bibliográficos
Autor(a) principal: He, Kuo
Data de Publicação: 2022
Outros Autores: Zhao, Liulan, Yuan, Zihao, Canario, Adelino, Liu, Qiao, Chen, Siyi, Guo, Jiazhong, Luo, Wei, Yan, Haoxiao, Zhang, Dongmei, Li, Lisen, Yang, Song
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.1/18660
Resumo: The largemouth bass (Micropterus salmoides) has become a cosmopolitan species due to its widespread introduction as game or domesticated fish. Here a high-quality chromosome-level reference genome of M. salmoides was produced by combining Illumina paired-end sequencing, PacBio single molecule sequencing technique (SMRT) and High-through chromosome conformation capture (Hi-C) technologies. Ultimately, the genome was assembled into 844.88 Mb with a contig N50 of 15.68 Mb and scaffold N50 length of 35.77 Mb. About 99.9% assembly genome sequences (844.00 Mb) could be anchored to 23 chromosomes, and 98.03% assembly genome sequences could be ordered and directed. The genome contained 38.19% repeat sequences and 2693 noncoding RNAs. A total of 26,370 protein-coding genes from 3415 gene families were predicted, of which 97.69% were functionally annotated. The high-quality genome assembly will be a fundamental resource to study and understand how M. salmoides adapt to novel and changing environments around the world, and also be expected to contribute to the genetic breeding and other research.
id RCAP_ebb5dee9bfc7ed360efc283e2e2823c1
oai_identifier_str oai:sapientia.ualg.pt:10400.1/18660
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologiesThe largemouth bass (Micropterus salmoides) has become a cosmopolitan species due to its widespread introduction as game or domesticated fish. Here a high-quality chromosome-level reference genome of M. salmoides was produced by combining Illumina paired-end sequencing, PacBio single molecule sequencing technique (SMRT) and High-through chromosome conformation capture (Hi-C) technologies. Ultimately, the genome was assembled into 844.88 Mb with a contig N50 of 15.68 Mb and scaffold N50 length of 35.77 Mb. About 99.9% assembly genome sequences (844.00 Mb) could be anchored to 23 chromosomes, and 98.03% assembly genome sequences could be ordered and directed. The genome contained 38.19% repeat sequences and 2693 noncoding RNAs. A total of 26,370 protein-coding genes from 3415 gene families were predicted, of which 97.69% were functionally annotated. The high-quality genome assembly will be a fundamental resource to study and understand how M. salmoides adapt to novel and changing environments around the world, and also be expected to contribute to the genetic breeding and other research.Nature PortfolioSapientiaHe, KuoZhao, LiulanYuan, ZihaoCanario, AdelinoLiu, QiaoChen, SiyiGuo, JiazhongLuo, WeiYan, HaoxiaoZhang, DongmeiLi, LisenYang, Song2022-12-19T14:08:47Z20222022-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.1/18660eng10.1038/s41597-022-01601-12052-4463info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-24T10:30:56Zoai:sapientia.ualg.pt:10400.1/18660Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:08:23.891313Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
title Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
spellingShingle Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
He, Kuo
title_short Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
title_full Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
title_fullStr Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
title_full_unstemmed Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
title_sort Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies
author He, Kuo
author_facet He, Kuo
Zhao, Liulan
Yuan, Zihao
Canario, Adelino
Liu, Qiao
Chen, Siyi
Guo, Jiazhong
Luo, Wei
Yan, Haoxiao
Zhang, Dongmei
Li, Lisen
Yang, Song
author_role author
author2 Zhao, Liulan
Yuan, Zihao
Canario, Adelino
Liu, Qiao
Chen, Siyi
Guo, Jiazhong
Luo, Wei
Yan, Haoxiao
Zhang, Dongmei
Li, Lisen
Yang, Song
author2_role author
author
author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Sapientia
dc.contributor.author.fl_str_mv He, Kuo
Zhao, Liulan
Yuan, Zihao
Canario, Adelino
Liu, Qiao
Chen, Siyi
Guo, Jiazhong
Luo, Wei
Yan, Haoxiao
Zhang, Dongmei
Li, Lisen
Yang, Song
description The largemouth bass (Micropterus salmoides) has become a cosmopolitan species due to its widespread introduction as game or domesticated fish. Here a high-quality chromosome-level reference genome of M. salmoides was produced by combining Illumina paired-end sequencing, PacBio single molecule sequencing technique (SMRT) and High-through chromosome conformation capture (Hi-C) technologies. Ultimately, the genome was assembled into 844.88 Mb with a contig N50 of 15.68 Mb and scaffold N50 length of 35.77 Mb. About 99.9% assembly genome sequences (844.00 Mb) could be anchored to 23 chromosomes, and 98.03% assembly genome sequences could be ordered and directed. The genome contained 38.19% repeat sequences and 2693 noncoding RNAs. A total of 26,370 protein-coding genes from 3415 gene families were predicted, of which 97.69% were functionally annotated. The high-quality genome assembly will be a fundamental resource to study and understand how M. salmoides adapt to novel and changing environments around the world, and also be expected to contribute to the genetic breeding and other research.
publishDate 2022
dc.date.none.fl_str_mv 2022-12-19T14:08:47Z
2022
2022-01-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.1/18660
url http://hdl.handle.net/10400.1/18660
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1038/s41597-022-01601-1
2052-4463
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Nature Portfolio
publisher.none.fl_str_mv Nature Portfolio
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133330444124160