TY - JOUR
T1 - Comparison of different sequencing strategies for assembling chromosome-level genomes of extremophiles with variable GC content
AU - Zhang, Zhidong
AU - Liu, Guilin
AU - Chen, Yao
AU - Xue, Weizhen
AU - Ji, Qianyue
AU - Xu, Qiwu
AU - Zhang, He
AU - Fan, Guangyi
AU - Huang, He
AU - Jiang, Ling
AU - Chen, Jianwei
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2021/3/19
Y1 - 2021/3/19
N2 - In this study, six bacterial isolates with variable GC, including Escherichia coli as mesophilic reference strain, were selected to compare hybrid assembly strategies based on next-generation sequencing (NGS) of short reads, single-tube long-fragment reads (stLFR) sequencing, and Oxford Nanopore Technologies (ONT) sequencing platforms. We obtained the complete genomes using the hybrid assembler Unicycler based on the NGS and ONT reads; others were de novo assembled using NGS, stLFR, and ONT reads by using different strategies. The contiguity, accuracy, completeness, sequencing costs, and DNA material requirements of the investigated strategies were compared systematically. Although all sequencing data could be assembled into accurate whole-genome sequences, the stLFR sequencing data yield a scaffold with more contiguity with more completeness of gene function than NGS sequencing assemblies. Our research provides a low-cost chromosome-level genome assembly strategy for large-scale sequencing of extremophile genomes with different GC contents.
AB - In this study, six bacterial isolates with variable GC, including Escherichia coli as mesophilic reference strain, were selected to compare hybrid assembly strategies based on next-generation sequencing (NGS) of short reads, single-tube long-fragment reads (stLFR) sequencing, and Oxford Nanopore Technologies (ONT) sequencing platforms. We obtained the complete genomes using the hybrid assembler Unicycler based on the NGS and ONT reads; others were de novo assembled using NGS, stLFR, and ONT reads by using different strategies. The contiguity, accuracy, completeness, sequencing costs, and DNA material requirements of the investigated strategies were compared systematically. Although all sequencing data could be assembled into accurate whole-genome sequences, the stLFR sequencing data yield a scaffold with more contiguity with more completeness of gene function than NGS sequencing assemblies. Our research provides a low-cost chromosome-level genome assembly strategy for large-scale sequencing of extremophile genomes with different GC contents.
KW - Microbial Genomics
KW - Microbiology
KW - Omics
UR - http://www.scopus.com/inward/record.url?scp=85102061162&partnerID=8YFLogxK
U2 - 10.1016/j.isci.2021.102219
DO - 10.1016/j.isci.2021.102219
M3 - 文章
AN - SCOPUS:85102061162
SN - 2589-0042
VL - 24
JO - iScience
JF - iScience
IS - 3
M1 - 102219
ER -