Englander Institute for Precision Medicine

BiosyntheticSPAdes: reconstructing biosynthetic gene clusters from assembly graphs.

TitleBiosyntheticSPAdes: reconstructing biosynthetic gene clusters from assembly graphs.
Publication TypeJournal Article
Year of Publication2019
AuthorsMeleshko D, Mohimani H, Tracanna V, Hajirasouliha I, Medema MH, Korobeynikov A, Pevzner PA
JournalGenome Res
Volume29
Issue8
Pagination1352-1362
Date Published2019 Aug
ISSN1549-5469
KeywordsContig Mapping, Datasets as Topic, Dental Plaque, Genes, Bacterial, Gingiva, Humans, Internet, Metagenome, Metagenomics, Mouth Mucosa, Multigene Family, Pharynx, Protein Biosynthesis, Software, Tongue
Abstract

Predicting biosynthetic gene clusters (BGCs) is critically important for discovery of antibiotics and other natural products. While BGC prediction from complete genomes is a well-studied problem, predicting BGCs in fragmented genomic assemblies remains challenging. The existing BGC prediction tools often assume that each BGC is encoded within a single contig in the genome assembly, a condition that is violated for most sequenced microbial genomes where BGCs are often scattered through several contigs, making it difficult to reconstruct them. The situation is even more severe in shotgun metagenomics, where the contigs are often short, and the existing tools fail to predict a large fraction of long BGCs. While it is difficult to assemble BGCs in a single contig, the structure of the genome assembly graph often provides clues on how to combine multiple contigs into segments encoding long BGCs. We describe biosyntheticSPAdes, a tool for predicting BGCs in assembly graphs and demonstrate that it greatly improves the reconstruction of BGCs from genomic and metagenomics data sets.

DOI10.1101/gr.243477.118
Alternate JournalGenome Res
PubMed ID31160374
PubMed Central IDPMC6673720
Grant ListP41 GM103484 / GM / NIGMS NIH HHS / United States
T32 GM083937 / GM / NIGMS NIH HHS / United States

Weill Cornell Medicine Englander Institute for Precision Medicine 413 E 69th Street
Belfer Research Building
New York, NY 10021