中文版 | English
题名

Phage-bacterial contig association prediction with a convolutional neural network

作者
发表日期
2022-06-24
DOI
发表期刊
ISSN
1367-4803
EISSN
1367-4811
卷号38期号:1页码:i45-i52
摘要
MOTIVATION: Phage-host associations play important roles in microbial communities. But in natural communities, as opposed to culture-based lab studies where phages are discovered and characterized metagenomically, their hosts are generally not known. Several programs have been developed for predicting which phage infects which host based on various sequence similarity measures or machine learning approaches. These are often based on whole viral and host genomes, but in metagenomics-based studies, we rarely have whole genomes but rather must rely on contigs that are sometimes as short as hundreds of bp long. Therefore, we need programs that predict hosts of phage contigs on the basis of these short contigs. Although most existing programs can be applied to metagenomic datasets for these predictions, their accuracies are generally low. Here, we develop ContigNet, a convolutional neural network-based model capable of predicting phage-host matches based on relatively short contigs, and compare it to previously published VirHostMatcher (VHM) and WIsH. RESULTS: On the validation set, ContigNet achieves 72-85% area under the receiver operating characteristic curve (AUROC) scores, compared to the maximum of 68% by VHM or WIsH for contigs of lengths between 200 bps to 50 kbps. We also apply the model to the Metagenomic Gut Virus (MGV) catalogue, a dataset containing a wide range of draft genomes from metagenomic samples and achieve 60-70% AUROC scores compared to that of VHM and WIsH of 52%. Surprisingly, ContigNet can also be used to predict plasmid-host contig associations with high accuracy, indicating a similar genetic exchange between mobile genetic elements and their hosts. AVAILABILITY AND IMPLEMENTATION: The source code of ContigNet and related datasets can be downloaded from https://github.com/tianqitang1/ContigNet.
相关链接[Scopus记录]
收录类别
语种
英语
学校署名
其他
资助项目
Gordon and Betty Moore Foundation[3779];National Science Foundation[EF-2125142];National Institutes of Health[R01GM120624];
WOS研究方向
Biochemistry & Molecular Biology ; Biotechnology & Applied Microbiology ; Computer Science ; Mathematical & Computational Biology ; Mathematics
WOS类目
Biochemical Research Methods ; Biotechnology & Applied Microbiology ; Computer Science, Interdisciplinary Applications ; Mathematical & Computational Biology ; Statistics & Probability
WOS记录号
WOS:000817250400010
出版者
ESI学科分类
BIOLOGY & BIOCHEMISTRY
Scopus记录号
2-s2.0-85132961504
来源库
Scopus
引用统计
被引频次[WOS]:3
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/352489
专题工学院_海洋科学与工程系
作者单位
1.Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
2.Department of Ocean Science and Engineering,Southern University of Science and Technology,Shenzhen,518055,China
3.Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
推荐引用方式
GB/T 7714
Tang,Tianqi,Hou,Shengwei,Fuhrman,Jed A.,et al. Phage-bacterial contig association prediction with a convolutional neural network[J]. BIOINFORMATICS,2022,38(1):i45-i52.
APA
Tang,Tianqi,Hou,Shengwei,Fuhrman,Jed A.,&Sun,Fengzhu.(2022).Phage-bacterial contig association prediction with a convolutional neural network.BIOINFORMATICS,38(1),i45-i52.
MLA
Tang,Tianqi,et al."Phage-bacterial contig association prediction with a convolutional neural network".BIOINFORMATICS 38.1(2022):i45-i52.
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Tang,Tianqi]的文章
[Hou,Shengwei]的文章
[Fuhrman,Jed A.]的文章
百度学术
百度学术中相似的文章
[Tang,Tianqi]的文章
[Hou,Shengwei]的文章
[Fuhrman,Jed A.]的文章
必应学术
必应学术中相似的文章
[Tang,Tianqi]的文章
[Hou,Shengwei]的文章
[Fuhrman,Jed A.]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。