题名 | Phage-bacterial contig association prediction with a convolutional neural network |
作者 | |
发表日期 | 2022-06-24
|
DOI | |
发表期刊 | |
ISSN | 1367-4803
|
EISSN | 1367-4811
|
卷号 | 38期号:1页码:i45-i52 |
摘要 | MOTIVATION: Phage-host associations play important roles in microbial communities. But in natural communities, as opposed to culture-based lab studies where phages are discovered and characterized metagenomically, their hosts are generally not known. Several programs have been developed for predicting which phage infects which host based on various sequence similarity measures or machine learning approaches. These are often based on whole viral and host genomes, but in metagenomics-based studies, we rarely have whole genomes but rather must rely on contigs that are sometimes as short as hundreds of bp long. Therefore, we need programs that predict hosts of phage contigs on the basis of these short contigs. Although most existing programs can be applied to metagenomic datasets for these predictions, their accuracies are generally low. Here, we develop ContigNet, a convolutional neural network-based model capable of predicting phage-host matches based on relatively short contigs, and compare it to previously published VirHostMatcher (VHM) and WIsH. RESULTS: On the validation set, ContigNet achieves 72-85% area under the receiver operating characteristic curve (AUROC) scores, compared to the maximum of 68% by VHM or WIsH for contigs of lengths between 200 bps to 50 kbps. We also apply the model to the Metagenomic Gut Virus (MGV) catalogue, a dataset containing a wide range of draft genomes from metagenomic samples and achieve 60-70% AUROC scores compared to that of VHM and WIsH of 52%. Surprisingly, ContigNet can also be used to predict plasmid-host contig associations with high accuracy, indicating a similar genetic exchange between mobile genetic elements and their hosts. AVAILABILITY AND IMPLEMENTATION: The source code of ContigNet and related datasets can be downloaded from https://github.com/tianqitang1/ContigNet. |
相关链接 | [Scopus记录] |
收录类别 | |
语种 | 英语
|
学校署名 | 其他
|
资助项目 | Gordon and Betty Moore Foundation[3779];National Science Foundation[EF-2125142];National Institutes of Health[R01GM120624];
|
WOS研究方向 | Biochemistry & Molecular Biology
; Biotechnology & Applied Microbiology
; Computer Science
; Mathematical & Computational Biology
; Mathematics
|
WOS类目 | Biochemical Research Methods
; Biotechnology & Applied Microbiology
; Computer Science, Interdisciplinary Applications
; Mathematical & Computational Biology
; Statistics & Probability
|
WOS记录号 | WOS:000817250400010
|
出版者 | |
ESI学科分类 | BIOLOGY & BIOCHEMISTRY
|
Scopus记录号 | 2-s2.0-85132961504
|
来源库 | Scopus
|
引用统计 |
被引频次[WOS]:3
|
成果类型 | 期刊论文 |
条目标识符 | http://sustech.caswiz.com/handle/2SGJ60CL/352489 |
专题 | 工学院_海洋科学与工程系 |
作者单位 | 1.Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA 2.Department of Ocean Science and Engineering,Southern University of Science and Technology,Shenzhen,518055,China 3.Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA |
推荐引用方式 GB/T 7714 |
Tang,Tianqi,Hou,Shengwei,Fuhrman,Jed A.,et al. Phage-bacterial contig association prediction with a convolutional neural network[J]. BIOINFORMATICS,2022,38(1):i45-i52.
|
APA |
Tang,Tianqi,Hou,Shengwei,Fuhrman,Jed A.,&Sun,Fengzhu.(2022).Phage-bacterial contig association prediction with a convolutional neural network.BIOINFORMATICS,38(1),i45-i52.
|
MLA |
Tang,Tianqi,et al."Phage-bacterial contig association prediction with a convolutional neural network".BIOINFORMATICS 38.1(2022):i45-i52.
|
条目包含的文件 | 条目无相关文件。 |
|
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论