中文版 | English
题名

Rapid multiple protein sequence search by parallel and heterogeneous computation

作者
通讯作者Fan, Rui; Wang, Zefeng
发表日期
2024-03-29
DOI
发表期刊
ISSN
1367-4803
EISSN
1367-4811
卷号40期号:4
摘要
Motivation Protein sequence database search and multiple sequence alignment generation is a fundamental task in many bioinformatics analyses. As the data volume of sequences continues to grow rapidly, there is an increasing need for efficient and scalable multiple sequence query algorithms for super-large databases without expensive time and computational costs.Results We introduce Chorus, a novel protein sequence query system that leverages parallel model and heterogeneous computation architecture to enable users to query thousands of protein sequences concurrently against large protein databases on a desktop workstation. Chorus achieves over 100x speedup over BLASTP without sacrificing sensitivity. We demonstrate the utility of Chorus through a case study of analyzing a similar to 1.5-TB large-scale metagenomic datasets for novel CRISPR-Cas protein discovery within 30 min.Availability and implementation Chorus is open-source and its code repository is available at https://github.com/Bio-Acc/Chorus.
相关链接[来源记录]
收录类别
语种
英语
学校署名
通讯
资助项目
Strategic Priority Research Program of Chinese Academy of Sciences (CAS)["XDC01000000","XDB38040100"] ; National Natural Science Foundation of China (NSFC)["31730110","91940303","31661143031"] ; Science and Technology Commission of Shanghai Municipality[18XD1404400] ; National Key Research and Development Program of China[2018YFA0107602] ; ShanghaiTech University[2017F0203-000-05] ; GHfund A["ghfund202202013878","ghfund202302012917"]
WOS研究方向
Biochemistry & Molecular Biology ; Biotechnology & Applied Microbiology ; Computer Science ; Mathematical & Computational Biology ; Mathematics
WOS类目
Biochemical Research Methods ; Biotechnology & Applied Microbiology ; Computer Science, Interdisciplinary Applications ; Mathematical & Computational Biology ; Statistics & Probability
WOS记录号
WOS:001203339000002
出版者
ESI学科分类
BIOLOGY & BIOCHEMISTRY
来源库
Web of Science
引用统计
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/788635
专题生命科学学院
作者单位
1.Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Computat Biol, 320 Yueyang Rd, Shanghai 200031, Peoples R China
2.ShanghaiTech Univ, Sch Informat Sci & Technol, 393 Middle Huaxia Rd, Shanghai 201210, Peoples R China
3.Chinese Acad Sci, Inst Intelligent Comp Technol, 88 Jinjihu Ave, Suzhou 215000, Jiangsu, Peoples R China
4.Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr, 320 Yueyang Rd, Shanghai 200031, Peoples R China
5.Southern Univ Sci & Technol, Sch Life Sci, 1088 Xueyuan Ave, Shenzhen 518055, Guangdong, Peoples R China
通讯作者单位生命科学学院
推荐引用方式
GB/T 7714
Li, Jiefu,Wang, Ziyuan,Fan, Xuwei,et al. Rapid multiple protein sequence search by parallel and heterogeneous computation[J]. BIOINFORMATICS,2024,40(4).
APA
Li, Jiefu.,Wang, Ziyuan.,Fan, Xuwei.,Yao, Ruijie.,Zhang, Guoqing.,...&Wang, Zefeng.(2024).Rapid multiple protein sequence search by parallel and heterogeneous computation.BIOINFORMATICS,40(4).
MLA
Li, Jiefu,et al."Rapid multiple protein sequence search by parallel and heterogeneous computation".BIOINFORMATICS 40.4(2024).
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Li, Jiefu]的文章
[Wang, Ziyuan]的文章
[Fan, Xuwei]的文章
百度学术
百度学术中相似的文章
[Li, Jiefu]的文章
[Wang, Ziyuan]的文章
[Fan, Xuwei]的文章
必应学术
必应学术中相似的文章
[Li, Jiefu]的文章
[Wang, Ziyuan]的文章
[Fan, Xuwei]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。