Parallel divide and conquer bio-sequence comparison based on Smith-Waterman algorithm

来源 :中国科学F辑(英文版) | 被引量 : 0次 | 上传用户:yanzi774
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Tools for pair-wise bio-sequence alignment have for long played a central role in computation biology. Several algorithms for bio-sequence alignment have been developed. The Smith-Waterman algorithm, based on dynamic programming, is considered the most fundamental alignment algorithm in bioinformatics. However the existing parallel Smith-Waterman algorithm needs large memory space, and this disadvantage limits the size of a sequence to be handled. As the data of biological sequences expand rapidly, the memory requirement of the existing parallel SmithWaterman algorithm has become a critical problem. For solving this problem, we develop a new parallel bio-sequence alignment algorithm, using the strategy of divide and conquer, named PSW-DC algorithm. In our algorithm, first, we partition the query sequence into several subsequences and distribute them to every processor respectively,then compare each subsequence with the whole subject sequence in parallel, using the Smith-Waterman algorithm, and get an interim result, finally obtain the optimal alignment between the query sequence and subject sequence, through the special combination and extension method. Memory space required in our algorithm is reduced significantly in comparison with existing ones. We also develop a key technique of combination and extension, named the C&E method, to manipulate the interim results and obtain the final sequences alignment. We implement the new parallel bio-sequences alignment algorithm,the PSW-DC, in a cluster parallel system.
其他文献
Poisson algebras are fundamental algebraic structures in physics and symplectic geometry. However, the structure theory of Poisson algebras has not been well de
We gave a complete list of totally geodesic submanifolds of maximal rank in symmetric spaces of noncompact type. The compact cases can be obtained by the dualit
6月15日,四川省新四军史料征集研究会举办庆祝会,近170名在川原新四军老战士济济一堂,欢庆中国共产党成立9。周年。庆祝会上在中国共产党的领导下,在广大人民群众的支持下,从小到
文章阐明微扰色动力学在π介子虚Compton散射过程中的应用性问题.在此过程中,量子色动力学耦合常数除了端点奇点之外还有中心区域的奇点.于是引进一套简单的技术来判断这些奇
Neural networks are being used to construct meta-models in numerical simulation of structures. In addition to network structures and training algorithms, traini
天空朦朦亮,鸟儿喳喳叫。爷爷披衣起床,吹响快乐的口哨。小明揉揉睡眼问爷爷:“今天晒网不打鱼,为什么起得这么早?”
6月15日,国台办在新闻发布厅举行例行发布会,发言人杨毅回答了记者提问。台胞办理来往大陆签注和居留签注的收费标准将下调 从今年的7月1日开始,台胞办理来往大陆签注收费标准将
For different strength matching, the reliability index and failure probability of welded pressure pipe with circumferential surface crack were calculated using
Vibratory synchronization transmission (VST) is a kind of special physical phenomenon in inertia vibra-tion mechanical systems.For an inertia vibration mechanic