Grouping of amino acids and recognition of protein structurally conserved regions by reduced alphabe

来源 :中国科学C辑(英文版) | 被引量 : 0次 | 上传用户:sheeperds
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Sequence alignment is a common method for finding protein structurally conserved/similar regions.However, sequence alignment is often not accurate if sequence identities between to-be-aligned sequences are less than 30%. This is because that for these sequences, different residues may play similar structural roles and they are incorrectly aligned during the sequence alignment using substitution matrix consisting of 20 types of residues. Based on the similarity of physicochemical features,residues can be clustered into a few groups. Using such simplified alphabets, the complexity of protein sequences is reduced and at the same time the key information encoded in the sequences remains. As a result, the accuracy of sequence alignment might be improved if the residues are properly clustered.Here, by using a database of aligned protein structures (DAPS), a new clustering method based on the substitution scores is proposed for the grouping of residues, and substitution matrices of residues at different levels of simplification are constructed. The validity of the reduced alphabets is confirmed by relative entropy analysis. The reduced alphabets are applied to recognition of protein structurally conserved/similar regions by sequence alignment. The results indicate that the accuracy or efficiency of sequence alignment can be improved with the optimal reduced alphabet with N around 9.
其他文献
针对北京地区冬小麦种植面积减少、产量下降的现状,从种植结构调整、自然资源限制、土地规模、劳动者素质、推广体系现状等多个方面进行了分析,并从政策制定、行政推动、土地
以Ⅱ优7号为供试水稻品种,通过精确计量稻田进出水量,研究了四川典型丘陵地区稻田耗水量。结果表明,丘陵地区降水利用率为71.59%,但降水与稻田需水不完全同步;全生育期总灌水
综述了心理健康概念和护理本科生的心理健康现状,以及目前对护生心理健康干预的研究进展,分析了影响护理本科生心理健康的因素,指出了素质教育体系下推进护理本科生心理健康教育
Obiective:This study aimed to explore the effects of different types of palatal lateral excisions on the growth and deveiopment of the maxilla and dental arch.a
以玉米骨干自交系黄早四与掖478分别作为轮回亲本构建的双向BC3回交群体为试验材料,系统分析在不同遗传背景下的玉米子粒淀粉、蛋白质、油分以及赖氨酸含量的变化及其相关性
应用主成分和隶属函数分析对国内20种大豆基因型的耐低磷能力进行评价,结果表明:11个差异显著的耐低磷评价指标通过主成分分析归纳为地上部分生物量因子、磷因子、根系因子3
目的:通过评估2种常规检测系统高密度脂蛋白-C (HDL-C)测定结果的正确性,为临床检测提供指导。方法宁波市医疗中心李惠利医院检验科联合宁波美康生物科技股份有限公司参考实验室
湘丰优186是湖南隆平超级杂交稻工程研究中心有限责任公司选育的三系杂交晚稻新组合。该组合2010和2011年连续2 a在醴陵市示范种植,2 a平均产量9.90 t/hm2,表现出生育期适中
Trichophyton rubrum is a dominating superficial dermatophyte, whose conidial germination is correlated to pathopoiesis and a highly important developmental proc
随着转基因技术的快速发展及其在农业生产领域的广泛应用,消费者对转基因技术的争论也日趋激烈.消费者的态度是影响农业转基因技术发展的关键因素.近几年国内外学术界对消费