Specifictome: Towards a Knowledge Atlas for Tissue Specificity of Gene Expressions and Regulations

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:alicial
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The identification and analysis of tissue specificity of genes and gene expressions have a direct and profound impacton the further understanding of a wide array of problems of much significance.Despite its significance, our understanding of gene tissue specificity is still quite fragmented and incomplete.We developeda large-scale and extensible knowledge acquisition and representation system-the Specifictome-to acquire, organizeand disseminate tissue specificity knowledge for humanand other model organisms such as mouse, rat, and so onusing novel statistical machine learning and Semantic Web technologies.Background:An important part of understanding the human genome and biology is the study of tissue-specific gene expressionsand regulations, which is a complex phenomenon.It is a result of a large number of interacting factors and can beexhibited in a number of ways, including developmental stages, gene expression levels, a genes structural characteristics (promoters, etc.) and gene regulatory and interaction networks.The inherent complexity makes tissue specificityresearch a very challenging problem.Also severaltissue specific databases such as TiGER,TiProD, TissueInfo,TissueDistributionDBsandTiSGeDprovide web services to extract the TS and HK genes and their genomic features by predefinedthresholdvalues.However, there is no widely acceptedcriterion to judge which gene is expressedin a specific tissue only by the expression patterns.Methods: In this research, we 1) uncover the co-expression pattems of TS genes using constrained Bayesian mixture models, 2) discover and evaluate the sequence significant patterns of TS genes using Bayesian factor analysis, 3) build stable and reliable TS gene regulatory networks, and 4) develop a comprehensive knowledge atlas for the expression and regulations of TS genes using semantic web technologies.Results: We have integrated three types of expression data sets: microarray, EST and SAGE for identifying TS and housekeeping genes and used pubmed literatures to validate them.Also, some significant DNA binding motifs are discovered using our motif discovery pipeline.A prototype of the knowledge platform has been developed.Conclusions: Tissue specificity of gene expressions and regulations is a fundamental event during the cell division and development in biology.The knowledge and algorithms for gene tissue specificity prediction could be used in the evaluation of individual tcchniques and the knowledge organization framework as a whole.The platform of the knowledge atlas can not only be used for acquiring corresponding information about TS genes, but also used as a portal to share knowledge among bioinformatics research community .
其他文献
Background: Apoptosis proteins is a kind of protein with specific functions, play an important role in the growth and homeostasis of organisms.Since the function of apoptosis proteins correlates with
Background: By studying the correlation of histone modifications and the process of transcription, it has been showed that there is very universal correlation between histone modification and gene exp
Background: IL-13 which is produced by a variety of cells, mainly by activated type Ⅱ T helper cells, is a multi-effectiveness of cytokines.It has confirmed that IL-13 is the primary cytokine that ind
Ubiquitylation is one of the most popular post-translational modifications (PTM), which plays important roles in directing the protein degradation.Therefore, identification of ubiquitylation sites is
With the development of the visualization technology, varieties of protein molecular 3D visualization software have been developed and applied for molecular modeling.However, most software can not rea
Background: Inflammation plays an important role in lung cancer development and cancer therapy.To identify potential protein markers for prognosis in non-small cell lung cancer (NSCLC) patients receiv
Background: With the exponentially exploding volume of scientific literatures available, traditional expert curation becomes increasingly ineffective to keep biological knowledge up-to-date, comprehen
Background: Bioinformatics data, such as Genomics and metabolomics data are usually high dimensional and small sample size.This brings difficulty for researchers to explain the data and understand the
Background: The general framework of promoter mechanism is dependent on protein-DNA interaction in which the different transcription factors (TFs) bind on the promoter region to enable its activity.Al
Background: With the advance of experimental technologies, large-scale protein quantification is widely applied to sample analysis in proteomics.Different stable isotope labeling methods have been dev