Statistical power of three model-free metrics for detecting gene-gene interactions

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:w34gss
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Background: it is increasingly recognized that gene-gene interaction is an important genetic component for complex diseases.Although several metrics for detecting gene-gene interactions have been developed, there is dearth of knowledge regarding their statistical power under different scenarios of interactions.This study thus aimed at systematic evaluation of the merits of three data mining metrics for detecting gene-gene interactions under various genetic models.Methods: In this article, we compared three different metrics of non-linear dependence between two genetic loci for detecting gene-gene interactions.The three metrics were two-loci Fst, mutual information and linkage disequilibrium, respectively.We generated case-control data by using simulation software genomeSIMLA based on 17 genetic models (including both epistatic and multiplicative interaction models).For each model, two different sets of disease allele frequencies and five different interaction effects were explored.The statistical power was computed from 500 simulated datasets.Results: In eight epistatic models, except for the model of recessive versus recessive, two-loci Fst and mutual information had higher statistical power than the linkagedise-quilibrium based metric.In nine multiplicative models, two-loci Fst and mutual information also had higher statistical power.Conclusions: The evolution-based metric, Fst, and information-based metric, mutual information, are more powerful and robust statistic for detecting both epistatic and multiplicative gene-gene interactions .
其他文献
Background: IL-13 which is produced by a variety of cells, mainly by activated type Ⅱ T helper cells, is a multi-effectiveness of cytokines.It has confirmed that IL-13 is the primary cytokine that ind
Ubiquitylation is one of the most popular post-translational modifications (PTM), which plays important roles in directing the protein degradation.Therefore, identification of ubiquitylation sites is
With the development of the visualization technology, varieties of protein molecular 3D visualization software have been developed and applied for molecular modeling.However, most software can not rea
Background: Inflammation plays an important role in lung cancer development and cancer therapy.To identify potential protein markers for prognosis in non-small cell lung cancer (NSCLC) patients receiv
Background: With the exponentially exploding volume of scientific literatures available, traditional expert curation becomes increasingly ineffective to keep biological knowledge up-to-date, comprehen
Background: Bioinformatics data, such as Genomics and metabolomics data are usually high dimensional and small sample size.This brings difficulty for researchers to explain the data and understand the
Background: The general framework of promoter mechanism is dependent on protein-DNA interaction in which the different transcription factors (TFs) bind on the promoter region to enable its activity.Al
Background: With the advance of experimental technologies, large-scale protein quantification is widely applied to sample analysis in proteomics.Different stable isotope labeling methods have been dev
The identification and analysis of tissue specificity of genes and gene expressions have a direct and profound impacton the further understanding of a wide array of problems of much significance.Despi
Background: With the avalanche of protein sequences generated in the post-genome era, it is very important to develop computational methods for fast annotating the subnuclear locations for numerous no