Physical Examination Data Based Cataract Risk Analysis

来源 :系统科学与系统工程学报(英文版) | 被引量 : 0次 | 上传用户:gz20090907
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Cataract is a very common eye disease and the most significant cause of blindness.In consideration of its burden on society,the focus was put on testing the risk factors of cataract and building robust machine learning models in which these factors can be utilized to predict the risk of cataract.The data used herein was collected by a Chinese physical examination center located in Shanghai.It contains more than 120,000 examinees and about 500 physical examination metrics.Firstly,association rules were adopted to filter 39 abnormalities which are more likely to incur the risk of cataract,and the significance of these abnormalities was tested with univariate analysis and multivariate analysis.The test results indicate that age,diabetes,refractive error,retinal arteriosclerosis,thyroid nodules,and incomplete mammary gland degeneration significantly increase the possibility of cataract.Various machine learning models were compared in terms of their performance in predicting the risk of cataract based on these six factors,among which the logistic regression model and the decision-tree based ensemble methods outperform others.The test set AUC of these models can reach 0.84.
其他文献
桥台的环境较复杂,易受腐蚀.在运营期间,这里有多个构件寿命较短,需要经常性的检修.传统桥台存在支座和伸缩缝难于维护,伸缩缝周边构件易受渗水影响,台背排水系统往往没有出
在役预应力混凝土连续箱梁桥腹板开裂对桥梁的耐久性和营运安全构成了极大的威胁,为了分析预应力箱梁腹板裂缝的变化规律及产生原因,对预应力混凝土箱梁桥的加固和设计配筋提