Performance comparison between Logistic regression, decision trees, and multilayer perceptron in pre

来源 :中华医学杂志(英文版) | 被引量 : 0次 | 上传用户:judge119
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Background Various methods can be applied to build predictive models for the clinical data with binary outcome variable.This research aims to explore the process of constructing common predictive models,Logistic regression (LR),decision tree (DT) and multilayer perceptron (MLP),as well as focus on specific details when applying the methods mentioned above:what preconditions should be satisfied,how to set parameters of the model,how to screen variables and build accuracy models quickly and efficiently,and how to assess the generalization ability (that is,prediction performance) reliably by Monte Carlo method in the case of small sample size.Methods All the 274 patients (include 137 type 2 diabetes mellitus with diabetic peripheral neuropathy and 137 type 2 diabetes mellitus without diabetic peripheral neuropathy) from the Metabolic Disease Hospital in Tianjin participated in the study.There were 30 variables such as sex,age,glycosylated hemoglobin,etc.On account of small sample size,the classification and regression tree (CART) with the chi-squared automatic interaction detector tree (CHAID) were combined by means of the 100 times 5-7 fold stratified cross-validation to build DT.The MLP was constructed by Schwarz Bayes Criterion to choose the number of hidden layers and hidden layer units,alone with levenberg-marquardt (L-M) optimization algorithm,weight decay and preliminary training method.Subsequently,LR was applied by the best subset method with the Akaike Information Criterion (AIC) to make the best used of information and avoid overfitting.Eventually,a 10 to 100 times 3-10 fold stratified cross-validation method was used to compare the generalization ability of DT,MLP and LR in view of the areas under the receiver operating characteristic (ROC) curves (AUC).Results The AUC of DT,MLP and LR were 0.8863,0.8536 and 0.8802,respectively.As the larger the AUC of a specific prediction model is,the higher diagnostic ability presents,MLP performed optimally,and then followed by LR and DT in terms of 10-100 times 2-10 fold stratified cross-validation in our study.Neural network model is a preferred option for the data.However,the best subset of multiple LR would be a better choice in view of efficiency and accuracy.Conclusion When dealing with data from small size sample,multiple independent variables and a dichotomous outcome variable,more strategies and statistical techniques (such as AIC criteria,L-M optimization algorithm,the best subset,etc.) should be considered to build a forecast model and some available methods (such as cross-validation,AUC,etc.) could be used for evaluation.
其他文献
The precessing vortex core (PVC) in a cyclone separator plays an important role in the separation performance and in further understanding of the general law of
In this paper, selective oxidation of n-butane to maleic anhydride (MA) and partial oxidation of methane to synthesis gas with lattice oxygen instead of molecul
The interface structure, work of adhesion, and bonding character of the polar TiC/Ti interface have been examined by the first-principles density functional pla
The structure and microstructure of constituent phases in annealed IQC100-x DQCx alloys,made from mixtures of Al62 Cu25.5 Fe12.5 icosahedral quasicrystal(IQC)an
Toluene insoluble matter (TIM) in coker heavy gas oil (CHGO) from oil sands bitumen is harmful to the downstream hydrotreating, and it may be difficult to be re
The nano-TiO2/unsaturated polyester resin (referred to as nano-TiO2/UPR hereafter) was prepared with the "reaction method", by which a chemical bond generated b
Typical cationic and anionic surfactants were chosen and their interactions were calculated by quantum ular pairs with fluocarbon and hydrocarbon chain: C4H10/C
2024 aluminum alloy was implanted with nitrogen then titanium at different titanium target sputtering currents by plasma-based ion implantation(PBII).The appear
Y2O3:Eu nanotubes were synthesized by a surfactant assembly mechanism. Under ultraviolet-light excitation, the nanotubes present luminescence properties differe
This paper studies the influence of feed injection on the hydrodynamic behavior of fluid catalytic cracking riser reactors. Experiments were conducted in a cold