论文部分内容阅读
目的:采用主客观评价系统分别评价深度学习方法在动静脉期CT上的分割效果,并探索影响动静脉期胰腺分割差异的因素及影响静脉期胰腺分割的相关因素。方法:回顾性收集2019年1至11月北京协和医院放射科218例胰腺CT扫描数据,每例均包含动脉期和静脉期图像,并按照训练集+验证集与测试集为7∶3的比例将数据随机划分为训练集(139例)、验证集(20例)及测试集(59例),使用训练集训练二阶段全局局部渐进融合网络,在验证集上寻找最优分割效果的模型参数,对测试集进行预测并对结果进行主观及客观评价。主观评价基于胰腺与周围器官的临界区域,采用李克特5分量表;客观评价采用Dice相似系数(DSC)。采用配对n t检验或Wilcoxon配对秩检验比较动静脉期主客观评分的差异。n 结果:在十二指肠、十二指肠空肠曲、左肾上腺、门脉、肠系膜上静脉、脾动脉及脾静脉处胰腺临界区域动脉期主观评分[n M(n Q1, n Q3)]分别为4(4, 5)、5(4, 5)、5(4, 5)、4(4, 5)、5(4, 5)、5(5, 5)及4(3, 5)分,静脉期主观评分[n M(n Q1, n Q3)]分别为4(4, 4)、5(4, 5)、5(4, 5)、5(4, 5)、5(5, 5)、4(3, 4)、5(5, 5)分,以上临界区域的胰腺动、静脉期主观评分差异均有统计学意义(均n P<0.05);静脉期DSC略高于动脉期,差异无统计学意义(DSC:0.923比0.921,n P=0.952)。胰腺与十二指肠空肠曲、胃、左肾上腺存在脂肪间隙组在静脉期主观评分分别为4.64、4.68及4.63分,无脂肪间隙组的主观评分分别为4.56、4.62及4.56分,胰腺与十二指肠空肠曲、胃、左肾上腺有、无脂肪间隙两组间的主观评分差异均有统计学意义(n t=2.147、2.112、2.277,均n P<0.05)。除外脾,胰腺临界区域与其余周围器官的密度差在动静脉分割的差异均有统计学意义(均n P<0.05)。n 结论:利用双期CT构建深度学习胰腺自动分割模型,并对分割效果进行主客观评价,主观评价可以提高今后胰腺临界区域的分割能力。“,”Objective:To investigate the segmentation effects of the deep learning method on CT in the arterial phase and venous phase respectively by using subjective and objective evaluation system, and to investigate the factors that affect the difference between arterial phase and venous phase pancreas segmentation and the related factors affecting the venous pancreas segmentation.Method:A total of 218 cases of pancreatic CT scan data in the Department of Radiology of Peking Union Medical College Hospital from January to November 2019 were retrospectively collected. Each case contained images of arterial and venous phases, and the data were randomly divided into training set (139 cases), validation set (20 cases) and test set (59 cases) according to the ratio of the training and verification set to the test set of 7∶3. The two-stage global local progressive fusion network was trained on the training set, the model parameters of the optimal segmentation effect were found on the validation set, and the test set was predicted and the results were evaluated subjectively and objectively. The Likert 5-point scale was used for subjective evaluation based on the critical regions between pancreas and peripheral organs, while the Dice similarity coefficient (DSC) was used for objective evaluation. The paired n t test or Wilcoxon paired rank test was used to compare the differences of subjective and objective scores of the arterial phase and venous phase.n Results:For the critical regions of the pancreas at the duodenum, duodenal jejunal flexure, left adrenal gland, portal vein, superior mesenteric vein, splenic artery and splenic vein, the median number of subjective scores in arterial phase were 4(4, 5), 5(4, 5), 5(4, 5), 4(4, 5), 5(4, 5), 5(5, 5) and 4(3, 5)points respectively, the median number(first quartile, third quartile) of subjective scores in venous phase were 4(4, 4), 5(4, 5), 5(4, 5), 5(4, 5), 5(5, 5), 4(3, 4) and 5(5, 5) points respectively,there were statistically significant differences of the median number(first quartile, third quartile) of the subjective scores between the arterial and venous phase for the critical regions of the pancreas at the organs described above (all n P<0.05). DSC in the venous phase was slightly higher than that in the arterial phase and the difference was not statistically significant (DSC: 0.932 vs 0.921,n P=0.952). Subjective scores in venous phase of the pancreas and duodenal jejunum, stomach, and left adrenal gland with fat gaps were 4.64,4.68 and 4.63 points respectively, and those of the group without fat gaps were 4.56,4.62 and 4.56 points respectively, there were statistically significant differences of the subjective scores in venous phase of the groups with fat gaps or not between the pancreas and the organs described above (n t=2.147, 2.112, 2.277, all n P<0.05). Except the spleen, the density differences between the critical regions of the pancreas and other surrounding organs were statistically significant in arterial phase and venous phase segmentation (alln P<0.05).n Conclusion:Dual-phase CT was used to construct a deep learning automatic pancreas segmentation model, and the segmentation effect was evaluated subjectively and objectively. Subjective evaluation was helpful to improve the ability to segment the critical regions of the pancreas in the future.