论文部分内容阅读
目的探索运用文献计量和数据挖掘技术分析肿瘤和网页相关研究的特征与趋势。方法在2002-2011年PubMed中下载符合条件的文献,运用自编程序结合Excel和UCINET软件分析文献的年份、国家、类型及主题词分布与关系。结果检索到的1 618篇文献中共有2 668个主题词,文献数量从2002年60篇增加到2011年的377篇。主题词频数按大小排列呈典型双曲线,按MeSH词库排列则呈特有的“五丛”频谱;防治效果、心理行为、一般人群与病种、流行病与统计类主题词呈增加趋势而生理与生物、信息技术、具体病种及诊疗类主题词呈下降趋势。前200个高频词关系网络呈现5个大致的族群。结论网页在肿瘤防治研究中的运用日益广泛,文献计量和数据挖掘技术可为肿瘤和网络相关文献分析提供有益的线索。
Objective To explore the use of document metrology and data mining techniques to analyze the characteristics and trends of related research on cancer and web pages. Methods Publish the eligible documents in PubMed from 2002 to 2011 and analyze the year, country, genre and the distribution of subject words and their relationship using self-compiled program combined with Excel and UCINET software. Results A total of 2 668 keywords were found among the 1 618 articles retrieved. The number of articles increased from 60 in 2002 to 377 in 2011. The frequency of thematic words arranged by size was a typical hyperbolic curve, which was uniquely arranged according to the MeSH lexicon. The results of prevention and treatment, psychological behavior, general population and disease type, epidemiology and statistics showed an increasing tendency Physiological and biological, information technology, specific diseases and diagnosis and treatment of the main keywords showed a downward trend. The first 200 high-frequency words related network presents five general ethnic groups. Conclusion The use of web pages in cancer prevention and treatment is increasingly widespread. Bibliometrics and data mining techniques can provide useful clues for the literature analysis of oncology and the internet.