Automatic Semantic Analysis of Software Requirements Through Machine Learning and Ontology Approach

来源 :Journal of Shanghai Jiaotong University(Science) | 被引量 : 0次 | 上传用户:zx0755
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Nowadays,software requirements are still mainly analyzed manually,which has many drawbacks(such as a large amount of labor consumption,inefficiency,and even inaccuracy of the results).The problems are even worse in domain analysis scenarios because a large number of requirements from many users need to be analyzed.In this sense,automatic analysis of software requirements can bring benefits to software companies.For this purpose,we proposed an approach to automatically analyze software requirement specifications(SRSs) and extract the semantic information.In this approach,a machine learning and ontology based semantic role labeling(SRL) method was used.First of all,some common verbs were calculated from SRS documents in the E-commerce domain,and then semantic frames were designed for those verbs.Based on the frames,sentences from SRSs were selected and labeled manually,and the labeled sentences were used as training examples in the machine learning stage.Besides the training examples labeled with semantic roles,external ontology knowledge was used to relieve the data sparsity problem and obtain reliable results.Based on the Sem Cor and Word Net corpus,the senses of nouns and verbs were identified in a sequential manner through the K-nearest neighbor approach.Then the senses of the verbs were used to identify the frame types.After that,we trained the SRL labeling classifier with the maximum entropy method,in which we added some new features based on word sense,such as the hypernyms and hyponyms of the word senses in the ontology.Experimental results show that this new approach for automatic functional requirements analysis is effective. Nowadays, software requirements are still primarily analyzed manually, which has many disadvantages (such as a large amount of labor consumption, inefficiency, and even inaccuracy of the results). The problems are even worse in domain analysis scenarios because a large number of requirements from many users need to be analyzed. this sense, automatic analysis of software requirements can bring benefits to software companies. For this purpose, we propose an approach to automatically analyze software requirement specifications (SRSs) and extract the semantic information. In this approach, a machine learning and ontology based semantic role labeling (SRL) method was used. First of all, some common verbs were calculated from SRS documents in the E-commerce domain, and then the semantic frames were designed for those verbs. Based on the frames, sentences from SRSs were selected and marked manually, and the labeled sentences were used as training examples in the machine learning stage.Besides the training examples lab eled with semantic roles, external ontology knowledge was used to relieve the data sparsity problem and obtain reliable results. Based on the Sem Cor and Word Net corpus, the senses of nouns and verbs were identified in a sequential manner through the K-nearest neighbor approach .Then the senses of the verbs were used to identify the frame types. After that, we trained the SRL labeling classifier with the maximum entropy method, in which we added some new features based on word sense, such as the hypernyms and hyponyms of the word senses in the ontology. Experimental results show that this new approach for automatic functional requirements analysis is effective.
其他文献
这次全省环境保护工作会议的主要任务是贯彻党的十六届六中全会、省委十届九次全委会议和全国环境保护厅局长会议精神的一次重要会议.刚才,浩明同志代表省环保局党组总结了20
期刊
德国哲学家卡尔·雅斯贝尔斯在《什么是教育》一书中说,教育是“一朵云推动另一朵云,一棵树摇动另一棵树,一个灵魂唤醒另一个灵魂”。亲情是教育的源泉和支柱,能促进学生更好
2013年辽宁省将举办第十二届全国运动会,作为分赛区的大连市将承办40%左右的比赛和全运会闭幕式,但是大连市原有的体育场馆无法承载如此规模的体育盛会,因此,大连市在甘井子区北部
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
本论文主要通过文献资料法、心理测量法、数理统计法等研究方法,调查研究参加2010年第十二届辽宁省运动会游泳比赛甲组运动员赛前焦虑状态和自信心水平状况,分析运动员赛前焦
重过程轻终结是高校体育学习评价研究、实施的核心理念,成绩是评价结果的主要表现形式。体育素养中的学习态度、锻炼习惯等虽是过程评价主要指标,但过于抽象,不易量化,难以客
随着散打运动在世界范围内的广泛开展,其训练理论和方法日益完善,运动员技战术水平不断提高,比赛中绝对胜利的比例减少,相持能力的作用加大,因此,要想获得优异的运动成绩,必须重视运