Introduction to Validity

来源 :校园英语·下旬 | 被引量 : 0次 | 上传用户:Tiger7
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  【Abstract】The designing of a test paper strictly follows the specification of the relevant Testing Syllabus, which covers various skills and abilities needed for real communicative situation. A test with high content validity is likely to have a positive impact upon the teaching. Therefore, a test with a high content validity will undoubtedly promote the college English education and it will certainly facilitate the implementation of the Syllabus.
  【Key words】test; validity; content validity
  The important quality of test interpretation or use is validity, or the extent to which the inferences or decisions we make on the basis of test scores are meaningful, appropriate and useful. High validity is the guarantee of a good test.
  I. Concept of validity
  The validity of a test is the extent to which it measures what it is supposed to measure and nothing else. Every test, whether it be a short, informal classroom test or a public examination, should be as valid as the constructor can make it. The test must aim to provide a true measure of the particular skill which it is intended to measure: to the extent that it measures external knowledge and other skills at the same time, it will not be a valid test (J. B. Heaton 2000: 159). For example, the following test item is invalid if we wish solely to measure writing ability: “Is photography an art or a science? Discuss.” It is likely to be invalid simply because it demands some knowledge of photography and will consequently favor certain students.
  II. Classification of validity
  Actually validity is a complex concept,different scholars have different versions of classifications. One expert in the field of testing in our country, Li Xiaoju (1997) classifies it into four categories. Her classification is illustrated in the following table.
  Although validity has traditionally been discussed in terms of different types, as pointed above, psychomericians have increasingly come to view it as a single, unitary concept.
  III. Content validity
  Content validity, in the conventional measurement sense, is the extent to which the content of the test constitutes a representative sample of the domain to be tested (Bachman 1990: 306). A test is said to be valid if its content constitutes a representative sample of the language skills, structures, etc. which it is meant to be concerned. The examination of content validity consists of the study of content relevance and content coverage. The investigation of content relevance requires “the specification of the behavioral domain in question and the attendant specification of the task or test domain’ (Messick 1980: 1017, cited from Bachman 1990: 244). The second aspect of examining test content is that of content coverage, or the extent to which the tasks required in the test adequately represent the behavioral domain in question. From the perspective of the test developer, if we had a well-defined domain that specified the entire set, or population, of possible test tasks, we could follow a standard procedure for random sampling to insure that the tasks required by the test were representative of that domain (Bachman 1990: 245).   In practice, content validity study needs a specification of the skills or structures, etc. that it is meant to cover. Such a specification should be made at a very early stage in test construction. A comparison of test specification and test content is the basis for judgments as to content validity. Ideally these judgments should be made by people who are familiar with language learning and testing but who are not directly concerned with the production of the test in question.
  The content validity is of great importance:
  Firstly, the greater a test’s content validity, the more likely it is to be an accurate measure of what it is supposed to measure. A test in which major areas identified in the specification are under-represented- or not represented at all – is unlikely to be accurate. Secondly, a test with no content validity is likely to have a harmful backwash effect. Areas which are not tested are likely to become areas ignored in teaching and learning. The best safeguard against this is to write full test specifications and to ensure that the test content is a fair reflection of these. (Arthur Hughes 2002: 23).
  Reference:
  [1]Bachman,L.F.(1990).Fundamental Considerations in Language Testing.Oxford:Oxford University Press.
  [2]Heaton,J.B.(2000).Writing English Language Tests.Foreign Langauge Teaching and Reserach Press.
  [3]李筱菊.语言测试科学与艺术[M].湖南:湖南教育出版社.1997.
其他文献
要使信念和其他命题态度语境的量化有意义,存在两个大的困难.一个是所指称的对象可能不存在;另一个是共指称词项的替换性原则可能会在这样的语境下被破坏.我将在本文中论证,
期刊
摘要:外资银行在中国的市场份额越做越大,不仅仅与政策的开放度有关,还归功于其独特的经营行为,文章主要从外资银行目标市场的定位、产品、服务策略、战略联盟这三个角度揭示其在中国的经营行为的深层原因,同时对外资银行未来动向作了预测。  关键词:外资银行;参股经营;经营策略    自WTO大门一开,蓄势已久的外资银行迅速入驻中国,一时间攻城掠地,硝烟弥漫。根据承诺,金融业是逐步开放的,诸多的政策壁垒束缚了
【摘要】大学英语教学,特别是独立学院的大学英语教学中,读和写是非常重要的两个部分。近年来,大英教师更加关注在英语课堂上如何有效地把阅读和写作加以融合,真正做到以读促写,读写结合。本文分析了读写结合在大学英语教学中的作用以及重要性,旨在探索读写结合的新的大学英语教学模式。  【关键词】读写结合 以读促写 大学英语教学  大学英语教学是高校特别是独立学院教学的重要组成部分。对于独立学院的学生来说,他们
在江苏省金坛市长荡湖畔的儒林镇,座落着一个美丽的花园式工厂———金坛市土木工程仪器厂。提起这个厂就不能不说说它的主人杨志余。杨志余从小家境贫寒,艰苦的生活环境磨砺了
摘要:中国经济的高速增长和整体经济实力的扩大,人民币的国际声望大大提高,周边国家和地区居民逐渐认可和接受人民币作为交易货币和国际清算手段,导致大量人民币在境外流通。而且,随着中国和周边国家和地区国际贸易和经济往来的扩大和加深,中国加入WTO后的资本账户放松管制,人民币的境外流通规模将扩大。文章对人民币在周边国家和地区的流通现状及其原因进行了系统的分析介绍。  关键词:金融;外汇;人民币;流通   
2012年,胜利油田“四化”工作开始标准化采购,并于2014年在全油田推开.实施标准化采购以来,油田物资采购模式发生积极转变,不仅质量管控更有效,采购流程更顺畅,而且采购资金
期刊
An empirical study on current situations of college students foreign language anxiety is conced with in this paper. The author adopted Horwitzs Foreign Language
7月9日至10日,十三届全国人大常委会举行第四次会议,审议了大气污染防治法执法检查报告并展开专题询问,同时审议通过了全国人大常委会关于全面加强生态环境保护依法推动打好污染防治攻坚战的决议。此次会议不仅集中聚焦于环保议题,而且打破了每隔两月开会的惯例,距离上一次常委会会议尚不足一个月。这在人大议政史上,极为罕见。  开会,是人大议事行权的基本方式,其时间节点、频率多寡、会期长短等等,决定了国家权力机
期刊
针对老油田管理存在的突出问题,从2014年底开始,采油二厂开展了以六大工程为主要内容的综合治理工程.经过3年的全面实施,取得了预期效果,推动了采油二厂可持续发展.rn1.推进
期刊
现阶段,农村土地整治被纳入城乡统筹发展的大格局中,成为改善农村公共基础设施,提高农民生活水平,缩小城乡发展差距,加快城乡发展—体化的重要抓手.rn农村土地综合整治的新要