Visual commonsense reasoning with directional visual connections

来源 :信息与电子工程前沿（英文版） | 被引量 : 0次 | 上传用户：lxz119110

【摘要】

：

To boost research into cognition-level visual understanding, i.e., making an accurate inference based on a thorough understanding of visual details, visual comm

【作者】

：

Yahong HAN Aming WU Linchao ZHU Yi YANG

【机构】

：

College of Intelligence and Computing,Tianjin University,Tianjin 300350,China;Tianjin Key Lab of Mac

【出处】

：

信息与电子工程前沿（英文版）

【发表日期】

：

2021年5期

【关键词】

：

Visual commonsense reasoning Directional connective network Visual neuron connec

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

To boost research into cognition-level visual understanding, i.e., making an accurate inference based on a thorough understanding of visual details, visual commonsense reasoning (VCR) has been proposed. Compared with traditional visual question answering which requires models to select correct answers, VCR requires models to select not only the correct answers, but also the correct rationales. Recent research into human cognition has indicated that brain function or cognition can be considered as a global and dynamic integration of local neuron connectivity, which is helpful in solving specific cognition tasks. Inspired by this idea, we propose a directional connective network to achieve VCR by dynamically reorganizing the visual neuron connectivity that is contextualized using the meaning of questions and answers and leveraging the directional information to enhance the reasoning ability. Specifically, we first develop a GraphVLAD module to capture visual neuron connectivity to fully model visual content correlations. Then, a contextualization process is proposed to fuse sentence representations with visual neuron representations. Finally, based on the output of contextualized connectivity, we propose directional connectivity to infer answers and rationales, which includes a ReasonVLAD module. Experimental results on the VCR dataset and visualization analysis demonstrate the effectiveness of our method.

其他文献

护生主导式查房在肾病风湿科护理教学中的应用价值分析

目的:分析护生主导式查房在肾病风湿科护理教学中的应用价值.方法:于2019年4月-2020年4月间在我院肾病风湿科进行实习的实习生中选取68例进行研究,并采用数字随机表法的方式

期刊

护生主导式查房肾病风湿科护理教学应用价值

NT-proBNP、cTnI、CK-MB联合检测对急性心肌梗死的诊断价值

目的:探讨NT-proBNP、cTnI、CK-MB三者联合检测对急性心肌梗死(AMI)患者的早期诊断价值.方法:随机选取2019年1月-2020年8月在宿州市第一人民医院心内科住院的急性心肌梗死患

期刊

NT-proBNPcTnICK-MB急性心肌梗死

每日目标化临床护理路径和健康教育在冠心病介入患者护理中的应用价值分析

目的:探讨在冠心病介入患者护理中实施每日目标化临床护理路径以及健康教育的具体效果.方法:选取我院2019年1月至～2020年12月的80例冠心病介入患者进行研究,以随机为基本原则

期刊

每日目标化临床护理路径健康教育冠心病介入治疗应用价值

后处理厂乏燃料储运吊篮设计与分析

在后处理厂乏燃料贮存水池,采用吊篮储存和转运乏燃料组件可以提升转运能力,同时具有较好的安全性和经济性.本文针对国内商业核电站服役使用的典型乏燃料组件结构参数,设计了

期刊

后处理乏燃料吊篮燃料组件

Visual knowledge: an attempt to explore machine creativity

1 Introduction—starting at noetic sciencernOne question that has long puzzled the artificial intelligence (AI) community is: Can AI be creative? Or, can the re

期刊

景观疗养在保健疗养中的应用价值

目的:探讨景观疗养在保健疗养中的应用价值.方法:以2020年4月-2021年3月在我疗养中心疗养的80例疗养员为研究对象,均分成甲组和乙组,分别行常规疗养和景观疗养,对比效果.结果

期刊

景观疗养保健疗养中心应用价值

铝锂合金纳米析出相结构与性能综述

纳米析出相种类、大小、形状、分布以及析出序列的调控是理解和设计第3、4代铝锂合金的基础。总结了铝锂合金中典型的Cu、Mg、Ag、Si合金元素作用下所产生的纳米析出相。重点

期刊

铝锂合金纳米沉淀相形核第一性原理计算高角环形暗场扫描透射电子显微镜像

老年护理实践教学改革对护生学习积极主动性的影响

目的:探讨护理实践教学改革应用于护生带教中的效果,分析其可行性.方法:将2019年3月至2020年3月作为研究时段,将该时段我院人力资源样本库内收入的40名,进入我院接受实习的护

期刊

护理实践教学改革我院护士带教分析

可视咽镜在气管内插管麻醉见习教学中的应用效果评价

目的:判断气管插管麻醉见习教学中可视咽镜的价值.方法:麻醉科内实习生中参与此研究的有42名,入组样本中,行普通咽喉镜见习教学者21例属一般组,行可视咽镜见习教学者21例属实

期刊

气管插管麻醉应用效果临床评价见习教学可视咽镜

小组合作教学法在糖尿病护理教学实践中的应用效果

目的分析糖尿病护理教学实践中小组合作教学法的应用效果.方法选取本院2019年1月-2020年10月糖尿病护理教学实践的护生60名作为研究对象,采取随机单盲法分组,每组30名,对照

期刊

糖尿病护理教学小组合作教学法考核评价

Visual commonsense reasoning with directional visual connections

与本文相关的学术论文