基于点关联测度矩阵分解的中英跨语言词嵌入

来源 :第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD | 被引量 : 0次 | 上传用户：rwsonny

【摘要】

：

　　研究基于矩阵分解的词嵌入方法，提出统一的描述模型，并应用于中英跨语言词嵌入问题.以双语对齐语料为知识源，提出跨语言关联词计算方法和两种点关联测度的计算方法：跨语言共

【作者】

：

于东荀恩东赵艳韦林煊

【机构】

：

北京语言大学大数据与教育技术研究所,北京100083

【出处】

：

第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD

【发表日期】

：

2016年期

【关键词】

：

关联词测度矩阵分解语言词义跨语言相似度计算嵌入问题目标函数

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　研究基于矩阵分解的词嵌入方法，提出统一的描述模型，并应用于中英跨语言词嵌入问题.以双语对齐语料为知识源，提出跨语言关联词计算方法和两种点关联测度的计算方法：跨语言共现计数和跨语言点互信息.分别设计目标函数学习中英跨语言词嵌入.从目标函数、语料数据、向量维数等角度进行实验，结果表明：在中英跨语言文档分类中以前者作为点关联测度最高得到87.04％的准确率；在中英跨语言词义相似度计算中，后者作为点关联测度得到更好的性能，同时在英-英词义相似度计算中的性能略高于主流的英语词嵌入.

其他文献

基于深度表示学习和高斯过程迁移学习的情感分析方法

　　情感分析是自然语言处理领域的重要研究问题。现有方法往往难以克服样本偏置与领域依赖问题，严重制约了情感分析的发展和应用。为此，本文提出了一种基于深度表示学习和高斯

会议

Event Extraction via Bidirectional Long Short-Term Memory Tensor Neural Networks

　　Traditional approaches to the task of ACE event extraction usually rely on complicated natural language processing(NLP)tools and elaborately designed featur

会议

A Novel Approach to Improve the Mongolian Language Model using Intermediate Characters

　　In Mongolian language,there is a phenomenon that many words have the same presentation form but represent different words with different codes.Since typists

会议

Recognizing Textual Entailment via Multi-task Knowledge Assisted LSTM

　　Recognizing Textual Entailment(RTE)plays an important role in NLP applications like question answering,information retrieval,etc.Most previous works either

会议

基于Spatial-DCTHash动态参数网络的视觉问答算法

随着深度学习和多模态融合的深入研究,问答系统从传统的纯文本方式扩展到结合图片的视觉问答,成为计算机视觉与自然语言理解的交叉研究热点.Hyeonwoo等最新提出的动态参数预测模型(DPPnet)能够简单、有效地实现问句和图片信息的融合.但是该模型在进行网络参数动态Hash分配时位置随机,缺乏图像内容空间分布的考虑.针对此不足,本文提出一种新的空间离散余弦Hash动态参数网络.算法采用全卷积方式提取保

会议

Using Collaborative Training Method to build Vietnamese Dependency Treebank

　　For the difficulty of marking Vietnamese dependency tree,this paper proposed the method which combined MST algorithm and improved Nivre algorithm to build V

会议

Recognizing Biomedical Named Entities Based on the Sentence Vector/Twin Word Embeddings Conditioned

　　As a fundamental step in biomedical information extraction tasks,biomedical named entity recognition remains challenging.In recent years,the neural network

会议

Transition-based Chinese Semantic Dependency Graph Parsing

　　Chinese semantic dependency graph is extended from semantic dependency tree,which uses directed acyclic graphs to capture richer latent semantics of sentenc

会议

Definition Extraction with LSTM Recurrent Neural Networks

　　Definition extraction is the task to identify definitional sentences automatically from unstructured text.The task can be used in the aspects of ontology ge

会议

Keeping the Meanings of the Source Text:An introduction to Yes Translate

　　The primary task of language translation is to faithfully pass the meaning(s)of the source text to the target language.Unfortunately,meanings often get lost

会议

基于点关联测度矩阵分解的中英跨语言词嵌入

与本文相关的学术论文