A Hierarchical LSTM Model for Joint Tasks

来源 :第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD | 被引量 : 0次 | 上传用户:neverneverland
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Previous work has shown that joint modeling of two Natural Language Processing(NLP)tasks are effective for achieving better performances for both tasks.Lots of task-specific joint models are proposed.This paper proposes a Hierarchical Long Short-Term Memory(HLSTM)model and some its variants for modeling two tasks jointly.The models are flexible for modeling different types of combinations of tasks.It avoids task-specific feature engineering.Besides the enabling of correlation information between tasks,our models take the hierarchical relations between two tasks into consideration,which is not discussed in previous work.Experimental results show that our models outperform strong baselines in three different types of task combination.While both correlation information and hierarchical relations between two tasks are helpful to improve performances for both tasks,the models especially boost performance of tasks on the top of the hierarchical structures.
其他文献
  Symptom entities are widely distributed in Chinese electronic medical records.Previous approaches on symptom entity extraction usually extract continuous st
会议
  The availability of labeled corpus is of great importance for emotion classification tasks.Because manual labeling is too time-consuming,hashtags have been
会议
  In this paper,we apply a bidirectional Long Short-Term Memory with a Conditional Random Field to the task of disfluency detection.Long-range dependencies is
会议
  The Chinese language is a character-based language,with no explicit separators between words like English.Traditionally,word segmentation is conducted to co
会议
  Large-scale annotated corpora are a prerequisite for developing high-performance age regression models.However,such annotated corpora are some-times very ex
会议
  This paper presents the state of art research progress on multilingual multi-document summarization.Our method utilizes hLDA(hierarchical Latent Dirichlet A
会议
  Previous researches on event relation classification primarily rely on lexical and syntactic features.In this paper,we use a Shallow Convolutional Neural Ne
会议
  The dialog manager is the most important component for a dialog system,in which the dialog state tracking is crucial to a real-world system.We claim that th
会议
  The algorithms for discovering global community structure require the knowledge about entire network structures,which are still difficult and unrealistic to
会议
  Finding similarity degree is one of the significant technologies used in the sample-based machine translation.It works in the following principle,first matc
会议