Investigation and use of methods for defining the extends of similarity of Kazakh language sentences

来源 :第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD | 被引量 : 0次 | 上传用户：zjqzc

【摘要】

：

　　Finding similarity degree is one of the significant technologies used in the sample-based machine translation.It works in the following principle,first matc

【作者】

：

UnzilaKamanur;AltynbekSharipbay;GulilaAltnbek;GulmiraBekmanova;LenaZhetkenbay;

【机构】

：

L.N.Gumilyov Eurasian National University,Astana

【出处】

：

第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD

【发表日期】

：

2016年期

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　Finding similarity degree is one of the significant technologies used in the sample-based machine translation.It works in the following principle,first matching the input sentences with a sentence in the sample database,after that it is necessary to pick up parts of the similar sentences for the sentence which is aimed to translate; it is finished by correcting the structure or paraphrasing it with a relevant meaning.For that reason,the degree of similarity of two samples highly affects on the results of translation.Thus,there are dependence between quality of the outputs and the similarity degree.

其他文献

Is Local Window Essential for Neural Network based Chinese Word Segmentation?

　　Neural network based Chinese Word Segmentation(CWS)approaches can bypass the burdensome feature engineering comparing with the conventional ones.All previou

会议

A Bootstrapping Approach to Symptom Entity Extraction on Chinese Electronic Medical Records

　　Symptom entities are widely distributed in Chinese electronic medical records.Previous approaches on symptom entity extraction usually extract continuous st

会议

Towards Scalable Emotion Classification in Microblog Based on Noisy Training Data

　　The availability of labeled corpus is of great importance for emotion classification tasks.Because manual labeling is too time-consuming,hashtags have been

会议

Enhancing Neural Disfluency Detection with Hand-crafted Features

　　In this paper,we apply a bidirectional Long Short-Term Memory with a Conditional Random Field to the task of disfluency detection.Long-range dependencies is

会议

Chinese Sentiment Analysis Exploiting Heterogeneous Segmentations

　　The Chinese language is a character-based language,with no explicit separators between words like English.Traditionally,word segmentation is conducted to co

会议

Active Learning for Age Regression in Social Media

　　Large-scale annotated corpora are a prerequisite for developing high-performance age regression models.However,such annotated corpora are some-times very ex

会议

Multilingual Multi-document Summarization with Enhanced hLDA Features

　　This paper presents the state of art research progress on multilingual multi-document summarization.Our method utilizes hLDA(hierarchical Latent Dirichlet A

会议

Combining Event-level and Cross-event Semantic Information for Event-Oriented Relation Classificatio

　　Previous researches on event relation classification primarily rely on lexical and syntactic features.In this paper,we use a Shallow Convolutional Neural Ne

会议

A New Focus Strategy for Efficient Dialog Management

　　The dialog manager is the most important component for a dialog system,in which the dialog state tracking is crucial to a real-world system.We claim that th

会议

A Novel Approach for Discovering Local Community Structure in Networks

　　The algorithms for discovering global community structure require the knowledge about entire network structures,which are still difficult and unrealistic to

会议

Investigation and use of methods for defining the extends of similarity of Kazakh language sentences

与本文相关的学术论文