Histogram equalization using a reduced feature set of background speakers’ utterances for speaker re

来源 :信息与电子工程前沿(英文版) | 被引量 : 0次 | 上传用户:fossi
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
We propose a method for histogram equalization using supplement sets to improve the performance of speaker recognition when the training and test utterances are very short. The supplement sets are derived using outputs of selection or clustering algorithms from the background speakers’ utterances. The proposed approach is used as a feature normalization method for building histograms when there are insufficient input utterance samples. In addition, the proposed method is used as an i-vector normalization method in an i-vector-based probabilistic linear discriminant analysis (PLDA) system, which is the current state-of-the-art for speaker verifi cation. The ranks of sample values for histogram equalization are estimated in ascending order from both the input utterances and the supplement set. New ranks are obtained by computing the sum of different kinds of ranks. Subsequently, the proposed method determines the cumulative distribution function of the test utterance using the newly defi ned ranks. The proposed method is compared with conventional feature normalization methods, such as cepstral mean normalization (CMN), cepstral mean and variance normalization (MVN), histogram equalization (HEQ), and the European Telecommunications Standards Institute (ETSI) advanced front-end methods. In addition, performance is compared for a case in which the greedy selection algorithm is used with fuzzy C-means and K-means algorithms. The YOHO and Electronics and Telecommunications Research Institute (ETRI) databases are used in an evaluation in the feature space. The test sets are simulated by the Opus VoIP codec. We also use the 2008 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) corpus for the i-vector system. The results of the experimental evaluation demonstrate that the average system performance is improved when the proposed method is used, compared to the conventional feature normalization methods.
其他文献
太谷核不育小麦在小麦育种上的应用研究正在成为一种小麦多途径育种的新方法。自1972□年以来在不育机理研究、不育基因定位、应用技术研究、特异种质创新与拓展、杂优利用等领域已取得很大的成就,尤其是把异花授粉作物上形之有效的“轮回选择”移植到自花授粉作物上之后,在不到30年的时间内,已经形成了一套有特色的育种体系——以轮回选择为主,多途径综合运用。本文针对近年来被广泛采用的或新近设计的几种轮回选择...
学位