基于听觉模型特征的与文本无关说话人识别系统

Abstract

Abstract: The paper proposes a novel feature based on an auditory periphery model for robust speaker recognition.The sub-band energies of theextracted auditory features are calculated using a Gammatone filterband insteadof commonly used triangle filter band.The center frequencies and bandwidthsare then determined according to the equivalent rectangular bandwidth (ERB) model.Moreover,weighting the Gammatone filter bank by analyzing contribution of short-time spectrum in different frequency sub-bands,and using the CMS method toremove the varibility of channels are also investigated.Simulation results withGaussian Mixture model indicate that the recognition accuracy is significantly improved by this auditoryfeature in the noisy environments for the text-indepentent speaker recognition,especially in low SNR environments.

Key words: auditory feature, Gammatone filter bank, sub-band weighting, speaker recognition

CLC Number:

TP391.42

LU Xiao-chun, YIN Jun-xun, WANG Xiu-xin. An Auditory Feature for Text-independent Speaker Recognition System[J].Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(2): 22-26.

References

[1] ZHANG Wan-feng,WU Zhao-hui,YANG Ying-chun,et al.Feature combination for speaker identification[J].Journal of Guangxi Normal University:Natural Science Edition,2003,21(1):10-15.
[2] DAVIS S B,MERMELSTEIN P.Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences[J].IEEE Transactions on Acoustics,Speech,and Signal Processing,1980,28(4):357-366.
[3] COLOMBI J M,ANDERSON T R,ROGERS S K.Auditory model representationfor speaker recogniton[C]//Proc ICASSP.Piscataway,NJ:IEEE Press,1993:700-703.
[4] 卢绪刚,陈道文.听觉计算模型在鲁棒性语音识别中的应用[J].声学学报,2000,25(6):493-498.
[5] 张卫强,刘加.基于听感知特征的语种识别[J].清华大学学报:自然科学版,2009,49(1):78-81.
[6] 俞一彪,袁冬梅,薛峰.一种适于说话人识别的非线性频率尺度变换[J].声学学报,2008,33(5):451-455.
[7] ZWICKER E,FASTL H.Psychoacoustic:facts and models[M].Berlin:Springer,1999.
[8] PATTERSON R,NIMMO-SMITH I,HOLDSWORTH J,et al.An efficient auditory filterbank based on the Gammatone function[C]//Proc.Meeting of the Instituteof Acoustics on Auditory Modeling.Malvern:RSRE,1987:1-18.
[9] CHEN C,CHENG P.Hybrid KLT-GMM approach for robust speaker identification[J].IEE Electronics Letters,2003,39(21):1552-1554.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 10

[1]	CHEN Yong-qi, BAI Ke-zhao, KUANG hua, KONG Ling-jiang, LIU Mu-ren. Effect of Internal Layout on the Pedestrian Evacuation in the Classroom[J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 1 -4 .
[2]	XU Lun-hui, YE Fan. Acceleration Noise Model Based on Horizontal,Vertical and LateralAcceleration[J]. Journal of Guangxi Normal University(Natural Science Edition), 2011, 29(1): 5 -9 .
[3]	YANG Li, KONG Ling-jiang. Capillary Force between Microparticles[J]. Journal of Guangxi Normal University(Natural Science Edition), 2012, 30(1): 1 -4 .
[4]	HE Qing, LIU Jian, WEI Lianfu. Single-Photon Detectors as the Physical Limit Detections of Weak Electromagnetic Signals[J]. Journal of Guangxi Normal University(Natural Science Edition), 2022, 40(5): 1 -23 .
[5]	BAI Ke-zhao, LUO Xu-dong, KONG Ling-jiang, LIU Mu-ren. Cellular Automaton Model of Date Transmission with Open Boundary Condition[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 1 -4 .
[6]	XU Lun-hui, LIAO Ran-kun. Signal Phasing-Sequence Optimization of Intersection Based on Traffic Track[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 5 -9 .
[7]	WANG Xiu-xin, QIN Li-mei, NONG Jing-hui, LIANG Zong-jin, ZHU Qi-jiang. Land Surface Temperature Retrieval with Mono-window Algorithm in Karst City[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 10 -14 .
[8]	LI Yu-fang, ZHANG Jun-jian. Strong Consistency of the Regression Weighted Function Estimator for Negatively Associated Samples[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 15 -19 .
[9]	JIA Bao-hua. A Strictly Stationary Associated Random Sequence Which Unsatisfythe Central Limit Theorem[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 20 -23 .
[10]	CHEN Cui-ling, LI Ming, LIANG Jia-mei, LI Lüe. A Class of New Conjugate Gradient Method and Its Convergence Property Under the Wolfe Line Search[J]. Journal of Guangxi Normal University(Natural Science Edition), 2010, 28(3): 24 -28 .

An Auditory Feature for Text-independent Speaker Recognition System

Abstract

Cite this article

share this article

References

Related Articles 0

Metrics

Comments

Recommended 10