Krishnakumar, S and Kumari, Prasanna KR and Balakrishnan, N (2003) Pitch Maxima for Robust Speaker Recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, April 6-10, 2003, Hong Kong, II-201.
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-free synthesised spectrum is produced from a noisy spectrum. This synthesised spectrum is used for feature extraction. From noisy speech, the pitch is extracted using a robust pitch estimation algorithm. This also helps in identifying the voiced segments of speech which are the only ones considered in the synthesis. After estimating pitch, the noisy signal is sampled in the frequency domain at pitch harmonics. From the sampled data, a reconstruction procedure is suggested in this paper in order to generate a noise-free synthesised spectrum which retains the characteristics of the speaker but rejects the noisy contributions. We compare results with the original MFCC parameters and show that on a 100 speaker database, the MFCC parameters computed on the reconstructed spectrum consistently outperforms conventional MFCC parameters over a full range of noise levels under mismatched conditions, while maintaining comparable performance under matched conditions.
|Item Type:||Conference Paper|
|Additional Information:||©2003 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.|
|Keywords:||Pitch maxima;Spectrum analysis|
|Department/Centre:||Division of Information Sciences > Supercomputer Education & Research Centre
Division of Mechanical Sciences > Aerospace Engineering (Formerly, Aeronautical Engineering)
|Date Deposited:||25 Aug 2008|
|Last Modified:||19 Sep 2010 04:14|
Actions (login required)