The role of dynamic features in speaker verification
[摘要] The thesis presents study to explore the role of dynamic features in speaker verification. Based on the theory that dynamic information should contain important speaker information, modelling the dynamics should have the potential to improve the speaker verification performance. Experiments on TD-SV using segmental hidden Markov models (SHMMs) on the YOHO database show performance improvement. However there is no significant improvement for TI-SV from experiments on the Switchboard database, using segmental GMMs. Analysis of the TD-SV results confirms that the speech dynamics modeled by SHMMs contribute more to the SV accuracy. Analysis of the TI-SV results indicates that the lack of speech dynamic information is a feature of GMM systems. It seems that the priority of the maximum likelihood training algorithm is to model stationary regions, and the role of dynamic features in GMM system, is to ensure that the classification focuses on static regions rather than to model dynamics. Study on TI-SV was carried out using conventional GMMs. Without RASTA filtering, the `delta-only' system works best. However, after RASTA filtering, the `static-plus-delta' system performs best. The results suggest that the good performance of the `delta-only' system before RASTA is mainly due to the noise robustness of the delta parameters.
[发布日期] [发布机构] University:University of Birmingham;Department:School of Engineering, Department of Electronic, Electrical and Systems Engineering
[效力级别] [学科分类]
[关键词] T Technology;TK Electrical engineering. Electronics Nuclear engineering [时效性]