DSpace logo

Please use this identifier to cite or link to this item: http://142.54.178.187:9060/xmlui/handle/123456789/13752
Title: PITCH ESTIMATION FRAMEWORK FOR SPEECH SEGREGATION USING COCHLEAGRAM MORPHING
Authors: Khan, M.J
Habib, H.A
Keywords: Pitch range estimation
Source separation
Computer auditory scene analysis (CASA)
k-means; Spectral Peaks.
Issue Date: 10-Dec-2015
Publisher: Lahore:Pakistan Association for the Advancement of Science
Citation: Khan, M. J., & Habib, H. A. (2015). PITCH ESTIMATION FRAMEWORK FOR SPEECH SEGREGATION USING COCHLEAGRAM MORPHING. Pakistan Journal of Science, 67(4).
Abstract: Computational auditory scene analysis (CASA) has significant role in speech segregation from monaural audio mixtures and generally a measure for performance of speech recognition systems. Pitch estimation has a substantial role in performance of CASA systems. This study presents a novel pitch estimation framework for speech segregation from monaural audio mixtures using cochleagram morphing. The proposed framework takes the rough estimation of target pitch from given audio mixtures containing speech and background interferences. Discrete set consisting morphed versions of cochleagram is obtained using k-Means clustering. The estimated pitch values are improved by validating and smoothing them to morphed cochleagram. Measure of refined estimated pitch contours along with harmonicity and temporal continuity are used to segregate target speech. The proposed framework produced 83.13% accuracy for MIR-1k dataset which is considerably higher than the existing methods
URI: http://142.54.178.187:9060/xmlui/handle/123456789/13752
ISSN: 2411-0930
Appears in Collections:Issue 4

Files in This Item:
File Description SizeFormat 
PJS-297-5432.htm135 BHTMLView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.