Role of the Short-Time Phase Spectrum in Speech Processing

Loading...
Thumbnail Image
File version
Primary Supervisor

Paliwal, Kuldip

So, Stephen

Other Supervisors
Editor(s)
Date
2011
Size
File type(s)
Location
License
Abstract

Majority of speech processing algorithms that employ the short-time Fourier transform process the short-time magnitude spectrum, while either discarding the short-time phase spectrum or leaving it unchanged. This is in-part due to a long-standing belief among speech researchers that the short-time phase spectrum, computed over small analysis window durations of 20–40 ms, contains little useful information and is thus (mostly) unimportant for speech processing (though it is accepted that the phase spectrum does contribute to some extent to naturalness and quality aspects of speech). The above belief has been supported by numerous studies presented in the literature. Results of recent speech perception experiments suggest, however, that the phase spectrum (at small analysis window durations of 20–40 ms) does contain significant amount of useful information, provided that the analysis window function is carefully selected. It was reported that the use of non-tapered analysis windows functions (such as the rectangular window) significantly improves intelligibility of the phase spectrum. This improvement was attributed to the spectral characteristics of the non-tapered analysis windows and—in particular—to their low spectral dynamic range. The main aim of the research presented in this dissertation is to further examine the importance of the short-time phase spectrum for human speech perception. It is hoped that results of such an examination can provide an incentive for further research in this direction. Three studies that investigate the usefulness of the phase spectrum for human speech perception are presented in this thesis. These studies employ human listening tests to explore the importance of the phase spectrum for speech intelligibility, speaker dependent speech information and speech quality. In each of these studies the effect of the spectral dynamic range of an analysis window function is systematically examined.

Journal Title
Conference Title
Book Title
Edition
Volume
Issue
Thesis Type

Thesis (PhD Doctorate)

Degree Program

Doctor of Philosophy (PhD)

School

Griffith School of Engineering

Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

The author owns the copyright in this thesis, unless stated otherwise.

Item Access Status

Public

Note
Access the data
Related item(s)
Subject

Fourier transform process

Short-time magnitude spectrum

Short-time phase spectrum

Speech processing algorithms

Persistent link to this record
Citation