• myGriffith
    • Staff portal
    • Contact Us⌄
      • Future student enquiries 1800 677 728
      • Current student enquiries 1800 154 055
      • International enquiries +61 7 3735 6425
      • General enquiries 07 3735 7111
      • Online enquiries
      • Staff phonebook
    View Item 
    •   Home
    • Griffith Theses
    • Theses - Higher Degree by Research
    • View Item
    • Home
    • Griffith Theses
    • Theses - Higher Degree by Research
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

  • All of Griffith Research Online
    • Communities & Collections
    • Authors
    • By Issue Date
    • Titles
  • This Collection
    • Authors
    • By Issue Date
    • Titles
  • Statistics

  • Most Popular Items
  • Statistics by Country
  • Most Popular Authors
  • Support

  • Contact us
  • FAQs
  • Admin login

  • Login
  • Modulation Domain Based Processing for Speech Enhancement

    Thumbnail
    View/Open
    Schwerin_2013_02Thesis.pdf (34.84Mb)
    Author(s)
    Schwerin, Belinda M.
    Primary Supervisor
    Paliwal, Kuldip
    Other Supervisors
    So, Stephen
    Year published
    2013
    Metadata
    Show full item record
    Abstract
    For a long time, the spectral envelope has been accepted as the principal carrier of information important to speech. Therefore much of the work done for speech applications, such as automatic speech recognition and speech enhancement, has aimed to process this envelope. For speech enhancement, given the quasi-stationarity of speech, many approaches have been based on short-time processing of speech in a Fourier analysis-modification-synthesis (AMS) framework. Within this framework, either the magnitude and/or phase spectrum can be modified by a noise suppression or signal estimation approach to achieve enhancement. Most ...
    View more >
    For a long time, the spectral envelope has been accepted as the principal carrier of information important to speech. Therefore much of the work done for speech applications, such as automatic speech recognition and speech enhancement, has aimed to process this envelope. For speech enhancement, given the quasi-stationarity of speech, many approaches have been based on short-time processing of speech in a Fourier analysis-modification-synthesis (AMS) framework. Within this framework, either the magnitude and/or phase spectrum can be modified by a noise suppression or signal estimation approach to achieve enhancement. Most commonly, it is the short-time (acoustic) magnitude spectrum which is modified in order to suppress noise. While there are many methods for enhancement in the literature, it is generally agreed that current methods only achieve in making noise less perceptually annoying while maintaining intelligibility, leaving much room for improvement. In more recent years, the low-frequency temporal modulations of the spectral envelope have received increasing attention. Findings of physiological and psychoacoustic experiments have indicated the importance of these modulations in the human auditory system. This has led to the view that these temporal modulations convey much of the information necessary for speech perception. Many of the efforts to apply modulation processing to the enhancement of speech originated from work in automatic speech recognition, and are based on filtering the trajectories of each acoustic band. However, these filters were typically designed to operate over the entire utterance, without accounting for the properties of speech and noise in the signal. Consequently, processed speech quality is quite poor where corrupting noise types are dissimilar from that used to design the filters.
    View less >
    Thesis Type
    Thesis (PhD Doctorate)
    Degree Program
    Doctor of Philosophy (PhD)
    School
    Griffith School of Engineering
    DOI
    https://doi.org/10.25904/1912/2608
    Copyright Statement
    The author owns the copyright in this thesis, unless stated otherwise.
    Note
    Figures 2.1, 2.4, 2.5, 2.6 and 2.10 have been removed from the digital copy to comply with copyright.
    Subject
    Spectral envelop
    Speech enhancement
    Automatic speech recognition
    Publication URI
    http://hdl.handle.net/10072/366414
    Collection
    • Theses - Higher Degree by Research

    Footer

    Disclaimer

    • Privacy policy
    • Copyright matters
    • CRICOS Provider - 00233E

    Tagline

    • Gold Coast
    • Logan
    • Brisbane - Queensland, Australia
    First Peoples of Australia
    • Aboriginal
    • Torres Strait Islander