Show simple item record

dc.contributor.authorPaliwal, Kuldipen_US
dc.contributor.authorWojcicki, Kamilen_US
dc.contributor.authorSchwerin, Belindaen_US
dc.date.accessioned2017-04-24T10:06:35Z
dc.date.available2017-04-24T10:06:35Z
dc.date.issued2010en_US
dc.date.modified2011-03-15T08:04:00Z
dc.identifier.issn01676393en_US
dc.identifier.doi10.1016/j.specom.2010.02.004en_AU
dc.identifier.urihttp://hdl.handle.net/10072/37260
dc.description.abstractIn this paper we investigate the modulation domain as an alternative to the acoustic domain for speech enhancement. More specifically, we wish to determine how competitive the modulation domain is for spectral subtraction as compared to the acoustic domain. For this purpose, we extend the traditional analysis-modification-synthesis framework to include modulation domain processing. We then compensate the noisy modulation spectrum for additive noise distortion by applying the spectral subtraction algorithm in the modulation domain. Using an objective speech quality measure as well as formal subjective listening tests, we show that the proposed method results in improved speech quality. Furthermore, the proposed method achieves better noise suppression than the MMSE method. In this study, the effect of modulation frame duration on speech quality of the proposed enhancement method is also investigated. The results indicate that modulation frame durations of 180-280 ms, provide a good compromise between different types of spectral distortions, namely musical noise and temporal slurring. Thus given a proper selection of modulation frame duration, the proposed modulation spectral subtraction does not suffer from musical noise artifacts typically associated with acoustic spectral subtraction. In order to achieve further improvements in speech quality, we also propose and investigate fusion of modulation spectral subtraction with the MMSE method. The fusion is performed in the short-time spectral domain by combining the magnitude spectra of the above speech enhancement algorithms. Subjective and objective evaluation of the speech enhancement fusion shows consistent speech quality improvements across input SNRs.en_US
dc.description.peerreviewedYesen_US
dc.description.publicationstatusYesen_AU
dc.languageEnglishen_US
dc.language.isoen_AU
dc.publisherElsevier BVen_US
dc.publisher.placeNetherlandsen_US
dc.relation.ispartofstudentpublicationNen_AU
dc.relation.ispartofpagefrom450en_US
dc.relation.ispartofpageto475en_US
dc.relation.ispartofissue5en_US
dc.relation.ispartofjournalSpeech Communicationen_US
dc.relation.ispartofvolume52en_US
dc.rights.retentionYen_AU
dc.subject.fieldofresearchSignal Processingen_US
dc.subject.fieldofresearchcode090609en_US
dc.titleSingle-channel speech enhancement using spectral subtraction in the short-time modulation domainen_US
dc.typeJournal articleen_US
dc.type.descriptionC1 - Peer Reviewed (HERDC)en_US
dc.type.codeC - Journal Articlesen_US
gro.facultyGriffith Sciences, Griffith School of Engineeringen_US
gro.date.issued2010
gro.hasfulltextNo Full Text


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Journal articles
    Contains articles published by Griffith authors in scholarly journals.

Show simple item record