Show simple item record

dc.contributor.authorStark, A
dc.contributor.authorPaliwal, K
dc.date.accessioned2017-05-03T12:28:10Z
dc.date.available2017-05-03T12:28:10Z
dc.date.issued2011
dc.date.modified2012-04-10T23:50:26Z
dc.identifier.issn0167-6393
dc.identifier.doi10.1016/j.specom.2010.11.004
dc.identifier.urihttp://hdl.handle.net/10072/44400
dc.description.abstractIn this paper, we derive a minimum mean square error log-filterbank energy estimator for environment-robust automatic speech recognition. While several such estimators exist within the literature, most involve trade-offs between simplifications of the log-filterbank noise distortion model and analytical tractability. To avoid this limitation, we extend a well known spectral domain noise distortion model for use in the log-filterbank energy domain. To do this, several mathematical transformations are developed to transform spectral domain models into filterbank and log-filterbank energy models. As a result, a new estimator is developed that allows for robust estimation of both log-filterbank energies and subsequent Mel-frequency cepstral coefficients. The proposed estimator is evaluated over the Aurora2, and RM speech recognition tasks, with results showing a significant reduction in word recognition error over both baseline results and several competing estimators.
dc.description.peerreviewedYes
dc.description.publicationstatusYes
dc.languageEnglish
dc.language.isoeng
dc.publisherElsevier
dc.publisher.placeNetherlands
dc.relation.ispartofstudentpublicationN
dc.relation.ispartofpagefrom403
dc.relation.ispartofpageto416
dc.relation.ispartofissue3
dc.relation.ispartofjournalSpeech Communication
dc.relation.ispartofvolume53
dc.rights.retentionY
dc.subject.fieldofresearchArtificial Intelligence and Image Processing not elsewhere classified
dc.subject.fieldofresearchArtificial Intelligence and Image Processing
dc.subject.fieldofresearchCognitive Sciences
dc.subject.fieldofresearchLinguistics
dc.subject.fieldofresearchcode080199
dc.subject.fieldofresearchcode0801
dc.subject.fieldofresearchcode1702
dc.subject.fieldofresearchcode2004
dc.titleMMSE estimation of log-filter bank energies for robust speech recognition
dc.typeJournal article
dc.type.descriptionC1 - Articles
dc.type.codeC - Journal Articles
gro.facultyGriffith Sciences, Griffith School of Engineering
gro.date.issued2011
gro.hasfulltextNo Full Text
gro.griffith.authorPaliwal, Kuldip K.
gro.griffith.authorStark, Anthony P.


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Journal articles
    Contains articles published by Griffith authors in scholarly journals.

Show simple item record