Show simple item record

dc.contributor.authorPaliwal, Kuldipen_US
dc.contributor.authorShannon, Benen_US
dc.contributor.authorLyons, Jamesen_US
dc.contributor.authorWojcicki, Kamilen_US
dc.date.accessioned2017-04-24T14:56:08Z
dc.date.available2017-04-24T14:56:08Z
dc.date.issued2009en_US
dc.date.modified2009-10-05T05:56:41Z
dc.identifier.issn10709908en_US
dc.identifier.doi10.1109/LSP.2009.2014096en_AU
dc.identifier.urihttp://hdl.handle.net/10072/25946
dc.description.abstractThe speech signal is used for transmission of linguistic information. High energy portions of the speech spectrum have higher signal-to-noise ratios than the low energy portions. As a result, these regions are more robust to noise. Since the speech signal is known to be very robust to noise, it is expected that the high energy regions of the speech spectrum carry the majority of the linguistic information. This letter tries to derive a frequency warping function directly from the speech signal by sampling the frequency axis nonuniformly with the high energy regions sampled more densely than the low energy regions. To achieve this, an ensemble average short-time power spectrum is computed from a large speech corpus. The speech-signal-based frequency warping is obtained by considering equal area portions of the log spectrum. The proposed frequency warping is shown to be similar to the frequency scales obtained through psycho-acoustic experiments, namely the mel and bark scales. The warping is then used in filterbank design for automatic speech recognition experiments. The results of these experiments show that cepstral features based on the proposed warping achieve performance under clean conditions comparable to that of mel-frequency cepstral coefficients, while outperforming them under noisy conditions.en_US
dc.description.peerreviewedYesen_US
dc.description.publicationstatusYesen_AU
dc.format.extent286084 bytes
dc.format.mimetypeapplication/pdf
dc.languageEnglishen_US
dc.language.isoen_AU
dc.publisherIEEEen_US
dc.publisher.placeUnited Statesen_US
dc.publisher.urihttp://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=97en_AU
dc.relation.ispartofstudentpublicationNen_AU
dc.relation.ispartofpagefrom319en_US
dc.relation.ispartofpageto322en_US
dc.relation.ispartofissue4en_US
dc.relation.ispartofjournalIEEE Signal Processing Lettersen_US
dc.relation.ispartofvolume16en_US
dc.rights.retentionYen_AU
dc.subject.fieldofresearchcode280204en_US
dc.subject.fieldofresearchcode280206en_US
dc.titleSpeech-Signal-Based Frequency Warpingen_US
dc.typeJournal articleen_US
dc.type.descriptionC1 - Peer Reviewed (HERDC)en_US
dc.type.codeC - Journal Articlesen_US
gro.facultyGriffith Sciences, Griffith School of Engineeringen_US
gro.rights.copyrightCopyright 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.en_AU
gro.date.issued2009
gro.hasfulltextFull Text


Files in this item

This item appears in the following Collection(s)

  • Journal articles
    Contains articles published by Griffith authors in scholarly journals.

Show simple item record