Show simple item record

dc.contributor.authorMilacic, Mitar
dc.contributor.authorJames, Alex Pappachen
dc.contributor.authorDimitrijev, Sima
dc.date.accessioned2017-08-03T01:37:01Z
dc.date.available2017-08-03T01:37:01Z
dc.date.issued2017
dc.identifier.issn1064-1246
dc.identifier.doi10.3233/JIFS-169236
dc.identifier.urihttp://hdl.handle.net/10072/343061
dc.description.abstractAutomated processing and recognition of human speech commands under unconstrained and noisy recognition situations with a limited number of training samples is a challenging problem of interest to smart devices and systems. In practice, it is impossible to remove noise without losing class discriminative information in the speech signals. Also, any attempts to improve signal quality place an additional burden on the computational capacity in state-of-the-art speech command recognition systems. In this paper, we propose a low-level word processing system using mean-variance normalised frequency-time spectrograms and a new similarity measure that compensates for feature length mismatches such as those resulting from pronunciation variations in speech segments. We find that padding a local similarity matrix with zero similarity values to disregard the effects of a mismatch in length of speech spectrograms results in improved word recognition accuracies and reduction in between class non-discriminative signals. As opposed to the state-of-the-art approaches in spectrogram comparisons such as DTW, the proposed method, when tested using the TIMIT database, shows improved recognition accuracies, robustness to noise, lower computational requirements, and scalability to large word problems.
dc.description.peerreviewedYes
dc.languageEnglish
dc.language.isoeng
dc.publisherI O S Press
dc.relation.ispartofpagefrom2933
dc.relation.ispartofpageto2939
dc.relation.ispartofissue4
dc.relation.ispartofjournalJournal of Intelligent and Fuzzy Systems
dc.relation.ispartofvolume32
dc.subject.fieldofresearchPattern Recognition and Data Mining
dc.subject.fieldofresearchArtificial Intelligence and Image Processing
dc.subject.fieldofresearchCognitive Sciences
dc.subject.fieldofresearchcode080109
dc.subject.fieldofresearchcode0801
dc.subject.fieldofresearchcode1702
dc.titleRecognizing isolated words with minimum distance similarity metric padding
dc.typeJournal article
dc.type.descriptionC1 - Articles
dc.type.codeC - Journal Articles
gro.hasfulltextNo Full Text
gro.griffith.authorDimitrijev, Sima
gro.griffith.authorJames, Alex P.


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

  • Journal articles
    Contains articles published by Griffith authors in scholarly journals.

Show simple item record