A Cortically-Inspired Model for Bioacoustics Recognition
File version
Accepted Manuscript (AM)
Author(s)
Thornton, John
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
Wavelet transforms have shown superior performance in auditory recognition tasks compared to the more commonly used Mel-Frequency Cepstral Coefficients, and offer the ability to more closely model the frequency response behaviour of the cochlear basilar membrane. In this paper we evaluate a gammatone wavelet as a preprocessor for the Hierarchical Temporal Memory (HTM) model of the neocortex as part of the broader development of a biologically motivated approach to sound recognition. Specifically, we apply for the first time, a gammatone/equivalent rectangular bandwidth wavelet transform in conjunction with the HTM’s Spatial Pooler to recognise frog calls, bird songs and insect sounds. Our audio feature detection results show that wavelets perform considerably better than MFCCs on our selected datasets but that combining wavelets with HTM does not produce further improvements. This outcome raises questions concerning the degree of match to the biology required for an effective HTM-based model of audition.
Journal Title
Lecture Notes in Computer Science
Conference Title
Book Title
Edition
Volume
9492
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2015 Springer Berlin / Heidelberg. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. The original publication is available at www.springerlink.com
Item Access Status
Note
Access the data
Related item(s)
Subject
Information and Computing Sciences not elsewhere classified