dc.contributor.convenor | Tadeusz A Wysocki | |
dc.contributor.author | Lyons, James G | |
dc.contributor.author | O'Connell, James G | |
dc.contributor.author | Paliwal, Kuldip K | |
dc.contributor.editor | Wysocki, BJ | |
dc.contributor.editor | Wysocki, TA | |
dc.date.accessioned | 2017-05-03T13:01:30Z | |
dc.date.available | 2017-05-03T13:01:30Z | |
dc.date.issued | 2010 | |
dc.date.modified | 2011-06-06T06:04:05Z | |
dc.identifier.isbn | 9781424479078 | |
dc.identifier.refuri | http://www.dspcs-witsp.com/icspcs_2010/ | |
dc.identifier.doi | 10.1109/ICSPCS.2010.5709772 | |
dc.identifier.uri | http://hdl.handle.net/10072/37259 | |
dc.description.abstract | In this paper we propose two new methods of improving the robustness of Automatic Speaker Identification systems. These methods rely on using long-term information in the speech signal to improve the robustness of the features. The first method involves averaging filterbank parameters from consecutive short-time frames over a longer window. The second method investigates the use of frame lengths longer than generally assumed stationary. We show that these two methods result in an improvement over standard Mel Frequency Cepstral Coefficients in the presence of additive white Gaussian noise in speaker identification applications. Furthermore, additional improvements are observed at mid-range SNR when the proposed methods are used in combination. | |
dc.description.peerreviewed | Yes | |
dc.description.publicationstatus | Yes | |
dc.format.extent | 130400 bytes | |
dc.format.mimetype | application/pdf | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | IEEE | |
dc.publisher.place | United States | |
dc.relation.ispartofstudentpublication | N | |
dc.relation.ispartofconferencename | 4th International Conference on Signal Processing and Communication Systems (ICSPCS)/12th International Symposium on DSP and Communication Systems (DSPCS)/9th Workshop on the Internet, Telecommunications and Signal Processing (WTSP) | |
dc.relation.ispartofconferencetitle | 2010 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS) | |
dc.relation.ispartofdatefrom | 2010-12-13 | |
dc.relation.ispartofdateto | 2010-12-15 | |
dc.relation.ispartoflocation | Gold Coast, AUSTRALIA | |
dc.relation.ispartofpagefrom | 4 pages | |
dc.relation.ispartofpageto | 4 pages | |
dc.rights.retention | Y | |
dc.subject.fieldofresearch | Signal processing | |
dc.subject.fieldofresearchcode | 400607 | |
dc.title | Using long-term information to improve robustness in speaker identification | |
dc.type | Conference output | |
dc.type.description | E1 - Conferences | |
dc.type.code | E - Conference Publications | |
gro.faculty | Griffith Sciences, Griffith School of Engineering | |
gro.rights.copyright | © 2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | |
gro.date.issued | 2010 | |
gro.hasfulltext | Full Text | |
gro.griffith.author | Paliwal, Kuldip K. | |