Show simple item record

dc.contributor.authorLai, Jun-Yao
dc.contributor.authorWang, Shi-Lin
dc.contributor.authorShi, Xing-Jian
dc.contributor.authorLiew, Alan Wee-Chung
dc.contributor.editorYong Ching Lim, Daniel P.K. Lun, A. N. Skodras, Danilo Mandic
dc.date.accessioned2017-05-03T15:20:17Z
dc.date.available2017-05-03T15:20:17Z
dc.date.issued2014
dc.identifier.isbn9781479946129
dc.identifier.issn1546-1874
dc.identifier.refurihttp://www.dsp2014.org/index.htm
dc.identifier.doi10.1109/ICDSP.2014.6900736
dc.identifier.urihttp://hdl.handle.net/10072/65346
dc.description.abstractRecent research has shown that the speaker's lip shape and movement contain rich identity-related information and can be adopted for speaker identification and authentication. Among all the static lip features, the lip texture (intensity variation inside the outer lip contour) is of high discriminative power to differentiate various speakers. However, the existing lip texture feature representations cannot describe the texture information adequately and provide unsatisfactory identification results. In this paper, a sparse representation of the lip texture is proposed and a corresponding visual speaker identification scheme is presented. In the training stage, a sparse dictionary is built based on the texture samples for each speaker. In the testing stage, for any lip image investigated, the lip texture information is extracted and the reconstruction errors using all the dictionaries for every speaker are calculated. The lip image is identified to the speaker with the minimum reconstruction error. The experimental results show that the proposed sparse coding based scheme can achieve much better identification accuracy (91.37% for isolate image and 98.21% for image sequence) compared with several state-of-the-art methods when considering the lip texture information only.
dc.description.peerreviewedYes
dc.description.publicationstatusYes
dc.format.extent338381 bytes
dc.format.mimetypeapplication/pdf
dc.languageEnglish
dc.language.isoeng
dc.publisherIEEE
dc.publisher.placeUnited States
dc.publisher.urihttp://www.dsp2014.org/index.htm
dc.relation.ispartofstudentpublicationN
dc.relation.ispartofconferencename19th International Conference on Digital Signal Processing (DSP)
dc.relation.ispartofconferencetitle2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP)
dc.relation.ispartofdatefrom2014-08-20
dc.relation.ispartofdateto2014-08-23
dc.relation.ispartoflocationHong Kong, PEOPLES R CHINA
dc.relation.ispartofpagefrom607
dc.relation.ispartofpageto610
dc.relation.ispartofvolume2014-January
dc.rights.retentionY
dc.subject.fieldofresearchComputer vision
dc.subject.fieldofresearchcode460304
dc.titleSparse Coding Based Lip Texture Representation For Visual Speaker Identification
dc.typeConference output
dc.type.descriptionE1 - Conferences
dc.type.codeE - Conference Publications
gro.rights.copyright© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
gro.date.issued2015-04-23T03:07:30Z
gro.hasfulltextFull Text
gro.griffith.authorLiew, Alan Wee-Chung


Files in this item

This item appears in the following Collection(s)

  • Conference outputs
    Contains papers delivered by Griffith authors at national and international conferences.

Show simple item record