Multi-frame GMM-based block quantisation of line spectral frequencies for wideband speech coding

Loading...
Thumbnail Image
File version
Author(s)
So, S
Paliwal, KK
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)

Kenneth Barner

Date
2005
Size

207207 bytes

40 bytes

File type(s)

application/pdf

text/plain

Location

Philadelphia, PA

License
Abstract

In this paper, we explore the use of the multi-frame GMM-based block quantiser for quantising line spectral frequencies for wideband speech coding. Its main advantages over vector quantisers are bitrate scalability and bitrate independent complexity. By concatenating multiple frames together, interframe correlation can be exploited by the KLT, leading to better quantisation. A saving of up to 3 bits/frame can be achieved by switching the quantiser from memoryless mode to jointly quantising two frames, with only a moderate increase in complexity. This quantisation scheme achieves lower spectral distortion than the split-multistage vector quantiser in the AMR-WB speech codec, with transparent coding at 37 bits/frame.

Journal Title
Conference Title

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5

Book Title
Edition
Volume

I

Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation