Deep residual-dense lattice network for speech enhancement
File version
Accepted Manuscript (AM)
Author(s)
Nicolson, Aaron
Gao, Yongsheng
Zhou, Jun
Paliwal, Kuldip K.
Shang, Fanhua
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
New york, USA
License
Abstract
Convolutional neural networks (CNNs) with residual links (ResNets) and causal dilated convolutional units have been the network of choice for deep learning approaches to speech enhancement. While residual links improve gradient flow during training, feature diminution of shallow layer outputs can occur due to repetitive summations with deeper layer outputs. One strategy to improve feature re-usage is to fuse both ResNets and densely connected CNNs (DenseNets). DenseNets, however, over-allocate parameters for feature re-usage. Motivated by this, we propose the residual-dense lattice network (RDL-Net), which is a new CNN for speech enhancement that employs both residual and dense aggregations without over-allocating parameters for feature re-usage. This is managed through the topology of the RDL blocks, which limit the number of outputs used for dense aggregations. Our extensive experimental investigation shows that RDL-Nets are able to achieve a higher speech enhancement performance than CNNs that employ residual and/or dense aggregations. RDL-Nets also use substantially fewer parameters and have a lower computational requirement. Furthermore, we demonstrate that RDL-Nets outperform many state-of-the-art deep learning approaches to speech enhancement. Availability: https://github.com/nick-nikzad/RDL-SE.
Journal Title
Conference Title
Proceedings of the AAAI Conference on Artificial Intelligence
Book Title
Edition
Volume
34
Issue
5
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2020 AAAI Press. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. Please refer to the conference's website for access to the definitive, published version.
Item Access Status
Note
Access the data
Related item(s)
Subject
Electrical engineering
Electronics, sensors and digital hardware
Persistent link to this record
Citation
Paliwal, K; Nikzad, M; Gao, Y; Zhou, J; Shang, F, Proceedings of the AAAI Conference on Artificial Intelligence, Deep residual-dense lattice network for speech enhancement, 2020