Deep residual-dense lattice network for speech enhancement

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Nikzad, M
Nicolson, Aaron
Gao, Yongsheng
Zhou, Jun
Paliwal, Kuldip K.
Shang, Fanhua
Primary Supervisor
Other Supervisors
Editor(s)
Date
2020
Size
File type(s)
Location

New york, USA

License
Abstract

Convolutional neural networks (CNNs) with residual links (ResNets) and causal dilated convolutional units have been the network of choice for deep learning approaches to speech enhancement. While residual links improve gradient flow during training, feature diminution of shallow layer outputs can occur due to repetitive summations with deeper layer outputs. One strategy to improve feature re-usage is to fuse both ResNets and densely connected CNNs (DenseNets). DenseNets, however, over-allocate parameters for feature re-usage. Motivated by this, we propose the residual-dense lattice network (RDL-Net), which is a new CNN for speech enhancement that employs both residual and dense aggregations without over-allocating parameters for feature re-usage. This is managed through the topology of the RDL blocks, which limit the number of outputs used for dense aggregations. Our extensive experimental investigation shows that RDL-Nets are able to achieve a higher speech enhancement performance than CNNs that employ residual and/or dense aggregations. RDL-Nets also use substantially fewer parameters and have a lower computational requirement. Furthermore, we demonstrate that RDL-Nets outperform many state-of-the-art deep learning approaches to speech enhancement. Availability: https://github.com/nick-nikzad/RDL-SE.

Journal Title
Conference Title

Proceedings of the AAAI Conference on Artificial Intelligence

Book Title
Edition
Volume

34

Issue

5

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2020 AAAI Press. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. Please refer to the conference's website for access to the definitive, published version.

Item Access Status
Note
Access the data
Related item(s)
Subject

Electrical engineering

Electronics, sensors and digital hardware

Persistent link to this record
Citation

Paliwal, K; Nikzad, M; Gao, Y; Zhou, J; Shang, F, Proceedings of the AAAI Conference on Artificial Intelligence, Deep residual-dense lattice network for speech enhancement, 2020