Back to Search
Start Over
Gen-Res-Net: A Novel Generative Model for Singing Voice Separation
- Source :
- MultiMedia Modeling ISBN: 9783030377304, MMM (1)
- Publication Year :
- 2019
- Publisher :
- Springer International Publishing, 2019.
-
Abstract
- In most cases, modeling in the time-frequency domain is the most common method to solve the problem of singing voice separation since frequency characteristics differ between different sources. During the past few years, applying recurrent neural network (RNN) to series of split spectrograms has been mostly adopted by researchers to tackle this problem. Recently, however, the U-net’s success has drawn the focus to treating the spectrogram as a 2-dimensional image with an auto-encoder structure, which indicates that some useful methods in image analysis may help solve this problem. Under this scenario, we propose a novel spectrogram-generative model to separate the two sources in the time-frequency domain inspired by Residual blocks, Squeeze and Excitation blocks and WaveNet. We apply none-reduce-sized Residual blocks together with Squeeze and Excitation blocks in the main stream to extract features of the input spectrogram while gathering the output layers in a skip-connection structure used in WaveNet. Experimental results on two datasets (MUSDB18 and CCMixer) have shown that our proposed network performs better than the current state-of-the-art approach working on spectrograms of mixtures – the deep U-net structure.
- Subjects :
- Structure (mathematical logic)
Series (mathematics)
Computer science
business.industry
020206 networking & telecommunications
Pattern recognition
02 engineering and technology
Image (mathematics)
Domain (software engineering)
030507 speech-language pathology & audiology
03 medical and health sciences
Generative model
Recurrent neural network
0202 electrical engineering, electronic engineering, information engineering
Spectrogram
Artificial intelligence
0305 other medical science
Focus (optics)
business
Subjects
Details
- ISBN :
- 978-3-030-37730-4
- ISBNs :
- 9783030377304
- Database :
- OpenAIRE
- Journal :
- MultiMedia Modeling ISBN: 9783030377304, MMM (1)
- Accession number :
- edsair.doi...........98e5f0f090cce679e3d3c672842c0f90
- Full Text :
- https://doi.org/10.1007/978-3-030-37731-1_3