Author: "Andhavarapu, Sivaganesh" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Andhavarapu, Sivaganesh"' showing total 6 results

Start Over Author "Andhavarapu, Sivaganesh" Publication Year Range Last 10 years

6 results on '"Andhavarapu, Sivaganesh"'

1. Phase Aware Speech Enhancement using Realisation of Complex-valued LSTM

Author: Goswami, Raktim Gautam, Andhavarapu, Sivaganesh, and Murty, K Sri Rama
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Signal Processing
Abstract: Most of the deep learning based speech enhancement (SE) methods rely on estimating the magnitude spectrum of the clean speech signal from the observed noisy speech signal, either by magnitude spectral masking or regression. These methods reuse the noisy phase while synthesizing the time-domain waveform from the estimated magnitude spectrum. However, there have been recent works highlighting the importance of phase in SE. There was an attempt to estimate the complex ratio mask taking phase into account using complex-valued feed-forward neural network (FFNN). But FFNNs cannot capture the sequential information essential for phase estimation. In this work, we propose a realisation of complex-valued long short-term memory (RCLSTM) network to estimate the complex ratio mask (CRM) using sequential information along time. The proposed RCLSTM is designed to process the complex-valued sequences using complex arithmetic, and hence it preserves the dependencies between the real and imaginary parts of CRM and thereby the phase. The proposed method is evaluated on the noisy speech mixtures formed from the Voice-Bank corpus and DEMAND database. When compared to real value based masking methods, the proposed RCLSTM improves over them in several objective measures including perceptual evaluation of speech quality (PESQ), in which it improves by over 4.3%
Published: 2020

2. Neural Comb Filtering Using Sliding Window Attention Network for Speech Enhancement

Author: Parvathala, Venkatesh, Andhavarapu, Sivaganesh, Pamisetty, Giridhar, and Murty, K. Sri Rama
Published: 2023
Full Text: View/download PDF

3. Neural Comb Filtering Using Sliding Window Attention Network for Speech Enhancement

Author: Parvathala, Venkatesh, primary, Andhavarapu, Sivaganesh, additional, Pamisetty, Giridhar, additional, and Murty, K. Sri Rama, additional
Published: 2022
Full Text: View/download PDF

4. Neural Comb Filtering Using Sliding Window Attention Network for Speech Enhancement

Author: Parvathala, Venkatesh, Andhavarapu, Sivaganesh, Pamisetty, Giridhar, Kodukula, Sri Rama Murty, Parvathala, Venkatesh, Andhavarapu, Sivaganesh, Pamisetty, Giridhar, and Kodukula, Sri Rama Murty
Abstract: In this paper, we demonstrate the significance of restoring harmonics of the fundamental frequency (pitch) in the deep neural network (DNN)-based speech enhancement. The parameters of the DNN can be estimated by minimizing the mask loss, but it does not restore the pitch harmonics, especially at higher frequencies. In this paper, we propose to restore the pitch harmonics in the spectral domain by minimizing cepstral loss around the pitch peak. Restoring the cepstral pitch peak, in turn, helps in restoring the pitch harmonics in the enhanced spectrum. The proposed cepstral pitch-peak loss acts as an adaptive comb filter on voiced segments and emphasizes the pitch harmonics in the speech spectrum. The network parameters are estimated using a combination of mask loss and cepstral pitch-peak loss. We show that this combination offers the complementary advantages of enhancing both the voiced and unvoiced regions. The DNN-based methods primarily rely on the network architecture, and hence, the prediction accuracy improves with the increasing complexity of the architecture. The lower complex models are essential for real-time processing systems. In this work, we propose a compact model using a sliding-window attention network (SWAN). The SWAN is trained to regress the spectral magnitude mask (SMM) from the noisy speech signal. Our experimental results demonstrate that the proposed approach achieves comparable performance with the state-of-the-art noncausal and causal speech enhancement methods with much lesser computational complexity. Our three-layered noncausal SWAN achieves 2.99 PESQ on the Valentini database with only 10 9 floating-point operations (FLOPs). © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
Published: 2022

5. Significance of Phase in DNN based speech enhancement algorithms

Author: Rani, P. Swetha, Andhavarapu, Sivaganesh, Kodukula, Sri Rama Murty, Rani, P. Swetha, Andhavarapu, Sivaganesh, and Kodukula, Sri Rama Murty
Abstract: Most of the speech enhancement algorithms rely on estimating the magnitude spectrum of the clean speech signal from that of the noisy speech signal using either spectral regression or spectral masking. Because of difficulty in processing the phase of the short time Fourier transform (STFT), noisy phase is reused while synthesizing the waveform from the enhanced magnitude spectrum. In order to demonstrate the significance of phase in speech enhancement, we compare the phase obtained from different reconstruction methods, like Griffin and Lim, minimum phase, with that of the gold phase (clean phase). In this work, spectral magnitude mask (SMM) is estimated using deep neural networks to enhance the magnitude spectrum of the speech signal. The experimental results showed that gold phase outperforms the phase reconstruction methods in all the objective measures, illustrating the significance of enhancing the noisy phase in speech enhancement.
Published: 2020

6. Significance of Phase in DNN based speech enhancement algorithms

Author: Rani, P. Swetha, primary, Andhavarapu, Sivaganesh, additional, and Murty Kodukula, Sri Rama, additional
Published: 2020
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Andhavarapu, Sivaganesh"'

1. Phase Aware Speech Enhancement using Realisation of Complex-valued LSTM

2. Neural Comb Filtering Using Sliding Window Attention Network for Speech Enhancement

3. Neural Comb Filtering Using Sliding Window Attention Network for Speech Enhancement

4. Neural Comb Filtering Using Sliding Window Attention Network for Speech Enhancement

5. Significance of Phase in DNN based speech enhancement algorithms

6. Significance of Phase in DNN based speech enhancement algorithms

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

6 results on '"Andhavarapu, Sivaganesh"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources