19 results on '"Rajesh, M."'
Search Results
2. Learning Speaker-specific Lip-to-Speech Generation
- Author
-
Varshney, Munender, Yadav, Ravindra, Namboodiri, Vinay P., and Hegde, Rajesh M
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Sound ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Understanding the lip movement and inferring the speech from it is notoriously difficult for the common person. The task of accurate lip-reading gets help from various cues of the speaker and its contextual or environmental setting. Every speaker has a different accent and speaking style, which can be inferred from their visual and speech features. This work aims to understand the correlation/mapping between speech and the sequence of lip movement of individual speakers in an unconstrained and large vocabulary. We model the frame sequence as a prior to the transformer in an auto-encoder setting and learned a joint embedding that exploits temporal properties of both audio and video. We learn temporal synchronization using deep metric learning, which guides the decoder to generate speech in sync with input lip movements. The predictive posterior thus gives us the generated speech in speaker speaking style. We have trained our model on the Grid and Lip2Wav Chemistry lecture dataset to evaluate single speaker natural speech generation tasks from lip movement in an unconstrained natural setting. Extensive evaluation using various qualitative and quantitative metrics with human evaluation also shows that our method outperforms the Lip2Wav Chemistry dataset(large vocabulary in an unconstrained setting) by a good margin across almost all evaluation metrics and marginally outperforms the state-of-the-art on GRID dataset., Comment: Accepted at ICPR 2022
- Published
- 2022
3. Exact Wirelength of Embedding 3-Ary n-Cubes into certain Cylinders and Trees
- Author
-
S, Rajeshwari and Rajesh, M
- Subjects
Computer Science - Discrete Mathematics ,68R01, 68R10 - Abstract
Graph embeddings play a significant role in the design and analysis of parallel algorithms. It is a mapping of the topological structure of a guest graph G into a host graph H, which is represented as a one-to-one mapping from the vertex set of the guest graph to the vertex set of the host graph. In multiprocessing systems the interconnection networks enhance the efficient communication between the components in the system. Obtaining minimum wirelength in embedding problems is significant in the designing of network and simulating one architecture by another. In this paper, we determine the wirelength of embedding 3-ary n-cubes into cylinders and certain trees., Comment: 13 pages, 5 figures
- Published
- 2022
4. Stochastic Talking Face Generation Using Latent Distribution Matching
- Author
-
Yadav, Ravindra, Sardana, Ashish, Namboodiri, Vinay P, and Hegde, Rajesh M
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The ability to envisage the visual of a talking face based just on hearing a voice is a unique human capability. There have been a number of works that have solved for this ability recently. We differ from these approaches by enabling a variety of talking face generations based on single audio input. Indeed, just having the ability to generate a single talking face would make a system almost robotic in nature. In contrast, our unsupervised stochastic audio-to-video generation model allows for diverse generations from a single audio input. Particularly, we present an unsupervised stochastic audio-to-video generation model that can capture multiple modes of the video distribution. We ensure that all the diverse generations are plausible. We do so through a principled multi-modal variational autoencoder framework. We demonstrate its efficacy on the challenging LRW and GRID datasets and demonstrate performance better than the baseline, while having the ability to generate multiple diverse lip synchronized videos., Comment: InterSpeech 2020
- Published
- 2020
5. Speech Prediction in Silent Videos using Variational Autoencoders
- Author
-
Yadav, Ravindra, Sardana, Ashish, Namboodiri, Vinay P, and Hegde, Rajesh M
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Understanding the relationship between the auditory and visual signals is crucial for many different applications ranging from computer-generated imagery (CGI) and video editing automation to assisting people with hearing or visual impairments. However, this is challenging since the distribution of both audio and visual modality is inherently multimodal. Therefore, most of the existing methods ignore the multimodal aspect and assume that there only exists a deterministic one-to-one mapping between the two modalities. It can lead to low-quality predictions as the model collapses to optimizing the average behavior rather than learning the full data distributions. In this paper, we present a stochastic model for generating speech in a silent video. The proposed model combines recurrent neural networks and variational deep generative models to learn the auditory signal's conditional distribution given the visual signal. We demonstrate the performance of our model on the GRID dataset based on standard benchmarks.
- Published
- 2020
6. Low frequency radio observations of the `quiet' corona during the descending phase of sunspot cycle 24
- Author
-
Ramesh, R., Kumari, A., Kathiravan, C., Ketaki, D., Rajesh, M., and Vrunda, M.
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
We carried out a statistical study of the `quiet' solar corona during the descending phase of the sunspot cycle 24 (i.e. 2015 January - 2019 May) using data obtained with the Gauribidanur RAdioheliograPH (GRAPH) at 53 MHz and 80 MHz simultaneously. Our results show that the equatorial (east-west) diameters of the solar corona at the above two frequencies shrunk steadily. The decrease was found to be due to a gradual reduction in the coronal electron density ($N_{e}$). Independent estimates of $N_{e}$ in the equatorial region of the `background' corona using white-light coronagraph observations indicate a decline consistent with our findings., Comment: Accepted for publication in Geophysical Research Letters
- Published
- 2020
- Full Text
- View/download PDF
7. A Generalized Framework for Autonomous Calibration of Wheeled Mobile Robots
- Author
-
Nutalapati, Mohan Krishna, Arora, Lavish, Bose, Anway, Rajawat, Ketan, and Hegde, Rajesh M
- Subjects
Computer Science - Robotics - Abstract
Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate transformational relationships between the corresponding reference frames. For wheeled robots equipped with exteroceptive sensors, calibration entails learning the motion model of the sensor or the robot in terms of the odometric data, and must generally be performed prior to performing tasks such as simultaneous localization and mapping (SLAM). Within this context, the current trend is to carry out simultaneous calibration of odometry and sensor without the use of any additional hardware. Building upon the existing simultaneous calibration algorithms, we put forth a generalized calibration framework that can not only handle robots operating in 2D with arbitrary or unknown motion models but also handle outliers in an automated manner. We first propose an algorithm based on the alternating minimization framework applicable to two-wheel differential drive. Subsequently, for arbitrary but known drive configurations we put forth an iteratively re-weighted least squares methodology leveraging an intelligent weighing scheme. Different from the existing works, these proposed algorithms require no manual intervention and seamlessly handle outliers that arise due to both systematic and non-systematic errors. Finally, we put forward a novel Gaussian Process-based non-parametric approach for calibrating wheeled robots with arbitrary or unknown drive configurations. Detailed experiments are performed to demonstrate the accuracy, usefulness, and flexibility of the proposed algorithms., Comment: This manuscript has been submitted to 'Elsevier Journal of Robotics and Autonomous Systems' and is under review for possible publication. Based on IROS 2019 conference submission [arXiv:1910.11917]
- Published
- 2020
8. Model Free Calibration of Wheeled Robots Using Gaussian Process
- Author
-
Nutalapati, Mohan Krishna, Arora, Lavish, Bose, Anway, Rajawat, Ketan, and Hegde, Rajesh M
- Subjects
Computer Science - Robotics - Abstract
Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate relationships between the corresponding reference frames. For wheeled robots equipped with camera/lidar along with wheel encoders, calibration entails learning the motion model of the sensor or the robot in terms of the data from the encoders and generally carried out before performing tasks such as simultaneous localization and mapping (SLAM). This work puts forward a novel Gaussian Process-based non-parametric approach for calibrating wheeled robots with arbitrary or unknown drive configurations. The procedure is more general as it learns the entire sensor/robot motion model in terms of odometry measurements. Different from existing non-parametric approaches, our method relies on measurements from the onboard sensors and hence does not require the ground truth information from external motion capture systems. Alternatively, we propose a computationally efficient approach that relies on the linear approximation of the sensor motion model. Finally, we perform experiments to calibrate robots with un-modelled effects to demonstrate the accuracy, usefulness, and flexibility of the proposed approach., Comment: To be published in International Conference on Intelligent Robots and Systems (IROS), 2019
- Published
- 2019
9. Minimum-Phase HRTF Modeling of Pinna Spectral Notches using Group Delay Decomposition
- Author
-
C, Sandeep Reddy and Hegde, Rajesh M
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Sound - Abstract
Accurate reconstruction of HRTFs is important in the development of high quality binaural sound synthesis systems. Conventionally, minimum phase HRTF model development for reconstruction of HRTFs has been limited to minimum phase-pure delay models which ignore the all pass component of the HRTF. In this paper, a novel method for minimum phase HRTF modelling of Pinna Spectral Notches (PSNs) using group delay decomposition is proposed. The proposed model captures the PSNs contributed by both the minimum phase and all pass component of HRTF thus facilitating an accurate reconstruction of HRTFs. The purely minimum phase HRTF components and their corresponding spatial angles are first identified using Fourier Bessel Series method that ensures a continuous evolution of the PSNs. The minimum phase-pure delay model is used to reconstruct HRTF for these spatial angles. Subsequently, the spatial angles which require both the minimum phase and all pass components are modelled using an all-pass filter cascaded with minimum-phase pure-delay model. Performance of the proposed model is evaluated by conducting experiments on PSN extraction, cross coherence analysis, and binaural synthesis. Both objective and subjective evaluation results are used to indicate the significance of the proposed model in binaural sound synthesis., Comment: 11 pages; This paper is a preprint of a paper submitted to IET Signal Processing Journal. If accepted, the copy of record will be available at the IET Digital Library
- Published
- 2017
10. On the Conditioning of the Spherical Harmonic Matrix for Spatial Audio Applications
- Author
-
Reddy, C Sandeep and Hegde, Rajesh M
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
In this paper, we attempt to study the conditioning of the Spherical Harmonic Matrix (SHM), which is widely used in the discrete, limited order orthogonal representation of sound fields. SHM's has been widely used in the audio applications like spatial sound reproduction using loudspeakers, orthogonal representation of Head Related Transfer Functions (HRTFs) etc. The conditioning behaviour of the SHM depends on the sampling positions chosen in the 3D space. Identification of the optimal sampling points in the continuous 3D space that results in a well-conditioned SHM for any number of sampling points is a highly challenging task. In this work, an attempt has been made to solve a discrete version of the above problem using optimization based techniques. The discrete problem is, to identify the optimal sampling points from a discrete set of densely sampled positions of the 3D space, that minimizes the condition number of SHM. This method has been subsequently utilized for identifying the geometry of loudspeakers in the spatial sound reproduction, and in the selection of spatial sampling configurations for HRTF measurement. The application specific requirements have been formulated as additional constraints of the optimization problem. Recently developed mixed-integer optimization solvers have been used in solving the formulated problem. The performance of the obtained sampling position in each application is compared with the existing configurations. Objective measures like condition number, D-measure, and spectral distortion are used to study the performance of the sampling configurations resulting from the proposed and the existing methods. It is observed that the proposed solution is able to find the sampling points that results in a better conditioned SHM and also maintains all the application specific requirements., Comment: 12 pages; This paper is a preprint of a paper submitted to IET Signal Processing Journal. If accepted, the copy of record will be available at the IET Digital Library
- Published
- 2017
11. A Review of Localization and Tracking Algorithms in Wireless Sensor Networks
- Author
-
Kumar, Sudhir and Hegde, Rajesh M.
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
In this paper, a comprehensive survey of the pioneer as well as the state of-the-art localization and tracking methods in the wireless sensor networks is presented. Localization is mostly applicable for the static sensor nodes, whereas, tracking for the mobile sensor nodes. The localization algorithms are broadly classified as range-based and range-free methods. The estimated range (distance) between an anchor and an unknown node is highly erroneous in an indoor scenario. This limitation can be handled up to a large extent by employing a large number of existing access points (APs) in the range free localization method. Recent works emphasize on the use multi-sensor data like magnetic, inertial, compass, gyroscope, ultrasound, infrared, visual and/or odometer to improve the localization accuracy further. Additionally, tracking method does the future prediction of location based on the past location history. A smooth trajectory is noted even if some of the received measurements are erroneous. Real experimental set-ups such as National Instruments (NI) wireless sensor nodes, Crossbow motes and hand-held devices for carrying out the localization and tracking are also highlighted herein.
- Published
- 2017
12. A Bayesian Approach to Estimation of Speaker Normalization Parameters
- Author
-
Ram, Dhananjay, Kundu, Debasis, and Hegde, Rajesh M.
- Subjects
Computer Science - Sound ,Computer Science - Computation and Language ,Statistics - Applications - Abstract
In this work, a Bayesian approach to speaker normalization is proposed to compensate for the degradation in performance of a speaker independent speech recognition system. The speaker normalization method proposed herein uses the technique of vocal tract length normalization (VTLN). The VTLN parameters are estimated using a novel Bayesian approach which utilizes the Gibbs sampler, a special type of Markov Chain Monte Carlo method. Additionally the hyperparameters are estimated using maximum likelihood approach. This model is used assuming that human vocal tract can be modeled as a tube of uniform cross section. It captures the variation in length of the vocal tract of different speakers more effectively, than the linear model used in literature. The work has also investigated different methods like minimization of Mean Square Error (MSE) and Mean Absolute Error (MAE) for the estimation of VTLN parameters. Both single pass and two pass approaches are then used to build a VTLN based speech recognizer. Experimental results on recognition of vowels and Hindi phrases from a medium vocabulary indicate that the Bayesian method improves the performance by a considerable margin., Comment: 23 Pages, 9 Figures
- Published
- 2016
13. OSPF Bidirectional Forwarding Detection (BFD) Strict-Mode
- Author
-
Psenak, P., additional, Fu, A., additional, and Rajesh, M., additional
- Published
- 2023
- Full Text
- View/download PDF
14. Second Order Cone Programming for Sensor Node Localization in Mixed LOS/NLOS Conditions
- Author
-
Kumar, Sudhir, Dixit, Rishabh, and Hegde, Rajesh M.
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
In this paper, a novel method for sensor node localization under mixed line-of-sight/non-line-of-sight (LOS/NLOS) conditions based on second order cone programming (SOCP) is presented. SOCP methods have, hitherto, not been utilized in the node localization under mixed LOS/NLOS conditions. Unlike semidefinite programming (SDP) formulation, SOCP is computationally efficient for resource constrained ad-hoc sensor network. The proposed method can work seamlessly in mixed LOS/NLOS conditions. The robustness of the method is due to the fair utilization of all measurements obtained under LOS and NLOS conditions. The computational complexity of this method is quadratic in the number of nearest neighbours of the unknown node. Extensive simulations and real field deployments are used to evaluate the performance of the proposed method. The experimental results of the proposed method is reasonably better when compared to similar methods in literature.
- Published
- 2015
15. A Complex Matrix Factorization approach to Joint Modeling of Magnitude and Phase for Source Separation
- Author
-
Ahuja, Chaitanya, Nathwani, Karan, and Hegde, Rajesh M.
- Subjects
Computer Science - Sound - Abstract
Conventional NMF methods for source separation factorize the matrix of spectral magnitudes. Spectral Phase is not included in the decomposition process of these methods. However, phase of the speech mixture is generally used in reconstructing the target speech signal. This results in undesired traces of interfering sources in the target signal. In this paper the spectral phase is incorporated in the decomposition process itself. Additionally, the complex matrix factorization problem is reduced to an NMF problem using simple transformations. This results in effective separation of speech mixtures since both magnitude and phase are utilized jointly in the separation process. Improvement in source separation results are demonstrated using objective quality evaluations on the GRID corpus., Comment: 5 pages, 3 figures
- Published
- 2014
16. A Suffix Tree Approach to Email Filtering
- Author
-
Pampapathi, Rajesh M., Mirkin, Boris, and Levene, Mark
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
We present an approach to email filtering based on the suffix tree data structure. A method for the scoring of emails using the suffix tree is developed and a number of scoring and score normalisation functions are tested. Our results show that the character level representation of emails and classes facilitated by the suffix tree can significantly improve classification accuracy when compared with the currently popular methods, such as naive Bayes. We believe the method can be extended to the classification of documents in other domains., Comment: Revisions made in the light of reviewer comments. Main changes: (i) The extension and elaboration of section 4.4 which describes the scoring algorithm; (ii) Favouring the use of false positive and false negative performance measures over the use of precision and recall; (iii) The addition of ROC curves wherever possible; and (iv) Inclusion of performance statistics for algorithm. Re-submitted 5th August 2005
- Published
- 2005
17. CHAPTER 7: BOMBS IN SEARCH OF A MISSION: INDIA.S UNCERTAIN NUCLEAR FUTURE.
- Author
-
Basrur, Rajesh M. and Cohen, Stephen Philip
- Abstract
Chapter 7 of the book "South Asia in 2020: Future Strategic Balances & Alliances" is presented. It explores India's relations with other countries in view of its nuclearization and possession of nuclear weapons. It highlights the India-Pakistan relation considering their nuclear program and contributions to the peace in South Asia. It also discusses the need for global security against terrorism and international nuclear non-proliferation regime.
- Published
- 2002
18. Part III: NUCLEAR WEAPONS AND REGIONAL SECURITY.
- Author
-
Basrur, Rajesh M. and Cohen, Stephen Philip
- Abstract
The article discusses increasing threat to regional security in South Asia in view of nuclearization of Pakistan and India. It discusses Nuclear future of India and Pakistan and its impact on their foreign policy. The consequences of nuclear proliferation in South Asia with special reference to regional security are highlighted. The diplomatic role of the United States in the region is also considered.
- Published
- 2002
19. CELLO - CLiRpath® Excimer Laser System to Enlarge Lumen Openings (CELLO)
- Author
-
Rajesh M. Dave, M.D.
- Published
- 2011
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.