Author: "H B Barathi Ganesh" - Searchworks@Jio Institute Digital Library Search Results

1. Enhanced Object Detection in Floor Plan Through Super-Resolution

Author: Dev Khare, N. S. Kamal, H. B. Barathi Ganesh, V. Sowmya, and V. V. Sajith Variyar
Published: 2023

2. Ink Recognition Using TDNN and Bi-LSTM

Author: R. Sai Kesav, H. B. Barathi Ganesh, B. Premjith, and K. P. Soman
Published: 2022

3. Geometry-Based Machining Feature Retrieval with Inductive Transfer Learning

Author: N. S. Kamal, H. B. Barathi Ganesh, V. V. Sajith Variyar, V. Sowmya, and K. P. Soman
Published: 2022

4. Geometrical Feature Extraction of CAD Models with Fully Convolutional Networks

Author: Gokul S. Jain, H. B. Barathi Ganesh, N. S. Kamal, V. V. Sajith Variyar, V. Sowmya, and K. P. Soman
Published: 2022

5. Multilingual Speech Recognition for Indian Languages

Author: R. Priyamvada, S. Sachin Kumar, H. B. Barathi Ganesh, and K. P. Soman
Published: 2022

6. MedNLU: Natural Language Understander for Medical Texts

Author: K. P. Soman, Reshma U, H B Barathi Ganesh, and M. Anand Kumar
Subjects: Conditional random field, Word embedding, Computer science, business.industry, Natural language understanding, computer.software_genre, Convolutional neural network, Named-entity recognition, Chunking (psychology), Feature (machine learning), Artificial intelligence, business, computer, Natural language, Natural language processing
Abstract: Natural Language Understanding is one of the essential tasks for building clinical text-based applications. Understanding of these clinical texts can be achieved through Vector Space Models and Sequential Modelling tasks. This paper is focused on sequential modelling i.e. Named Entity Recognition and Part of Speech Tagging by attaining a state of the art performance of 93.8% as F1 score for i2b2 clinical corpus and achieves 97.29% as F1 score for GENIA corpus. This paper also states the performance of feature fusion by integrating word embedding, feature embedding and character embedding for sequential modelling tasks. We also propose a framework based on a sequential modelling architecture, named MedNLU, which has the capability of performing Part of Speech Tagging, Chunking, and Entity Recognition on clinical texts. The sequence modeler in MedNLU is an integrated framework of Convolutional Neural Network, Conditional Random Fields and Bi-directional Long-Short Term Memory network.
Published: 2019

7. Significance of Global Vectors Representation in Protein Sequences Analysis

Author: K. P. Soman, M. Anand Kumar, Anon George, and H B Barathi Ganesh
Subjects: ComputingMethodologies_PATTERNRECOGNITION, Protein family, business.industry, Computer science, Representation (systemics), Artificial intelligence, Semantics, computer.software_genre, business, computer, Natural language processing
Abstract: Understanding the meaning of protein sequences is tedious with human efforts alone. Through this work, we experiment an NLP technique to extract features and give appropriate representation for the protein sequences. In this paper, we have used GloVe representation for the same. A dataset named Swiss-Prot has been incorporated into this work. We were able to create a representation that has comparable ability to understand the semantics of protein sequences compared to the existing ones. We have analyzed the performance of representation by the classification of different protein families in the Swiss-Prot dataset using machine learning technique. The analysis done by us proved the significance of this representation.
Published: 2019

8. Distributed Representation of Healthcare Text Through Qualitative and Quantitative Analysis

Author: H B Barathi Ganesh, J. R. Naveen, M. Anand Kumar, and K. P. Soman
Subjects: Computer science, business.industry, Cosine similarity, computer.software_genre, Part of speech, Distributed representation, Domain (software engineering), Quantitative analysis (finance), Health care, Word representation, Embedding, Artificial intelligence, business, computer, Natural language processing
Abstract: Many healthcare-related applications use pretrained embeddings, but these are often trained over general corpus which is mostly downstreamed to certain particular application. One problem noticed among such embeddings is that these are not efficient across various health text applications and even less number of research describe evaluation of these embedding for health domain. In this paper, distributional embedding model is performed to acquire a word representation on data crawled from Journal of Medical Case Reports. This distributed embedding model is analyzed qualitatively and quantitatively over crawled corpus. Qualitative evaluation is employed by cosine similarity on different categories and is visually represented. Quantitative evaluation performed with parts of speech tagging and entity recognition. The embedding model attained a cross-validation accuracy of 91.70% in parts of speech tagging for GENIA corpus and ensured 83% accuracy in the entity recognition of i2b2 clinical data.
Published: 2019

9. Sentiment Analysis for Code-Mixed Indian Social Media Text With Distributed Representation

Author: K Shalini, M. Anand Kumar, K. P. Soman, and H B Barathi Ganesh
Subjects: Artificial neural network, business.industry, Computer science, Deep learning, InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, Sentiment analysis, 02 engineering and technology, Crawling, computer.software_genre, Code (semiotics), Task (project management), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Task analysis, 020201 artificial intelligence & image processing, Social media, Artificial intelligence, business, computer, Natural language processing
Abstract: The enormous number of user activity on online social networks results in a considerable amount of data which expresses the opinion from millions of people with diversity in their social aspects. The freedom of language usage shared through social media paves the way for the existence of code-mixed data that turns out to be more complex for mining the information out of it. Considering this, we created Kannada-English code mixed corpus by crawling Facebook comments. As of now, there is no relevant corpus as well as literature available for code-mixed Kannada-English sentiment analysis. In addition to the crawled corpus, we also used sentiment analysis code-mixed corpus provided by Sentiment Analysis for Indian Languages (SAIL)-2017 which includes Bengali-English and Hindi-English languages. This paper also addresses the performance of distributed representation methods in sentiment analysis task. We have reported comparisons among different machine learning and deep learning techniques.
Published: 2018

10. CENNLP at SemEval-2018 Task 1: Constrained Vector Space Model in Affects in Tweets

Author: Anand Kumar M, K P Soman, Naveen J R, and H B Barathi Ganesh
Subjects: Measure (data warehouse), Computer science, business.industry, 05 social sciences, 02 engineering and technology, computer.software_genre, SemEval, Task (project management), Bag-of-words model, 0202 electrical engineering, electronic engineering, information engineering, Vector space model, 020201 artificial intelligence & image processing, Artificial intelligence, 0509 other social sciences, 050904 information & library sciences, business, Representation (mathematics), computer, Word (computer architecture), Natural language processing
Abstract: This paper discusses on task 1, “Affect in Tweets” sharedtask, conducted in SemEval-2018. This task comprises of various subtasks, which required participants to analyse over different emotions and sentiments based on the provided tweet data and also measure the intensity of these emotions for subsequent subtasks. Our approach in these task was to come up with a model on count based representation and use machine learning techniques for regression and classification related tasks. In this work, we use a simple bag of words technique for supervised text classification model as to compare, that even with some advance distributed representation models we can still achieve significant accuracy. Further, fine tuning on various parameters for the bag of word, representation model we acquired better scores over various other baseline models (Vinayan et al.) participated in the sharedtask.
Published: 2018

11. Amrita_student at SemEval-2018 Task 1: Distributed Representation of Social Media Text for Affects in Tweets

Author: H B Barathi Ganesh, M. Anand Kumar, K Shalini, K P Soman, and Nidhin A. Unnithan
Subjects: Computer science, business.industry, 02 engineering and technology, computer.software_genre, SemEval, Random forest, Task (project management), Tree (data structure), 020204 information systems, Linear regression, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), 020201 artificial intelligence & image processing, Artificial intelligence, business, Representation (mathematics), computer, Natural language processing
Abstract: In this paper we did an analysis of “Affects in Tweets” which was one of the task conducted by semeval 2018. Task was to build a model which is able to do regression and classification of different emotions from the given tweets data set. We developed a base model for all the subtasks using distributed representation (Doc2Vec) and applied machine learning techniques for classification and regression. Distributed representation is an unsupervised algorithm which is capable of learning fixed length feature representation from variable length texts. Machine learning techniques used for regression is ’Linear Regression’ while ’Random Forest Tree’ is used for classification purpose. Empirical results obtained for all the subtasks by our model are shown in this paper.
Published: 2018

12. From Vector Space Models to Vector Space Models of Semantics

Author: K. P. Soman, H B Barathi Ganesh, and M. Anand Kumar
Subjects: business.industry, Computer science, Representation (systemics), 020207 software engineering, 02 engineering and technology, Semantics, computer.software_genre, Task (project management), Data set, Text mining, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, business, computer, Natural language processing, Vector space
Abstract: This paper assesses the performance of frequency and concept based text representation in Mixed Script Information Retrieval and Classification tasks. In text analytics, representation serves as an unresolved research problem to progress further towards different applications. In this paper observations from different text representation methods in text classification and information retrieval are presented. The data set from the Mixed Script Information Retrieval shared task is used in this experiment and the performance of final submitted model is evaluated by task organizers. It is observed that distributional representation performs better than the frequency based text representation methods. The final system attained first place in task 2 and was 3.89% lesser than the top scored system in task 1.
Published: 2018

13. CENNLP at SemEval-2018 Task 2: Enhanced Distributed Representation of Text using Target Classes for Emoji Prediction Representation

Author: K P Soman, H B Barathi Ganesh, Hariharan, Naveen J R, and Anand Kumar M
Subjects: Interpretation (logic), Computer science, Emoji, business.industry, 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, SemEval, Task (project management), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Social media, Artificial intelligence, Representation (mathematics), business, computer, Natural language processing, 0105 earth and related environmental sciences, Meaning (linguistics)
Abstract: Emoji is one of the “fastest growing language ” in pop-culture, especially in social media and it is very unlikely for its usage to decrease. These are generally used to bring an extra level of meaning to the texts, posted on social media platforms. Providing such an added info, gives more insights to the plain text, arising to hidden interpretation within the text. This paper explains our analysis on Task 2, ” Multilingual Emoji Prediction” sharedtask conducted by Semeval-2018. In the task, a predicted emoji based on a piece of Twitter text are labelled under 20 different classes (most commonly used emojis) where these classes are learnt and further predicted are made for unseen Twitter text. In this work, we have experimented and analysed emojis predicted based on Twitter text, as a classification problem where the entailing emoji is considered as a label for every individual text data. We have implemented this using distributed representation of text through fastText. Also, we have made an effort to demonstrate how fastText framework can be useful in case of emoji prediction. This task is divide into two subtask, they are based on dataset presented in two different languages English and Spanish.
Published: 2018

14. TeamCEN at SemEval-2018 Task 1: Global Vectors Representation in Emotion Detection

Author: K P Soman, Anand Kumar M, H B Barathi Ganesh, and Anon George
Subjects: Computer science, business.industry, media_common.quotation_subject, 02 engineering and technology, Representation (arts), 010501 environmental sciences, Anger, computer.software_genre, 01 natural sciences, SemEval, Task (project management), Sadness, Identification (information), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Social media, Artificial intelligence, business, computer, Natural language processing, 0105 earth and related environmental sciences, media_common
Abstract: Emotions are a way of expressing human sentiments. In the modern era, social media is a platform where we convey our emotions. These emotions can be joy, anger, sadness and fear. Understanding the emotions from the written sentences is an interesting part in knowing about the writer. In the amount of digital language shared through social media, a considerable amount of data reflects the sentiment or emotion towards some product, person and organization. Since these texts are from users with diverse social aspects, these texts can be used to enrich the application related to the business intelligence. More than the sentiment, identification of intensity of the sentiment will enrich the performance of the end application. In this paper we experimented the intensity prediction as a text classification problem that evaluates the distributed representation text using aggregated sum and dimensionality reduction of the glove vectors of the words present in the respective texts .
Published: 2018

15. Wavelet Based RTL-SDR Real Time Signal Denoising in GNU Radio

Author: Reshma U, H B Barathi Ganesh, J. Jyothi, K. P. Soman, and R Gandhiraj
Subjects: Universal Software Radio Peripheral, Computer science, Noise reduction, Real-time computing, Wavelet transform, Time signal, 030507 speech-language pathology & audiology, 03 medical and health sciences, Noise, Wavelet, Electronic engineering, Radio frequency, 0305 other medical science, Continuous wavelet transform
Abstract: Noise removal is considered to be an efficacious step in processing any kind of data. Here the proposed model deals with removal of noise from aperiodic and piecewise constant signals by utilizing wavelet transform, which is being realized in GNU Radio platform. We have also dealt with the replacement of Universal Software Radio Peripheral with RTL-SDR for a low cost Radio Frequency Receiver system without any compromise in its efficiency. Wavelet analyzes noise level separately at each wavelet scale in time-scale domain and adapts the denoising algorithm especially for aperiodic and piecewise constant signals. GNU Radio companion serves well in analysis and synthesis of real time signals.
Published: 2015

16. Amrita-CEN at SemEval-2016 task 1: Semantic relation from word embeddings in higher dimension

Author: M. Anand Kumar, H B Barathi Ganesh, and K. P. Soman
Subjects: business.industry, Computer science, Context (language use), 02 engineering and technology, computer.software_genre, SemEval, Support vector machine, Semantic similarity, Similarity (network science), Dimension (vector space), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Word2vec, Artificial intelligence, business, computer, Natural language processing, Sentence
Abstract: Semantic Textual Similarity measures similarity between pair of texts, even though the similar context is projected using different words. This work attempted to incorporate the context space of the sentence from that sentence alone. It proposes combination of Word2Vec and Non-Negative Matrix Factorization to represent the sentence as context embedding vector in context space. Distance and correlation values between context embedding vector pairs used as a features for Support Vector Regression to built the domain independent similarity measuring model. The proposed model yielding performance 0.41 in terms of correlation.

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

16 results on '"H B Barathi Ganesh"'

1. Enhanced Object Detection in Floor Plan Through Super-Resolution

2. Ink Recognition Using TDNN and Bi-LSTM

3. Geometry-Based Machining Feature Retrieval with Inductive Transfer Learning

4. Geometrical Feature Extraction of CAD Models with Fully Convolutional Networks

5. Multilingual Speech Recognition for Indian Languages

6. MedNLU: Natural Language Understander for Medical Texts

7. Significance of Global Vectors Representation in Protein Sequences Analysis

8. Distributed Representation of Healthcare Text Through Qualitative and Quantitative Analysis

9. Sentiment Analysis for Code-Mixed Indian Social Media Text With Distributed Representation

10. CENNLP at SemEval-2018 Task 1: Constrained Vector Space Model in Affects in Tweets

11. Amrita_student at SemEval-2018 Task 1: Distributed Representation of Social Media Text for Affects in Tweets

12. From Vector Space Models to Vector Space Models of Semantics

13. CENNLP at SemEval-2018 Task 2: Enhanced Distributed Representation of Text using Target Classes for Emoji Prediction Representation

14. TeamCEN at SemEval-2018 Task 1: Global Vectors Representation in Emotion Detection

15. Wavelet Based RTL-SDR Real Time Signal Denoising in GNU Radio

16. Amrita-CEN at SemEval-2016 task 1: Semantic relation from word embeddings in higher dimension

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

16 results on '"H B Barathi Ganesh"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources