16 results on '"H B Barathi Ganesh"'
Search Results
2. Ink Recognition Using TDNN and Bi-LSTM
- Author
-
R. Sai Kesav, H. B. Barathi Ganesh, B. Premjith, and K. P. Soman
- Published
- 2022
3. Geometry-Based Machining Feature Retrieval with Inductive Transfer Learning
- Author
-
N. S. Kamal, H. B. Barathi Ganesh, V. V. Sajith Variyar, V. Sowmya, and K. P. Soman
- Published
- 2022
4. Geometrical Feature Extraction of CAD Models with Fully Convolutional Networks
- Author
-
Gokul S. Jain, H. B. Barathi Ganesh, N. S. Kamal, V. V. Sajith Variyar, V. Sowmya, and K. P. Soman
- Published
- 2022
5. Multilingual Speech Recognition for Indian Languages
- Author
-
R. Priyamvada, S. Sachin Kumar, H. B. Barathi Ganesh, and K. P. Soman
- Published
- 2022
6. MedNLU: Natural Language Understander for Medical Texts
- Author
-
K. P. Soman, Reshma U, H B Barathi Ganesh, and M. Anand Kumar
- Subjects
Conditional random field ,Word embedding ,Computer science ,business.industry ,Natural language understanding ,computer.software_genre ,Convolutional neural network ,Named-entity recognition ,Chunking (psychology) ,Feature (machine learning) ,Artificial intelligence ,business ,computer ,Natural language ,Natural language processing - Abstract
Natural Language Understanding is one of the essential tasks for building clinical text-based applications. Understanding of these clinical texts can be achieved through Vector Space Models and Sequential Modelling tasks. This paper is focused on sequential modelling i.e. Named Entity Recognition and Part of Speech Tagging by attaining a state of the art performance of 93.8% as F1 score for i2b2 clinical corpus and achieves 97.29% as F1 score for GENIA corpus. This paper also states the performance of feature fusion by integrating word embedding, feature embedding and character embedding for sequential modelling tasks. We also propose a framework based on a sequential modelling architecture, named MedNLU, which has the capability of performing Part of Speech Tagging, Chunking, and Entity Recognition on clinical texts. The sequence modeler in MedNLU is an integrated framework of Convolutional Neural Network, Conditional Random Fields and Bi-directional Long-Short Term Memory network.
- Published
- 2019
7. Significance of Global Vectors Representation in Protein Sequences Analysis
- Author
-
K. P. Soman, M. Anand Kumar, Anon George, and H B Barathi Ganesh
- Subjects
ComputingMethodologies_PATTERNRECOGNITION ,Protein family ,business.industry ,Computer science ,Representation (systemics) ,Artificial intelligence ,Semantics ,computer.software_genre ,business ,computer ,Natural language processing - Abstract
Understanding the meaning of protein sequences is tedious with human efforts alone. Through this work, we experiment an NLP technique to extract features and give appropriate representation for the protein sequences. In this paper, we have used GloVe representation for the same. A dataset named Swiss-Prot has been incorporated into this work. We were able to create a representation that has comparable ability to understand the semantics of protein sequences compared to the existing ones. We have analyzed the performance of representation by the classification of different protein families in the Swiss-Prot dataset using machine learning technique. The analysis done by us proved the significance of this representation.
- Published
- 2019
8. Distributed Representation of Healthcare Text Through Qualitative and Quantitative Analysis
- Author
-
H B Barathi Ganesh, J. R. Naveen, M. Anand Kumar, and K. P. Soman
- Subjects
Computer science ,business.industry ,Cosine similarity ,computer.software_genre ,Part of speech ,Distributed representation ,Domain (software engineering) ,Quantitative analysis (finance) ,Health care ,Word representation ,Embedding ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
Many healthcare-related applications use pretrained embeddings, but these are often trained over general corpus which is mostly downstreamed to certain particular application. One problem noticed among such embeddings is that these are not efficient across various health text applications and even less number of research describe evaluation of these embedding for health domain. In this paper, distributional embedding model is performed to acquire a word representation on data crawled from Journal of Medical Case Reports. This distributed embedding model is analyzed qualitatively and quantitatively over crawled corpus. Qualitative evaluation is employed by cosine similarity on different categories and is visually represented. Quantitative evaluation performed with parts of speech tagging and entity recognition. The embedding model attained a cross-validation accuracy of 91.70% in parts of speech tagging for GENIA corpus and ensured 83% accuracy in the entity recognition of i2b2 clinical data.
- Published
- 2019
9. Sentiment Analysis for Code-Mixed Indian Social Media Text With Distributed Representation
- Author
-
K Shalini, M. Anand Kumar, K. P. Soman, and H B Barathi Ganesh
- Subjects
Artificial neural network ,business.industry ,Computer science ,Deep learning ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,Sentiment analysis ,02 engineering and technology ,Crawling ,computer.software_genre ,Code (semiotics) ,Task (project management) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,020201 artificial intelligence & image processing ,Social media ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
The enormous number of user activity on online social networks results in a considerable amount of data which expresses the opinion from millions of people with diversity in their social aspects. The freedom of language usage shared through social media paves the way for the existence of code-mixed data that turns out to be more complex for mining the information out of it. Considering this, we created Kannada-English code mixed corpus by crawling Facebook comments. As of now, there is no relevant corpus as well as literature available for code-mixed Kannada-English sentiment analysis. In addition to the crawled corpus, we also used sentiment analysis code-mixed corpus provided by Sentiment Analysis for Indian Languages (SAIL)-2017 which includes Bengali-English and Hindi-English languages. This paper also addresses the performance of distributed representation methods in sentiment analysis task. We have reported comparisons among different machine learning and deep learning techniques.
- Published
- 2018
10. CENNLP at SemEval-2018 Task 1: Constrained Vector Space Model in Affects in Tweets
- Author
-
Anand Kumar M, K P Soman, Naveen J R, and H B Barathi Ganesh
- Subjects
Measure (data warehouse) ,Computer science ,business.industry ,05 social sciences ,02 engineering and technology ,computer.software_genre ,SemEval ,Task (project management) ,Bag-of-words model ,0202 electrical engineering, electronic engineering, information engineering ,Vector space model ,020201 artificial intelligence & image processing ,Artificial intelligence ,0509 other social sciences ,050904 information & library sciences ,business ,Representation (mathematics) ,computer ,Word (computer architecture) ,Natural language processing - Abstract
This paper discusses on task 1, “Affect in Tweets” sharedtask, conducted in SemEval-2018. This task comprises of various subtasks, which required participants to analyse over different emotions and sentiments based on the provided tweet data and also measure the intensity of these emotions for subsequent subtasks. Our approach in these task was to come up with a model on count based representation and use machine learning techniques for regression and classification related tasks. In this work, we use a simple bag of words technique for supervised text classification model as to compare, that even with some advance distributed representation models we can still achieve significant accuracy. Further, fine tuning on various parameters for the bag of word, representation model we acquired better scores over various other baseline models (Vinayan et al.) participated in the sharedtask.
- Published
- 2018
11. Amrita_student at SemEval-2018 Task 1: Distributed Representation of Social Media Text for Affects in Tweets
- Author
-
H B Barathi Ganesh, M. Anand Kumar, K Shalini, K P Soman, and Nidhin A. Unnithan
- Subjects
Computer science ,business.industry ,02 engineering and technology ,computer.software_genre ,SemEval ,Random forest ,Task (project management) ,Tree (data structure) ,020204 information systems ,Linear regression ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Representation (mathematics) ,computer ,Natural language processing - Abstract
In this paper we did an analysis of “Affects in Tweets” which was one of the task conducted by semeval 2018. Task was to build a model which is able to do regression and classification of different emotions from the given tweets data set. We developed a base model for all the subtasks using distributed representation (Doc2Vec) and applied machine learning techniques for classification and regression. Distributed representation is an unsupervised algorithm which is capable of learning fixed length feature representation from variable length texts. Machine learning techniques used for regression is ’Linear Regression’ while ’Random Forest Tree’ is used for classification purpose. Empirical results obtained for all the subtasks by our model are shown in this paper.
- Published
- 2018
12. From Vector Space Models to Vector Space Models of Semantics
- Author
-
K. P. Soman, H B Barathi Ganesh, and M. Anand Kumar
- Subjects
business.industry ,Computer science ,Representation (systemics) ,020207 software engineering ,02 engineering and technology ,Semantics ,computer.software_genre ,Task (project management) ,Data set ,Text mining ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,business ,computer ,Natural language processing ,Vector space - Abstract
This paper assesses the performance of frequency and concept based text representation in Mixed Script Information Retrieval and Classification tasks. In text analytics, representation serves as an unresolved research problem to progress further towards different applications. In this paper observations from different text representation methods in text classification and information retrieval are presented. The data set from the Mixed Script Information Retrieval shared task is used in this experiment and the performance of final submitted model is evaluated by task organizers. It is observed that distributional representation performs better than the frequency based text representation methods. The final system attained first place in task 2 and was 3.89% lesser than the top scored system in task 1.
- Published
- 2018
13. CENNLP at SemEval-2018 Task 2: Enhanced Distributed Representation of Text using Target Classes for Emoji Prediction Representation
- Author
-
K P Soman, H B Barathi Ganesh, Hariharan, Naveen J R, and Anand Kumar M
- Subjects
Interpretation (logic) ,Computer science ,Emoji ,business.industry ,02 engineering and technology ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,SemEval ,Task (project management) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Social media ,Artificial intelligence ,Representation (mathematics) ,business ,computer ,Natural language processing ,0105 earth and related environmental sciences ,Meaning (linguistics) - Abstract
Emoji is one of the “fastest growing language ” in pop-culture, especially in social media and it is very unlikely for its usage to decrease. These are generally used to bring an extra level of meaning to the texts, posted on social media platforms. Providing such an added info, gives more insights to the plain text, arising to hidden interpretation within the text. This paper explains our analysis on Task 2, ” Multilingual Emoji Prediction” sharedtask conducted by Semeval-2018. In the task, a predicted emoji based on a piece of Twitter text are labelled under 20 different classes (most commonly used emojis) where these classes are learnt and further predicted are made for unseen Twitter text. In this work, we have experimented and analysed emojis predicted based on Twitter text, as a classification problem where the entailing emoji is considered as a label for every individual text data. We have implemented this using distributed representation of text through fastText. Also, we have made an effort to demonstrate how fastText framework can be useful in case of emoji prediction. This task is divide into two subtask, they are based on dataset presented in two different languages English and Spanish.
- Published
- 2018
14. TeamCEN at SemEval-2018 Task 1: Global Vectors Representation in Emotion Detection
- Author
-
K P Soman, Anand Kumar M, H B Barathi Ganesh, and Anon George
- Subjects
Computer science ,business.industry ,media_common.quotation_subject ,02 engineering and technology ,Representation (arts) ,010501 environmental sciences ,Anger ,computer.software_genre ,01 natural sciences ,SemEval ,Task (project management) ,Sadness ,Identification (information) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Social media ,Artificial intelligence ,business ,computer ,Natural language processing ,0105 earth and related environmental sciences ,media_common - Abstract
Emotions are a way of expressing human sentiments. In the modern era, social media is a platform where we convey our emotions. These emotions can be joy, anger, sadness and fear. Understanding the emotions from the written sentences is an interesting part in knowing about the writer. In the amount of digital language shared through social media, a considerable amount of data reflects the sentiment or emotion towards some product, person and organization. Since these texts are from users with diverse social aspects, these texts can be used to enrich the application related to the business intelligence. More than the sentiment, identification of intensity of the sentiment will enrich the performance of the end application. In this paper we experimented the intensity prediction as a text classification problem that evaluates the distributed representation text using aggregated sum and dimensionality reduction of the glove vectors of the words present in the respective texts .
- Published
- 2018
15. Wavelet Based RTL-SDR Real Time Signal Denoising in GNU Radio
- Author
-
Reshma U, H B Barathi Ganesh, J. Jyothi, K. P. Soman, and R Gandhiraj
- Subjects
Universal Software Radio Peripheral ,Computer science ,Noise reduction ,Real-time computing ,Wavelet transform ,Time signal ,030507 speech-language pathology & audiology ,03 medical and health sciences ,Noise ,Wavelet ,Electronic engineering ,Radio frequency ,0305 other medical science ,Continuous wavelet transform - Abstract
Noise removal is considered to be an efficacious step in processing any kind of data. Here the proposed model deals with removal of noise from aperiodic and piecewise constant signals by utilizing wavelet transform, which is being realized in GNU Radio platform. We have also dealt with the replacement of Universal Software Radio Peripheral with RTL-SDR for a low cost Radio Frequency Receiver system without any compromise in its efficiency. Wavelet analyzes noise level separately at each wavelet scale in time-scale domain and adapts the denoising algorithm especially for aperiodic and piecewise constant signals. GNU Radio companion serves well in analysis and synthesis of real time signals.
- Published
- 2015
16. Amrita-CEN at SemEval-2016 task 1: Semantic relation from word embeddings in higher dimension
- Author
-
M. Anand Kumar, H B Barathi Ganesh, and K. P. Soman
- Subjects
business.industry ,Computer science ,Context (language use) ,02 engineering and technology ,computer.software_genre ,SemEval ,Support vector machine ,Semantic similarity ,Similarity (network science) ,Dimension (vector space) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Embedding ,020201 artificial intelligence & image processing ,Word2vec ,Artificial intelligence ,business ,computer ,Natural language processing ,Sentence - Abstract
Semantic Textual Similarity measures similarity between pair of texts, even though the similar context is projected using different words. This work attempted to incorporate the context space of the sentence from that sentence alone. It proposes combination of Word2Vec and Non-Negative Matrix Factorization to represent the sentence as context embedding vector in context space. Distance and correlation values between context embedding vector pairs used as a features for Support Vector Regression to built the domain independent similarity measuring model. The proposed model yielding performance 0.41 in terms of correlation.
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.