Author: "Snehasis Mukhopadhyay" / Topic: machine learning - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Snehasis Mukhopadhyay"' showing total 17 results

Start Over Author "Snehasis Mukhopadhyay" Topic machine learning

17 results on '"Snehasis Mukhopadhyay"'

1. Exploration and Visualization of Patterns Underlying Multistakeholder Preferences in Watershed Conservation Decisions Generated by an Interactive Genetic Algorithm

Author: Adriana Debora Piemonti, Snehasis Mukhopadhyay, Meghna Babbar-Sebens, Eugene Zhang, and Mariam Guizani
Subjects: Watershed, Interactive optimization, Computer science, business.industry, 0207 environmental engineering, 020207 software engineering, 02 engineering and technology, Machine learning, computer.software_genre, Visualization, Information visualization, Genetic algorithm, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, 020701 environmental engineering, business, computer, Water Science and Technology
Published: 2021
Full Text: View/download PDF

2. Non-Stationary Reinforcement-Learning Based Dimensionality Reduction for Multi-objective Optimization of Wetland Design

Author: Meghna Babbar-Sebens, Andrew Hoblitzell, and Snehasis Mukhopadhyay
Subjects: Decision support system, Watershed, Artificial neural network, business.industry, Computer science, User modeling, Dimensionality reduction, Feature selection, 02 engineering and technology, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Multi-objective optimization, 0202 electrical engineering, electronic engineering, information engineering, Reinforcement learning, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, 0105 earth and related environmental sciences
Abstract: This paper outlines a method of non-stationary reinforcement-based learning for feature selection. The method was developed for the Watershed REstoration using Spatio-Temporal Optimization of REsources (WRESTORE) system, which is a decision support system used for wetland design on the Eagle Creek Watershed, northwest of Indianapolis, Indiana. Our results show measurable impact for maximizing reward efficiently for the feature selection task. This work describes the existing WRESTORE system, provides an overview of related work in reinforcement learning and dimensionality reduction, and shows the impact of our work in the multi-objective optimization process of WRESTORE. The contribution of this work is the application of an RL-based feature selection technique in interactive optimization of watershed design.
Published: 2019
Full Text: View/download PDF

3. Interactive Machine Learning by Visualization: A Small Data Solution

Author: Huang Li, Snehasis Mukhopadhyay, Li Shen, Andrew J. Saykin, and Shiaofen Fang
Subjects: Visual analytics, Data collection, Small data, business.industry, Computer science, Big data, 020207 software engineering, 02 engineering and technology, Machine learning, computer.software_genre, Article, Visualization, 03 medical and health sciences, 0302 clinical medicine, Data visualization, Handwriting recognition, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, business, computer, 030217 neurology & neurosurgery
Abstract: Machine learning algorithms and traditional data mining process usually require a large volume of data to train the algorithm-specific models, with little or no user feedback during the model building process. Such a “big data” based automatic learning strategy is sometimes unrealistic for applications where data collection or processing is very expensive or difficult, such as in clinical trials. Furthermore, expert knowledge can be very valuable in the model building process in some fields such as biomedical sciences. In this paper, we propose a new visual analytics approach to interactive machine learning and visual data mining. In this approach, multi-dimensional data visualization techniques are employed to facilitate user interactions with the machine learning and mining process. This allows dynamic user feedback in different forms, such as data selection, data labeling, and data correction, to enhance the efficiency of model building. In particular, this approach can significantly reduce the amount of data required for training an accurate model, and therefore can be highly impactful for applications where large amount of data is hard to obtain. The proposed approach is tested on two application problems: the handwriting recognition (classification) problem and the human cognitive score prediction (regression) problem. Both experiments show that visualization supported interactive machine learning and data mining can achieve the same accuracy as an automatic process can with much smaller training data sets.
Published: 2018
Full Text: View/download PDF

4. Fuzzy and deep learning approaches for user modeling in wetland design

Author: Snehasis Mukhopadhyay, Meghna Babbar-Sebens, and Andrew Hoblitzell
Subjects: Decision support system, Artificial neural network, business.industry, Process (engineering), Computer science, Deep learning, User modeling, 0208 environmental biotechnology, 02 engineering and technology, Machine learning, computer.software_genre, 020801 environmental engineering, Component (UML), 0202 electrical engineering, electronic engineering, information engineering, Reinforcement learning, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer
Abstract: Determining the optimal design of a watershed is a highly subjective process which involves the consideration of many distinct factors by several different stakeholder groups. We describe additional functionality for our watershed planning system, called WRESTORE (Watershed REstoration Using Spatio-Temporal Optimization of REsources) (http://wrestore.iupui.edu), where stakeholders can collaboratively optimize best management practices on to the watershed. WRESTORE utilizes the USDA's public domain Soil and Water Assessment Tool hydrologic model for watershed simulations. Reinforcement learning and interactive genetic algorithms are applied for the search process. The new functionality described is a user modeling component that develops a computational model of a user's decision-making, based on real-time user-provided ratings for a subset of possible designs. The user modeling task utilizes neural network approaches, such as deep learning. We believe the originality of our approach centers on integrating user models in to the hydrological decision support process. This paper thus has three objectives: (i) outline current work in user modeling and watershed design, (ii) describe our system for interactive optimization of watershed design, and (iii) describe our work on implementing accurate and stable user predictive models to boost optimization performance.
Published: 2016
Full Text: View/download PDF

5. A learning approach to the database selection problem in the presence of dynamic user interests and database contents

Author: Snehasis Mukhopadhyay and Pooja Bajracharya
Subjects: Computer Science::Machine Learning, Database, Computer science, business.industry, Probabilistic logic, Library and Information Sciences, Machine learning, computer.software_genre, Ranking (information retrieval), Set (abstract data type), Action (philosophy), Vector space model, Reinforcement learning, Pruning (decision trees), Artificial intelligence, business, computer, Finite set, Computer Science::Databases, Information Systems
Abstract: Database Selection is the problem of choosing, from a finite number of databases, the one that contains the most relevant information pertaining to a query. Previous approaches to this problem consisted of deterministic search techniques in conjunction with efficient pruning of search spaces, based on vector space model and ranking. In this article, we propose a new probabilistic search method, based on a reinforcement learning algorithm, to solve the database selection problem, where an agent learns to map situations to action by means of receiving reward and penalties for the action taken and trying to maximize its rewards. We use reinforcement algorithm with user feedback to learn a policy, which maps a particular topic or particular interest to a set of databases. Reinforcement learning is an attractive approach to this problem due to its ability to generate optimal solutions for both stationary and non-stationary/dynamically changing environments(queries). Experiments with simple queries show that reinforcement learning has potential to be considered as an efficient approach for database selection.
Published: 2005
Full Text: View/download PDF

6. Genetic Sequence Classification and its Application to Cross-Species Homology Detection

Author: Mathew J. Palakal, Snehasis Mukhopadhyay, Changhong Tang, and Jeffery Huang
Subjects: Artificial neural network, Computer science, business.industry, Sequence alignment, Pattern recognition, Machine learning, computer.software_genre, Minimax, Backpropagation, ComputingMethodologies_PATTERNRECOGNITION, Signal Processing, Principal component analysis, Artificial intelligence, Electrical and Electronic Engineering, Cluster analysis, business, computer, Information Systems
Abstract: Although large-scale classification studies of genetic sequence data are in progress around the world, very few studies compare different classification approaches, e.g. unsupervised and supervised, in terms of objective criteria such as classification accuracy and computational complexity. In this paper, we study such criteria for both unsupervised and supervised classification of a relatively large sequence data set. The unsupervised approach involves use of different sequence alignment algorithms (e.g., Smith-Waterman, FASTA and BLAST) followed by clustering using the Maximin algorithm. The supervised approach uses a suitable numeric encoding (relative frequencies of tuples of nucleotides followed by principal component analysis) which is fed to a Multi-layer Backpropagation Neural Network. Classification experiments conducted on IBM-SP parallel computers show that FASTA with unsupervised Maximin leads to best trade-off between accuracy and speed among all methods, followed by supervised neural networks as the second best approach. Finally, the different classifiers are applied to the problem of cross-species homology detection.
Published: 2003
Full Text: View/download PDF

7. Automated web navigation using multiagent adaptive dynamic programming

Author: J. Varghese and Snehasis Mukhopadhyay
Subjects: Computer science, business.industry, Multi-agent system, Hyperlink, Machine learning, computer.software_genre, Computer Science Applications, Task (project management), Human-Computer Interaction, Dynamic programming, Control and Systems Engineering, Human–computer interaction, Web navigation, The Internet, Motion planning, Artificial intelligence, Electrical and Electronic Engineering, business, computer, Software
Abstract: Today a massive amount of information available on the WWW often makes searching for information of interest a long and tedious task. Chasing hyperlinks to find relevant information may be daunting. To overcome such a problem, a learning system, cognizant of a user's interests, can be employed to automatically search for and retrieve relevant information by following appropriate hyperlinks. In this paper, we describe the design of such a learning system for automated Web navigation using adaptive dynamic programming methods. To improve the performance of the learning system, we introduce the notion of multiple model-based learning agents operating in parallel, and describe methods for combining their models. Experimental results on the WWW navigation problem are presented to indicate that combining multiple learning agents, relying on user feedback, is a promising direction to improve learning speed in automated WWW navigation.
Published: 2003
Full Text: View/download PDF

8. Selecting an artificial neural network for efficient modeling and accurate simulation of the milling process

Author: Snehasis Mukhopadhyay, Jorge F. Briceno, and Hazim El-Mounayri
Subjects: Engineering, Artificial neural network, business.industry, Mechanical Engineering, Design of experiments, Supervised learning, Process (computing), Machine learning, computer.software_genre, Industrial and Manufacturing Engineering, Backpropagation, Machining, Radial basis function, Artificial intelligence, business, computer, Dimensioning
Abstract: In this paper, two supervised neural networks are used to estimate the forces developed during milling. These two Artificial Neural Networks (ANNs) are compared based on a cost function that relates the size of the training data to the accuracy of the model. Training experiments are screened based on design of experiments. Verification experiments are conducted to evaluate these two models. It is shown that the Radial Basis Network model is superior in this particular case. Orthogonal design and specifically equally spaced dimensioning showed to be a good way to select the training experiments.
Published: 2002
Full Text: View/download PDF

9. User Modelling for Interactive Optimization Using Neural Network

Author: Meghna Babbar-Sebens, Snehasis Mukhopadhyay, and Vidya Bhushan Singh
Subjects: Training set, Interactive optimization, Optimization problem, Artificial neural network, business.industry, Computer science, Artificial intelligence, Machine learning, computer.software_genre, business, computer, Fuzzy logic
Abstract: User modelling is one of the prominent research fields in information retrieval systems. In this paper, we model user's preferences and search criteria using an NN (Neural Network) to solve a multiobjective optimization problem specific to environmental planning systems. We argue that some NP hard problems cannot be solved alone either by a human or by a computer. Human participation in automated search is one way of combining human intuition with algorithmic search to solve such problems. However, even humans have some limitations for participation in that they cannot participate in search completely because of human fatigue. To overcome this, in our approach, an NN tries to model the user's rating criteria and preferences to help the user in rating large set of designs. Although training an NN with limited data is not always feasible, there are many situations where a simple modelling technique (e.g., linear/quadratic mapping) works better if the learning data set is small. In this paper we attempt to get more accuracy of the NN by generating data using other linear/non-linear techniques that fills the gap created by lack of sufficient training data. Also, we provided the architectural design of an HPC based framework we have proposed and compared the performance of the NN with fuzzy logic and other linear/non-linear user modelling techniques for the environmental resources optimization problem.
Published: 2013
Full Text: View/download PDF

10. Interactive pattern mining on hidden data

Author: Snehasis Mukhopadhyay, Mohammad Al Hasan, and Mansurul Bhuiyan
Subjects: Computer science, business.industry, Sampling (statistics), Markov chain Monte Carlo, computer.software_genre, Machine learning, Session (web analytics), Task (project management), symbols.namesake, Hidden data, symbols, Revenue, Data mining, Artificial intelligence, business, computer
Abstract: Mining frequent patterns from a hidden dataset is an important task with 43 various real-life applications. In this research, we propose a solution to this problem that is based on Markov Chain Monte Carlo (MCMC) sampling of frequent patterns. Instead of returning all the frequent patterns, the proposed paradigm returns a small set of randomly selected patterns so that the clandestinity of the dataset can be maintained. Our solution also allows interactive sampling, so that the sampled patterns can fulfill the user's requirement effectively. We show experimental results from several real life datasets to validate the capability and usefulness of our solution; in particular, we show examples that by using our proposed solution, an eCommerce marketplace can allow pattern mining on user session data without disclosing the data to the public; such a mining paradigm helps the sellers of the marketplace, which eventually boost the marketplace's own revenue.
Published: 2012
Full Text: View/download PDF

11. A novel reinforcement learning framework for sensor subset selection

Author: Omkar Tilak, Rajeev R. Raje, Mihran Tuceryan, and Snehasis Mukhopadhyay
Subjects: Learning automata, Selection (relational algebra), Computational complexity theory, Computer science, business.industry, Distributed object, Sensor fusion, Machine learning, computer.software_genre, Domain (software engineering), Scalability, Reinforcement learning, Artificial intelligence, business, computer
Abstract: The problem of selecting a subset of sensors in a distributed object tracking environment that optimizes an objective function consisting of a trade-off between data accuracy and energy consumption is known to be NP-hard. The problem is exacerbated because of the uncertainty and dynamic nature of either sensor characteristics or the environment or both. We propose, for the first time, a novel framework based on a reinforcement learning approach, to deal with the problems of computational complexity, dynamic nature and uncertainty for sensor subset selection. Our proposed sensor subset selection approach is completely decentralized and sensors do not need to know even the presence of other sensors in the system. This makes our approach extremely scalable and easy to implement in a distributed system. To the best of our knowledge, this is the first application of reinforcement learning to the domain of sensor subset selection.
Published: 2010
Full Text: View/download PDF

12. Multi-way association extraction and visualization from biological text documents using hyper-graphs: applications to genetic association studies for diseases

Author: Kalyan Maddu, Mathew J. Palakal, and Snehasis Mukhopadhyay
Subjects: Apriori algorithm, business.industry, Binary relation, Computer science, Medicine (miscellaneous), Binary number, Computational Biology, computer.software_genre, Machine learning, Visualization, Text mining, Knowledge extraction, Artificial Intelligence, Vector space model, Computer Graphics, A priori and a posteriori, Humans, Disease, Artificial intelligence, business, computer, Natural language processing, Genome-Wide Association Study
Abstract: Objectives: Biological research literature, as in many other domains of human endeavor, represents a rich, ever growing source of knowledge. An important form of such biological knowledge constitutes associations among biological entities such as genes, proteins, diseases, drugs and chemicals, etc. There has been a considerable amount of recent research in extraction of various kinds of binary associations (e.g., gene-gene, gene-protein, protein-protein, etc.) using different text mining approaches. However, an important aspect of such associations (e.g., ''gene A activates protein B'') is identifying the context in which such associations occur (e.g., ''gene A activates protein B in the context of disease C in organ D under the influence of chemical E''). Such contexts can be represented appropriately by a multi-way relationship involving more than two objects (e.g., objects A, B, C, D, E) rather than usual binary relationship (objects A and B). Methods: Such multi-way relations naturally lead to a hyper-graph representation of the knowledge rather than a binary graph. The hyper-graph based multi-way knowledge extraction from biological text literature represents a computationally difficult problem (due to its combinatorial nature) which has not received much attention from the Bioinformatics research community. In this paper, we describe and compare two different approaches to such multi-way hyper-graph extraction: one based on an exhaustive enumeration of all multi-way hyper-edges and the other based on an extension of the well-known A Priori algorithm for structured data to the case unstructured textual data. We also present a representative graph based approach towards visualizing these genetic association hyper-graphs. Results: Two case studies are conducted for two biomedical problems (related to the diseases of lung cancer and colorectal cancer respectively), illustrating that the latter approach (using the text-based A Priori method) identifies the same hyper-edges as the former approach (the exhaustive method), but at a much less computational cost. The extracted hyper-relations are presented in the paper as cognition-rich representative graphs, representing the corresponding hyper-graphs. Conclusions: The text-based A Priori algorithm is a practical, useful method to extract hyper-graphs representing multi-way associations among biological objects. These hyper-graphs and their visualization using representative graphs can provide important contextual information for understanding gene-gene associations relevant to specific diseases.
Published: 2010

13. Reinforcement learning for human-machine collaborative optimization: Application in ground water monitoring

Author: Snehasis Mukhopadhyay and Meghna Babbar-Sebens
Subjects: Computer Science::Machine Learning, Optimization problem, Learning automata, Computer science, business.industry, Active learning (machine learning), Machine learning, computer.software_genre, Generalization error, Genetic algorithm, Unsupervised learning, Reinforcement learning, Human–machine system, Artificial intelligence, business, computer
Abstract: In this paper, we introduce reinforcement learning as a methodology to solve complex multi-criteria optimization problems for ground water monitoring. Multiple analytical criteria are used to assess design decisions and human feedback is simulated by adding random noise. Different learning automata based reinforcement learning methods as well as a genetic algorithm based method are used in experimental studies, which demonstrate the efficiency of reinforcement learning approaches.
Published: 2009
Full Text: View/download PDF

14. Multi-way Association Extraction from Biological Text Documents Using Hyper-Graphs

Author: K. Maddu, Mathew J. Palakal, and Snehasis Mukhopadhyay
Subjects: Apriori algorithm, Computer science, business.industry, Association (object-oriented programming), Binary number, Context (language use), Graph theory, Extension (predicate logic), Machine learning, computer.software_genre, ComputingMethodologies_PATTERNRECOGNITION, Knowledge extraction, Artificial intelligence, Representation (mathematics), business, computer, Natural language processing
Abstract: There has been a considerable amount of recent research in extraction of various kinds of binary associations (e.g., gene-gene, gene-protein, protein-protein, etc) using different text mining approaches. However, an important aspect of such associations is identifying the context in which such associations occur (e.g., "gene A activates protein B in the context of disease C in organ D under the influence of chemical E"). Such contexts can be represented appropriately by a multi-way relationship involving more than two objects rather than usual binary relationships. Such multi-way relations naturally lead to a hyper-graph representation of the knowledge. The hyper-graph based knowledge extraction from biological literature represents a computationally difficult problem due to its combinatorial nature. In this paper, we compare two different approaches to such hyper-graph extraction: one based on an exhaustive enumeration of all hyper-edges and the other based on an extension of the well-known A Priori algorithm.
Published: 2008
Full Text: View/download PDF

15. A comparative study of genetic sequence classification algorithms

Author: Mulong Yu, Jeffery Huang, Snehasis Mukhopadhyay, Changhong Tang, and Mathew J. Palakal
Subjects: Measure (data warehouse), Sequence, Artificial neural network, Computer science, business.industry, media_common.quotation_subject, Linear classifier, computer.software_genre, Minimax, Machine learning, Statistical classification, ComputingMethodologies_PATTERNRECOGNITION, One-class classification, Quality (business), Data mining, Artificial intelligence, business, computer, media_common
Abstract: Classification of genetic sequence data available in public and private databases is an important problem in using, understanding, retrieving, filtering and correlating such large volumes of information. Although a significant amount of research effort is being spent internationally on this problem, very few studies exist that compare different classification approaches in terms of an objective and quantitative classification performance criterion. In this paper, we present experimental studies for classification of genetic sequences using both unsupervised and supervised approaches, focusing on both computational effort as well as a suitably defined classification performance measure. The results indicate that both unsupervised classification using the Maximin algorithm combined with FASTA sequence alignment algorithm and supervised classification using artificial neural network have good classification performance, with the unsupervised classification performs better and the supervised classification performs faster. A trade-off between the quality of classification and the computational efforts exists. The utilization of these classifiers for retrieval, filtering and correlation of genetic information as well as prediction of functions and structures will be logical future directions for further research.
Published: 2003
Full Text: View/download PDF

16. Feature decomposition architectures for neural networks: algorithms, error bounds, and applications

Author: Haiying Wang, Snehasis Mukhopadhyay, and Shiaofen Fang
Subjects: Artificial neural network, Computer Networks and Communications, Computer science, Time delay neural network, business.industry, General Medicine, Modular design, Machine learning, computer.software_genre, Feature (computer vision), Decomposition (computer science), Speech Perception, Artificial intelligence, Neural Networks, Computer, business, computer, Algorithms
Abstract: In recent years, systems consisting of multiple modular neural networks have attracted substantial interest in the neural networks community because of various advantages they offer over a single large monolithic network. In this paper, we propose two basic feature decomposition models (namely, parallel model and tandem model) in which each of the neural network modules processes a disjoint subset of the input features. A novel feature decomposition algorithm is introduced to partition the input space into disjoint subsets solely based on the available training data. Under certain assumptions, the approximation error due to decomposition can be proved to be bounded by any desired small value over a compact set. Finally, the performance of feature decomposition networks is compared with that of a monolithic network in real-world bench-mark pattern recognition and modeling problems.
Published: 2000

17. A two-level approach to learning in nonstationary environments

Author: Wai Lam and Snehasis Mukhopadhyay
Subjects: Learning classifier system, Wake-sleep algorithm, Computer science, business.industry, Active learning (machine learning), Stability (learning theory), Online machine learning, Machine learning, computer.software_genre, Computational learning theory, Unsupervised learning, Artificial intelligence, Instance-based learning, business, computer
Abstract: A nonstationary environment is one in which the suitability of the strategies available to a learning element changes with time. Since the optimal action in such a case is not fixed, the learning problem (i.e., the determination of the optimal strategy) becomes considerably difficult. In this paper, a two-level approach is presented for a learning automaton operating in a nonstationary environment. The lower level consists of a standard absolutely expedient learning algorithm for stationary environments. The higher level on the other hand is a tracking algorithm, based on Bayesian decision theory, for detecting changes in the environment and reinitializing the lower level algorithm in a suitable manner. Simulation studies empirically demonstrate the clear superiority of the two-level approach over the single-level learning in nonstationary environments.
Published: 1996
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Snehasis Mukhopadhyay"'

1. Exploration and Visualization of Patterns Underlying Multistakeholder Preferences in Watershed Conservation Decisions Generated by an Interactive Genetic Algorithm

2. Non-Stationary Reinforcement-Learning Based Dimensionality Reduction for Multi-objective Optimization of Wetland Design

3. Interactive Machine Learning by Visualization: A Small Data Solution

4. Fuzzy and deep learning approaches for user modeling in wetland design

5. A learning approach to the database selection problem in the presence of dynamic user interests and database contents

6. Genetic Sequence Classification and its Application to Cross-Species Homology Detection

7. Automated web navigation using multiagent adaptive dynamic programming

8. Selecting an artificial neural network for efficient modeling and accurate simulation of the milling process

9. User Modelling for Interactive Optimization Using Neural Network

10. Interactive pattern mining on hidden data

11. A novel reinforcement learning framework for sensor subset selection

12. Multi-way association extraction and visualization from biological text documents using hyper-graphs: applications to genetic association studies for diseases

13. Reinforcement learning for human-machine collaborative optimization: Application in ground water monitoring

14. Multi-way Association Extraction from Biological Text Documents Using Hyper-Graphs

15. A comparative study of genetic sequence classification algorithms

16. Feature decomposition architectures for neural networks: algorithms, error bounds, and applications

17. A two-level approach to learning in nonstationary environments

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

17 results on '"Snehasis Mukhopadhyay"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources