15 results on '"Martínez-Trinidad, José Fco."'
Search Results
2. Extensions to AGraP Algorithm for Finding a Reduced Set of Inexact Graph Patterns.
- Author
-
Flores-Garrido, Marisol, Carrasco-Ochoa, J. Ariel, and Martínez-Trinidad, José Fco.
- Subjects
PATTERN recognition systems ,DATA mining ,ISOMORPHISM (Mathematics) ,CLASSIFICATION algorithms ,GRAPH algorithms - Abstract
Most algorithms to mine graph patterns, during the searching process, require a pattern to be identical to its occurrences, relying on the graph isomorphism problem. However, in recent years, there has been interest in the case in which it is acceptable to have some differences between a pattern and its occurrences, whether these differences are in labels or in structure. Allowing some differences and using inexact matching to measure the similarity between graphs lead to the discovery of new patterns, but some important challenges, such as the increment on the number of found patterns, make the post-mining analysis harder. In this work we focus on two extensions of the AGraP algorithm, which mines inexact patterns, addressing the issue of reducing the output pattern set while trying to retain the useful information gained through the use of inexact matching. First, exploring a traditional approach, we propose the CloseAFG algorithm that focuses on closed patterns. Then, we propose the IntAFG algorithm to find a subset of patterns covering the original pattern set, while lessening redundancy among selected patterns. We show the performance of our approaches through some experiments on synthetic databases; additionally, we also show the usefulness of the reduced pattern sets for image classification. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
3. Extension of Canonical Adjacency Matrices for Frequent Approximate Subgraph Mining on Multi-Graph Collections.
- Author
-
Acosta-Mendoza, Niusvel, Gago-Alonso, Andrés, Carrasco-Ochoa, Jesús Ariel, Martínez-Trinidad, José Fco., and Medina-Pagola, José E.
- Subjects
DATA mining ,MULTIGRAPH ,ALGORITHMS ,PATTERN perception ,LITERATURE reviews - Abstract
Into the data mining field, frequent approximate subgraph (FAS) mining has become an important technique with a broad spectrum of real-life applications. This fact is because several real-life phenomena can be modeled by graphs. In the literature, several algorithms have been reported for mining frequent approximate patterns on simple-graph collections; however, there are applications where more complex data structures, as multi-graphs, are needed for modeling the problem. But to the best of our knowledge, there is no FAS mining algorithm designed for dealing with multi-graphs. Therefore, in this paper, a canonical form (CF) for simple-graphs is extended to allow representing multi-graphs and a state-of-the-art algorithm for FAS mining is also extended for processing multi-graph collections by using the extended CF. Our experiments over different synthetic and real-world multi-graph collections show that the proposed algorithm has a good performance in terms of runtime and scalability. Additionally, we show the usefulness of the patterns computed by our algorithm in an image classification problem where images are represented as multi-graphs. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
4. Combining hybrid rule ordering strategies based on netconf and a novel satisfaction mechanism for CAR-based classifiers.
- Author
-
Hernández-León, R., Carrasco-Ochoa, Jesús A., Martínez-Trinidad, José Fco., and Hernández-Palancar, J.
- Subjects
IMAGE retrieval ,INFORMATION retrieval research ,IMAGE storage & retrieval systems ,BIG data ,DATA mining - Abstract
In Associative Classification, building a classifier based on Class Association Rules (CARs) consists in finding an ordered CAR list by applying a rule ordering strategy, and selecting a satisfaction mechanism to determine the class of unseen transactions. In this paper, we introduce four novel hybrid rule ordering strategies; the first three combine the Netconf measure with different Support-Confidence based rule ordering strategies. The fourth strategy combines the Netconf measure with a rule ordering strategy based on the CAR's size. Additionally, we combine the proposed strategies with a novel "Dynamic K" satisfaction mechanism. Experiments over several datasets show that the proposed rule ordering strategies jointly with the "Dynamic K" satisfaction mechanism allow improving the performance of CAR-based classifiers. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
5. An empirical comparison among quality measures for pattern based classifiers.
- Author
-
Loyola-González, Octavio, García-Borroto, Milton, Martínez-Trinidad, José Fco., and Carrasco-Ochoa, Jesús Ariel
- Subjects
PATTERN recognition systems ,DATA mining ,INFORMATION filtering ,DATABASE research ,DATA quality - Abstract
Measuring the quality of a contrast pattern is an active and relevant area of pattern recognition and data mining. Quality measures are important tools in very different scenarios like supervised classification, pattern based clustering, and association rule mining. Consequently, and due to the large collection of available measures, it is important to perform comparative studies for each particular context. Most published studies comparing quality measures are theoretical and in the context of association rule evaluation. In this paper, we present an empirical comparison of the behavior of 33 quality measures in the context of supervised classification and contrast pattern filtering. A comprehensive experimentation using several databases compares the behavior of these measures in three different contexts: as aggregation value, as pattern evaluation for classification, and as pattern evaluation for filtering. Experiments also show that top-accurate quality measures for classification have a deceptive performance for pattern filtering, because they cannot distinguish among patterns with zero support in the negative class. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
6. Mining frequent patterns and association rules using similarities.
- Author
-
Rodríguez-González, Ansel Y., Martínez-Trinidad, José Fco., Carrasco-Ochoa, Jesús A., and Ruiz-Shulcloper, José
- Subjects
- *
ALGORITHMS , *DATA mining , *SUBROUTINES (Computer programs) , *COMPUTER science , *PATTERN recognition systems , *EXPERIMENTAL design - Abstract
Abstract: Most of the current algorithms for mining association rules assume that two object subdescriptions are similar when they are exactly equal, but in many real world problems some other similarity functions are used. Commonly these algorithms are divided in two steps: Frequent pattern mining and generation of interesting association rules from frequent patterns. In this work, two algorithms for mining frequent similar patterns using similarity functions different from the equality are proposed. Additionally, the GenRules Algorithm is adapted to generate interesting association rules from frequent similar patterns. Experimental results show that our algorithms are more effective and obtain better quality patterns than the existing ones. [Copyright &y& Elsevier]
- Published
- 2013
- Full Text
- View/download PDF
7. An algorithm based on density and compactness for dynamic overlapping clustering.
- Author
-
Pérez-Suárez, Airel, Martínez-Trinidad, José Fco., Carrasco-Ochoa, Jesús A., and Medina-Pagola, José E.
- Subjects
- *
COMPUTER algorithms , *CLUSTER analysis (Statistics) , *INFORMATION retrieval , *BIOINFORMATICS , *GRAPH theory , *PERFORMANCE evaluation , *SOCIAL networks - Abstract
Abstract: Most clustering algorithms organize a collection of objects into a set of disjoint clusters. Although this approach has been successfully applied in unsupervised learning, there are several applications where objects could belong to more than one cluster. Overlapping clustering is an alternative in those contexts like social network analysis, information retrieval and bioinformatics, among other problems where non-disjoint clusters appear. In addition, there are environments where the collection changes frequently and the clustering must be updated; however, most of the existing overlapping clustering algorithms are not able to efficiently update the clustering. In this paper, we introduce a new overlapping clustering algorithm, called DClustR, which is based on the graph theory approach and it introduces a new strategy for building more accurate overlapping clusters than those built by state-of-the-art algorithms. Moreover, our algorithm introduces a new strategy for efficiently updating the clustering when the collection changes. The experimentation conducted over several standard collections shows the good performance of the proposed algorithm, wrt. accuracy and efficiency. [Copyright &y& Elsevier]
- Published
- 2013
- Full Text
- View/download PDF
8. A dynamic clustering algorithm for building overlapping clusters.
- Author
-
Pérez-Suárez, Airel, Martínez-Trinidad, José Fco., Carrasco-Ochoa, Jesús A., and Medina-Pagola, José E.
- Subjects
- *
DATA mining , *DIAGNOSIS , *ALGORITHMS , *DECISION support systems , *DATABASE searching - Abstract
Clustering is a Data Mining technique which has been widely used in many practical applications. In some of these applications like, medical diagnosis, categorization of digital libraries, topic detection and others, the objects could belong to more than one cluster. However, most of the clustering algorithms generate disjoint clusters. Moreover, processing additions, deletions and modifications of objects in the clustering built so far, without having to rebuild the clustering from the beginning is an issue that has been little studied. In this paper, we introduce DCS, a clustering algorithm which includes a new graph-cover strategy for building a set of clusters that could overlap, and a strategy for dynamically updating the clustering, managing multiple additions and/or deletions of objects. The experimental evaluation conducted over different collections demonstrates the good performance of the proposed algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
9. CAR-NF: A classifier based on specific rules with high netconf.
- Author
-
Hernández-León, R., Carrasco-Ochoa, Jesús A., Martínez-Trinidad, José Fco., and Hernández-Palancar, J.
- Subjects
DATA mining ,CLASSIFICATION ,ASSOCIATION rule mining ,COMPUTER systems ,DATA analysis - Abstract
In this paper, an accurate classifier based on Class Association Rules (CARs), called CAR-NF, is proposed. CAR-NF introduces a new strategy for computing CARs, using the Netconf as measure of interest, that allows to prune the CAR search space for building specific rules with high Netconf. Moreover, we propose and prove a proposition that supports the use of a Netconf threshold value equal to 0.5 for mining the CARs. Additionally, a new way for ordering the set of CARs based on their rule sizes and Netconf values is introduced in CAR-NF. The ordering strategy together with the "Best K rules" satisfaction mechanism allows CAR-NF to have better accuracy than CBA, CMAR, CPAR, TFPC and HARMONY classifiers, the best classifiers based on CARs reported in the literature. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
10. LCMine: An efficient algorithm for mining discriminative regularities and its application in supervised classification
- Author
-
García-Borroto, Milton, Martínez-Trinidad, José Fco., Carrasco-Ochoa, Jesús Ariel, Medina-Pérez, Miguel Angel, and Ruiz-Shulcloper, José
- Subjects
- *
ALGORITHMS , *DECISION trees , *DATA mining , *MATHEMATICAL models , *DATABASES , *INFORMATION storage & retrieval systems , *CLASSIFICATION , *DATA extraction - Abstract
Abstract: In this paper, we introduce an efficient algorithm for mining discriminative regularities on databases with mixed and incomplete data. Unlike previous methods, our algorithm does not apply an a priori discretization on numerical features; it extracts regularities from a set of diverse decision trees, induced with a special procedure. Experimental results show that a classifier based on the regularities obtained by our algorithm attains higher classification accuracy, using fewer discriminative regularities than those obtained by previous pattern-based classifiers. Additionally, we show that our classifier is competitive with traditional and state-of-the-art classifiers. [Copyright &y& Elsevier]
- Published
- 2010
- Full Text
- View/download PDF
11. Full duplicate candidate pruning for frequent connected subgraph mining.
- Author
-
Gago-Alonsoa, Andrés, Carrasco-Ochoa, Jesús A., Medina-Pagola, José E., and Martínez-Trinidad, José Fco.
- Subjects
DATA mining ,ALGORITHMS ,EMBEDDED computer systems ,COMPUTATIONAL complexity ,MACHINE theory - Abstract
Support calculation and duplicate detection are the most challenging and unavoidable subtasks in frequent connected subgraph (FCS) mining. The most successful FCS mining algorithms have focused on optimizing these subtasks since the existing solutions for both subtasks have high computational complexity. In this paper, we propose two novel properties that allow removing all duplicate candidates before support calculation. Besides, we introduce a fast support calculation strategy based on embedding structures. Both properties and the new embedding structure are used for designing two new algorithms: gdFil for mining all FCSs; and gdClosed for mining all closed FCSs. The experimental results show that our proposed algorithms get the best performance in comparison with other well known algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
12. A new algorithm for mining frequent connected subgraphs based on adjacency matrices.
- Author
-
Gago-Alonso, Andrés, Puentes-Luberta, Abel, Carrasco-Ochoa, Jesús A., Medina-Pagola, José E., and Martínez-Trinidad, José Fco.
- Subjects
DATA mining ,GRAPH theory ,ACQUISITION of data ,ALGORITHMS ,DATABASES - Abstract
Most of the Frequent Connected Subgraph Mining (FCSM) algorithms have been focused on detecting duplicate candidates using canonical form (CF) tests. CF tests have high computational complexity, which affects the efficiency of graph miners. In this paper, we introduce novel properties of the canonical adjacency matrices for reducing the number of CF tests in FCSM. Based on these properties, a new algorithm for frequent connected subgraph mining called grCAM is proposed. The experiments on real world datasets show the impact of the proposed properties in FCSM. Besides, the performance of our algorithm is compared against some other reported algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
13. Algorithms for mining frequent itemsets in static and dynamic datasets.
- Author
-
Hernández-León, R., Hernández-Palancar, J., Carrasco-Ochoa, Jesús A., and Martínez-Trinidad, José Fco.
- Subjects
DATA mining ,STATICS ,ALGORITHMS ,DYNAMICS ,MATRICES (Mathematics) - Abstract
In this paper, two algorithms for mining frequent itemsets in large sparse datasets are proposed. The first one, named Compressed Arrays (CA), allows to process datasets that do not change along the time (static datasets) while the second one, based on the ideas of the former and named Dynamic Compressed Arrays (DCA), processes datasets that change along the time by adding/deleting transactions (dynamic datasets). Both algorithms introduce a novel way to use equivalence classes of itemsets by performing a breadth first search through them and by storing the class prefix support in compressed arrays, which allows fast itemset support computing. On the other hand, unlike previous algorithms for dynamic datasets that store the full dataset in main memory without reusing the current frequent itemsets, DCA algorithm stores the current frequent itemsets in binary files, grouped in equivalence classes, and reuses them to calculate the new frequent itemsets. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
14. Classification based on specific rules and inexact coverage
- Author
-
Hernández-León, Raudel, Carrasco-Ochoa, Jesús A., Martínez-Trinidad, José Fco., and Hernández-Palancar, José
- Subjects
- *
ASSOCIATION rule mining , *AMBIGUITY , *CONFIDENCE intervals , *PRUNING , *DATA mining , *EXPERT systems - Abstract
Abstract: Association rule mining and classification are important tasks in data mining. Using association rules has proved to be a good approach for classification. In this paper, we propose an accurate classifier based on class association rules (CARs), called CAR-IC, which introduces a new pruning strategy for mining CARs, which allows building specific rules with high confidence. Moreover, we propose and prove three propositions that support the use of a confidence threshold for computing rules that avoids ambiguity at the classification stage. This paper also presents a new way for ordering the set of CARs based on rule size and confidence. Finally, we define a new coverage strategy, which reduces the number of non-covered unseen-transactions during the classification stage. Results over several datasets show that CAR-IC beats the best classifiers based on CARs reported in the literature. [Copyright &y& Elsevier]
- Published
- 2012
- Full Text
- View/download PDF
15. Closed frequent similar pattern mining: Reducing the number of frequent similar patterns without information loss.
- Author
-
Rodríguez-González, Ansel Y., Lezama, Fernando, Iglesias-Alvarez, Carlos A., Martínez-Trinidad, José Fco., Carrasco-Ochoa, Jesús A., and de Cote, Enrique Munoz
- Subjects
- *
DATA mining , *SCALABILITY , *SIMILARITY (Geometry) , *ALGORITHMS , *PATTERN recognition systems - Abstract
Frequent pattern mining is considered a key task to discover useful information. Despite the quality of solutions given by frequent pattern mining algorithms, most of them face the challenge of how to reduce the number of frequent patterns without information loss. Frequent itemset mining addresses this problem by discovering a reduced set of frequent itemsets, named closed frequent itemsets , from which the entire frequent pattern set can be recovered. However, for frequent similar pattern mining , where the number of patterns is even larger than for Frequent itemset mining, this problem has not been addressed yet. In this paper, we introduce the concept of closed frequent similar pattern mining to discover a reduced set of frequent similar patterns without information loss. Additionally, a novel closed frequent similar pattern mining algorithm, named CFSP-Miner , is proposed. The algorithm discovers frequent patterns by traversing a tree that contains all the closed frequent similar patterns. To do this efficiently, several lemmas to prune the search space are introduced and proven. The results show that CFSP-Miner is more efficient than the state-of-the-art frequent similar pattern mining algorithms, except in cases where the number of frequent similar patterns and closed frequent similar patterns are almost equal. However, CFSP-Miner is able to find the closed similar patterns, yielding a reduced size of the discovered frequent similar pattern set without information loss. Also, CFSP-Miner shows good scalability while maintaining an acceptable runtime performance. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.