1,266 results on '"Cluster analysis"'
Search Results
2. A new adaptive membership function with CUB uncertainty with application to cluster analysis of Likert-type data
- Author
-
Biasetton, Nicolò, Disegna, Marta, Barzizza, Elena, and Salmaso, Luigi
- Published
- 2023
- Full Text
- View/download PDF
3. Effect of dimensionality reduction on stock selection with cluster analysis in different market situations
- Author
-
Han, Jingti and Ge, Zhipeng
- Published
- 2020
- Full Text
- View/download PDF
4. A SOM prototype-based cluster analysis methodology
- Author
-
Delgado, Soledad, Higuera, Clara, Calle-Espinosa, Jorge, Morán, Federico, and Montero, Francisco
- Published
- 2017
- Full Text
- View/download PDF
5. Hybrid chemical reaction based metaheuristic with fuzzy c-means algorithm for optimal cluster analysis
- Author
-
Nayak, Janmenjoy, Naik, Bighnaraj, Behera, Himansu Sekhar, and Abraham, Ajith
- Published
- 2017
- Full Text
- View/download PDF
6. Cluster analysis application for understanding SME manufacturing strategies
- Author
-
Lorentz, Harri, Hilmola, Olli-Pekka, Malmsten, Jarmo, and Srai, Jagjit Singh
- Published
- 2016
- Full Text
- View/download PDF
7. Discovering users with similar internet access performance through cluster analysis
- Author
-
Cerquitelli, Tania, Servetti, Antonio, and Masala, Enrico
- Published
- 2016
- Full Text
- View/download PDF
8. (Dis)integration levels across global stock markets: A multidimensional scaling and cluster analysis
- Author
-
Esmalifalak, Hamidreza, Ajirlou, Ali Irannezhad, Behrouz, Sahar Pordeli, and Esmalifalak, Maryam
- Published
- 2015
- Full Text
- View/download PDF
9. Cluster analysis using optimization algorithms with newly designed objective functions
- Author
-
Binu, D.
- Published
- 2015
- Full Text
- View/download PDF
10. Integrated cluster analysis and artificial neural network modeling for steam-assisted gravity drainage performance prediction in heterogeneous reservoirs
- Author
-
Amirian, Ehsan, Leung, Juliana Y., Zanon, Stefan, and Dzurman, Peter
- Published
- 2015
- Full Text
- View/download PDF
11. An evolutionary computational model applied to cluster analysis of DNA microarray data
- Author
-
Castellanos-Garzón, José A. and Díaz, Fernando
- Published
- 2013
- Full Text
- View/download PDF
12. A visual analytics framework for cluster analysis of DNA microarray data
- Author
-
Castellanos-Garzón, José A., García, Carlos Armando, Novais, Paulo, and Díaz, Fernando
- Published
- 2013
- Full Text
- View/download PDF
13. Integration of fuzzy cluster analysis and kernel density estimation for tracking typhoon trajectories in the Taiwan region
- Author
-
Chu, Hone-Jay, Liau, Churn-Jung, Lin, Chao-Hung, and Su, Bo-Song
- Published
- 2012
- Full Text
- View/download PDF
14. Mining qualitative patterns in spatial cluster analysis
- Author
-
Lee, Ickjai, Qu, Yang, and Lee, Kyungmi
- Published
- 2012
- Full Text
- View/download PDF
15. Analyzing ECG for cardiac arrhythmia using cluster analysis
- Author
-
Yeh, Yun-Chi, Chiou, Che Wun, and Lin, Hong-Jhih
- Published
- 2012
- Full Text
- View/download PDF
16. Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
- Author
-
Miguel Díaz-Lozano, David Guijo-Rubio, Pedro Antonio Gutiérrez, and César Hervás-Martínez
- Subjects
Artificial Intelligence ,General Engineering ,Computer Science Applications - Published
- 2023
17. A distributed sinkhole detection method using cluster analysis
- Author
-
Shim, Woochul, Kim, Gisung, and Kim, Sehun
- Published
- 2010
- Full Text
- View/download PDF
18. Self-organizing feature map for cluster analysis in multi-disease diagnosis
- Author
-
Zhang, Ke, Chai, Yi, and Yang, Simon X.
- Published
- 2010
- Full Text
- View/download PDF
19. Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty
- Author
-
Hosseini, Seyed Mohammad Seyed, Maleki, Anahita, and Gholamian, Mohammad Reza
- Published
- 2010
- Full Text
- View/download PDF
20. A SOM prototype-based cluster analysis methodology
- Author
-
Jorge Calle-Espinosa, Soledad Delgado, Francisco Montero, Clara Higuera, and Federico Morn
- Subjects
Self-organizing map ,Clustering high-dimensional data ,Fuzzy clustering ,Computer science ,Correlation clustering ,Conceptual clustering ,02 engineering and technology ,Machine learning ,computer.software_genre ,03 medical and health sciences ,0302 clinical medicine ,Artificial Intelligence ,CURE data clustering algorithm ,Consensus clustering ,0202 electrical engineering, electronic engineering, information engineering ,Cluster analysis ,Brown clustering ,business.industry ,General Engineering ,Constrained clustering ,Computer Science Applications ,Data set ,ComputingMethodologies_PATTERNRECOGNITION ,Data stream clustering ,Canopy clustering algorithm ,FLAME clustering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Data mining ,business ,computer ,030217 neurology & neurosurgery - Abstract
An original computational approach for cluster analysis is proposed.The method consists of two phases, which are based on Self-Organizing Map.Topology-preserving and connectivity functions are used in the clustering process.The method is proved using three benchmark datasets and a real biological dataset.Automation in parameterization results in a user-friendly methodology. Data clustering is aimed at finding groups of data that share common hidden properties. These kinds of techniques are especially critical at early stages of data analysis where no information about the dataset is available. One of the mayor shortcomings of the clustering algorithms is the difficulty for non-experts users to configure them and, in some cases, interpret the results. In this work a computational approach with a two-layer structure based on Self-Organizing Map (SOM) is presented for cluster analysis. In the first level, a quantization of the data samples using topology-preserving metrics to automatically determine the number of units in the SOM is proposed. In the second level the obtained SOM prototypes are clustered by means of a connectivity analysis to explore the quality of the partitioning with different number of clusters. The most important benefit of this two-layer procedure is that computational load decreases considerably in comparison with data based clustering methods, making it possible to cluster large data sets and to consider several different clustering alternatives in a limited time. This methodology produces a two-dimensional map representation of the, usually, high dimensional input space, along with quantitative information on viable clustering alternatives, which facilitates the exploration of the possible partitions in a dataset. The efficiency and interpretation of the methodology is illustrated by its application to artificial, benchmark and real complex biological datasets. The experimental results demonstrate the ability of the method to identify possible segmentations in a dataset, compared to algorithms that only yield a single clustering solution. The proposed algorithm tackles the intrinsic limitations of SOM and the parameter settings associated with the clustering methodology, without requiring the number of clusters or the SOM architecture as a prerequisite, among others. This way, it makes possible its application even by researchers with a limited expertise in machine learning.
- Published
- 2017
21. A fuzzy modeling method via Enhanced Objective Cluster Analysis for designing TSK model
- Author
-
Wang, Na and Yang, Yupu
- Published
- 2009
- Full Text
- View/download PDF
22. Comparing the performance of traditional cluster analysis, self-organizing maps and fuzzy C-means method for strategic grouping
- Author
-
Budayan, Cenk, Dikmen, Irem, and Birgonul, M. Talat
- Published
- 2009
- Full Text
- View/download PDF
23. Cluster analysis using multi-algorithm voting in cross-cultural studies
- Author
-
Gelbard, Roy, Carmeli, Abraham, Bittmann, Ran M., and Ronen, Simcha (Simi)
- Published
- 2009
- Full Text
- View/download PDF
24. Cluster analysis of genome-wide expression data for feature extraction
- Author
-
Lin, Kuo-Sheng and Chien, Chen-Fu
- Published
- 2009
- Full Text
- View/download PDF
25. DDoS attack detection method using cluster analysis
- Author
-
Lee, Keunsoo, Kim, Juhyun, Kwon, Ki Hoon, Han, Younggoo, and Kim, Sehun
- Published
- 2008
- Full Text
- View/download PDF
26. Increasing the discriminatory power of DEA in the presence of the sample heterogeneity with cluster analysis and decision trees
- Author
-
Samoilenko, Sergey and Osei-Bryson, Kweku-Muata
- Published
- 2008
- Full Text
- View/download PDF
27. Hybrid chemical reaction based metaheuristic with fuzzy c-means algorithm for optimal cluster analysis
- Author
-
Janmenjoy Nayak, Himansu Sekhar Behera, Ajith Abraham, and Bighnaraj Naik
- Subjects
0209 industrial biotechnology ,Computer science ,General Engineering ,k-means clustering ,Swarm behaviour ,Initialization ,02 engineering and technology ,computer.software_genre ,Hybrid algorithm ,Fuzzy logic ,Computer Science Applications ,ComputingMethodologies_PATTERNRECOGNITION ,020901 industrial engineering & automation ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Unsupervised learning ,020201 artificial intelligence & image processing ,Data mining ,Cluster analysis ,Metaheuristic ,Algorithm ,computer - Abstract
A new hybrid method of FCM and CRO algorithm is proposed for optimal cluster analysis.The performance of CRO-FCM is compared with evolutionary & swarm based methods.Performance metrics like rate of error, inter & intra cluster distance are considered. Hybridization of two or more algorithms has always been a keen interest of research due to the quality of improvement in searching capability. Taking the positive insights of both the algorithms, the developed hybrid algorithm tries to minimize the substantial limitations. Clustering is an unsupervised learning method, which groups the data according to their similar or dissimilar properties. Fuzzy c-means (FCM) is one of the popularly used clustering algorithms and performs better as compared to other clustering techniques such as k-means. However, FCM possesses certain limitations such as premature trapping at local minima and high sensitivity to the cluster center initialization. Taking these issues into consideration, this research proposes a novel hybrid approach of FCM with a recently developed chemical based metaheuristic for obtaining optimal cluster centers. The performance of the proposed approach is compared in terms of cluster fitness values, inter-cluster distance and intra-cluster distance with other evolutionary and swarm optimization based approaches. A rigorous experimentation is simulated and experimental result reveals that the proposed hybrid approach is performing better as compared to other approaches.
- Published
- 2017
28. Cluster analysis using optimization algorithms with newly designed objective functions
- Author
-
D. Binu
- Subjects
Clustering high-dimensional data ,Fuzzy clustering ,business.industry ,Correlation clustering ,General Engineering ,Machine learning ,computer.software_genre ,Computer Science Applications ,Artificial Intelligence ,Search algorithm ,CURE data clustering algorithm ,Canopy clustering algorithm ,Data mining ,Artificial intelligence ,Cluster analysis ,business ,computer ,Metaheuristic ,Mathematics - Abstract
Perform cluster analysis using three newly designed objective functions.Utilize three optimization algorithms like, genetic, cuckoo search and PSO.Present 21 different clustering algorithms and the validation with 16 datasets.Proved, objective function decides effectiveness & search algorithm decides efficiency.Presented a suggestion for a better algorithm based on input data characteristics. Clustering finds various applications in the field of medical and telecommunication for unsupervised learning which is much required in expert system and its application. Various algorithms have been developed to clustering for the past fifty years after the introduction of k-means clustering. Recently, optimization algorithms are applied for clustering to find optimal clusters with the help of different objective functions. Accordingly, in this research, clustering is performed using three newly designed objective functions along with four existing objective functions with the help of optimization algorithms like, genetic algorithm, cuckoo search and particle swarm optimization algorithm. Here, three different objective functions are designed including the cumulative summation of fuzzy membership and distance value with normal data space, kernel space as well as multiple kernel space. In addition to the existing seven objective functions, totally, 21 different clustering algorithms are discussed and the performance is validated with 16 different datasets which are synthetic, small and large scale real data. The comparison is made with five different evaluation metrics to validate the effectiveness and efficiency. From the research outcome, the suggestion is presented to select a suitable algorithm among 21 algorithms for a particular data and results proved that the effectiveness of cluster analysis is mainly dependent on objective function and the efficiency of cluster analysis is based on search algorithm.
- Published
- 2015
29. Integrated cluster analysis and artificial neural network modeling for steam-assisted gravity drainage performance prediction in heterogeneous reservoirs
- Author
-
Juliana Y. Leung, Stefan Zanon, Peter John Dzurman, and Ehsan Amirian
- Subjects
Artificial neural network ,business.industry ,Computer science ,Dimensionality reduction ,General Engineering ,Steam injection ,Machine learning ,computer.software_genre ,Computer Science Applications ,Steam-assisted gravity drainage ,Artificial Intelligence ,Robustness (computer science) ,Principal component analysis ,Performance prediction ,Artificial intelligence ,Data mining ,business ,Cluster analysis ,computer ,Oil shale ,Curse of dimensionality - Abstract
Data-driven modeling provides an attractive alternative to predict SAGD recovery.The modeling approach is applied successfully for heterogeneous reservoirs.Arps parameters are proposed to parameterize production time-series data.A normalized shale indicator is used as a pertinent input attribute.Accuracy of the prediction is greatly enhanced when cluster analyses are performed. Evaluation of steam-assisted gravity drainage (SAGD) performance that involves detailed compositional simulations is usually deterministic, cumbersome, expensive (manpower and time consuming), and not quite suitable for practical decision making and forecasting, particularly when dealing with high-dimensional data space consisting of large number of operational and geological parameters. Data-driven modeling techniques, which entail comprehensive data analysis and implementation of machine learning methods for system forecast, provide an attractive alternative.In this paper, artificial neural network (ANN) is employed to predict SAGD production in heterogeneous reservoirs, an important application that is lacking in existing literature. Numerical flow simulations are performed to construct a training data set consists of various attributes describing characteristics associated with reservoir heterogeneities and other relevant operating parameters. Empirical Arps decline parameters are tested successfully for parameterization of cumulative production profile and considered as outputs of the ANN models. Sensitivity studies on network configurations are also investigated. Principal components analysis (PCA) is performed to reduce the dimensionality of the input vector, improve prediction quality, and limit over-fitting. In a case study, reservoirs with distinct heterogeneity distributions are fed to the model. It is shown that robustness and accuracy of the prediction capability are greatly enhanced when cluster analysis are performed to identify internal data structures and groupings prior to ANN modeling. Both deterministic and fuzzy-based clustering techniques are compared, and separate ANN model is constructed for each cluster. The model is then tested using a validation data set (cases that have not been used during the training stage).The proposed approach can be integrated directly into most existing reservoir management routines. In addition, incorporating techniques for dimensionality reduction and clustering with ANN demonstrates the viability of this approach for analyzing large field data set. Given that quantitative ranking of operating areas, robust forecasting, and optimization of heavy oil recovery processes are major challenges faced by the industry, the proposed research highlights the significant potential of applying effective data-driven modeling approaches in analyzing other solvent-additive steam injection projects.
- Published
- 2015
30. Neutrosophic genetic algorithm and its application in clustering analysis of rock discontinuity sets.
- Author
-
Yong, Rui, Wang, Hanzhong, Ye, Jun, Du, Shigui, and Luo, Zhanyou
- Subjects
- *
ROCK analysis , *CLUSTER analysis (Statistics) , *K-means clustering , *EVIDENCE gaps , *COMPUTATIONAL complexity , *DISCONTINUOUS functions - Abstract
This paper presents the neutrosophic genetic algorithm (NGA) to address the research gap in the application of neutrosophic theory in conjunction with genetic algorithms. NGA introduces three distinct solution spaces—truth, falsity, and indeterminacy—enabling it to entirely encompass neutrosophic solution spaces in the operational process. Fine-tuning in the true solution space (TSS), adaptive regeneration in the false solution space (FSS), and modified crossover and mutation operations in the indeterminate solution space (ISS) enhance NGA ability to navigate away from local optima while reducing computational complexity. Evaluation against several prior algorithms based on the CEC2017 test suites demonstrates the superior performance of NGA, achieving the highest overall score of 92.11% in various problems and conditions. Sensitivity analysis of NGA parameters provides significant insights into algorithm performance variations, emphasizing the substantial impact of these parameters on the NGA's performance. The application of NGA to optimize the K-means method for clustering analysis of rock discontinuity sets showcases its efficiency and potential for practical applications in related fields, highlighting its advantages over other methods. This research establishes NGA as an innovative and efficient approach to address imprecision, incompleteness, and uncertainty in practical data scenarios, with significant implications for future development and applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. An evolutionary computational model applied to cluster analysis of DNA microarray data
- Author
-
Fernando Díaz and José A. Castellanos-Garzón
- Subjects
Clustering high-dimensional data ,Fuzzy clustering ,Brown clustering ,business.industry ,General Engineering ,computer.software_genre ,Machine learning ,Computer Science Applications ,Hierarchical clustering ,Artificial Intelligence ,CURE data clustering algorithm ,Consensus clustering ,Canopy clustering algorithm ,Artificial intelligence ,Data mining ,business ,Cluster analysis ,computer ,Mathematics - Abstract
This paper proposes a new hierarchical clustering method using genetic algorithms for the analysis of gene expression data. This method is based on the mathematical proof of several results, showing its effectiveness with regard to other clustering methods. Genetic algorithms applied to cluster analysis have disclosed good results on biological data and many studies have been carried out in this sense, although most of them are focused on partitional clustering methods. Even though there are few studies that attempt to use genetic algorithms for building hierarchical clustering, they do not include constraints that allow us to reduce the complexity of the problem. Therefore, these studies become intractable problems for large data sets. On the other hand, the deterministic hierarchical clustering methods generally face the problem of convergence towards local optimums due to their greedy strategy. The method introduced here is an alternative to solve some of the problems existing methods face. The results of the experiments have shown that our approach can be very effective in cluster analysis of DNA microarray data.
- Published
- 2013
32. A visual analytics framework for cluster analysis of DNA microarray data
- Author
-
Fernando Díaz, Carlos Armando García, José A. Castellanos-Garzón, Paulo Novais, and Universidade do Minho
- Subjects
Clustering high-dimensional data ,Metric spaces ,Visual analytics ,Computer science ,02 engineering and technology ,computer.software_genre ,Cluster analysis ,Knowledge extraction ,Artificial Intelligence ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Data mining ,Science & Technology ,DNA-microarrays ,Dendrogram ,General Engineering ,Computer Science Applications ,Visualization ,ComputingMethodologies_PATTERNRECOGNITION ,Gene chip analysis ,Data analysis ,Boundary points ,020201 artificial intelligence & image processing ,Surface reconstruction ,computer - Abstract
Prova tipográfica, Cluster analysis of DNA microarray data is an important but difficult task in knowledge discovery processes. Many clustering methods are applied to analysis of data for gene expression, but none of them is able to deal with an absolute way with the challenges that this technology raises. Due to this, many applications have been developed for visually representing clustering algorithm results on DNA microarray data, usually providing dendrogram and heat map visualizations. Most of these applications focus only on the above visualizations, and do not offer further visualization components to the validate the clustering methods or to validate one another. This paper proposes using a visual analytics framework in cluster analysis of gene expression data. Additionally, it presents a new method for finding cluster boundaries based on properties of metric spaces. Our approach presents a set of visualization components able to interact with each other; namely, parallel coordinates, cluster boundary genes, 3D cluster surfaces and DNA microarray visualizations as heat maps. Experimental results have shown that our framework can be very useful in the process of more fully understanding DNA microarray data. The software has been implemented in Java, and the framework is publicly available at http://www. analiticavisual.com/jcastellanos/3DVisualCluster/3D-VisualCluster., This work has been partially funded by the Spanish Ministry of Science and Innovation, the Plan E from the Spanish Government, the European Union from the ERDF (TIN2009-14057-C03-02).
- Published
- 2013
33. Mining qualitative patterns in spatial cluster analysis
- Author
-
Ickjai Lee, Yang Qu, and Kyungmi Lee
- Subjects
Computer science ,Process (engineering) ,General Engineering ,Spatial cluster analysis ,computer.software_genre ,Computer Science Applications ,Set (abstract data type) ,ComputingMethodologies_PATTERNRECOGNITION ,Artificial Intelligence ,Concept learning ,Cluster (physics) ,Data mining ,Cluster analysis ,computer - Abstract
Clustering is an important concept formation process within AI. It detects a set of objects with similar characteristics. These similar aggregated objects represent interesting concepts and categories. As clustering becomes more mature, post-clustering activities that reason about clusters need a great attention. Numerical quantitative information about clusters is not as intuitive as qualitative one for human analysis, and there is a great demand for an intelligent qualitative cluster reasoning technique in data-rich environments. This article introduces a qualitative cluster reasoning framework that reasons about clusters. Experimental results demonstrate that our proposed qualitative cluster reasoning reveals interesting cluster structures and rich cluster relations.
- Published
- 2012
34. A fuzzy modeling method via Enhanced Objective Cluster Analysis for designing TSK model
- Author
-
Yupu Yang and Na Wang
- Subjects
Mathematical optimization ,Fuzzy clustering ,Fuzzy classification ,Fuzzy rule ,Artificial Intelligence ,General Engineering ,Fuzzy number ,Fuzzy set operations ,FLAME clustering ,Cluster analysis ,Defuzzification ,Computer Science Applications ,Mathematics - Abstract
This paper proposes a fuzzy modeling method via Enhanced Objective Cluster Analysis to obtain the compact and robust approximate TSK fuzzy model. In our approach, the Objective Cluster Analysis algorithm is introduced. In order to obtain more compact and more robust fuzzy rule prototypes, this algorithm is enhanced by introducing the Relative Dissimilarity Measure and the new consistency criterion to represent the similarity degree between the clusters. By these additional criteria, the redundant clusters caused by iterations are avoided; the subjective influence from human judgment for clustering is weakened. Moreover the clustering results including the number of clusters and the cluster centers are considered as the initial condition of the premise parameters identification. Thus the traditional iteration modeling procedure for determining the number of rules and identifying parameters is changed into one-off modeling, which significantly reduces the burden of computation. Furthermore the decomposition errors and the approximation errors resulted from premise parameters identification by Fuzzy c-Means clustering are decreased. For the consequence parameters identification, the Stable Kalman Filter algorithm is adopted. The performance of the proposed modeling method is evaluated by the example of Box-Jenkins gas furnace. The simulation results demonstrate the power of our model.
- Published
- 2009
35. Comparing the performance of traditional cluster analysis, self-organizing maps and fuzzy C-means method for strategic grouping
- Author
-
Cenk Budayan, Irem Dikmen, and M. Talat Birgonul
- Subjects
Self-organizing map ,Structure (mathematical logic) ,Computer science ,General Engineering ,Strategic group ,computer.software_genre ,Fuzzy logic ,Computer Science Applications ,Set (abstract data type) ,Artificial Intelligence ,Selection (linguistics) ,Data mining ,Cluster analysis ,computer - Abstract
Strategic group analysis comprises of clustering of firms within an industry according to their similarities with respect to a set of strategic dimensions and investigating the performance implications of strategic group membership. One of the challenges of strategic group analysis is the selection of the clustering method. In this study, the results of the strategic group analysis of Turkish contractors are presented to compare the performances of traditional cluster analysis techniques, self-organizing maps (SOM) and fuzzy C-means method (FCM) for strategic grouping. Findings reveal that traditional cluster analysis methods cannot disclose the overlapping strategic group structure and position of companies within the same strategic group. It is concluded that SOM and FCM can reveal the typology of the strategic groups better than traditional cluster analysis and they are more likely to provide useful information about the real strategic group structure.
- Published
- 2009
36. Cluster analysis using multi-algorithm voting in cross-cultural studies
- Author
-
Roy Gelbard, Abraham Carmeli, Simcha Ronen, and Ran M. Bittmann
- Subjects
Point (typography) ,Computer science ,media_common.quotation_subject ,General Engineering ,Computer Science Applications ,Ranking ,Artificial Intelligence ,Voting ,Cross-cultural ,Hofstede's cultural dimensions theory ,World Values Survey ,Cluster analysis ,Algorithm ,media_common - Abstract
The goal of this study was to overcome three main shortcomings in using a single algorithm to determine a particular clustering of a phenomenon. We addressed this issue by considering cross-cultural research as a case in point and applied Multi-Algorithm Voting (MAV) methodology to cluster analysis. Specifically, this study was designed to provide more systematic supportive decision tools for researchers and managers alike when attempting to cluster analyzing phenomena. To assess the merits of the methodology of MAV for cluster analysis, we analytically examined cross-cultural data from Merritt [Merritt, A. (2000). Culture in the cockpit Do Hofstede's dimensions replicate? Journal of Cross-Cultural Psychology, 31, 283-301] study as well as data scored and ranked by Hofstede [Hofstede, G. (1980). Culture's consequences: International differences in work-related values. Beverly Hills, CA: Sage; Hofstede, G. (1982). Values survey module (Tech. Paper). Maastricht, The Netherlands: Institute for Research on Intercultural Cooperation]. Our study contributes to the literature in several ways. From a methodological point of view, we show how researchers can avoid arbitrary decisions in determining the number of clusters. We provide the researcher with more compelling and robust methodologies not only for analyzing the results of cluster analysis, but also for more better-grounded decision-making through which theoretical insights and implications can be drawn.
- Published
- 2009
37. Clustering of multi-view relational data based on particle swarm optimization.
- Author
-
de Gusmão, Renê Pereira and de Carvalho, Francisco de A.T.
- Subjects
- *
PARTICLE swarm optimization , *CLUSTER analysis (Statistics) , *BIG data , *MACHINE learning , *MATHEMATICAL functions - Abstract
Highlights • The paper provides multi-view clustering algorithms for relational data. • The algorithms are based on Particle Swarm Optimization. • They are able to select the relevant views for the clustering task. • The algorithms consider eleven different fitness functions. • Experiments with real multi-view data sets shows their usefulness. Abstract Clustering of multi-view data has received increasing attention since it explores multiple views of data sets aiming at improving clustering accuracy. Particle Swarm Optimization (PSO) is a well-known population-based meta-heuristic successfully used in cluster analysis. This paper introduces two hybrid clustering methods for multi-view relational data. These hybrid methods combine PSO and hard clustering algorithms based on multiple dissimilarity matrices. These methods take advantage of the global convergence ability of PSO and the local exploitation of hard clustering algorithms in the position update step, aiming to improve the balance between exploitation and exploration processes. Moreover, the paper provides adapted versions of 11 fitness functions suitable for vector data aiming at dealing with multi-view relational data. Two performance criteria were used to evaluate the clustering quality using the two proposed methods over eleven real-world data sets including image and document data sets. Among new findings, it was observed that the top three fitness functions are Silhouette index, Xu index and Intra-cluster homogeneity. The performance of the proposed algorithms was compared with previous single and multi-view relational clustering algorithms. The results show that the proposed methods significantly outperformed the other algorithms in the majority of cases. The results reinforce the importance of the application of techniques such as PSO-based clustering algorithms in the field of expert systems and machine learning. Such application enhances classification accuracy and cluster compactness. Besides, the proposed algorithms can be useful tools in content-based image retrieval systems, providing good categorizations and automatically learning relevance weights for each cluster of images and sets of views. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
38. Integration of fuzzy cluster analysis and kernel density estimation for tracking typhoon trajectories in the Taiwan region
- Author
-
Bo-Song Su, Churn-Jung Liau, Chao-Hung Lin, and Hone Jay Chu
- Subjects
Normalization (statistics) ,Computer science ,Kernel density estimation ,General Engineering ,k-means clustering ,computer.software_genre ,Fuzzy logic ,Computer Science Applications ,Hierarchical clustering ,Goodness of fit ,Artificial Intelligence ,Typhoon ,Cluster (physics) ,Data mining ,Cluster analysis ,computer - Abstract
Highlights? Offer an alternative way to explore the spatial patterns of typhoon tracks. ? Cluster methodologies show that typhoon centers pass through Taiwan from the south-east to the north-west. ? Provide planners to understand the hotspot areas of typhoon tracks and adjust disaster management efforts. Increasing our understanding of typhoon movements remains a priority in the western North Pacific. In this study, the trajectories of typhoons that affected Taiwan between 1986 and 2010 are used for clustering, where each trajectory consists of 6-hourly latitude-longitude positions over two days. We compare the performance of four statistical clustering methods, namely, k-means clustering, fuzzy c-means (FCM) clustering, hierarchical clustering, and normalized cut techniques. The results show that the FCM technique provides sufficient cluster efficiency with a relatively high degree of goodness of fit. FCM identifies six clusters according to the minimum coefficients of variation (CV). The hotspots of the typhoon centers in each cluster are determined by kernel density estimation (KDE). Moreover, the typhoon track belongs to six clusters with different membership degrees in FCM. The typhoon track density map is estimated by combining the KDE hotspot maps associated with the FCM weights. The information could be used in planning for disaster management.
- Published
- 2012
39. Increasing the discriminatory power of DEA in the presence of the sample heterogeneity with cluster analysis and decision trees
- Author
-
Sergey Samoilenko and Kweku-Muata Osei-Bryson
- Subjects
Discriminatory power ,Efficiency ,Artificial Intelligence ,Homogeneity (statistics) ,Statistics ,General Engineering ,Decision tree ,Econometrics ,Data envelopment analysis ,Cluster analysis ,Computer Science Applications ,Mathematics - Abstract
Data envelopment analysis (DEA) is a widely used non-parametric data analytic tool discriminatory power of which is dependent on the homogeneity of the domain of the sample. In many real-life cases, however, the sample of the decision making units (DMU) could consist of two or more naturally occurring subsets, thus exhibiting clear signs of heterogeneity. In such situations, the discriminatory power of DEA is limited, for the nature of the relative efficiency of a DMU is likely to be influenced by its membership in a particular subset of the sample. In this study, we propose a three-step methodology allowing for increasing the discriminatory power of DEA in the presence of the heterogeneity of the sample. In the first phase, we use cluster analysis (CA) in order to test for the presence of the naturally occurring subsets in the sample. In the second phase DEA is used to calculate the relative efficiencies of the DMUs, as well as averaged relative efficiencies of each subset identified in the previous phase. Finally, we utilize decision tree (DT) induction in order to inquire into the subset-specific nature of the relative efficiencies of the DMUs in the sample. Illustrative example is provided.
- Published
- 2008
40. Self-organizing feature map for cluster analysis in multi-disease diagnosis
- Author
-
Simon X. Yang, Ke Zhang, and Yi Chai
- Subjects
Self-organizing map ,Euclidean distance ,ComputingMethodologies_PATTERNRECOGNITION ,Artificial Intelligence ,Computer science ,Feature (computer vision) ,General Engineering ,Cluster (physics) ,Data mining ,computer.software_genre ,Cluster analysis ,computer ,Computer Science Applications - Abstract
Aiming at the multi-disease diagnosis, a self-organizing map (SOM) is developed. In this paper the tomato disease features are extracted and a mapping relationship between the diseases and the features is created. The inaccurate clustering of traditional SOM algorithm is analyzed. According to the analysis, Euclidean distance is taken as the main discrimination, and the adjacent-searching algorithm is optimized. Using the optimized algorithm, the cluster results of input samples are obtained, features of diseases are mapped, and a multi-disease diagnosis model is developed. The proposed SOM-based model has two layers. The feature array of diseases can be accurately and rapidly sorted and clustered using this model. This model can achieve an accurate diagnosis of multi-diseases. The simulation results show that the proposed model performs well and the proposed multi-disease diagnosis is effective.
- Published
- 2010
41. Effect of dimensionality reduction on stock selection with cluster analysis in different market situations
- Author
-
Jingti Han and Zhipeng Ge
- Subjects
0209 industrial biotechnology ,Computer science ,Dimensionality reduction ,General Engineering ,02 engineering and technology ,Stock market index ,Autoencoder ,Computer Science Applications ,020901 industrial engineering & automation ,Component analysis ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Econometrics ,020201 artificial intelligence & image processing ,Stock (geology) - Abstract
Dimensionality reduction is inevitable in stock selection with cluster analysis. Considering relations among dimensionality reduction, noise trading, and market situations, we empirically investigate the effect of dimensionality-reduction methods–principal component analysis, stacked autoencoder, and stacked restricted Boltzmann machine–on stock selection with cluster analysis in different market situations. Based on the index fluctuation, the market is divided into sideways and trend situations. For the CSI 100 and Nikkei 225 constituent stocks, experimental results show that: (1) In sideways situations, dimensionality reduction hardly improves the performance of stock selection with cluster analysis; (2) the advantage of dimensionality reduction is mainly reflected in trend situations, but whether it is in an up or down trend depends on the market analyzed. More importantly, according to the above findings and assuming that the dimensionality-reduction effect will continue, we propose a rotation strategy with and without dimensionality reduction. The results of experiments show that the proposed rotation strategy outperforms the stock market indices as well as the stock-selection strategies based on dimensionality reduction and cluster analysis. These findings offer practical insights into how dimensionality reduction can be efficiently used for stock selection.
- Published
- 2020
42. (Dis)integration levels across global stock markets: A multidimensional scaling and cluster analysis
- Author
-
Sahar Pordeli Behrouz, Hamidreza Esmalifalak, Ali Irannezhad Ajirlou, and Maryam Esmalifalak
- Subjects
Computer science ,General Engineering ,Diversification (finance) ,computer.software_genre ,Stock market index ,Computer Science Applications ,Globalization ,Artificial Intelligence ,Financial crisis ,Econometrics ,Stock market ,Data mining ,Explanatory power ,computer ,Stock (geology) - Abstract
A visual inspection over 41 global stock indexes is conducted.We use MDS and AHC methods to detect (dis)integration levels and clusters.For each level and cluster, the abnormal indexes are detected.Using most integrated indexes, we report their linkage in 12 different time periods. Following the globalization of the economy, interests in examining the financial linkages of economies have been increased, especially in the wake of the global financial crisis. In light of the benefits of international portfolio diversification, it is not surprising that there has been a strong interest among researchers to capture presence and extent of global stock market GSM integration. This study addresses to the feasibility and benefits of two visual data interpretation methods, based on multidimensional scaling and cluster analysis, in GSM integration context. The explanatory power of applied methods has become stronger by putting both integration and disintegration (anomaly) detection strategies. Empirically, in the first phase, we detect (dis)integration levels and clusters among 41 indexes from 40 countries. In the second phase, we choose most integrated indexes, detected in the first phase, to see whether their linkages are progressing, stable or regressing during different time series (12 seasons). In phase one and two, the topological properties of global stock networks are expressed in two-dimensional plots (dendrogram, heat map, frequency map). Based on the results, we found proposed methods capable of capturing (1) hierarchy of interrelated clusters embedded in a complex network of objects (2) different (dis)integration levels among selected global stock indexes and (3) abnormal seasons and indexes considering time and regional horizons.
- Published
- 2015
43. Cluster evolution analysis: Identification and detection of similar clusters and migration patterns.
- Author
-
Ramon-Gonen, Roni and Gelbard, Roy
- Subjects
- *
CLUSTER analysis (Statistics) , *HUMAN migration patterns , *LOGICAL prediction , *DATA analysis , *GRAPH theory - Abstract
Cluster analysis often addresses a specific point in time, ignoring previous cluster analysis products. The present study proposes a model entitled Cluster Evolution Analysis (CEA) that addresses three phenomena likely to occur over time: (1) changes in the number of clusters; (2) changes in cluster characteristics; (3) between-cluster migration of objects. To achieve this goal, two new techniques are implemented: to find similarities between clusters at different points in time, we used the moving average of cluster centroid technique, and to detect prominent migration patterns we used the clustering of clusters technique. The research introduces two new visual tools displaying all the clusters over the entire time period under study in a single graph. The model was tested on five-year trade data of corporate bonds (2010–2014). The results obtained by the CEA model were checked and validated against the bond rating report issued periodically by the local bond rating company. The results proved the model capable of identifying repeated clusters at various points in time, and detecting patterns that predict prospective loss of value, as well as patterns that indicate stability and preservation of value over time. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
44. Analyzing ECG for cardiac arrhythmia using cluster analysis
- Author
-
Che Wun Chiou, Hong-Jhih Lin, and Yun-Chi Yeh
- Subjects
Heartbeat ,Left bundle branch block ,business.industry ,General Engineering ,Cardiac arrhythmia ,Pattern recognition ,Right bundle branch block ,medicine.disease ,Computer Science Applications ,Ventricular premature contractions ,QRS complex ,Artificial Intelligence ,Atrial premature contractions ,cardiovascular system ,medicine ,cardiovascular diseases ,Artificial intelligence ,Ecg signal ,business ,circulatory and respiratory physiology ,Mathematics - Abstract
This work proposes a method of analyzing ECG signal to diagnose cardiac arrhythmias utilizing the cluster analysis (CA) method. The proposed method can accurately classify and distinguish the difference between normal heartbeats (NORM) and abnormal heartbeats. Abnormal heartbeats may include the following: left bundle branch block (LBBB), right bundle branch block (RBBB), ventricular premature contractions (VPC), and atrial premature contractions (APC). Analysis of ECG signal consists of three major stages: (i) detecting the QRS waveform; (ii) selecting qualitative features; and (iii) determining heartbeat case. The ECG signals in the MIT-BIH arrhythmia database are adopted as reference data for accomplishing the first two stages, and cluster analysis is used to determine patient heartbeat case. In the experiments, the sensitivity is 95.59%, 91.32%, 90.50%, 94.51%, and 93.77% for heartbeat case NORM, LBBB, RBBB, VPC, and APC, respectively. The total classification accuracy (TCA) was about 94.30%.
- Published
- 2012
45. Optimized Multi-Algorithm Voting: Increasing objectivity in clustering.
- Author
-
Kempen, Regina, Meier, Alexander, Hasche, Jens, and Mueller, Karsten
- Subjects
- *
CLUSTER analysis (Statistics) , *OBJECTIVITY , *VOTING , *ALGORITHMS , *MATHEMATICAL optimization - Abstract
Highlights • Depending on clustering algorithm used, the results may change. • Based on Multi-Algorithm Voting, robustness of cluster solutions can be improved. • OMAV is developed that uses an optimization algorithm as integrative method. • OMAV is applied to the example of country clustering using GLOBE data. • Increased robustness and reduced subjectivity are demonstrated. Abstract Currently, the influence of a single statistical cluster algorithm on the results of clustering procedures represents a major threat to the objectivity in clustering. To exemplify this question, this paper refers to country clustering in cross-cultural research. In this field, previous research has determined differing numbers of clusters, depending on choices available for the clustering procedure, leading to a high number of inconsistent results. Hence, it is argued that the variety in cluster solutions induced by the choice of different statistical cluster algorithms should be reduced. To this end, this study builds on Multi-Algorithm Voting (MAV) procedure introduced by Bittmann and Gelbard (2007) and presents an advancement to the MAV method. Specifically, MAV procedure is refined for the analysis of larger data sets using the simulated annealing algorithm for optimization. The use of this Optimized MAV (OMAV) is then demonstrated for country clustering in cross-cultural research. Specifically, a set of 57 countries is divided into 12 clusters based on work-related values obtained from GLOBE database reported in House et al. (2004). Thus, results clearly show that the objectivity of clustering results can be significantly improved based on OMAV. Implications for expert and intelligent systems on the use of OMAV are discussed. Namely, OMAV represents a powerful tool supporting the decision-making process in cluster analysis reducing the number of subjective and arbitrary decisions. Taken together, this study contributes to existing literature by providing an integrative and robust method of country clustering using OMAV and by presenting country clusters applicable to various settings. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
46. Cluster analysis of genome-wide expression data for feature extraction
- Author
-
Chen-Fu Chien and Kuo-Sheng Lin
- Subjects
Microarray ,Covariance matrix ,Computer science ,business.industry ,Microarray analysis techniques ,Feature extraction ,General Engineering ,Pattern recognition ,Feature selection ,medicine.disease ,computer.software_genre ,Computer Science Applications ,ComputingMethodologies_PATTERNRECOGNITION ,Breast cancer ,Artificial Intelligence ,Gene expression ,Gene chip analysis ,medicine ,Microarray databases ,Table (database) ,Artificial intelligence ,Data mining ,business ,computer ,Curse of dimensionality - Abstract
Bio-chip data that consists of high-dimensional attributes have more attributes than specimens. Thus, it is difficult to obtain covariance matrix from tens thousands of genes within a number of samples. Feature selection and extraction is critical to remove noisy features and reduce the dimensionality in microarray analysis. This study aims to fill the gap by developing a data mining framework with a proposed algorithm for cluster analysis of gene expression data, in which coefficient correlation is employed to arrange genes. Indeed, cluster analysis of microarray data can find coherent patterns of gene expression. The output is displayed as table list for convenient survey. We adopt the breast cancer microarray dataset to demonstrate practical viability of this approach.
- Published
- 2009
47. A density connection weight-based clustering approach for dataset with density-sparse region.
- Author
-
Zhang, Min, Ma, Yang, Li, Junli, and Zhang, Jifu
- Subjects
- *
CLUSTER analysis (Statistics) , *DENSITY - Abstract
Density-Based clustering, an important clustering analysis approach, has several advantages of discovering clusters with arbitrary shapes and identifying noise samples etc. However, existing density-based clustering approaches cannot effectively identify clusters in density-sparse region. In this paper, we propose a density-based clustering approach for dataset with density-sparse region by using density connection weight. Firstly, we define a series of new concepts, such as density connection weight and average weight. And data objects are divided into core objects, boundary objects and noise objects according to average weight, which can effectively distinguish core objects and boundary objects in density-sparse region. Secondly, a density clustering assignment strategy is constructed based on density connection weight. A density clustering algorithm, together with the dimensionality reduction using t-SNE, judiciously handles the low-density clusters in multi-density datasets and the minimal clusters in extremely imbalanced datasets. In the end, the experimental results on the UCI and synthetic datasets validate that our algorithm exhibits prominent clustering performance, especially while being adapted in diverse clustering tasks, including the multi-density, imbalanced and uniform distribution datasets. • New concepts of density connection weight etc., are defined for selecting core objects. • A filtering method of dividing data objects is proposed using average weight. • A cluster allocation strategy is constructed using density connection weight. • A density connection weight-based clustering algorithm is proposed. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
48. DDoS attack detection method using cluster analysis
- Author
-
Juhyun Kim, Sehun Kim, Younggoo Han, Keunsoo Lee, and Ki Hoon Kwon
- Subjects
business.industry ,Computer science ,Network packet ,ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS ,Application layer DDoS attack ,General Engineering ,Denial-of-service attack ,Intrusion detection system ,Cluster (spacecraft) ,Computer Science Applications ,Artificial Intelligence ,Packet drop attack ,business ,Trinoo ,Computer network - Abstract
Distributed Denial of Service (DDoS) attacks generate enormous packets by a large number of agents and can easily exhaust the computing and communication resources of a victim within a short period of time. In this paper, we propose a method for proactive detection of DDoS attack by exploiting its architecture which consists of the selection of handlers and agents, the communication and compromise, and attack. We look into the procedures of DDoS attack and then select variables based on these features. After that, we perform cluster analysis for proactive detection of the attack. We experiment with 2000 DARPA Intrusion Detection Scenario Specific Data Set in order to evaluate our method. The results show that each phase of the attack scenario is partitioned well and we can detect precursors of DDoS attack as well as the attack itself.
- Published
- 2008
49. Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty
- Author
-
Mohammad Reza Gholamian, Seyed Mohammad Seyed Hosseini, and Anahita Maleki
- Subjects
Index (economics) ,business.industry ,Computer science ,media_common.quotation_subject ,General Engineering ,Customer relationship management ,computer.software_genre ,Marketing strategy ,Computer Science Applications ,Loyalty business model ,Artificial Intelligence ,Loyalty ,Product (category theory) ,Data mining ,Customer intelligence ,business ,computer ,media_common - Abstract
Data mining (DM) methodology has a tremendous contribution for researchers to extract the hidden knowledge and information which have been inherited in the data used by researchers. This study has proposed a new procedure, based on expanded RFM model by including one additional parameter, joining WRFM-based method to K-means algorithm applied in DM with K-optimum according to Davies-Bouldin Index, and then classifying customer product loyalty in under B2B concept. The developed methodology has been implemented for SAPCO Co. in Iran. The result shows a tremendous capability to the firm to assess his customer loyalty in marketing strategy designed by this company in comparing with random selection commonly used by most companies in Iran.
- Published
- 2010
50. Hybrid K-means, fuzzy C-means, and hierarchical clustering for DNA hepatitis C virus trend mutation analysis.
- Author
-
Al Kindhi, Berlian, Sardjono, Tri Arief, Purnomo, Mauridhi Hery, and Verkerke, Gijbertus Jacob
- Subjects
- *
CLUSTER analysis (Statistics) , *K-means clustering , *FUZZY clustering technique , *HIERARCHICAL clustering (Cluster analysis) , *HEPATITIS C virus , *GENETIC mutation - Abstract
Highlights • Analysis of virus trend using hybrid clustering. • Combining the K-means, Fuzzy C-means, and hierarchical clustering. • Processing DNA data into unsupervised learning methods. • DNA Micro Array data processing. Abstract Every single strand of DNA consists of 10 sequences of nucleotides. These sequences cannot be separated or randomly arranged because each sequence of DNA contains a certain genomic encoding. When a virus mutates, a drug or vaccine for that virus that has been given to a patient will become useless. Therefore, there is a need for a method of analysing the likely direction of DNA mutation so that preventative measures can be adapted more quickly. RNA-type viruses are able to alter the patterns of infected DNA, which is one way for such a virus to defend itself. In this paper, we propose a new hybrid clustering method that combines K-means, fuzzy C-means, and hierarchical clustering to predict the direction of DNA mutation trends. We have combined these three different approaches in a hybrid clustering method and tested it on two data sets of 1000 isolated positive hepatitis C virus (HCV)-infected and non-infected DNA strands with 37 HCV primers. We compare the results with those of eight other clustering methods, and the comparison shows that our method achieves sensitivity and specificity values of 0.998. The level of precision of cluster division is also 0.004 higher than that of the next highest among the eight methods considered for comparison. From this study, the primer trends that most often appear in isolated DNA can be found, and the origins of these trends in isolated DNA can be inferred. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.