25 results on '"Clusterings"'
Search Results
2. A Novel Two-Level Clustering-Based Differential Evolution Algorithm for Training Neural Networks
- Author
-
Mousavirad, S. J., Oliva, D., Schaefer, G., Helali Moghadam, Mahshid, El-Abd, M., Mousavirad, S. J., Oliva, D., Schaefer, G., Helali Moghadam, Mahshid, and El-Abd, M.
- Abstract
Determining appropriate weights and biases for feed-forward neural networks is a critical task. Despite the prevalence of gradient-based methods for training, these approaches suffer from sensitivity to initial values and susceptibility to local optima. To address these challenges, we introduce a novel two-level clustering-based differential evolution approach, C2L-DE, to identify the initial seed for a gradient-based algorithm. In the initial phase, clustering is employed to detect some regions in the search space. Population updates are then executed based on the information available within each region. A new central point is proposed in the subsequent phase, leveraging cluster centres for incorporation into the population. Our C2L-DE algorithm is compared against several recent DE-based neural network training algorithms, and is shown to yield favourable performance.
- Published
- 2024
- Full Text
- View/download PDF
3. Ultrametricity of Dissimilarity Spaces and Its Significance for Data Mining
- Author
-
Simovici, Dan A., Vetro, Rosanne, Hua, Kaixun, Kacprzyk, Janusz, Series editor, Guillet, Fabrice, editor, Pinaud, Bruno, editor, and Venturini, Gilles, editor
- Published
- 2017
- Full Text
- View/download PDF
4. Data-Driven Analyses of Electronic Text Books
- Author
-
Boubekki, Ahcène, Kröhne, Ulf, Goldhammer, Frank, Schreiber, Waltraud, Brefeld, Ulf, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Michaelis, Stefan, editor, Piatkowski, Nico, editor, and Stolpe, Marco, editor
- Published
- 2016
- Full Text
- View/download PDF
5. Real-Time Large-Scale 6G Satellite-UAV Networks
- Author
-
Nguyen, Minh-Hien T., Bui, Tinh T., Nguyen, Long D., Garcia-Palacios, Emiliano, Zepernick, Hans-Juergen, Duong, Trung Q., Nguyen, Minh-Hien T., Bui, Tinh T., Nguyen, Long D., Garcia-Palacios, Emiliano, Zepernick, Hans-Juergen, and Duong, Trung Q.
- Abstract
In this paper, we consider an Internet-of-Things network supported by several satellites and multiple cache-assisted unmanned aerial vehicles (UAVs). We propose an optimisation problem with the aim of minimising the total network latency. To reduce the complexity of the original problem, it is divided into three sub-problems, namely, clustering ground users associated with UAVs, cache placement in UAVs (to support the network in avoiding backhaul congestion), and power allocation for satellites and UAVs. A non-cooperative game is designed to obtain the solution to the clustering problem; a genetic algorithm, which is powerful in the scenario of many variables, is employed to obtain the optimal solution to the high-complexity caching problem; and a quick estimation technique is used for power allocation. The total network latency is then minimised by using alternating optimisation technique. Numerical results prove the efficiency of our methods compared to other traditional ones. © 2023 IEEE.
- Published
- 2023
- Full Text
- View/download PDF
6. Group-Personalized Federated Learning for Human Activity Recognition Through Cluster Eccentricity Analysis
- Author
-
Al-Saedi, Ahmed Abbas Mohsin, Boeva, Veselka, Al-Saedi, Ahmed Abbas Mohsin, and Boeva, Veselka
- Abstract
Human Activity Recognition (HAR) plays a significant role in recent years due to its applications in various fields including health care and well-being. Traditional centralized methods reach very high recognition rates, but they incur privacy and scalability issues. Federated learning (FL) is a leading distributed machine learning (ML) paradigm, to train a global model collaboratively on distributed data in a privacy-preserving manner. However, for HAR scenarios, the existing action recognition system mainly focuses on a unified model, i.e. it does not provide users with personalized recognition of activities. Furthermore, the heterogeneity of data across user devices can lead to degraded performance of traditional FL models in the smart applications such as personalized health care. To this end, we propose a novel federated learning model that tries to cope with a statistically heterogeneous federated learning environment by introducing a group-personalized FL (GP-FL) solution. The proposed GP-FL algorithm builds several global ML models, each one trained iteratively on a dynamic group of clients with homogeneous class probability estimations. The performance of the proposed FL scheme is studied and evaluated on real-world HAR data. The evaluation results demonstrate that our approach has advantages in terms of model performance and convergence speed with respect to two baseline FL algorithms used for comparison. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
- Published
- 2023
- Full Text
- View/download PDF
7. Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management
- Author
-
Homod, Raad Z., Yaseen, Zaher M., Hussein, Ahmed K., Almusaed, Amjad, Alawi, Omer A., Falah, Mayadah W., Abdelrazek, Ali H., Ahmed, Waqar, Eltaweel, Mahmoud, Homod, Raad Z., Yaseen, Zaher M., Hussein, Ahmed K., Almusaed, Amjad, Alawi, Omer A., Falah, Mayadah W., Abdelrazek, Ali H., Ahmed, Waqar, and Eltaweel, Mahmoud
- Abstract
Chillers are responsible for almost half of the total energy demand in buildings. Hence, the obligation of control systems of multi-chiller due to changes indoor environments is one of the most significant parts of a smart building. Such a controller is described as a nonlinear and multi-objective algorithm, and its fabrication is crucial to achieving the optimal balance between indoor thermal comfort and running a minimum number of chillers. This work proposes deep clustering of cooperative multi-agent reinforcement learning (DCCMARL) as well-suited to such system control, which supports centralized control by learning of agents. In MARL, since the learning of agents is based on discrete sets of actions and stats, this drawback significantly affects the model of agents for representing their actions with efficient performance. This drawback becomes considerably worse when increasing the number of agents, due to the increased complexity of solving MARL, which makes modeling policy very challenging. Therefore, the DCCMARL of multi-objective reinforcement learning is leveraging powerful frameworks of a hybrid clustering algorithm to deal with complexity and uncertainty, which is a critical factor that influences to the achievement of high levels of a performance action. The results showed that the ability of agents to manipulate the behavior of the smart building could improve indoor thermal conditions, as well as save energy up to 44.5% compared to conventional methods. It seems reasonable to conclude that agents' performance is influenced by what type of model structure.
- Published
- 2023
- Full Text
- View/download PDF
8. An Overall Index for Comparing Hierarchical Clusterings
- Author
-
Morlini, I., Zani, S., Gaul, Wolfgang A., editor, Geyer-Schulz, Andreas, editor, Schmidt-Thieme, Lars, editor, and Kunze, Jonas, editor
- Published
- 2012
- Full Text
- View/download PDF
9. Explain your clusters with words : The role of metadata in interactive clustering
- Author
-
Mozolewski, Maciej, Jamshidi, Samaneh, Bobek, Szymon, Nalepa, Grzegorz J., Mozolewski, Maciej, Jamshidi, Samaneh, Bobek, Szymon, and Nalepa, Grzegorz J.
- Abstract
In this preliminary work, we present an approach for the augmentation of clustering with natural language explanations. In clustering there are 2 main challenges: a) choice of a proper, reasonable number of clusters, and b) cluster analysis and profiling. There is a plethora of technics for a) but not so much for b), which is in general a laborious task of explaining obtained clusters. We propose a method that aids experts in cluster analysis by providing an iterative, human-in-the-loop methodology of generating cluster explanations. In an illustrative example, we show how the process of clustering on a set of objective variables could be facilitated with textual metadata. In our case, images of products from online fashion store are used for clustering. Then, product descriptions are used for profiling clusters. © 2022 Copyright for this paper by its authors., This paper is funded from the XPM (ExplainablePredictive Maintenance) project funded by the National Science Center, Poland under CHIST-ERAprogramme Grant Agreement No. 857925 (NCNUMO-2020/02/Y/ST6/00070).The work of Szymon Bobek has been additionallysupported by a HuLCKA grant from the PriorityResearch Area (Digiworld) under the Strategic Programme Excellence Initiative at the JagiellonianUniversity (U1U/P06/NO/02.16).The work of Samaneh Jamshidi was supportedby CHIST-ERA grant CHIST-ERA-19-XAI-012funded by Swedish Research Council.
- Published
- 2022
10. FedCO : Communication-Efficient Federated Learning via Clustering Optimization †
- Author
-
Al-Saedi, Ahmed Abbas Mohsin, Boeva, Veselka, Casalicchio, Emiliano, Al-Saedi, Ahmed Abbas Mohsin, Boeva, Veselka, and Casalicchio, Emiliano
- Abstract
Federated Learning (FL) provides a promising solution for preserving privacy in learning shared models on distributed devices without sharing local data on a central server. However, most existing work shows that FL incurs high communication costs. To address this challenge, we propose a clustering-based federated solution, entitled Federated Learning via Clustering Optimization (FedCO), which optimizes model aggregation and reduces communication costs. In order to reduce the communication costs, we first divide the participating workers into groups based on the similarity of their model parameters and then select only one representative, the best performing worker, from each group to communicate with the central server. Then, in each successive round, we apply the Silhouette validation technique to check whether each representative is still made tight with its current cluster. If not, the representative is either moved into a more appropriate cluster or forms a cluster singleton. Finally, we use split optimization to update and improve the whole clustering solution. The updated clustering is used to select new cluster representatives. In that way, the proposed FedCO approach updates clusters by repeatedly evaluating and splitting clusters if doing so is necessary to improve the workers’ partitioning. The potential of the proposed method is demonstrated on publicly available datasets and LEAF datasets under the IID and Non-IID data distribution settings. The experimental results indicate that our proposed FedCO approach is superior to the state-of-the-art FL approaches, i.e., FedAvg, FedProx, and CMFL, in reducing communication costs and achieving a better accuracy in both the IID and Non-IID cases. © 2022 by the authors., open access
- Published
- 2022
- Full Text
- View/download PDF
11. ERCP : Energy-Efficient and Reliable-Aware Clustering Protocol for Wireless Sensor Networks
- Author
-
El-Fouly, Fatma H., Khedr, Ahmed Y., Sharif, Md. Haidar, Alreshidi, Eissa Jaber, Yadav, Kusum, Kusetogullari, Hüseyin, Ramadan, Rabie A., El-Fouly, Fatma H., Khedr, Ahmed Y., Sharif, Md. Haidar, Alreshidi, Eissa Jaber, Yadav, Kusum, Kusetogullari, Hüseyin, and Ramadan, Rabie A.
- Abstract
Wireless Sensor Networks (WSNs) have been around for over a decade and have been used in many important applications. Energy and reliability are two of the major problems with these kinds of applications. Reliable data delivery is an important issue in WSNs because it is a key part of how well data are sent. At the same time, energy consumption in battery-based sensors is another challenge. Therefore, efficient clustering and routing are techniques that can be used to save sensors energy and guarantee reliable message delivery. With this in mind, this paper develops an energy-efficient and reliable clustering protocol (ERCP) for WSNs. First, an efficient clustering technique is proposed for sensor nodes’ energy savings considering different clustering parameters, including the link quality metric, the energy, the distance to neighbors, the distance to the sink node, and the cluster load metric. The proposed routing protocol works based on the concept of a reliable inter-cluster routing technique that saves energy. The routing decisions are made based on different parameters, such as the energy balance metric, the distance to the sink node, and the wireless link quality. Many experiments and analyses are examined to determine how well the ERCP performs. The experiment results showed that the ECRP protocol performs much better than some of the recent algorithms in both homogeneous and heterogeneous networks. © 2022 by the authors., open access
- Published
- 2022
- Full Text
- View/download PDF
12. Cluster Assignment in Multi-Agent Systems
- Author
-
Sharf, Miel, Zelazo, D., Sharf, Miel, and Zelazo, D.
- Abstract
We study cluster assignment in multi-agent networks. We consider homogeneous diffusive networks, and focus on design of the graph that ensures the system will converge to a prescribed cluster configuration, i.e., specifying the number of clusters and agents within each cluster. Leveraging recent results from cluster synthesis, we show that it is possible to design an oriented graph such that the action of the automorphism group of the graph has orbits of predetermined sizes, guaranteeing that the network will converge to the prescribed cluster configuration. We provide upper and lower bounds on the number of edges that are needed to construct these graphs along with a constructive approach for generating these graphs. We support our analysis with some numerical examples., QC 20230510
- Published
- 2022
- Full Text
- View/download PDF
13. Optical Aerial Images Change Detection Based on a Color Local Dissimilarity Map and k-Means Clustering
- Author
-
Diaw, Moustapha, Landré, Jérôme, Delahaies, Agnès, Morain-Nicolier, Frédéric, Retraint, Florent, Diaw, Moustapha, Landré, Jérôme, Delahaies, Agnès, Morain-Nicolier, Frédéric, and Retraint, Florent
- Abstract
Considering the unavailability of labeled data sets in remote sensing change detection, this letter presents a novel and low complexity unsupervised change detection method based on the combination of similarity and dissimilarity measures: mutual information (MI), disjoint information (DI), and local dissimilarity map (LDM). MI and DI are calculated on sliding windows with a step of 1 pixel for each pair of channels of both images. The resulting scalar values, weighted by q and m coefficients, are multiplied by the values of the center pixels of the windows weighted by p to remove the textures on images. The changes are detected using, respectively, the grayscale LDM and color LDM. A sliding window is then used on the color LDM and each pixel is characterized by a two-parameter Weibull distribution. Binarized change maps can be obtained by using a k-means clustering on the model parameters. Experiments on optical aerial image data set show that the proposed method produces comparable, even better results, to the state-of-the-art methods in terms of recall, precision, and F-measure.
- Published
- 2022
- Full Text
- View/download PDF
14. Impact of Clustering Methods on Machine Learning-based Solar Power Prediction Models
- Author
-
Aupke, Phil, Kassler, Andreas, Theocharis, Andreas, Nilsson, Magnus, Andersson, Isac Myren, Aupke, Phil, Kassler, Andreas, Theocharis, Andreas, Nilsson, Magnus, and Andersson, Isac Myren
- Abstract
Prediction of solar power generation is important in order to optimize energy exchanges in future micro-grids that integrate a large amount of photovoltaics. However, an accurate prediction is difficult due to the uncertainty of weather phenomena that impact produced power. In this paper, we evaluate the impact of different clustering methods on the forecast accuracy for predicting hourly ahead solar power when using machine learning based prediction approaches trained on weather and generated power features. In particular, we compare clustering methods using clearness index and K-means clustering, where we use both euclidian distance and dynamic time-warping. For evaluating prediction accuracy, we develop and compare different prediction models for each of the clusters using production data from a swedish SmartGrid. We demonstrate that proper tuning of thresholds for the clearness index improves prediction accuracy by 20.19% but results in worse performance than using K-means with all weather features as input to the clustering.
- Published
- 2022
- Full Text
- View/download PDF
15. Analysis of traffic accidents with fuzzy and crisp data mining techniques to identify factors affecting injury severity
- Author
-
Emrah Bilgiç, Tutku TuncalıYaman, M. Fevzi Esen, Mühendislik ve Doğa Bilimleri Fakültesi -- Lojistik Yönetimi Bölümü, and Bilgiç, Emrah
- Subjects
Statistics and Probability ,Classification trees ,Clustering algorithms ,Computer science ,Injury severity ,Regression trees ,030508 substance abuse ,02 engineering and technology ,Data-mining techniques ,computer.software_genre ,Fuzzy logic ,Clustering ,03 medical and health sciences ,Insurance ,Motor vehicle ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Crisp data ,Clusterings ,Data mining ,Child Restraint Systems ,Accident severity ,Fuzzy data ,Fuzzy clustering ,Traffic accidents ,General Engineering ,Vehicles ,Seat Belts ,Automobile manufacture ,Vehicle traffic ,Computer Science ,CRT ,020201 artificial intelligence & image processing ,Electrical Engineering, Electronics & Computer Science - Transportation - Road Safety ,Highway accidents ,0305 other medical science ,human activities ,computer - Abstract
Injury severity in motor vehicle traffic accidents is determined by a number of factors including driver, vehicle, and environment. Airbag deployment, vehicle speed, manner of collusion, atmospheric and light conditions, degree of ejection of occupant’s body from the crash, the use of equipment or other forces to re-move occupants from the vehicle, model and type of vehicle have been considered as important risk factors affecting accident severity as well as driver-related conditions such as age, gender, seatbelt use, alcohol and drug involvement. In this study, we aim to identify important variables that contribute to injury severity in the traffic crashes. A contemporary dataset is obtained from National Highway Traffic Safety Administration’s (NHTSA) Fatality Analysis Reporting System (FARS). To identify accident severity groups, we performed different clustering algorithms including fuzzy clustering. We then assessed the important factors affecting injury severity by using classification and regression trees (CRT). The results which would guide car manufacturers, policy makers and insurance companies indicate that the most important factor in defining injury severity is deployment of air-bag, followed by extrication, ejection occurrences, and travel speed and alcohol involvement.
- Published
- 2022
16. FedCO : Communication-Efficient Federated Learning via Clustering Optimization †
- Author
-
Ahmed A. Al-Saedi, Veselka Boeva, and Emiliano Casalicchio
- Subjects
Central servers ,Shared model ,federated learning ,Learning systems ,Computer Networks and Communications ,Computer Sciences ,Internet of Things ,convolutional neural network ,Optimization approach ,communication efficiency ,Cost reduction ,Clustering optimizations ,Datavetenskap (datalogi) ,Privacy-preserving techniques ,Convolutional neural networks ,Communication cost ,Clusterings ,Workers' ,clustering - Abstract
Federated Learning (FL) provides a promising solution for preserving privacy in learning shared models on distributed devices without sharing local data on a central server. However, most existing work shows that FL incurs high communication costs. To address this challenge, we propose a clustering-based federated solution, entitled Federated Learning via Clustering Optimization (FedCO), which optimizes model aggregation and reduces communication costs. In order to reduce the communication costs, we first divide the participating workers into groups based on the similarity of their model parameters and then select only one representative, the best performing worker, from each group to communicate with the central server. Then, in each successive round, we apply the Silhouette validation technique to check whether each representative is still made tight with its current cluster. If not, the representative is either moved into a more appropriate cluster or forms a cluster singleton. Finally, we use split optimization to update and improve the whole clustering solution. The updated clustering is used to select new cluster representatives. In that way, the proposed FedCO approach updates clusters by repeatedly evaluating and splitting clusters if doing so is necessary to improve the workers’ partitioning. The potential of the proposed method is demonstrated on publicly available datasets and LEAF datasets under the IID and Non-IID data distribution settings. The experimental results indicate that our proposed FedCO approach is superior to the state-of-the-art FL approaches, i.e., FedAvg, FedProx, and CMFL, in reducing communication costs and achieving a better accuracy in both the IID and Non-IID cases. © 2022 by the authors. open access
- Published
- 2022
17. Anatomy of continuous Mars SEIS and pressure data from unsupervised learning
- Author
-
S. Barkaoui, Grégory Sainton, Maarten V. de Hoop, Eléonore Stutzmann, Rudolf Widmer-Schnidrig, Philippe Lognonné, W. Bruce Banerdt, Aymeric Spiga, Taichi Kawamura, Randall Balestriero, Matthieu Plasman, John Clinton, Léonard Seydoux, Savas Ceylan, Francesco Civilini, John-Robert Scholz, Institut de Physique du Globe de Paris (IPGP (UMR_7154)), Institut national des sciences de l'Univers (INSU - CNRS)-Université de La Réunion (UR)-Institut de Physique du Globe de Paris (IPG Paris)-Centre National de la Recherche Scientifique (CNRS)-Université Paris Cité (UPCité), Institut des Sciences de la Terre (ISTerre), Institut national des sciences de l'Univers (INSU - CNRS)-Institut de recherche pour le développement [IRD] : UR219-Université Savoie Mont Blanc (USMB [Université de Savoie] [Université de Chambéry])-Centre National de la Recherche Scientifique (CNRS)-Université Gustave Eiffel-Université Grenoble Alpes (UGA), Rice University [Houston], Max-Planck-Institut für Sonnensystemforschung = Max Planck Institute for Solar System Research (MPS), Max-Planck-Gesellschaft, Laboratoire de Météorologie Dynamique (UMR 8539) (LMD), Institut national des sciences de l'Univers (INSU - CNRS)-École polytechnique (X)-École des Ponts ParisTech (ENPC)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Département des Géosciences - ENS Paris, École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), Institut Pierre-Simon-Laplace (IPSL (FR_636)), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Université de Versailles Saint-Quentin-en-Yvelines (UVSQ)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Institut national des sciences de l'Univers (INSU - CNRS)-École polytechnique (X)-Centre National d'Études Spatiales [Toulouse] (CNES)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Université Paris Cité (UPCité), California Institute of Technology (CALTECH), and ANR-19-CE31-0008,MAGIS,MArs Geophysical InSight(2019)
- Subjects
Seismic investigations ,Seismic noise ,Computer science ,unsupervised classification ,[SDU.STU]Sciences of the Universe [physics]/Earth Sciences ,Interior structure ,Machine learning ,computer.software_genre ,Continuous data ,Geochemistry and Petrology ,geodesy ,Clusterings ,Pressure drop ,Seismology ,Pressure data ,business.industry ,Heat transport ,Deep learning ,pressure gradient ,Mars Exploration Program ,Stable time ,Geophysics ,machine learning ,Unsupervised learning ,Artificial intelligence ,Micro-events ,business ,computer ,Quasi-periodicities - Abstract
The seismic noise recorded by the Interior Exploration using Seismic Investigations, Geodesy, and Heat Transport (InSight) seismometer (Seismic Experiment for Interior Structure [SEIS]) has a strong daily quasi-periodicity and numerous transient microevents, associated mostly with an active Martian environment with wind bursts, pressure drops, in addition to thermally induced lander and instrument cracks. That noise is far from the Earth’s microseismic noise. Quantifying the importance of nonstochasticity and identifying these microevents is mandatory for improving continuous data quality and noise analysis techniques, including autocorrelation. Cataloging these events has so far been made with specific algorithms and operator’s visual inspection. We investigate here the continuous data with an unsupervised deep-learning approach built on a deep scattering network. This leads to the successful detection and clustering of these microevents as well as better determination of daily cycles associated with changes in the intensity and color of the background noise. We first provide a description of our approach, and then present the learned clusters followed by a study of their origin and associated physical phenomena. We show that the clustering is robust over several Martian days, showing distinct types of glitches that repeat at a rate of several tens per sol with stable time differences. We show that the clustering and detection efficiency for pressure drops and glitches is comparable to or better than manual or targeted detection techniques proposed to date, noticeably with an unsupervised approach. Finally, we discuss the origin of other clusters found, especially glitch sequences with stable time offsets that might generate artifacts in autocorrelation analyses. We conclude with presenting the potential of unsupervised learning for long-term space mission operations, in particular, for geophysical and environmental observatories.
- Published
- 2021
- Full Text
- View/download PDF
18. Studying expert initial set and hard to map cases in automated code-to-architecture mappings
- Author
-
Olsson, Tobias, Ericsson, Morgan, Wingkvist, Anna, Olsson, Tobias, Ericsson, Morgan, and Wingkvist, Anna
- Abstract
We study the mapping of software source code to architectural modules. Background: To evaluate techniques for performing automatic mapping of code-to-architecture, a ground truth mapping, often provided by an expert, is needed. From this ground truth, techniques use an initial set of mapped source code as a starting point. The size and composition of this set affect the techniques’ performance, and to make comparisons, random sizes and compositions are used. However, while randomness will give a baseline for comparison, it is not likely that a human expert would compose an initial set on random to map source code. We are interested in letting an expert create an initial set based on their experience with the system and study how this affects how a technique performs. Also, previous research has shown that when comparing an automatic mapping with the ground truth mappings, human experts often accept the automated mappings and, if not, point to the need for refactoring the source code. We want to study this phenomenon further. Audience: Researchers and developers of tools in the area of architecture conformance. The system expert can gain valuable insights into where the source code needs to be refactored. Aim: We hypothesize that an initial set assigned by an expert performs better than a random initial set of similar size and that an expert will agree upon or find opportunities for refactoring in a majority of cases where the automatic mapping and expert mapping disagrees. Method: The initial set will be extracted from an interview with the expert. Then the performance (precision and recall f1 score) will be compared to mappings starting from random initial sets and using an automatic technique. We will also use our tool to find the cases where the automatic and human mapping disagrees and then let the expert review these cases. Results: We expect to find a difference when performance is compared. We expect the expert review to reveal source code that should be rema
- Published
- 2021
19. Cell-Free Massive MIMO with Large-Scale Fading Decoding and Dynamic Cooperation Clustering
- Author
-
Demir, Ozlem Tugfe, Björnson, Emil, Sanguinetti, Luca, Demir, Ozlem Tugfe, Björnson, Emil, and Sanguinetti, Luca
- Abstract
This paper considers the uplink of user-centric cellfree massive MIMO (multiple-input multiple-output) systems. We utilize the user-centric dynamic cooperation clustering (DCC) framework and derive the achievable spectral efficiency with two-layer decoding that is divided between the access points and the central processing unit (CPU). This decoding method is also known as large-scale fading decoding (LSFD). The fronthaul signaling load is analyzed and a nearly optimal second-layer decoding scheme at the CPU is proposed to reduce the fronthaul requirements compared to the optimal scheme.We also revisit the joint optimization of LSFD weights and uplink power control and show that the corresponding max-min fair optimization problem can be solved optimally via an efficient fixed-point algorithm. We provide simulations that bring new insights into the cell-free massive MIMO implementation., Part of proceedings: ISBN 978-3-8007-5688-9QC 20220615
- Published
- 2021
20. HMS-OS : Improving the Human Mental Search Optimisation Algorithm by Grouping in both Search and Objective Space
- Author
-
Mousavirad, S. J., Schaefer, G., Korovin, I., Oliva, D., Helali Moghadam, Mahshid, Saadatmand, Mehrdad, Mousavirad, S. J., Schaefer, G., Korovin, I., Oliva, D., Helali Moghadam, Mahshid, and Saadatmand, Mehrdad
- Abstract
The human mental search (HMS) algorithm is a relatively recent population-based metaheuristic algorithm, which has shown competitive performance in solving complex optimisation problems. It is based on three main operators: mental search, grouping, and movement. In the original HMS algorithm, a clustering algorithm is used to group the current population in order to identify a promising region in search space, while candidate solutions then move towards the best candidate solution in the promising region. In this paper, we propose a novel HMS algorithm, HMS-OS, which is based on clustering in both objective and search space, where clustering in objective space finds a set of best candidate solutions whose centroid is then also used in updating the population. For further improvement, HMS-OS benefits from an adaptive selection of the number of mental processes in the mental search operator. Experimental results on CEC-2017 benchmark functions with dimensionalities of 50 and 100, and in comparison to other optimisation algorithms, indicate that HMS-OS yields excellent performance, superior to those of other methods., Conference code: 176593; Cited By :1; Export Date: 8 June 2022; Conference Paper; Funding text 1: This research is funded within the SFEDU development program (PRIORITY 2030).
- Published
- 2021
- Full Text
- View/download PDF
21. A Framework for Localization and Clustering in IoT Enabled Wireless Sensor and Actor Networks
- Author
-
Zuleyha Akusta Dagdeviren
- Subjects
Energy utilization ,Internet of things ,Decentralized control ,Computer science ,Energy Efficiency ,Message complexity ,Wireless sensor and actor networks ,Communication layers ,Decentralised control ,Sensor nodes ,Clustering ,Total energy ,Cluster (physics) ,Wireless ,Cluster analysis ,Clusterings ,Energy-consumption ,Structure (mathematical logic) ,Crucial technology ,business.industry ,Node (networking) ,Localisation ,Decentralised system ,Sensors network ,Localization ,business ,Wireless sensor network ,Efficient energy use ,Computer network - Abstract
5h International Symposium on Multidisciplinary Studies and Innovative Technologies, ISMSIT 2021 -- 21 October 2021 through 23 October 2021 -- -- 174473, Wireless sensor and actor networks (WSANs) compose of ordinary sensor networks and actor devices equipped with various capabilities. These networks are crucial technologies which are located at the communication layer of Internet of things. Actors (agents) are employed to provide decentralized control of the sensor networks. We propose an actor controlled framework for localization and clustering in sensor networks. To the best of our knowledge, our framework is the first which offers both localization and clustering within constant message complexity per node. In this structure, a mobile actor device traverses the sensor network to localize and cluster the sensor nodes. Total energy consumption of the sensor nodes using this method are reduced compared to the distributed approaches. Moreover, cluster quality of our algorithms is better than their counterparts. We show that our framework provides a suitable infrastructure for cluster-based applications in an energy efficient manner. © 2021 IEEE.
- Published
- 2021
22. Building a gold standard for perceptual sketch similarity
- Author
-
Sezgin, Tevfik Metin (ORCID 0000-0002-1524-1646 & YÖK ID 18632), Çakmak, Serike, College of Engineering, Graduate School of Sciences and Engineering, and Department of Computer Engineering
- Subjects
Basic concepts ,Clusterings ,Designed experiments ,Efficient construction ,Experiment set-up ,Gold standards ,Large datasets ,Measure of similarities ,Scalings ,Sketch recognition ,Crowdsourcing ,Personnel ,Task assignment - Abstract
Similarity is among the most basic concepts studied in psychology. Yet, there is no unique way of assessing similarity of two objects. In the sketch recognition domain, many tasks such as classification, detection or clustering require measuring the level of similarity between sketches. In this paper, we propose a carefully designed experiment setup to construct a gold standard for measuring the similarity of sketches. Our setup is based on table scaling, and allows efficient construction of a measure of similarity for large datasets containing hundreds of sketches in reasonable time scales. We report the results of an experiment involving a total of 9 unique assessors, and 8 groups of sketches, each containing 300 drawings. The results show high interrater agreement between the assessors, which makes the constructed gold standard trustworthy., Scientific and Technological Research Council of Turkey (TÜBİTAK)
- Published
- 2016
23. Studies on special types of copolymers (ABA block copolymers)
- Author
-
Shit, Subhas Chandra
- Subjects
Trade ,Clusterings ,Industries ,Land Availability ,Environmental - Published
- 2015
24. Outlier Ranking via Subspace Analysis in Multiple Views of the Data
- Author
-
Ira Assent, Patricia Iglesias, Emmanuel Müller, Klemens Böhm, and Yvonne Mulle
- Subjects
business.industry ,media_common.quotation_subject ,InformationSystems_DATABASEMANAGEMENT ,Pattern recognition ,Space (commercial competition) ,Object (computer science) ,computer.software_genre ,Linear subspace ,Measure (mathematics) ,ComputingMethodologies_PATTERNRECOGNITION ,multiple subspaces ,Ranking ,Outlier ,outlier ranking ,clusterings ,Quality (business) ,Artificial intelligence ,Data mining ,business ,computer ,Subspace topology ,Mathematics ,media_common - Abstract
Outlier mining is an important task for finding anomalous objects. In practice, however, there is not always a clear distinction between outliers and regular objects as objects have different roles w.r.t. different attribute sets. An object may deviate in one subspace, i.e. a subset of attributes. And the same object might appear perfectly regular in other subspaces. One can think of subspaces as multiple views on one database. Traditional methods consider only one view (the full attribute space). Thus, they miss complex outliers that are hidden in multiple subspaces. In this work, we propose Outrank, a novel outlier ranking concept. Outrank exploits subspace analysis to determine the degree of outlierness. It considers different subsets of the attributes as individual outlier properties. It compares clustered regions in arbitrary subspaces and derives an outlierness score for each object. Its principled integration of multiple views into an outlierness measure uncovers outliers that are not detectable in the full attribute space. Our experimental evaluation demonstrates that Outrank successfully determines a high quality outlier ranking, and outperforms state-of-the-art outlierness measures.
- Published
- 2012
- Full Text
- View/download PDF
25. Multiple clustering solutions analysis through least-squares consensus algorithms
- Author
-
Murino, L.a, Angelini, C.b, Bifulco, I.a, De Feis, I.b, Raiconi, G.a, Tagliaferri, and R.a
- Subjects
Consensus algorithms ,Least Square ,Artificial intelligence ,Clustering algorithms ,Graphical visualization ,Bioinformatics ,Multiple solutions ,Clustering solutions ,Multiple clusterings ,Cluster analysis ,Synthetic and real data ,Least square errors ,Consensus clustering ,Data sets ,Clusterings ,Unlabeled data ,Numerical experiments ,Visualization - Abstract
Clustering is one of the most important unsupervised learning problems and it deals with finding a structure in a collection of unlabeled data; however, different clustering algorithms applied to the same data-set produce different solutions. In many applications the problem of multiple solutions becomes crucial and providing a limited group of good clusterings is often more desirable than a single solution. In this work we propose the Least Square Consensus clustering that allows a user to extrapolate a small number of different clustering solutions from an initial (large) set of solutions obtained by applying any clustering algorithm to a given data-set. Two different implementations are presented. In both cases, each consensus is accomplished with a measure of quality defined in terms of Least Square error and a graphical visualization is provided in order to make immediately interpretable the result. Numerical experiments are carried out on both synthetic and real data-sets. © 2010 Springer-Verlag.
- Published
- 2010
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.