13,263 results
Search Results
2. Unsupervised Multimodal Supervoxel Merging Towards Brain Tumor Segmentation
- Author
-
Pelluet, Guillaume, Rizkallah, Mira, Acosta, Oscar, Mateus, Diana, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Crimi, Alessandro, editor, and Bakas, Spyridon, editor
- Published
- 2022
- Full Text
- View/download PDF
3. Online Prediction of Aggregated Retailer Consumer Behaviour
- Author
-
Spenrath, Yorick, Hassani, Marwan, van Dongen, Boudewijn F., van der Aalst, Wil, Series Editor, Mylopoulos, John, Series Editor, Ram, Sudha, Series Editor, Rosemann, Michael, Series Editor, Szyperski, Clemens, Series Editor, Munoz-Gama, Jorge, editor, and Lu, Xixi, editor
- Published
- 2022
- Full Text
- View/download PDF
4. Automatic Unsupervised Clustering of Videos of the Intracytoplasmic Sperm Injection (ICSI) Procedure
- Author
-
Storås, Andrea M., Riegler, Michael A., Haugen, Trine B., Thambawita, Vajira, Hicks, Steven A., Hammer, Hugo L., Kakulavarapu, Radhika, Halvorsen, Pål, Stensen, Mette H., Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Zouganeli, Evi, editor, Yazidi, Anis, editor, Mello, Gustavo, editor, and Lind, Pedro, editor
- Published
- 2022
- Full Text
- View/download PDF
5. A Comprehensive Tensor Framework for the Clustering of Hyperspectral Paper Data With an Application to Forensic Document Analysis
- Author
-
Jobin Francis, Baburaj Madathil, Sudhish N. George, and Sony George
- Subjects
Forensic document analysis ,hyperspectral imaging (HSI) ,clustering ,self-expressiveness property ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In forensic document analysis, the authenticity of a document must be properly checked in the context of suspected forgery. Hyperspectral Imaging (HSI) is a non-invasive way of detecting fraudulent papers in a multipage document. The occurrence of a forged paper in a multi-page document may have a substantial difference from rest of the papers in its age, type, color, texture, and so on. Each pixel in an HSI data can be used as the material fingerprint for the spatial point it corresponds to. Hence, hyperspectral data of paper samples made of the same substance have similar characteristics and can be grouped into a single cluster. Similarly, paper samples made of different substances have different spectral properties. This paper relies on this heuristic and proposes a tensor based clustering framework for hyperspectral paper data, with an application to detect the forged papers in multi-page documents. Information embedded in the hyperspectral patches of the papers to be clustered is arranged into individual lateral slices of a third-order tensor in this framework. Further, this work employs the self-expressiveness property of submodules and an objective function is formulated to extract self-expressive representation tensor with low multirank and f-diagonal structure. Objective function of the proposed method incorporates $l_{\frac {1}{2}}$ -induced Tensor Nuclear Norm (TNN) and $l_{\frac {1}{2}}$ regularization to impart better low rankness and f-diagonal structure to the representation tensor. Experimental results of the proposed method were compared to the state-of-the-art subspace clustering approaches. The results demonstrate improved performance of the proposed method over the existing clustering algorithms.
- Published
- 2022
- Full Text
- View/download PDF
6. Deduction of time-dependent machine tool characteristics by fuzzy-clustering
- Author
-
Frieß, Uwe, Kolouch, Martin, Putz, Matthias, inIT - Institut für industrielle Informa, Beyerer, Jürgen, editor, Kühnert, Christian, editor, and Niggemann, Oliver, editor
- Published
- 2019
- Full Text
- View/download PDF
7. Temporal dynamics of research field integration on slow-cited papers and the awakeners
- Author
-
Miura, Takahiro, Asatani, Kimitaka, and Sakata, Ichiro
- Subjects
Sleeping beauty ,Awakener ,Slow-cited paper ,Research field formation ,Clustering - Abstract
Understanding the long-term impact of scientific findings requires understanding the dynamic process of new research fields' formation. In new research fields, slow-cited papers (SCP) and the awakeners (AW) are more likely to exist, indicating explorers revisited underrated but significant past papers relocating the findings in the new paradigm. This study acquired SCP-AW pairs located in the integrated point of two different research fields using the inheritance of clusters. We found that research field integration, including SCP-AW pairs, was diverse but followed a similar pattern throughout history, generating an equal mix of SCP and AW fields. The recent trend toward more AW-centric disciplinary combinations supports the belief that field integration will become increasingly technology-driven in the coming years.
- Published
- 2022
- Full Text
- View/download PDF
8. The Thematic Structure of Papers on Depression Treatment in PubMed from 2005 to 2014
- Author
-
Seyed Hossein Hosseininasab, Fatemeh Makkizadeh, Ebrahim Zalzadeh, and Afsaneh Hazeri
- Subjects
Co-words ,Clustering ,PubMed ,Thematic Structure ,Depression ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Introduction: Due to the high prevalence of depression and the considerable pressure it puts on individuals, society and healthcare system, it is essential to conduct sufficient research to help with decision making in prevention, treatment and control of this condition. To assist with research planning and setting priorities, this paper aimed to identify the thematic structure of papers related to depression treatment. Methods: This was an applied research study which used Scientometrics approach. To obtain data, the keyword “Depression” was searched as a mesh descriptor with the subdivision “Therapy” in PubMed database for the period of 2005 to 2014. The data were analyzed using co-word and clustering methods with the help of Excel and SPSS software packages. Results: The growth of scientific production in this field appeared to have been balanced throughout the ten-year period examined in the study. The most active research areas for the two five year periods (2005-2009 and 2010-2014) were drug therapy, psychology, and medication adherence. Two hierarchical graphs of descriptors for each five-year period were prepared both of which were composed of 12 clusters with 34 common descriptors. Conclusion: The findings based on the inclusion index showed that only 20 percent of topics in the second five-year period of the study were novel. Therefore, it could be concluded that research areas that were related with each other in previous years will probably co-occur with other topics in the future might disappear. In every discipline, some topics are considered to be fundamental and research works are being carried out on them almost every year.
- Published
- 2016
9. (Position paper) Characterizing the Behavior of Small Producers in Smart Grids A Data Sanity Analysis.
- Author
-
Stefan, Maria, Gutierrez, Jose, Barlet, Pere, Prieto, Eduardo, Gomis, Oriol, and Olsen, Rasmus L.
- Subjects
ELECTRIC power distribution grids ,DATA analysis ,ENERGY consumption ,FORECASTING ,CONSUMER profiling ,LOAD forecasting (Electric power systems) - Abstract
Renewable energy production throughout low-voltage grids has gradually increased in electrical distribution systems, therefore introducing small energy producers - prosumers. This paradigm challenges the traditional unidirectional energy distribution flow to include disperse power production from renewables. To understand how energy usage can be optimized in the dynamic electrical grid, it is important to understand the behavior of prosumers and their impact on the grid's operational procedures. The main focus of this study is to investigate how grid operators can obtain an automatic data-driven system for the low-voltage electrical grid management, by analyzing the available grid topology and time-series consumption data from a real-life test area. The aim is to argue for how different consumer profiles, clustering and prediction methods contribute to the grid-related operations. Ultimately, this work is intended for future research directions that can contribute to improving the trade-off between systematic and scalable data models and software computational challenges. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
10. Gene expression responses of paper birch (Betula papyrifera) to elevated CO2 and O3 during leaf maturation and senescence
- Author
-
Kontunen-Soppela, Sari, Parviainen, Juha, Ruhanen, Hanna, Brosché, Mikael, Keinänen, Markku, Thakur, Ramesh C., Kolehmainen, Mikko, Kangasjärvi, Jaakko, Oksanen, Elina, Karnosky, David F., and Vapaavuori, Elina
- Subjects
GENE expression ,PAPER birch ,EFFECT of carbon dioxide on plants ,EFFECT of atmospheric carbon dioxide on plants ,ATMOSPHERIC ozone & the environment ,OZONE & the environment ,EFFECT of ozone on plants ,EFFECT of atmospheric ozone on plants - Abstract
Gene expression responses of paper birch (Betula papyrifera) leaves to elevated concentrations of CO
2 and O3 were studied with microarray analyses from three time points during the summer of 2004 at Aspen FACE. Microarray data were analyzed with clustering techniques, self-organizing maps, K-means clustering and Sammon''s mappings, to detect similar gene expression patterns within sampling times and treatments. Most of the alterations in gene expression were caused by O3 , alone or in combination with CO2 . O3 induced defensive reactions to oxidative stress and earlier leaf senescence, seen as decreased expression of photosynthesis- and carbon fixation-related genes, and increased expression of senescence-associated genes. The effects of elevated CO2 reflected surplus of carbon that was directed to synthesis of secondary compounds. The combined CO2 +O3 treatment resulted in differential gene expression than with individual gas treatments or in changes similar to O3 treatment, indicating that CO2 cannot totally alleviate the harmful effects of O3 . [Copyright &y& Elsevier]- Published
- 2010
- Full Text
- View/download PDF
11. Multi-View Data approaches in Recommender Systems: an Overview: (Invited Paper).
- Author
-
Palomares, Iván and Kovalchuk, Sergey V.
- Subjects
RECOMMENDER systems ,DATA science ,DECISION support systems ,LEARNING ,AGGREGATION operators - Abstract
This paper overviews an assortment of recent research work undertaken on recommender system models based on using multiple views of user and item-related data across the recommendation process. A summary of representative literature on multi-view recommender approaches is provided, describing their main characteristics, such as: their potential to overcome most common shortcomings in conventional recommender systems, as well as the use of data science, learning techniques and aggregation processes to combine information stemming from multiple views. A tabular summary is provided to facilitate the comparison of the similarities and differences among the surveyed works, along with commonly identified directions for future research in the topic. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
12. Adaptive weights clustering of research papers
- Author
-
Kirill Efimov, Larisa Adamyan, Wolfgang Karl Härdle, and Cathy Yi-Hsuan Chen
- Subjects
JEL system ,Adaptive algorithm ,Point (typography) ,Computer science ,330 Wirtschaft ,05 social sciences ,Nonparametric statistics ,020206 networking & telecommunications ,02 engineering and technology ,computer.software_genre ,Clustering ,Weighting ,0502 economics and business ,ddc:330 ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,Economic articles ,Nonparametric ,Data mining ,050207 economics ,Cluster analysis ,computer ,Research center - Abstract
The JEL classification system is a standard way of assigning key topics to economic articles to make them more easily retrievable in the bulk of nowadays massive literature. Usually the JEL (Journal of Economic Literature) is picked by the author(s) bearing the risk of suboptimal assignment. Using the database of the Collaborative Research Center from Humboldt-Universität zu Berlin we employ a new adaptive clustering technique to identify interpretable JEL (sub)clusters. The proposed Adaptive Weights Clustering (AWC) is available on http://www.quantlet.de/ and is based on the idea of locally weighting each point (document, abstract) in terms of cluster membership. Comparison with $$k$$ k -means or CLUTO reveals excellent performance of AWC.
- Published
- 2020
13. Establishing the original order of the poems in Harward’s Almanac using paleography, codicology, X-ray fluorescence spectroscopy, and statistical analysis
- Author
-
Biolcati, Veronica, Woolley, James, Lévêque, Élodie, Rossi, Andrea, Hoffmann, Anna Grace, Visentin, Andrea, Macháin, Pádraig Ó, and Iacopino, Daniela
- Published
- 2023
- Full Text
- View/download PDF
14. Recuperación de información para artículos científicos soportada en el agrupamiento de documentos XML.
- Author
-
Magdaleno, Damny, Fuentes, Ivett E., Cabezas, Michel, and García, María M.
- Published
- 2016
15. Cluster-based supplier segmentation: a sustainable data-driven approach
- Author
-
Rahiminia, Mohammad, Razmi, Jafar, Shahrabi Farahani, Sareh, and Sabbaghnia, Ali
- Published
- 2023
- Full Text
- View/download PDF
16. (Position paper) Characterizing the behavior of small producers in smart grids: A data sanity analysis
- Author
-
Rasmus Løvenstein Olsen, Pere Barlet, Eduardo Prieto, Jose M. Gutierrez, Oriol Gomis, Maria Stefan, Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya. Departament d'Enginyeria Elèctrica, Universitat Politècnica de Catalunya. CBA - Sistemes de Comunicacions i Arquitectures de Banda Ampla, and Universitat Politècnica de Catalunya. CITCEA - Centre d'Innovació Tecnològica en Convertidors Estàtics i Accionaments
- Subjects
Computer science ,Distributed computing ,02 engineering and technology ,Prediction methods ,Renewable energy sources ,Clustering ,Data modeling ,Energy optimization ,Data sanity ,0202 electrical engineering, electronic engineering, information engineering ,Cluster analysis ,General Environmental Science ,Energies::Energia elèctrica::Automatització i control de l'energia elèctrica [Àrees temàtiques de la UPC] ,business.industry ,Electric power distribution ,020206 networking & telecommunications ,Grid ,Electrical grid ,Renewable energy ,Smart grid ,Scalability ,User modeling and applications ,General Earth and Planetary Sciences ,Energia elèctrica -- Distribució ,020201 artificial intelligence & image processing ,Energies renovables ,business - Abstract
Renewable energy production throughout low-voltage grids has gradually increased in electrical distribution systems, therefore introducing small energy producers - prosumers. This paradigm challenges the traditional unidirectional energy distribution flow to include disperse power production from renewables. To understand how energy usage can be optimized in the dynamic electrical grid, it is important to understand the behavior of prosumers and their impact on the grid’s operational procedures. The main focus of this study is to investigate how grid operators can obtain an automatic data-driven system for the low-voltage electrical grid management, by analyzing the available grid topology and time-series consumption data from a real-life test area. The aim is to argue for how different consumer profiles, clustering and prediction methods contribute to the grid-related operations. Ultimately, this work is intended for future research directions that can contribute to improving the trade-off between systematic and scalable data models and software computational challenges. This work is financially supported by the Danish project RemoteGRID, which is a ForskEL program under Energinet.dk with grant agreement no. 2016-1-12399.
- Published
- 2020
17. A Clustering Expert System using Particle Swarm Optimization and K-means++ for Journal Recommendation to Publish the Papers
- Author
-
Mansoureh Maadi, Seyedeh Malihe Khatami, and Rohollah Ramezani
- Subjects
Control and Optimization ,Information retrieval ,Journal recommendation system ,Computer Networks and Communications ,Computer science ,business.industry ,k-means clustering ,Particle swarm optimization ,computer.software_genre ,Fuzzy logic ,Clustering ,Expert system ,Particle Swarm Optimization ,Hardware and Architecture ,Signal Processing ,Electrical and Electronic Engineering ,Cluster analysis ,business ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) ,computer ,Publication ,Fuzzy ,Information Systems - Abstract
In this paper, an android expert system for recommending the suitable journal for publishing the researchers' papers has been presented. In so doing, the expectations of different journals for accepting an article and also the requests of papers' writers for choosing the journals have been examined. Language, quality, waiting time for judgment, waiting time for publication after acceptance, field, length of paper and cost are the system inputs and its output is the degree of suitability of journals for publishing a certain paper. This system includes a database of different journals and their parameters. It uses particle swarm optimization method and K-means++ algorithm for assessing and clustering the journals database and determines an index for every cluster of journals. The process for matching the paper with a cluster's index is done through fuzzy induction system. After choosing the most similar cluster, the paper is compared with all the journals of the cluster in the same way and the result including the most similar journals is presented. This system has been tested for journals and papers in the computer field indexed in Elsevier.
- Published
- 2018
18. Archives of Data Science, Series A. Vol. 1,1: Special Issue: Selected Papers of the 3rd German-Polish Symposium on Data Analysis and Applications
- Author
-
Geyer-Schulz, Andreas and Pociecha, Józef
- Subjects
Data Analysis ,Economics ,statistische Simulation ,Educational Economics ,Bildungsökonomie ,Clustering ,Machine Learning ,Datenanalyse ,Conjoint Analyse ,ddc:330 ,Conjoint Analysis ,Statistical Simulation ,Maschinelles Lernen - Abstract
The first volume of Archives of Data Science, Series A is a special issue of a selection of contributions which have been originally presented at the {\em 3rd Bilateral German-Polish Symposium on Data Analysis and Its Applications} (GPSDAA 2013). All selected papers fit into the emerging field of data science consisting of the mathematical sciences (computer science, mathematics, operations research, and statistics) and an application domain (e.g. marketing, biology, economics, engineering).
- Published
- 2017
- Full Text
- View/download PDF
19. Оцінка конкурентоспроможності обробної промисловості лісопромислового комплексу України та країн ЄС
- Author
-
Губарєва, І. О. and Ярошенко, І. В.
- Subjects
WOOD products manufacturing ,PAPER products ,MANUFACTURED products ,FURNITURE manufacturing ,ENERGY consumption - Abstract
Copyright of Problems of Economy is the property of Research Centre for Industrial Developmen Problems of Nas and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2020
- Full Text
- View/download PDF
20. Mining publication papers via text miningEvaluation and Results.
- Author
-
Ibrahim, Ahmed S., Saad, Sally, and MostafaAref
- Subjects
MACHINE learning ,TEXT mining ,NATURAL language processing ,DATA mining ,AUTOMATION - Abstract
Data nowadays is the language of technologies as every process needs a data to be processed the input is data and the output also is data. Analyzing the data is a significant task especially with the increasing production of the data particularly data as a text, it would be difficult to manually analyze the data, extract information and detect the hidden patterns from unstructured text. Datamining is automated technique for gathering or deriving a new high-quality information and uncover the relations among the data. Text mining is one of main branches of the data mining however data mining this paper, an is more comprehensive overview for mining the publication papers via text mining techniques and their results and evaluation would be presentedas following: the first approachis keywords extraction using natural language processing (NLP) approach, the second approach named entity recognition and the last approach is document clustering where machine learning techniques are applied to the both of them. [ABSTRACT FROM AUTHOR]
- Published
- 2021
21. Investigating critical failure drivers of construction project at planning stage in Saudi Arabia
- Author
-
Alsulamy, Saleh
- Published
- 2022
- Full Text
- View/download PDF
22. Adaptive weights clustering of research papers
- Author
-
Adamyan, Larisa, Efimov, Kirill, Chen, Cathy Yi-Hsuan, and Härdle, Wolfgang K.
- Subjects
G17 ,JEL system ,Adaptive algorithm ,330 Wirtschaft ,ddc:330 ,C58 ,Economic articles ,Nonparametric ,G11 ,C32 ,C55 ,Clustering - Abstract
The JEL classification system is a standard way of assigning key topics to economic articles in order to make them more easily retrievable in the bulk of nowadays massive literature. Usually the JEL (Journal of Economic Literature) is picked by the author(s) bearing the risk of suboptimal assignment. Using the database of a Collaborative Research Center from Humboldt-Universit¨at zu Berlin and Xiamen University, China we employ a new adaptive clustering technique to identify interpretable JEL (sub)clusters. The proposed Adaptive Weights Clustering (AWC) is available on www.quantlet.de and is based on the idea of locally weighting each point (document, abstract) in terms of cluster membership. Comparison with k-means or CLUTO reveals excellent performance of AWC.
- Published
- 2017
23. The influence of clustering on HR practices and intrapreneurial behavior
- Author
-
Canet-Giner, María Teresa, Redondo-Cano, Ana, Balbastre-Benavent, Francisco, Escriba-Carda, Naiara, Revuelto-Taboada, Lorenzo, and Saorin-Iborra, María del Carmen
- Published
- 2022
- Full Text
- View/download PDF
24. Innovation within networks – patent strategies for blockchain technology
- Author
-
Dehghani, Milad, Mashatan, Atefeh, and Kennedy, Ryan William
- Published
- 2021
- Full Text
- View/download PDF
25. Reverse logistics risk management: identification, clustering and risk mitigation strategies
- Author
-
Panjehfouladgaran, Hamidreza and Lim, Stanley Frederick W.T.
- Published
- 2020
- Full Text
- View/download PDF
26. A methodology for classification and validation of customer datasets
- Author
-
Nie, Dongyun, Cappellari, Paolo, and Roantree, Mark
- Published
- 2021
- Full Text
- View/download PDF
27. An anomaly detection method to improve the intelligent level of smart articles based on multiple group correlation probability models
- Author
-
Lu, Xudong, Wang, Shipeng, Kang, Fengjian, Liu, Shijun, Li, Hui, Xu, Xiangzhen, and Cui, Lizhen
- Published
- 2019
- Full Text
- View/download PDF
28. A Prompt Example Construction Method Based on Clustering and Semantic Similarity.
- Author
-
Chen, Ding and Wang, Jun
- Abstract
With the launch of OpenAI's ChatGPT, large language models have garnered significant attention, and applications based on these models have proliferated. A critical challenge has emerged: how to rapidly enhance the capabilities of general LLMs in specialized domains. Compared to fine-tuning and other methods, prompt engineering has proven to be a cost-effective approach for improving the performance of LLMs on specific tasks, yielding remarkable results. However, current prompt example construction methods are numerous and lack a universally applicable approach that spans different models and tasks. Furthermore, existing research is predominantly tested and evaluated on a limited range of specific datasets, failing to explore the broader impact of these methods on a wider array of tasks. This paper proposes a prompt example construction method based on clustering and semantic similarity, which combines clustering algorithms with semantic similarity techniques to significantly improve the quality of prompt examples. In comparative tests conducted on six LLMs and seven datasets, the overall accuracy and stability of the proposed method significantly outperforms five other common methods, demonstrating broad applicability and the potential to enhance the output performance of all LLMs. Through comparative experiments, this paper also identifies that as the parameter scale of LLMs increases, the improvement effect of the prompt example construction method on LLM output performance tends to diminish. Additionally, diversified prompt example sets provide a more pronounced enhancement in LLM output performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. The role of pension knowledge in voluntary pension and banking savings in Chile
- Author
-
Diaz, David, Ruiz, José L., and Tapia, Pablo
- Published
- 2021
- Full Text
- View/download PDF
30. Mining efficient training patterns of non-professional cyclists (Discussion Paper)
- Author
-
Cintia P., Pappalardo L., and Pedreschi D.
- Subjects
Sport mining ,Science of success ,Clustering - Abstract
The recent emergence of the so called online social fitness open up new scenarios for fascinating challenges in the field of data sci- ence. Through these platforms, users can collect, monitor and share with friends their sport performance, with interesting details about heartrate, watt consumption and calories burned. The availability of this data, col- lected among a large number of users, gives us the possibility to explore new data mining applications. In the current work, we present the results of a study conducted on a sample of 29; 284 cyclists downloaded via APIs from the social fitness platform Strava.com. We defined two basic metrics: A measure of train- ing effort, that is how much a cyclist struggled during the workout; and a measure of training performance indicating the results achieved during the training. Although the average effort is weakly correlated with the average performance, by deeply investigating workouts time evolution and cyclists' training characteristics interesting findings came out. We found that athletes that better improve their performance follow precise training patterns usually referred as overcompensation theory, with alter- nation of stress peaks and rest periods. Studies and experiments related to such theory, up to now, have always been conducted by sports doctors on a few dozen professionals athletes. To the best of our knowledge, our study is the first corroboration on large scale of this theory.
- Published
- 2014
31. Innovative propensity with a fuzzy multicriteria approach : Analysis of the Spanish industrial sector with data mining techniques
- Author
-
Cobo, Angel, Rocha, Eliana Rocio, and Villamizar, Marco Antonio
- Published
- 2019
- Full Text
- View/download PDF
32. A comparative data analytic approach to construct a risk trade-off for cardiac patients’ re-admissions
- Author
-
Nasir, Murtaza, South-Winter, Carole, Ragothaman, Srini, and Dag, Ali
- Published
- 2019
- Full Text
- View/download PDF
33. MINIMUM POWER CONSUMPTION ROUTING USING HIERARCHICAL FUZZY LOGIC CLUSTERING FOR INTERNET OF THINGS.
- Author
-
PRADEEPA, K. and PRAVEEN, M.
- Subjects
ROUTING algorithms ,HIERARCHICAL clustering (Cluster analysis) ,FUZZY logic ,ENERGY consumption ,INTERNET of things ,MULTICASTING (Computer networks) - Abstract
The Internet of Things (IoT) is a very famous network because of its many applications. IoT network has an integration of large-scale IoT devices that generate data. These IoT devices are very low power computing devices due to which they have a low level of communication. These devices construct data and transmit the data to the base station via intermediate IoT devices. The base station gathers and integrates the data and sends it to the administrator for further processing. The data attains the base station using various routing algorithms with the goal of low power consumption. When discussing low power IoT devices, power efficiency is an important performance measurement when creating a routing algorithm. This paper proposes a Minimum Power Consumption Routing (MPCR) algorithm using Hierarchical Fuzzy Logic Clustering (HFLC) algorithm for IoT networks. The MPCR with HFLC algorithm is an energy-efficient algorithm because of its lower power consumption for the cluster by aggregating the data within the cluster head and decreasing the number of data transmissions to the base station. In this paper, the cluster formation and cluster-head selection are explained, and a simulation has been conducted. In addition, the proposed algorithm is compared with the existing algorithms based on different metrics such as throughput, packet delivery ratio, and energy consumption of the network. The experimental results show that the proposed MPCR with the HFLC algorithm provides high throughput and packet delivery ratio and reduces energy consumption more efficiently than other existing algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
34. Context-aware knowledge selection and reliable model recommendation with ACCORDION.
- Author
-
Ahmed, Yasmine, Telmer, Cheryl A., Gaoxiang Zhou, and Miskov-Zivanov, Natasa
- Subjects
EXPERT systems ,LITERARY sources ,GRAPH algorithms ,BIOLOGICAL systems ,NATURAL language processing - Abstract
New discoveries and knowledge are summarized in thousands of published papers per year per scientific domain, making it incomprehensible for scientists to account for all available knowledge relevant for their studies. In this paper, we present ACCORDION (ACCelerating and Optimizing model RecommenDatIONs), a novel methodology and an expert system that retrieves and selects relevant knowledge from literature and databases to recommend models with correct structure and accurate behavior, enabling mechanistic explanations and predictions, and advancing understanding. ACCORDION introduces an approach that integrates knowledge retrieval, graph algorithms, clustering, simulation, and formal analysis. Here, we focus on biological systems, although the proposed methodology is applicable in other domains. We used ACCORDION in nine benchmark case studies and compared its performance with other previously published tools. We show that ACCORDION is: comprehensive, retrieving relevant knowledge from a range of literature sources through machine reading engines; very effective, reducing the error of the initial baseline model by more than 80%, recommending models that closely recapitulate desired behavior, and outperforming previously published tools; selective, recommending only the most relevant, contextspecific, and useful subset (15%-20%) of candidate knowledge in literature; diverse, accounting for several distinct criteria to recommend more than one solution, thus enabling alternative explanations or intervention directions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. What do we want to know about MOOCs? Results from a machine learning approach to a systematic literature mapping review.
- Author
-
Despujol, Ignacio, Castañeda, Linda, Marín, Victoria I., and Turró, Carlos
- Subjects
MASSIVE open online courses ,MACHINE learning ,EDUCATIONAL resources ,LARGE-scale brain networks ,CONTENT analysis ,EDUCATORS - Abstract
By the end of 2020, over 16,300 Massive Open Online Courses (MOOCs) from 950 universities worldwide had enrolled over 180 million students. Interest in MOOCs has been matched by significant research on the topic, including a considerable number of reviews. This study uses Machine Learning techniques and human expert supervision to generate a comprehensive systematic literature mapping review that overcomes some limitations of the traditional ones and provides a broader overview of the content and main topics studied in the specialized literature devoted to MOOCs. The sample consisted of 6320 publications automatically classified within six research topics, denominated by human experts: institutional approach, pedagogical approach, evaluation, analytics, participation, and educational resources. The content analysis of the topics identified was conducted using visual network analysis, which supported the identification of different thematic sub-clusters and endorsed the classification. Results from the review show that the lowest production of MOOC papers is within the topics of the pedagogical approach and educational resources. In contrast, participation and evaluation are the most frequent ones. In addition, the most cited papers are on the topics of analytics and resources, being the pedagogical approach and the institutional approach the less cited. This highlights the need for more MOOC research from a pedagogical perspective and calls upon the presence of educators. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
36. A STATISTICAL STUDY ANALYSIS ON EXPLORING CONSUMPTION PATTERNS REGARDING FOOD LOSS AND WASTE.
- Author
-
NIJLOVEANU, Daniel, TIȚA, Victor, BOLD, Nicolae, SMEDESCU, Dragoș, FÎNTÎNERU, Alexandru, TUDOR, Valentina, SMEDESCU, Cosmina, and JERCA, Emanuela
- Subjects
DIETARY patterns ,FOOD waste ,FOOD consumption ,MULTIVARIATE analysis ,STATISTICS ,UNIVARIATE analysis - Abstract
Food loss ans waste, referred next as FLW, has a great extent on the economic and behavioural patterns of consumption. In this paper, we aim to present statistical univariate and multivariate analyses based on a statistical study run during a period of several months in the Romanian territory. In this matter, the main purpose of the analysis developed in this paper is to determine several patterns of FLW phenomenon. This is made based on a methodology which comprises two main sections: the univariate analysis, consisting in statistical determinations of indicators and direct observation, and the multivariate analysis, related to the clustering analysis. The clustering analysis is based on the usage of hierarchical method, which has as final result the determination of clusters describing patterns of consumption related to food loss and waste. After the analysis was run, five main clusters related to food waste behaviour patterns were determined, with slight differences between food consumption habits, but with various combinations between the parameters taken into consideration. The obtained clusters offer important information for policymakers and stakeholders aiming to customize interventions and programs to target specific demographic groups or segments of the population. [ABSTRACT FROM AUTHOR]
- Published
- 2024
37. Echo State Network-Based Content Prediction for Mobile Edge Caching Networks.
- Author
-
Cai, Zengyu, Chen, Xi, Zhang, Jianwei, Zhu, Liang, and Hu, Xinhua
- Subjects
CACHE memory ,INTELLIGENT computer terminals ,DOCUMENT clustering ,INTERNET ,COMPUTER networks - Abstract
With the rapid development of internet communication and the wide application of intelligent terminal, moving the cache to the edge of the network is an effective solution to shorten the delay of users accessing content. However, the existing cache work lacks the comprehensive consideration of users and content, resulting in low cache hit ratio and low accuracy of the whole system. In this paper, the authors propose a collaborative caching model that considers both user request content and content prediction, so as to improve the caching performance of the whole network. Firstly, the model uses the clustering algorithm based on Akike information criterion to cluster users. Then, combined with the clustering results, echo state network is used as the machine learning framework to predict the content. Finally, the cache contents are selected according to the prediction results and cached in the cache unit of the small base station. Simulation results show that compared with the existing cache algorithms, the proposed method has obvious improvement in cache hit ratio, accuracy, and recall rate. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
38. Latent Space Perspicacity and Interpretation Enhancement (LS-PIE) Framework.
- Author
-
Stevens, Jesse, Wilke, Daniel N., and Setshedi, Isaac I.
- Abstract
Linear latent variable models such as principal component analysis (PCA), independent component analysis (ICA), canonical correlation analysis (CCA), and factor analysis (FA) identify latent directions (or loadings) either ordered or unordered. These data are then projected onto the latent directions to obtain their projected representations (or scores). For example, PCA solvers usually rank principal directions by explaining the most variance to the least variance. In contrast, ICA solvers usually return independent directions unordered and often with single sources spread across multiple directions as multiple sub-sources, severely diminishing their usability and interpretability. This paper proposes a general framework to enhance latent space representations to improve the interpretability of linear latent spaces. Although the concepts in this paper are programming language agnostic, the framework is written in Python. This framework simplifies the process of clustering and ranking of latent vectors to enhance latent information per latent vector and the interpretation of latent vectors. Several innovative enhancements are incorporated, including latent ranking (LR), latent scaling (LS), latent clustering (LC), and latent condensing (LCON). LR ranks latent directions according to a specified scalar metric. LS scales latent directions according to a specified metric. LC automatically clusters latent directions into a specified number of clusters. Lastly, LCON automatically determines the appropriate number of clusters to condense the latent directions for a given metric to enable optimal latent discovery. Additional functionality of the framework includes single-channel and multi-channel data sources and data pre-processing strategies such as Hankelisation to seamlessly expand the applicability of linear latent variable models (LLVMs) to a wider variety of data. The effectiveness of LR, LS, LC, and LCON is shown in two foundational problems crafted with two applied latent variable models, namely, PCA and ICA. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Exploring and Visualizing Multilingual Cultural Heritage Data Using Multi-Layer Semantic Graphs and Transformers.
- Author
-
Gagliardi, Isabella and Artese, Maria Teresa
- Subjects
LANGUAGE models ,TRANSFORMER models ,CULTURAL property ,DATA visualization ,ALGORITHMS - Abstract
The effectiveness of archives, particularly those related to cultural heritage, depends on their accessibility and navigability. An intuitive interface is essential for improving accessibility and inclusivity, enabling users with diverse backgrounds and expertise to interact with archival content effortlessly. This paper introduces a new method for visualizing and navigating dataset information through the creation of semantic graphs. By leveraging pre-trained large language models, this approach groups data and generates semantic graphs. The development of multi-layer maps facilitates deep exploration of datasets, and the capability to handle multilingual datasets makes it ideal for archives containing documents in various languages. These features combine to create a user-friendly tool adaptable to various contexts, offering even non-expert users a new way to interact with and navigate the data. This enhances their overall experience, promoting a greater understanding and appreciation of the content. The paper presents experiments conducted on diverse datasets across different languages and topics employing various algorithms and methods. It provides a thorough discussion of the results obtained from these experiments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Optimal Knowledge Distillation through Non-Heuristic Control of Dark Knowledge †.
- Author
-
Onchis, Darian, Istin, Codruta, and Samuila, Ioan
- Subjects
MACHINE learning ,DECISION trees ,KNOWLEDGE transfer ,DISTILLATION ,VALUES (Ethics) - Abstract
In this paper, a method is introduced to control the dark knowledge values also known as soft targets, with the purpose of improving the training by knowledge distillation for multi-class classification tasks. Knowledge distillation effectively transfers knowledge from a larger model to a smaller model to achieve efficient, fast, and generalizable performance while retaining much of the original accuracy. The majority of deep neural models used for classification tasks append a SoftMax layer to generate output probabilities and it is usual to take the highest score and consider it the inference of the model, while the rest of the probability values are generally ignored. The focus is on those probabilities as carriers of dark knowledge and our aim is to quantify the relevance of dark knowledge, not heuristically as provided in the literature so far, but with an inductive proof on the SoftMax operational limits. These limits are further pushed by using an incremental decision tree with information gain split. The user can set a desired precision and an accuracy level to obtain a maximal temperature setting for a continual classification process. Moreover, by fitting both the hard targets and the soft targets, one obtains an optimal knowledge distillation effect that mitigates better catastrophic forgetting. The strengths of our method come from the possibility of controlling the amount of distillation transferred non-heuristically and the agnostic application of this model-independent study. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Identification of Similar Weather Scenes in Terminal Areas Based on Multiresolution Spatiotemporal Windows.
- Author
-
Yuan, Ligang, Zhu, Jianan, Zeng, Yang, Chen, Wenlu, and Liu, Li
- Subjects
AIR travel ,TRAFFIC flow ,FEATURE extraction ,WEATHER ,TIME management - Abstract
To accurately analyze the influence of similar weather scenes in the terminal area, a framework is proposed for identifying such scenarios based on the Multiresolution Spatiotemporal Window (MRSTW). The goal is to analyze the impact of similar weather patterns. This paper introduces a simple and effective method called the rasterized weather severity index (WSI) to reduce the dimensionality of data used for extracting air transport weather features, which can cause the loss of spatial information in an image. Additionally, the paper uses Dynamic Time Warping (DTW) and the Fuzzy C-mean (FCM) clustering algorithm to cluster time-series scenes influenced by convective weather in an unsupervised manner on a daily basis. The most similar weather scenes are then identified by searching for the same cluster within a multiresolution spatiotemporal window, using 4 h weather scenes as typical examples. Finally, the framework analyzes the impact of weather scenes on the operation of terminal area approach traffic flow by combining trajectory data. The findings demonstrate that this framework can effectively identify similar weather scenes and provide a more accurate reflection of their impact on the operation of terminal area approach traffic flow. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. RDSC: Range-Based Device Spatial Clustering for IoT Networks.
- Author
-
Achkouty, Fouad, Gallon, Laurent, and Chbeir, Richard
- Subjects
DATA privacy ,NETWORK performance ,DATA warehousing ,INTERNET of things ,DATA quality - Abstract
The growth of the Internet of Things (IoT) has become a crucial area of modern research. While the increasing number of IoT devices has driven significant advancements, it has also introduced several challenges, such as data storage, data privacy, communication protocols, complex network topologies, and IoT device management. In essence, the management of IoT devices is becoming more and more challenging, especially with the limited capacity and power of the IoT devices. The devices, having limited capacities, cannot store the information of the entire environment at once. In addition, device power consumption can affect network performance and stability. The devices' sensing areas with device grouping and management can simplify further networking tasks and improve response quality with data aggregation and correction techniques. In fact, most research papers are looking forward to expanding network lifetimes by relying on devices with high power capabilities. This paper proposes a device spatial clustering technique that covers crucial challenges in IoT. Our approach groups the dispersed devices to create clusters of connected devices while considering their coverage, their storage capacities, and their power. A new clustering protocol alongside a new clustering algorithm is introduced, resolving the aforementioned challenges. Moreover, a technique for non-sensed area extraction is presented. The efficiency of the proposed approach has been evaluated with extensive experiments that gave notable results. Our technique was also compared with other clustering algorithms, showing the different results of these algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. From Customer's Voice to Decision-Maker Insights: Textual Analysis Framework for Arabic Reviews of Saudi Arabia's Super App.
- Author
-
Alrayani, Bodoor, Kalkatawi, Manal, Abulkhair, Maysoon, and Abukhodair, Felwa
- Subjects
USER experience ,SENTIMENT analysis ,PRIVATE sector ,K-means clustering ,DATA mining - Abstract
Recently, business sectors have focused on offering a wide variety of services through utilizing different modern technologies such as super apps in order to fulfill customers' needs and create a satisfactory user experience. Accordingly, studying the user experience has become one of the most popular trends in the research field due to its essential role in business prosperity and continuity. Thus, many researchers have dedicated their efforts to exploring and analyzing the user experience across social media, blogs, and websites, employing a variety of research methods such as machine learning to mine users' reviews. However, there are limited studies concentrated on analyzing super app users' experiences and specifically mining Arabic users' reviews. Therefore, this paper aims to analyze and discover the most important topics that affect the user experience in the super app environment by mining Arabic business sector users' reviews in Saudi Arabia using biterm topic modeling, CAMeL sentiment analyzer, and doc2vec with k-means clustering. We explore users' feelings regarding the extracted topics in order to identify the weak aspects to improve and the strong aspects to enhance, which will promote a satisfactory user experience. Hence, this paper proposes an Arabic text annotation framework to help the business sector in Saudi Arabia to determine the important topics with negative and positive impacts on users' experience. The proposed framework uses two approaches: topic modeling with sentiment analysis and topic modeling with clustering. As a result, the proposed framework reveals four important topics: delivery and payment, customer service and updates, prices, and application. The retrieved topics are thoroughly studied, and the findings show that, in most topics, negative comments outweigh positive comments. These results are provided with general analysis and recommendations to help the business sector to improve its level of services. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. MFC-RMA (Matrix Factorization and Constraints- Role Mining Algorithm): An Optimized Role Mining Algorithm.
- Author
-
Zhu, Fubao, Yang, Chenguang, Zhu, Liang, Zuo, Hongqiang, and Gu, Jingzhong
- Subjects
BOOLEAN matrices ,MATRIX decomposition ,SECURITY management ,ALGORITHMS ,SYMMETRY - Abstract
Role-based access control (RBAC) is a widely adopted access control model in various domains for defining security management. Role mining is closely related to role-based access control, as the latter employs role assignments to offer a flexible and scalable approach to managing permissions within an organization. The edge role mining problem (Edge RMP), a variant of the role mining problem (RMP), has long been recognized as an effective strategy for role assignment. Role mining, which groups users with similar access permissions into the same role, bears some resemblance to symmetry. Symmetry categorizes objects or graphics with identical characteristics into one group. Both involve a certain form of "classification" or "induction". Edge-RMP reduces the associations between users and permissions, thereby lowering the security risks faced by the system. While an algorithm based on Boolean matrix factorization exists for this problem, it fails to further refine the resulting user–role assignment (UA) and role–permission assignment (PA) relationships. Additionally, this algorithm does not address constraint-related issues, such as cardinality constraints, user exclusion constraints, and user capabilities. Furthermore, it demonstrates significant redundancy of roles when handling large datasets, leaving room for further optimization of Edge-RMP results. To address these concerns, this paper proposes the MFC-RMA algorithm based on Boolean matrix factorization. The method achieves significant optimization of Edge-RMP results by handling relationships between roles possessing various permissions. Furthermore, this paper clusters, compresses, modifies, and optimizes the original data based on the similarity between users, ensuring its usability for role mining. Both theoretical and practical considerations are taken into account for different types of constraints, and algorithms are devised to reallocate roles incorporating these constraints, thereby generating UA and PA matrices. The proposed approach yields optimal numbers of generated roles and the sum of the minimum number of generated edges to address the aforementioned issues. Experimental results demonstrate that the algorithm reduces management overhead, provides efficient execution results, and ensures the accuracy of generated roles. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Applying Hybrid Clustering with Evaluation by AUC Classification Metrics.
- Author
-
Dakhil, Ali Fattah, Ali, Waffaa M., and Hasan, Mustafa Asaad
- Subjects
MACHINE learning ,RESEARCH questions ,FALSE alarms ,STATISTICAL learning ,DISTANCE education - Abstract
Traditional metrics may not adequately assess performance in certain situations, whereas the Area Under Curve (AUC) offers a comprehensive perspective by considering both sensitivity and specificity. This method enhances interpretability, addresses limitations, and promotes the development of robust clustering algorithms. In unsupervised learning, utilizing AUC is a significant method for improving the precision and accuracy of machine learning models. Our work is inspired by several recent related works that implement approaches to manage the challenges of developing new metrics that can effectively assess and evaluate the performance of clustering algorithms. The research question relies on the concept of using an optimal metric for model evaluation of classification and clustering. Therefore, the paper investigates the use of the classification metric AUC for clustering validation purposes. The methodology we adopt is a hybrid clustering model because such a technique offers a robust model by combining the strengths of each model. The linkage approach directly impacts the clustering results, so we give significant attention to this feature in our implementation. Among the various linkage methods, we utilized single and average linkages. The Manhattan and Euclidean metrics are the distance measures used in this work. Thus, our contribution is to explore the benefit of using linkages and distance measurement in clustering with the help of the AUC metric. In addition, the entire proposed work and the contributions of this paper are evaluated and applied to the NSL-KDD dataset. Based on the proposed approach of using AUC with clustering, the Detection Rate (DR), False Alarm Rate (FAR), and other criteria are chosen to examine the model's results and capabilities. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. VisualRPI: Visualizing Research Productivity and Impact.
- Author
-
Hung, Chihli and Lin, Wei-Chao
- Abstract
Research productivity and impact (RPI) is commonly measured through citation analysis, such as the h-index. Despite the popularity and objectivity of this type of method, it is still difficult to effectively compare a number of related researchers in terms of various citation-related statistics at the same time, such as average cites per year/paper, the number of papers/citations, h-index, etc. In this work, we develop a method that employs information visualization technology, and examine its applicability for the assessment of researchers' RPI. Specifically, our prototype, a visualizing research productivity and impact (VisualRPI) system, is introduced, which is composed of clustering and visualization components. The clustering component hierarchically clusters similar research statistics into the same groups, and the visualization component is used to display the RPI in a clear manner. A case example using information for 85 information systems researchers is used to demonstrate the usefulness of VisualRPI. The results show that this method easily measures the RPI for various performance indicators, such as cites/paper and h-index. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
47. Optimization of Linear Quantization for General and Effective Low Bit-Width Network Compression.
- Author
-
Yang, Wenxin, Zhi, Xiaoli, and Tong, Weiqin
- Subjects
PARTICLE swarm optimization ,K-means clustering ,ENERGY consumption - Abstract
Current edge devices for neural networks such as FPGA, CPLD, and ASIC can support low bit-width computing to improve the execution latency and energy efficiency, but traditional linear quantization can only maintain the inference accuracy of neural networks at a bit-width above 6 bits. Different from previous studies that address this problem by clipping the outliers, this paper proposes a two-stage quantization method. Before converting the weights into fixed-point numbers, this paper first prunes the network by unstructured pruning and then uses the K-means algorithm to cluster the weights in advance to protect the distribution of the weights. To solve the instability problem of the K-means results, the PSO (particle swarm optimization) algorithm is exploited to obtain the initial cluster centroids. The experimental results on baseline deep networks such as ResNet-50, Inception-v3, and DenseNet-121 show the proposed optimized quantization method can generate a 5-bit network with an accuracy loss of less than 5% and a 4-bit network with only 10% accuracy loss as compared to 8-bit quantization. By quantization and pruning, this method reduces the model bit-width from 32 to 4 and the number of neurons by 80%. Additionally, it can be easily integrated into frameworks such as TensorRt and TensorFlow-Lite for low bit-width network quantization. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
48. ON WHICH SOCIOECONOMIC GROUPS DO REVERSE MORTGAGES HAVE THE GREATEST IMPACT? EVIDENCE FROM SPAIN.
- Author
-
BOJ, Eva, CLARAMUNT, M. Mercè, and VAREA, Xavier
- Subjects
REVERSE mortgage loans ,HOUSEHOLD surveys ,STOCHASTIC models ,HOUSEHOLDS ,MARKOV processes - Abstract
Reverse mortgage is one of the products (perhaps the main one) that is good to obtain additional income by using the habitual residence as collateral. The main objective of this paper is to analyse the effects that reverse mortgage contracting has on household finances over the lifetime of a family according to the socioeconomic group to which it belongs in Spain. Four indicators are employed to measure the immediate and long-term effects. We use a stochastic model with a double source of randomness, survival and entry into dependency, and apply it to the three socioeconomic groups obtained with cluster methodology from the 2017 Spanish Household Financial Survey data. We conclude that the effects are very different depending on the group: regarding only the effects of hiring a reverse mortgage on the income of the family, widowed women aged between 81 and 85 years, with low income and expenses as well as little net wealth, and a habitual residence that represents half of her net wealth (Cluster 1) are the most benefited; considering that the highest impact indicators are on the probability of illiquidity and on the value of lack of liquidity, the use of reverse mortgages benefits more the families in Cluster 3 (high income and expenses and really high net wealth, head of household aged between 76 and 80 years) and less the families in Cluster 2 (medium income, net wealth and expenses, head of household aged between 65 and 75 years). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. A Review of Predictive Analytics Models in the Oil and Gas Industries.
- Author
-
R Azmi, Putri Azmira, Yusoff, Marina, and Mohd Sallehud-din, Mohamad Taufik
- Subjects
PREDICTION models ,GAS industry ,PETROLEUM industry ,MACHINE learning - Abstract
Enhancing the management and monitoring of oil and gas processes demands the development of precise predictive analytic techniques. Over the past two years, oil and its prediction have advanced significantly using conventional and modern machine learning techniques. Several review articles detail the developments in predictive maintenance and the technical and non-technical aspects of influencing the uptake of big data. The absence of references for machine learning techniques impacts the effective optimization of predictive analytics in the oil and gas sectors. This review paper offers readers thorough information on the latest machine learning methods utilized in this industry's predictive analytical modeling. This review covers different forms of machine learning techniques used in predictive analytical modeling from 2021 to 2023 (91 articles). It provides an overview of the details of the papers that were reviewed, describing the model's categories, the data's temporality, field, and name, the dataset's type, predictive analytics (classification, clustering, or prediction), the models' input and output parameters, the performance metrics, the optimal model, and the model's benefits and drawbacks. In addition, suggestions for future research directions to provide insights into the potential applications of the associated knowledge. This review can serve as a guide to enhance the effectiveness of predictive analytics models in the oil and gas industries. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. MEASURING THE EFFECT OF ENTRY INTO THE EUROZONE ON ECONOMIC GROWTH -- DATA STORYTELLING USING CLUSTERING AND ANFIS.
- Author
-
BOSNA, Jurica, BRLEČIĆ VALČIĆ, Sonja, and PEŠA, Anita
- Subjects
RATINGS & rankings of public debts ,EUROZONE ,ECONOMIC expansion ,MONETARY unions ,ECONOMIC forecasting ,INTERNATIONAL economic integration - Abstract
The aim of this paper is to examine the impact of a country's entry into the monetary union on its economic variables growth rate of real GDP as well as on GDP per capita growth for the period from 2010 to 2020. The clustering method and the ANFIS method were used in the data analysis. A total of two cluster analyses were performed. The first cluster includes countries that joined the EU in 2004 and became EZ members by 2010. The second cluster refers to those countries that joined the EU in 2004 but are not yet members of the EZ. For the first individual cluster analysis two models were analysed and for the second individual cluster three models were analysed using the ANFIS method. As expected, the results showed that GDP growth is connected with trade, inflation and gross investments in fixed capital in the observed countries, while GDP per capita is connected with unemployment, interest rates and public debt. With regard to GDP growth, the difference between countries that are in the eurozone and those that are not is not significant, which is in line with other studies. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.