6,589 results
Search Results
2. Cloud Computing and firm performance: a SEM microdata analysis for Israeli firms
- Author
-
Katz, Raúl, Jung, Juan, and Goldman, Matan
- Published
- 2024
- Full Text
- View/download PDF
3. Short papers of the 10th Conference on Cloud Computing, Big Data & Emerging Topics
- Author
-
Naiouf, Marcelo, De Giusti, Armando Eduardo, Chichizola, Franco, De Giusti, Laura Cristina, and Rucci, Enzo
- Subjects
Informática ,big data ,cloud computing ,high-performance computing ,emerging tech ,Inteligencia artificial - Abstract
Compilación de los short papers presentados en las 10mas Jornadas de Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET2022), llevadas a cabo en modalidad híbrida durante junio de 2021 y organizadas por el Instituto de Investigación en Informática LIDI (III-LIDI) y la Secretaría de Posgrado de la Facultad de Informática de la UNLP, en colaboración con universidades de Argentina y del exterior., Facultad de Informática
- Published
- 2022
4. A Review Paper On Big Data Analytics in Cloud
- Author
-
Shaheen Mohsin Ansari
- Subjects
Computer science ,business.industry ,Big data ,Cloud computing ,business ,Data science - Abstract
The amount of data produced in the enterprise is increasing. Any industry will have to cope with exploding data volumes in the future, which will accelerate exponential data growth. It is critical to use a cost-effective, flexible approach for storing and analyzing this data. As a service to big data, the cloud will offer storage, platform, and software capabilities. Big data and cloud technologies are combining to make big data analytics in the cloud a viable choice. Data Analytics as a Service is another name for Cloud for Big Data Analytics. In this review paper we will get to know how big data analytics used cloud computing services for better performance or experience with their benefits, challenges and so on.
- Published
- 2021
5. Cloud-based big data framework towards strengthening disaster risk reduction: systematic mapping
- Author
-
Mahrin, Mohd Naz’ri, Subbarao, Anusuyah, Chuprat, Suriayati, and Abu Bakar, Nur Azaliah
- Published
- 2023
- Full Text
- View/download PDF
6. A Study Paper on Vision of IOT
- Author
-
Hima Mohan and L. C. Manikandan
- Subjects
Pharmacology ,business.industry ,Computer science ,020208 electrical & electronic engineering ,Big data ,Cloud computing ,02 engineering and technology ,WiMAX ,GSM ,0202 electrical engineering, electronic engineering, information engineering ,General Packet Radio Service ,business ,Internet of Things ,Computer network - Abstract
The Internet of things (IoT) refers to a type of network to connect anything with the Internet based on stipulated protocols through information sensing equipment’s to conduct information exchange and communications in order to achieve smart recognitions, positioning, tracing, monitoring, and administration. In this paper we briefly discussed about what IOT is, how IOT enables different technologies, about its architecture, characteristics & applications. The purpose of this paper is to provide the basic information about IoT for young readers.
- Published
- 2020
7. Research Directions on Big IoT Data Processing using Distributed Ledger Technology: A Position Paper
- Author
-
Richard Hill, Benjamin Agbo, and Yongrui Qin
- Subjects
Data processing ,Blockchain ,Computer science ,business.industry ,Scale (chemistry) ,Big data ,Distributed ledger ,Volume (computing) ,Position paper ,Cloud computing ,business ,Data science - Abstract
The significant growth and adoption of Internet of Things (IoT) solutions has led to tremendous increase in the generation of data. The need for high speed data processing has become very important to meet with the ever increasing volume and velocity of IoT data, due to the large scale and distributed nature of IoT infrastructure and networks. Present cloud based technologies are struggling to meet up with these needs for real time data processing in the midst of enormous amounts of data. The success of bitcoin has inspired more research in the application of Distributed ledger technologies in various domains. The decentralized nature of these platforms have enabled security and privacy of data in previous research and their architecture has a potential for enabling large scale decentralized data processing. In this paper, we identify some open areas of research in the use of distributed ledger technology and propose a framework for storing, analyzing and ensuring the security of large volumes of IoT data.
- Published
- 2019
8. Short papers of the 9th Conference on Cloud Computing, Big Data & Emerging Topics
- Author
-
Laura Cristina De Giusti, Enzo Rucci, Marcelo Naiouf, Armando Eduardo De Giusti, and Franco Chichizola
- Subjects
Big data ,Computer science ,Cloud computing big data ,Ciencias Informáticas ,Cloud computing ,Data science - Abstract
Compilación de los short papers presentados en las 9nas Jornadas de Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET2021), llevadas a cabo en modalidad virtual durante junio de 2021 y organizadas por el Instituto de Investigación en Informática LIDI (III-LIDI) y la Secretaría de Posgrado de la Facultad de Informática de la UNLP, en colaboración con universidades de Argentina y del exterior., Facultad de Informática
- Published
- 2021
9. Brief Industry Paper: AutoToolCSU: CAN Signal Unpacking Tool for Automotive Software
- Author
-
Pingfu Xie, Bo He, Fengnan Huang, Renfa Li, and Guoqi Xie
- Subjects
Unpacking ,Software ,business.industry ,Computer science ,Big data ,Byte ,ComputerApplications_COMPUTERSINOTHERSYSTEMS ,Cloud computing ,business ,Computer hardware ,Graphical user interface ,Automotive software ,CAN bus - Abstract
The CAN (Controller Area Network) signals transmitted in vehicles have great analytical value with the quick development of complex automotive software. The boom in big data creates an opportunity to transmit CAN signals from the in-vehicle network to the big data cloud platform, through which the signal analysis can be conducted. The signals are transmitted from the in-vehicle network to TelematicsBOX via CAN bus and then sent to the big data cloud platform. When using the CAN bus for signal transmission of the in-vehicle network, signals larger than 1 byte need to be unpacked into several 1-byte signals. The general solution of automotive software manufacturers usually uses the model-based development method to unpack the CAN signals, but such method is inefficient. To solve this problem, we develop a CAN signal unpacking tool called AutoToolCSU, which is based on a configured template through a GUI (Graphical User Interface). Compared to the model-based development method, AutoToolCSU not only greatly improves the development efficiency of CAN signal unpacking but also interfaces with the standard development processes of automotive software manufacturers.
- Published
- 2021
10. Big Data Privacy Management: A Vision Paper
- Author
-
Xiaofeng Meng and Xiaojian Zhang
- Subjects
Conceptual framework ,business.industry ,Computer science ,Server ,Big data ,Scalability ,Key (cryptography) ,Differential privacy ,Cloud computing ,business ,Encryption ,Data science - Abstract
The growing trend towards big data and cloud computing provides enormous data-driven applications such as location-based services. However, it also creates many potential risks for some individuals in big data scenarios. These risks are further complicated by the security and privacy constraints on the individuals’ data that are inherently sensitive. To handle these risks and challenges, this paper proposes our vision of an active, adaptive, and scalable framework for sharing and processing potentially sensitive data. We identify the main privacy risks and technical challenges and present some preliminary solutions. The key idea of this framework is that it integrates risk active monitoring, risk active assessment, privacy active management, accountable systems, and law and regulation to ensure big data security and privacy in the whole system. We believe that this conceptual framework may provide a helpful and useful foundation for big data privacy management and will open up many existing research challenges.
- Published
- 2020
11. Short Papers of the 8th Conference on Cloud Computing Conference, Big Data & Emerging Topics (JCC-BD&ET 2020)
- Author
-
De Giusti, Armando Eduardo, Naiouf, Marcelo, Chichizola, Franco, Rucci, Enzo, and De Giusti, Laura Cristina
- Subjects
Big data ,Ciencias Informáticas ,Cloud computing - Abstract
Compilación de los short papers presentados en las 8vas Jornadas de Cloud Computing, Big Data & Emerging Topics (JCC-BD&ET2020), llevadas a cabo en modalidad virtual durante septiembre de 2020 y organizadas por el Instituto de Investigación en Informática LIDI (III-LIDI) y la Secretaría de Posgrado de la Facultad de Informática de la UNLP en colaboración con universidades de Argentina y del exterior., Facultad de Informática
- Published
- 2020
12. Online optimization in the Non-Stationary Cloud: Change Point Detection for Resource Provisioning (Invited Paper)
- Author
-
Zhenhua Liu, Joshua Comden, and Jessica Maghakian
- Subjects
Computer science ,business.industry ,Distributed computing ,Big data ,Control (management) ,020206 networking & telecommunications ,Provisioning ,Cloud computing ,02 engineering and technology ,Resource (project management) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Spike (software development) ,Online algorithm ,business ,Change detection - Abstract
The rapid mainstream adoption of cloud computing and the corresponding spike in the energy usage of big data systems make the efficient management of cloud computing resources a more pressing issue than ever before. To this end, numerous online algorithms such as Receding Horizon Control and Online Balanced Descent have been designed. However it is difficult for cloud service providers to select the best control algorithm dynamically for resource provisioning when confronted with consumer resource demands that are notoriously unpredictable and volatile. Furthermore, it highly possible that it might not be the case for any one algorithm to consistently perform well over the months-long contract period. In this paper, we first exemplify the need to address non-stationarity in cloud computing by showcasing traces from MS Azure. We then develop a novel meta-algorithm that combines change point detection and online optimization. The new algorithm is shown to outperform existing solutions in real-world trace-driven simulations.
- Published
- 2019
13. How does cloud computing help businesses to manage big data issues
- Author
-
Latifian, Ahmad
- Published
- 2022
- Full Text
- View/download PDF
14. 从信息化赋能到综合赋能:智慧国土空间规划思路探索.
- Author
-
甄峰, 张姗琪, 秦萧, and 席广亮
- Subjects
- *
ELECTRONIC paper , *URBAN life , *SELF-efficacy , *BIG data , *COMMUNIST parties , *INTERNET in education , *CLOUD computing - Abstract
The Communist Party of China's 19th National Congress Report has clearly set the goal of "smart society" and put forward new requirements for the current development of territorial spatial planning. Nowadays, the empowerment of information technology based on the Internet, big data, cloud computing, etc. is the main driving force and development focus of the current practices of smart territorial spatial planning. How to understand and promote the development of smart society is the foundation of the compilation and implementation of smart territorial spatial planning. This paper emphasizes the importance of human-land relationship and the theory of urban life organism to the planning and governance of territory in a smart society. It points out that the overall conceptualization of smart territorial spatial planning should be transformed from informational empowerment to comprehensive empowerment, which includes technological empowerment and innovative empowerment. The paper constructs a smart territorial spatial planning framework-EPTI-based on the ideas of ecological civilization, people- oriented, technology integration application and institutional innovation, and discusses the paths toward smart compilation and implementation of territorial spatial planning. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
15. Classification of Scientific Papers With Big Data Technologies
- Author
-
Galip Aydin and Selen Gurbuz
- Subjects
FOS: Computer and information sciences ,Information retrieval ,Computer science ,business.industry ,Document classification ,Big data ,Computer Science - Digital Libraries ,Cloud computing ,computer.software_genre ,Field (computer science) ,Data set ,Naive Bayes classifier ,Computer Science - Distributed, Parallel, and Cluster Computing ,Server ,Digital Libraries (cs.DL) ,Distributed, Parallel, and Cluster Computing (cs.DC) ,Cluster analysis ,business ,computer - Abstract
Data sizes that cannot be processed by conventional data storage and analysis systems are named as Big Data.It also refers to nex technologies developed to store, process and analyze large amounts of data. Automatic information retrieval about the contents of a large number of documents produced by different sources, identifying research fields and topics, extraction of the document abstracts, or discovering patterns are some of the topics that have been studied in the field of big data.In this study, Naive Bayes classification algorithm, which is run on a data set consisting of scientific articles, has been tried to automatically determine the classes to which these documents belong. We have developed an efficient system that can analyze the Turkish scientific documents with the distributed document classification algorithm run on the Cloud Computing infrastructure. The Apache Mahout library is used in the study. The servers required for classifying and clustering distributed documents are, Comment: in Turkish
- Published
- 2018
- Full Text
- View/download PDF
16. Augmenting Security of Internet-of-Things Using Programmable Network-Centric Approaches: A Position Paper
- Author
-
Gabby Raymond, Qing Mu, Daniel Vivanco, Hammad Iqbal, Venkatesh Ramaswamy, John Zuena, and Jamie Ma
- Subjects
Cloud computing security ,Situation awareness ,business.industry ,Computer science ,05 social sciences ,Big data ,020206 networking & telecommunications ,Cloud computing ,02 engineering and technology ,Computer security ,computer.software_genre ,Automation ,Protocol stack ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,The Internet ,business ,Internet of Things ,Software-defined networking ,computer ,050203 business & management - Abstract
Advances in nanotechnology, large scale computing and communications infrastructure, coupled with recent progress in big data analytics, have enabled linking several billion devices to the Internet. These devices provide unprecedented automation, cognitive capabilities, and situational awareness. This new ecosystem--termed as the Internet-of-Things (IoT)--also provides many entry points into the network through the gadgets that connect to the Internet, making security of IoT systems a complex problem. In this position paper, we argue that in order to build a safer IoT system, we need a radically new approach to security. We propose a new security framework that draws ideas from software defined networks (SDN), and data analytics techniques; this framework provides dynamic policy enforcements on every layer of the protocol stack and can adapt quickly to a diverse set of industry use-cases that IoT deployments cater to. Our proposal does not make any assumptions on the capabilities of the devices - it can work with already deployed as well as new types of devices, while also conforming to a service-centric architecture. Even though our focus is on industrial IoT systems, the ideas presented here are applicable to IoT used in a wide array of applications. The goal of this position paper is to initiate a dialogue among standardization bodies and security experts to help raise awareness about network-centric approaches to IoT security.
- Published
- 2017
17. Perceptions on adopting artificial intelligence and related technologies in libraries: public and academic librarians in North America
- Author
-
Yoon, JungWon, Andrews, James E., and Ward, Heather L.
- Published
- 2022
- Full Text
- View/download PDF
18. The impact of digital technologies on business models. Insights from the space industry
- Author
-
Aloini, Davide, Latronico, Loretta, and Pellegrini, Luisa
- Published
- 2022
- Full Text
- View/download PDF
19. An Emerging Decentralized Services Computing Paradigm for Big Data Governance: A Position Paper
- Author
-
Gang Huang, Xuanzhe Liu, and Sam Xun Sun
- Subjects
Service (systems architecture) ,Information Systems and Management ,Computer Networks and Communications ,Computer science ,business.industry ,Interoperability ,Big data ,Services computing ,020206 networking & telecommunications ,Cloud computing ,02 engineering and technology ,Data science ,Computer Science Applications ,Data governance ,Hardware and Architecture ,Analytics ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Applications of artificial intelligence ,business - Abstract
With the explosion of “big data” in the past decade, exploring and mining the value hidden in the data has already generated a lot of innovative applications, especially the recent advances of AI applications. The data governance, including activities of data creation, sharing, exchange, management, analytics, tracing, and accounting, has drawn a lot of attentions. Services computing establishes the foundation of current data governance, typically in a centralized fashion, e.g., the cloud-based storage services and analytic services. However, the potential values of big data distributed on the Internet are far away from being adequately explored. Considering the infrastructure revolution made by the blockchain, in this position article, we try to rethink a new data governance fashion that is built upon the blockchain-based decentralized services computing paradigm. The core principle is that data owners are able to publish their data as a set of services that can be deployed independently from the application systems where the data were born. Meanwhile, data owners can define service rules/policies where their data should be stored and how the data can be shared, and keep governing the whole lifecycle record of how their data are actually used. Similar to existing services computing paradigm, data users can search, discover, integrate, and analyze the data in a decentralized fashion. With this perspective, we try to discuss some key insights and enumerate several related new technologies and open challenges, in terms of programmability, interoperability, and intelligence.
- Published
- 2019
20. An Approach for Value as a Service Discovery on Scientific Papers Big Data
- Author
-
He Keqing, Chen Jingliang, Ma Yutao, and Zhang Neng
- Subjects
Information retrieval ,business.industry ,Computer science ,Ontology-based data integration ,Big data ,Service discovery ,Services computing ,Cloud computing ,Data as a service ,Ontology (information science) ,business ,Cluster analysis ,Data science - Abstract
With the integration of cloud computing and big data, it is difficult for the masses to discover valuable service from big data. The understanding of historical data and streaming data is fundamental to the value discovery, and the construction of topic knowledge is essential to the understanding of big data. This paper proposes an approach for the construction of topic knowledge based on ontology meta-modeling, and the approach follows three stages: classification, clustering and integration. Furthermore, the realization of the three stages is based on support vector machine, probability computing, and ontology meta-modeling. Finally, experiments on scientific papers of service computing were conducted in order to get the recommended reviewers. The results of the experiments demonstrate the effectiveness of the approach. In conclusion, the approach provides a solution for the value discovery from big data.
- Published
- 2014
21. A cloud-based approach to library management solution for college libraries
- Author
-
Shaw, Jitendra Nath and De Sarkar, Tanmay
- Published
- 2021
- Full Text
- View/download PDF
22. A Demo Paper: An Analytic Workflow Framework for Green Campus
- Author
-
Changbing Chen, Bong Zoebir, Chonho Lee, Sivadon Chaisiri, Bu-Sung Lee, School of Computer Engineering, and IEEE International Conference on Parallel and Distributed Systems (18th : 2012 : Singapore)
- Subjects
Engineering::Computer science and engineering [DRNTU] ,Database ,Computer science ,Windows Workflow Foundation ,Event (computing) ,business.industry ,Big data ,Cloud computing ,computer.software_genre ,Workflow engine ,Workflow technology ,Workflow ,Utility computing ,business ,computer ,Workflow management system - Abstract
This paper proposes a multi-tenant workflow framework that allows users to create data analytic workflows whose tasks are efficiently scheduled and distributed in cloud computing environment. We provide a demo of an event room assignment (ERA) as a test application of the framework. The ERA dynamically and automatically assigns registered events (e.g., meetings, classes, conferences, etc.) to available rooms meeting the user requirements such as the event size, purpose, reservation period, etc. The assignment will lead to the energy efficiency with respect to the power usage (e.g., lighting, ventilation, devices, etc.), and the energy savings can be achieved without affecting people's comfort. We run the ERA with power consumption data (whose size is approximately 50GB) collected from each of over 200 rooms in a building at Dept. of Engineering, Tokyo University. Through the demonstration, we will show that the proposed framework accelerates the speed of data analysis by providing user-friendly workflow composition and parallel processing features utilizing cloud computing technologies.
- Published
- 2012
23. Exploring the Literature of Data Analytics Services on Cloud Computing: A Comprehensive Summary.
- Author
-
Abdul-Jabbar, Safa S. and Farhan, Alaa k.
- Subjects
CLOUD computing ,DATA analytics ,COLLECTION agencies ,DIGITAL technology - Abstract
Copyright of Iraqi Journal of Science is the property of Republic of Iraq Ministry of Higher Education & Scientific Research (MOHESR) and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
24. 造纸工业智能化设计探讨.
- Author
-
崔焕新, 许海蓝, 牛帅, 司成林, 杨乐, and 张涛
- Subjects
FAULT diagnosis ,BIG data ,DATA extraction ,INTELLIGENT buildings ,CLOUD computing - Abstract
Copyright of China Pulp & Paper is the property of China Pulp & Paper Magazines Publisher and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2020
- Full Text
- View/download PDF
25. Data pipeline approaches in serverless computing: a taxonomy, review, and research trends.
- Author
-
Shojaee Rad, Zahra and Ghobaei-Arani, Mostafa
- Subjects
PIPELINE inspection ,REAL-time computing ,FAULT-tolerant computing ,ELECTRONIC data processing ,TAXONOMY ,FAULT tolerance (Engineering) ,COST control - Abstract
Serverless computing has gained significant popularity due to its scalability, cost-effectiveness, and ease of deployment. With the exponential growth of data, organizations face the challenge of efficiently processing and analyzing vast amounts of data in a serverless environment. Data pipelines play a crucial role in managing and transforming data within serverless architectures. This paper provides a taxonomy of data pipeline approaches in serverless computing. Classification is based on architectural features, data processing techniques, and workflow orchestration mechanisms, these approaches are categorized into three primary methods: heuristic-based approach, Machine learning-based approach, and framework-based approach. Furthermore, a systematic review of existing data pipeline frameworks and tools is provided, encompassing their strengths, limitations, and real-world use cases. The advantages and disadvantages of each approach, also the challenges and performance metrics that influence their effectuality have been examined. Every data pipeline approach has certain advantages and disadvantages, whether it is framework-based, heuristic-based, or machine learning-based. Each approach is suitable for specific use cases. Hence, it is crucial assess the trade-offs between complexity, performance, cost, and scalability, while selecting a data pipeline approach. In the end, the paper highlights a number of open issues and future investigations directions for data pipeline in the serverless computing, which involve scalability, fault tolerance, data real time processing, data workflow orchestration, function state management with performance and cost in the serverless computing environments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Proposal guidelines to implement the concepts of industry 4.0 into information technology companies
- Author
-
Cunha, Tairine Pravadelli, Méxas, Mirian Picinini, Cantareli da Silva, André, and Gonçalves Quelhas, Osvaldo Luiz
- Published
- 2020
- Full Text
- View/download PDF
27. Into the Future with Cloud: A Comparison with Onpremises Data Warehouse.
- Author
-
Noor, Iman, Bin Tariq, Saad, Shabbir, Aisha, and Aksa, Mary
- Subjects
DATA warehousing ,CLOUD computing ,BIG data ,COST analysis ,DATA security - Abstract
The need for data is growing at an extremely steep rate in the ever-digital realm, where terms like "big data" are becoming a thing of the past. All this development requires the use of modern and advanced data handling techniques, where users and researchers can analyze and predict vast amounts of data efficiently. Data warehouses are centralized repositories of data used for business intelligence activities such as analysis and reporting. In this paper, a comparative emphasis has been laid down on two different types of data warehouses, on-premises, and cloud data warehouses. The on-premises are known to be physically housed inside an organization's infrastructure. Cloud data warehouses are online-accessible repositories for data that is stored on cloud platforms. This paper provides a comparative analysis of both types in the context of deployment, scalability, flexibility, query management, cost analysis, access and integration, data security, data storage, data recovery, self-service capabilities and nonetheless, speed and performance. This article further highlights the evolution of data warehouses onto cloud and accentuates the growing demand for an efficient data warehouse, due to the amplification of volume, velocity, variety, value, and veracity of the incoming data in all realms. Furthermore, it provides an in-depth analysis of the advantages of the most suitable data warehouse and discusses the limitations of both. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Into the Future with Cloud: A Comparison with On-premises Data Warehouse.
- Author
-
Noor, Iman, Tariq, Saad Bin, Shabbir, Aisha, and Aksa, Mary
- Subjects
CLOUD computing ,WAREHOUSE management ,DATA analysis ,COST analysis ,DATA recovery - Abstract
The need for data is growing at an extremely steep rate in the ever-digital realm, where terms like "big data" are becoming a thing of the past. All this development requires the use of modern and advanced data handling techniques, where users and researchers can analyze and predict vast amounts of data efficiently. Data warehouses are centralized repositories of data used for business intelligence activities such as analysis and reporting. In this paper, a comparative emphasis has been laid down on two different types of data warehouses, on-premises, and cloud data warehouses. The on-premises are known to be physically housed inside an organization's infrastructure. Cloud data warehouses are online-accessible repositories for data that is stored on cloud platforms. This paper provides a comparative analysis of both types in the context of deployment, scalability, flexibility, query management, cost analysis, access and integration, data security, data storage, data recovery, self-service capabilities and nonetheless, speed and performance. This article further highlights the evolution of data warehouses onto cloud and accentuates the growing demand for an efficient data warehouse, due to the amplification of volume, velocity, variety, value, and veracity of the incoming data in all realms. Furthermore, it provides an in-depth analysis of the advantages of the most suitable data warehouse and discusses the limitations of both. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. An Insight into the State of Big Data Research: A Bibliometric Study of Scientific Publications.
- Author
-
Islam, Md Nurul and Hu, Guangwei
- Subjects
BIG data ,DATABASES ,BIBLIOMETRICS ,CITATION indexes ,MACHINE learning ,CLOUD computing - Abstract
In the past few years, the field of big data has multiplied, with more academic papers written about it. This bibliometric research was done to look at and understand the trends regarding countries, organizations, authors, and keywords that are creating the most publications and citations in big data. This study was done to understand the current state of scientific publications in the field. The research used Web of Science (WoS) database information from 1993 to 2021. The study of 32,085 papers showed that, on average, each document has 14.7 citations and 3.46 citations per year. According to the results, the United States, China, and the United Kingdom have the most scholarly publications about big data. The Chinese Academy of Science, Harvard University, and Stanford University were the three most productive groups. When researching big data, most writers work together, and most terms are related to big data analytics, machine learning, and cloud computing. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Towards big services: a synergy between service computing and parallel programming.
- Author
-
Mezni, Haithem, Sellami, Mokhtar, Aridhi, Sabeur, and Charrada, Faouzi Ben
- Subjects
CLOUD computing ,PARALLEL programming ,BIG data ,ELECTRONIC data processing ,MOBILE apps ,OPERATIONS management - Abstract
Over the last years, cloud computing has emerged as a natural choice to host, manage, and provide various kinds of virtualized resources (e.g., software, business processes, databases, platforms, mobile and social applications, etc.) as on-demand services. This "servicelization" across various domains has produced a huge volume of data, leading to the emergence of a new service model, called big service. This latter consists of the encapsulation, abstraction and the processing of big data, allowing then to hide their complexity. However, this promising approach still lacks management facilities and tools. Indeed, due to the highly dynamic and uncertain nature of their hosting cloud environments, big services together with their accessed data need continuous management operations, so that to maintain a moderate state and high quality of their execution. In this context, frameworks for designing, composing, executing and managing big services become a major need. The purpose of this paper is to provide an understanding of the new emerging big service model from the lifecycle management phases' point of view. We also study the role of big data frameworks and multi-cloud strategies in the provisioning of big services. A research road map on this topic will be summarized at the end of this paper. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
31. EXPLORING TRADITIONAL AND ADVANCED CLOUD COMPUTING ARCHITECTURE.
- Author
-
VEDAIYAN, RAMCHAND
- Subjects
MODERN architecture ,CLOUD computing ,REMOTE sensing ,DATA management ,BIG data - Abstract
This paper explores the fundamental elements of cloud computing architecture, discussing the integration of cloud computing with Big Data, the use of homomorphic encryption, and remote sensing data in cloud architecture. It also examines advanced cloud architectures, such as Hypervisor Clustering architecture, Load Balanced Virtual Server Instances architecture, Zero Downtime architecture, and Resource Reservation architecture, emphasizing their role in enhancing availability and scalability. The paper concludes by showcasing the significance of cloud computing architecture in addressing modern computational and data management challenges. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
32. Call for papers.
- Subjects
- *
BIG data , *COMMUNICATION , *CLOUD computing , *MACHINE learning , *DATA mining - Abstract
Prospective authors are requested to submit new, unpublished manuscripts for inclusion in the upcoming event described in this call for papers. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
33. Attribute-Based Searchable Encryption: A Survey.
- Author
-
Yan, Li, Wang, Gaozhou, Yin, Tian, Liu, Peishun, Feng, Hongxin, Zhang, Wenbin, Hu, Hailin, and Pan, Fading
- Subjects
CLOUD computing ,INFORMATION sharing ,RESEARCH personnel ,BIG data ,PROBLEM solving ,PRIVACY - Abstract
With the advent of the big data era, the size and complexity of data continue to increase, which makes the requirement for data privacy and security increasingly urgent. However, traditional encryption methods cannot meet the demand for efficient searching in large-scale datasets. To solve this problem and enable users to search within encryped data and without decrypting the entire dataset, trapdoor functions and other cryptograhic techniques are introduced in searchable encryption. However, searchable encryption still cannot meet the needs in the real world. Therefore, researchers have introduced the concept of attribute-based encryption into searchable encryption, resulting in attribute-based searchable encryption (ABSE). This approach aims to achieve efficient search by attributes in encrypted datasets. ABSE has a wide range of applications in the fields of privacy protection, data sharing, and cloud computing. In this paper, we describe the trends in development, focusing on enhancing security, improving computational efficiency, and increasing flexibility. We also present the related schemes. In addition, several common application areas are introduced and the relevant schemes proposed by researchers are summarized. Moreover, the challenges and future directions of ABSE are discussed in this paper. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Data-Oriented Operating System for Big Data and Cloud.
- Author
-
Kessler, Selwyn Darryl, Ng, Kok-Why, and Haw, Su-Cheng
- Subjects
BIG data ,COMPUTERS ,DATA management ,ENGINEERING design ,SYSTEMS design ,HARD disks - Abstract
Operating System (OS) is a critical piece of software that manages a computer's hardware and resources, acting as the intermediary between the computer and the user. The existing OS is not designed for Big Data and Cloud Computing, resulting in data processing and management inefficiency. This paper proposes a simplified and improved kernel on an x86 system designed for Big Data and Cloud Computing purposes. The proposed algorithm utilizes the performance benefits from the improved Input/Output (I/O) performance. The performance engineering runs the data-oriented design on traditional data management to improve data processing speed by reducing memory access overheads in conventional data management. The OS incorporates a data-oriented design to "modernize" various Data Science and management aspects. The resulting OS contains a basic input/output system (BIOS) bootloader that boots into Intel 32-bit protected mode, a text display terminal, 4 GB paging memory, 4096 heap block size, a Hard Disk Drive (HDD) I/O Advanced Technology Attachment (ATA) driver and more. There are also I/O scheduling algorithm prototypes that demonstrate how a simple Sweeping algorithm is superior to more conventionally known I/O scheduling algorithms. A MapReduce prototype is implemented using Message Passing Interface (MPI) for big data purposes. An attempt was made to optimize binary search using modern performance engineering and data-oriented design. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Time Series Prediction in Industry 4.0: A Comprehensive Review and Prospects for Future Advancements.
- Author
-
Kashpruk, Nataliia, Piskor-Ignatowicz, Cezary, and Baranowski, Jerzy
- Subjects
BIG data ,TIME series analysis ,INDUSTRY 4.0 ,LITERATURE reviews ,ARTIFICIAL intelligence ,MANUFACTURING processes - Abstract
Time series prediction stands at the forefront of the fourth industrial revolution (Industry 4.0), offering a crucial analytical tool for the vast data streams generated by modern industrial processes. This literature review systematically consolidates existing research on the predictive analysis of time series within the framework of Industry 4.0, illustrating its critical role in enhancing operational foresight and strategic planning. Tracing the evolution from the first to the fourth industrial revolution, the paper delineates how each phase has incrementally set the stage for today's data-centric manufacturing paradigms. It critically examines how emergent technologies such as the Internet of things (IoT), artificial intelligence (AI), cloud computing, and big data analytics converge in the context of Industry 4.0 to transform time series data into actionable insights. Specifically, the review explores applications in predictive maintenance, production optimization, sales forecasting, and anomaly detection, underscoring the transformative impact of accurate time series forecasting on industrial operations. The paper culminates in a call to action for the strategic dissemination and management of these technologies, proposing a pathway for leveraging time series prediction to drive societal and economic advancement. Serving as a foundational compendium, this article aims to inform and guide ongoing research and practice at the intersection of time series prediction and Industry 4.0. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
36. Thematic Analysis of Big Data in Financial Institutions Using NLP Techniques with a Cloud Computing Perspective: A Systematic Literature Review.
- Author
-
Sharma, Ratnesh Kumar, Bharathy, Gnana, Karimi, Faezeh, Mishra, Anil V., and Prasad, Mukesh
- Subjects
LITERATURE reviews ,BIG data ,THEMATIC analysis ,DATA analysis ,BIBLIOMETRICS ,NATURAL language processing ,FINANCIAL institutions ,CLOUD computing - Abstract
This literature review explores the existing work and practices in applying thematic analysis natural language processing techniques to financial data in cloud environments. This work aims to improve two of the five Vs of the big data system. We used the PRISMA approach (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) for the review. We analyzed the research papers published over the last 10 years about the topic in question using a keyword-based search and bibliometric analysis. The systematic literature review was conducted in multiple phases, and filters were applied to exclude papers based on the title and abstract initially, then based on the methodology/conclusion, and, finally, after reading the full text. The remaining papers were then considered and are discussed here. We found that automated data discovery methods can be augmented by applying an NLP-based thematic analysis on the financial data in cloud environments. This can help identify the correct classification/categorization and measure data quality for a sentiment analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
37. Perspectives on Big Data, Cloud-Based Data Analysis and Machine Learning Systems.
- Author
-
Marozzo, Fabrizio and Talia, Domenico
- Subjects
BIG data ,MACHINE learning ,CLOUD computing ,DATA analysis ,INSTRUCTIONAL systems ,NEWS websites - Published
- 2023
- Full Text
- View/download PDF
38. Industry 4.0 and healthcare: Context, applications, benefits and challenges.
- Author
-
Kotzias, Konstantinos, Bukhsh, Faiza A., Arachchige, Jeewanie Jayasinghe, Daneva, Maya, and Abhishta, Abhishta
- Subjects
INDUSTRY 4.0 ,AUGMENTED reality ,TECHNOLOGICAL innovations ,DIGITAL transformation ,AUTOMATION ,DIGITAL technology ,MANUFACTURING processes ,SYSTEM integration - Abstract
Industry 4.0 refers to the digital transformation in the manufacturing domain through new technology. Currently, it expands well beyond manufacturing, affecting many areas of life and posing implications for all types of business. This paper focuses on the relationships between Industry 4.0 and Healthcare which transitions to increased interconnectivity, automation and smart decision making. The integration context of Industry 4.0 into Healthcare is only partly understood. Little was done until now to consolidate what is known on the integration benefits and the challenges. This article reports results of a systematic mapping study that analysed 69 papers to extract knowledge about the concepts of Industry 4.0 and the emerging Healthcare 4.0., and the relationships between them. We found 10 different perspectives of Healthcare 4.0, ranging from strategic to tactical and operational levels. Next, our results show: (i) nine applications of Industry 4.0 in the Healthcare domain: Augmented Reality and Simulation, Autonomous Robotics, Cybersecurity, Big Data Analytics, Internet of Things, Cloud Computing, Additive Manufacturing and Systems Integration; and (ii) 10 benefits and nine challenges in Healthcare 4.0. The most frequently mentioned benefits are patients' diagnosis, monitoring, treatment, and financial benefits. The most researched challenges are data fragmentation, heterogeneity, complexity, and privacy. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
39. Hybridized Wrapper Filter Using Deep Neural Network for Intrusion Detection.
- Author
-
Venkateswaran, N. and Umadevi, K.
- Subjects
CLOUD computing ,BIG data ,DIFFERENTIAL equations ,INTRUSION detection systems (Computer security) ,DATA extraction - Abstract
Huge data over the cloud computing and big data are processed over the network. The data may be stored, send, altered and communicated over the network between the source and destination. Once data send by source to destination, before reaching the destination data may be attacked by any intruders over the network. The network has numerous routers and devices to connect to internet. Intruders may attack any were in the network and breaks the original data, secrets. Detection of attack in the network became interesting task for many researchers. There are many intrusion detection feature selection algorithm has been suggested which lags on performance and accuracy. In our article we propose new IDS feature selection algorithm with higher accuracy and performance in detecting the intruders. The combination of wrapper filtering method using Pearson correlation with recursion function is used to eliminate the unwanted features. This feature extraction process clearly extracts the attacked data. Then the deep neural network is used for detecting intruders attack over the data in the network. This hybrid machine learning algorithm in feature extraction process helps to find attacked information using recursive function. Performance of proposed method is compared with existing solution. The traditional feature selection in IDS such as differential equation (DE), Gain ratio (GR), symmetrical uncertainty (SU) and artificial bee colony (ABC) has less accuracy than proposed PCRFE. The experimented results are shown that our proposed PCRFE-CDNN gives 99% of accuracy in IDS feature selection process and 98% in sensitivity. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
40. Multimedia big data computing mechanisms: a bibliometric analysis.
- Author
-
Rivai, Faradillah Amalia, Navimipour, Nima Jafari, and Yalcın, Senay
- Subjects
USER-generated content ,BIBLIOMETRICS ,BIG data ,INTERNET content ,ELECTRONIC publications ,SOCIAL networks ,CELL phones - Abstract
Massive multimedia data are being created due to the rising amount of the Internet and user-generated content, low-cost commodity devices with cameras (like cellphones, surveillance systems, and so on), and the proliferation of social networks, forming a unique type of big data. Several studies have been conducted in this research area using a survey and event analysis approach; however, none has been conducted to investigate the status of knowledge, its features, evolution, and emerging trend of multimedia big data. Therefore, in this paper, a bibliometric study using VOSviewer software is carried out with 1,865 documents from 2008 to 2020. Based on the result, 2013 is the starting year where the total publication excess of 100 articles and the configuration of leading countries, productive organizations, and authors are investigated. The most cited journals, popular publications venues, and hot research topics are also included in the investigations. Our investigation uncovered useful information, such as annual publishing patterns, the hottest research topic, the top 10 important authors and articles, and the most helpful funding organizations and venues. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
41. Big Data Analytics and Cloud Computing on Industrial Management, Information Engineering and Data Science.
- Author
-
SOEBANDRIJA, Khristian Edi Nugroho
- Subjects
BIG data ,CLOUD computing ,INDUSTRIAL management ,DIGITAL transformation ,DATA science - Abstract
This paper elaborates big data analytics and cloud computing vis-à-vis the trilogy of industrial management, information engineering and data science. Thus, this paper provides balanced perspectives on both management and engineering. Both data analytics and cloud computing constitute solid combination vis-à-vis the mentioned trilogy. Handling, storing and analyzing data in the digital transformation era and disruptive innovation competition are the significant distinctive competitive advantages in the corporate world. Artificial Intelligence (AI) and Machine Learning (ML) applications in industrial perspectives and industrial management are deemed behind advancements in the empirical aspects of computer and network. Thus, to fill the gap, information engineering and data science in this paper, bridging the mentioned gap. The purpose of this paper is to ensure the full swing application of data analytics and cloud computing in both engineering and management perspectives in order to generate sustainable competitive advantages. Ultimately, this paper proceeds quantitative methods, and if it is deemed indispensable proceed to qualitative methods. The trilogy of this paper includes industrial management and information engineering and data science. The two latter aspects which are information engineering and data science constitutes highly demanded disciplines, as it happens to apply statistics, data mining, database management science and management information system within information engineering. [ABSTRACT FROM AUTHOR]
- Published
- 2023
42. Dynamic Power Provisioning System for Fog Computing in IoT Environments.
- Author
-
Al Masarweh, Mohammed and Alwada'n, Tariq
- Subjects
COMPUTER systems ,INTERNET of things ,CLOUD computing ,SERVER farms (Computer network management) ,MULTIAGENT systems ,FOG ,ELECTRICITY - Abstract
Large amounts of data are created from sensors in Internet of Things (IoT) services and applications. These data create a challenge in directing these data to the cloud, which needs extreme network bandwidth. Fog computing appears as a modern solution to overcome these challenges, where it can expand the cloud computing model to the boundary of the network, consequently adding a new class of services and applications with high-speed responses compared to the cloud. Cloud and fog computing propose huge amounts of resources for their clients and devices, especially in IoT environments. However, inactive resources and large number of applications and servers in cloud and fog computing data centers waste a huge amount of electricity. This paper will propose a Dynamic Power Provisioning (DPP) system in fog data centers, which consists of a multi-agent system that manages the power consumption for the fog resources in local data centers. The suggested DPP system will be tested by using the CloudSim and iFogsim tools. The outputs show that employing the DPP system in local fog data centers reduced the power consumption for fog resource providers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Research on Abnormal Behavior Prediction by Integrating Multiple Indexes of Student Behavior and Text Information in Big Data Environment.
- Author
-
Wang, Yubiao, Wen, Junhao, Zhou, Wei, Wu, Quanwang, Wei, Yingchun, Li, Heng, and Tao, Bamei
- Subjects
PSYCHOLOGY of students ,BIG data ,INFORMATION-seeking behavior ,BEHAVIORAL research ,K-means clustering ,SHORT-term memory ,CLOUD computing ,MULTIDIMENSIONAL databases - Abstract
With the wide application of information technologies such as big data, the Internet of Things, and cloud computing, college students have accumulated a large amount of personal information and daily behavior data in their daily studies and life. How to dynamically integrate multidimensional information of students to build accurate student portraits, using multi-indicator data of student behavior and comment texts, and finding out students with abnormal behavior from among many students has become an important problem to be solved. This paper proposes an abnormal behavior prediction method integrating multiple indicators of student behavior and text information (ABPM-IMISBTI) for the problem of abnormal behavior prediction of college students in the big data environment. First, given the problems of multidimensionality, timeliness, and dynamics of student behavior information fusion in the construction of student behavior portraits, by integrating students' objective tags and subjective tags, an optimized K-means algorithm based on a cloud platform environment is proposed. Second, aiming at the problem of insufficient text information analysis in the analysis of students' abnormal behavior, the ABPM-IMISBTI method is proposed to solve the prediction of students' abnormal behavior through long and short-term memory networks (LSTM) combined with student behavior multi-index data and text information. Finally, this paper takes student achievement prediction as an example for verification. The experimental results show that, compared with other prediction methods, the ABPM-IMISBTI method proposed in this paper can improve the accuracy of student behavior prediction, and then quickly determine the abnormal behavior of students, to improve the level of education management in universities and promote the development of safe campuses, smart campuses, and smart education. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
44. Data management within new product development and collaborative engineering: a bibliometric and systemic analysis.
- Author
-
Larocca, Arthur, Borsato, Milton, Kubo, Pablo, and Estorilio, Carla
- Subjects
BIBLIOMETRICS ,NEW product development ,LITERATURE reviews ,EVIDENCE gaps ,BIG data ,DIGITAL twins ,DATA management ,CLOUD computing - Abstract
Purpose: Although organizations have more data than ever at their disposal, actually deriving meaningful insights and actions from them is easier said than done. In this concern, the main objective of this study is to identify trends and research opportunities regarding data management within new product development (NPD) and collaborative engineering. Design/methodology/approach: Bibliometric and systemic analyses have been carried out using the methodological procedure ProKnow-C, which provides a structured framework for the literature review. A bibliographic portfolio (BP) was consolidated with 33 papers that represent the state of art in the subject. Findings: Most recent researches within the BP indicate new trends and paradigm shifts in this area of research, tackling subjects such as the internet of things, cloud computing, big data analytics and digital twin. Research gaps include the lack of data automation and the absence of a common architecture for systems integration. However, from a general perspective of the BP, the management of experimental data is suggested as a research opportunity for future works. Although many studies have tackled data and collaboration based on computer-aided technologies environments, no study examined the management of the measured data collected during the verification and validation stages of a product. Originality/value: This work provides a fresh and relevant source of authors, journals and studies for researchers and practitioners interested in the domain of data management applied to NPD and collaborative engineering. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
45. Using big data to make better decisions in the digital economy.
- Author
-
Tan, Kim Hua, Ji, Guojun, Lim, Chee Peng, and Tseng, Ming-Lang
- Subjects
BIG data ,TECHNOLOGY & economics ,CLOUD computing - Abstract
The question this special issue would like to address is how to harvest big data to help decision-makers to deliver better fact-based decisions aimed at improving performance or to create better strategy? This special issue focuses on the big data applications in supporting operations decisions, including advanced research on decision models and tools for the digital economy. Responds to this special issue was great and we have included many high-quality papers. We are pleased to present 13 of the best papers. The techniques presented include data mining, simulation and expert system with applications span across online reviews, food retail chain to e-health. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
46. Internet of Things and Artificial Intelligence applied to predictive maintenance in Industry 4.0: A systematic literature review.
- Author
-
Mendes Caldana, Vitor, Garrido da Silva, Francisco Diego, Araujo de Oliveira, Rafael, and Freitag Borin, Juliana
- Subjects
INTERNET of things ,ARTIFICIAL intelligence ,INDUSTRY 4.0 ,PRODUCTION management (Manufacturing) ,BIG data ,CLOUD computing - Abstract
The technological advancements in Industry 4.0, specifically in the areas of Industrial Internet of Things (IIoT) and Artificial Intelligence (AI) enables a series of enhancements in production management. The development of Big Data, Fog & Cloud Computing and Neural Networks have made Predictive Maintenance (PdM) an area of interest as it has been able to effectively transform and adapt to machine conditions. This paper presents a systemic literature review of the state of the art in AI and IIoT regarding PdM to serve as a basis for future work in the area. The relevance of this subject is still high, as seen by the number of publications in the last two years, however there are still several relevant research challenges to be addressed, in particular to achieve an adaptable and homogeneous PdM model. [ABSTRACT FROM AUTHOR]
- Published
- 2021
47. Enhancing Resilience via Exponential Technologies: Analysing Trends, Focus and Contributions.
- Author
-
Arora, Manpreet, Kumar, Jeetesh, Dhiman, Vaishali, Rathore, Sunaina, Singh, Swati, and Chandel, Monika
- Subjects
ARTIFICIAL intelligence ,BIBLIOMETRICS ,BIG data ,CLOUD computing ,BLOCKCHAINS - Abstract
This article seeks to conduct a bibliometric analysis focusing on exponential technologies such as big data, internet of thing (IoT), artificial intelligence (AI), blockchain and cloud computing. It aims to outline research trends in this domain and explore their correlation with resilience. The study aims to track the evolution of research trends in this field over time and identify less explored dimensions of exponential technologies. Leveraging performance analysis and science mapping techniques, the paper highlights the significant growth and potential in these areas, considering them as pivotal agendas of the twenty-first century. By examining scientific productivity metrics such as publications, authors, institutions, countries and keywords, the article offers insights into emerging areas within exponential technologies. As the first comprehensive study of its kind, it provides a broad overview of the main trends and patterns in resilience research encompassing big data, IoT, AI, blockchain and cloud computing, consolidating them into a single cohesive narrative. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. On the Optimization of Kubernetes toward the Enhancement of Cloud Computing.
- Author
-
Mondal, Subrota Kumar, Zheng, Zhen, and Cheng, Yuning
- Subjects
DISASTER resilience ,CLOUD computing ,DATA distribution ,QUALITY of service ,BIG data - Abstract
With the vigorous development of big data and cloud computing, containers are becoming the main platform for running applications due to their flexible and lightweight features. Using a container cluster management system can more effectively manage multiocean containers on multiple machine nodes, and Kubernetes has become a leader in container cluster management systems, with its powerful container orchestration capabilities. However, the current default Kubernetes components and settings have appeared to have a performance bottleneck and are not adaptable to complex usage environments. In particular, the issues are data distribution latency, inefficient cluster backup and restore leading to poor disaster recovery, poor rolling update leading to downtime, inefficiency in load balancing and handling requests, poor autoscaling and scheduling strategy leading to quality of service (QoS) violations and insufficient resource usage, and many others. Aiming at the insufficient performance of the default Kubernetes platform, this paper focuses on reducing the data distribution latency, improving the cluster backup and restore strategies toward better disaster recovery, optimizing zero-downtime rolling updates, incorporating better strategies for load balancing and handling requests, optimizing autoscaling, introducing better scheduling strategy, and so on. At the same time, the relevant experimental analysis is carried out. The experiment results show that compared with the default settings, the optimized Kubernetes platform can handle more than 2000 concurrent requests, reduce the CPU overhead by more than 1.5%, reduce the memory by more than 0.6%, reduce the average request time by an average of 7.6%, and reduce the number of request failures by at least 32.4%, achieving the expected effect. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Big-IDS: a decentralized multi agent reinforcement learning approach for distributed intrusion detection in big data networks.
- Author
-
Louati, Faten, Ktata, Farah Barika, and Amous, Ikram
- Subjects
CLOUD computing security measures ,REINFORCEMENT learning ,BIG data ,ANOMALY detection (Computer security) ,MACHINE learning ,INTRUSION detection systems (Computer security) - Abstract
The growing complexity of security threats and the pervasive prevalence of cyberattacks have become more apparent in the present era, and the advent of big data, characterized by its distinctive features, has introduced layers of complexity to security tasks. Intrusion Detection Systems (IDSs) constitute a crucial line of defense, but their adaptation to the realm of big data is imperative. While traditional Machine Learning (ML)-based IDSs have been pivotal in detecting malicious patterns, they are often incapable to keep pace with the demands of expansive big data networks. This paper proposes a novel decentralized Multi-Agent Reinforcement Learning (MARL)-based IDS designed to address the specific challenges posed by big data. Our solution employs decentralized cooperative MARL, securing communicative channels throughout the detection process and concurrent data preprocessing which significantly reduces the overall processing time. Furthermore, the integration of Cloud computing and Big Data streaming techniques further facilitates real-time intrusion detection as cloud's resources allow rapid pre-process and analyse of massive data streams using powerful clusters. Likewise, Big Data streaming techniques ensure that potential intrusions are identified and addressed as they occur. Experimental results, conducted on the widely recognized NSLKDD benchmark dataset, demonstrate the superiority of our solution over other state-of-the-art approaches for big data networks, achieving an accuracy rate of 97.44%. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. Cloud Computing-aided Multi-type Data Fusion with Correlation for Education.
- Author
-
Tai, Baoqing, Li, Xindong, Yang, Lifang, Miao, Ying, Lin, Wenmin, and Yan, Chao
- Subjects
MULTISENSOR data fusion ,BIG data ,EDUCATIONAL quality - Abstract
As one of the major constitutes of human society, education has been continuously producing a huge amount of data and become an important of sources of big data. Deeply mining and analyzing these big education data are of practical significance for optimizing education resource deployment and improving education quality. However, the big education data are often of diverse types and from multiple parties, which raises a big challenge for accurate and reasonable educational data fusion especially when the educational data are correlated with each other. In view of this challenge, we put forward a novel cloud computing-aided multi-type data fusion approach considering data correlation in education, to accommodate the big volume, diverse types and correlation of educational data. In concrete, the data fusion operation is mainly based on the Mahalanobis distances which can overcome the data diversity in multiple-dimensional data fusion for education. Afterwards, we provide a case study to show the concrete steps of our proposal. At last, a set of experiments are deployed to validate the feasibility of our proposal in this paper. Experimental results prove the effectiveness and efficiency of our approach in dealing with multi-type data fusion with correlation in education. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.