Descriptor: "Executable" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Executable"' showing total 10,049 results

Start Over Descriptor "Executable"

10,049 results on '"Executable"'

1. A Survey of Binary Code Similarity.

Author: HAQ, IRFAN UL and CABALLERO, JUAN
Subjects: *BINARY codes, *MALWARE
Abstract: Binary code similarityapproaches compare two or more pieces of binary code to identify their similarities and differences. The ability to compare binary code enables many real-world applications on scenarios where source code may not be available such as patch analysis, bug search, and malware detection and analysis. Over the past 22 years numerous binary code similarity approaches have been proposed, but the research area has not yet been systematically analyzed. This article presents the first survey of binary code similarity. It analyzes 70 binary code similarity approaches, which are systematized on four aspects: (1) the applications they enable, (2) their approach characteristics, (3) how the approaches are implemented, and (4) the benchmarks and methodologies used to evaluate them. In addition, the survey discusses the scope and origins of the area, its evolution over the past two decades, and the challenges that lie ahead. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

2. Comparative analysis of segmentation techniques based on chest X-ray images.

Author: Kiran, Mahreen, Ahmed, Imran, Khan, Nazish, ur Rehman, Hamood, Din, Sadia, Paul, Anand, and Reddy, Alavalapati Goutham
Subjects: COMPARATIVE studies, IMAGE processing, DIAGNOSTIC imaging, LUNGS
Abstract: The image segmentation is the basic step in the image processing involved in the processing of medical images. Over the past two decades, medical image segmentation has remained a challenge for researchers while the use of this imaging modality is rapidly growing in research studies. This article surveys the techniques and their effect on chest X-ray images. The objective of this work is to study the key similarities and differences among the different published methods while highlighting their strengths and weaknesses on chest X-ray images. The reason is to assist the researchers in the choice of an appropriate lung segmentation methodology. We additionally give a complete portrayal of the existing few basic methods when combined with preprocessing method that can be utilized as a part of the segmentation. A discussion and fair analysis justified with experimental results along with quantitative correlation of the outcomes on 247 images of JSRT through Dice coefficient exhibited. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

3. Uncovering Secrets of the Maven Repository: Maven packaging

Author: Rungta, Priyam (author) and Rungta, Priyam (author)
Abstract: Maven, a widely adopted software ecosystem for Java libraries, plays a critical role in the development and deployment of software applications. However, there exists a limited understanding of the composition and characteristics of the Maven repository, leaving users and contributors unaware of the contents they interact with. This research aims to address this knowledge gap by conducting a comprehensive analysis of Maven packaging and informing developers, library maintainers, security analysts, and the open-source community about Maven library practices. The research investigates the secrets of the Maven repository, focusing on Maven packaging. Using data from the POM file, Maven index file, and Maven repository, we analyze the distribution of packaging types, checksums, qualifiers, and file types within Maven libraries. The experiment involves examining 479,915 packages from the Maven repository, utilizing the POM file, the Maven index, the Maven repository and manual requests to the Maven repository. The results reveal that JAR is the packaging type in more than 75% packages across all sources, and inconsistencies are found among different data sources, highlighting the need for improved data consistency and reliability within the Maven ecosystem. Furthermore, the adoption of the sha256 and sha512 checksum algorithms remains limited, with only 1.4% of packages utilizing these secure hash functions. In terms of qualifiers, sources and Javadoc exhibit the highest prevalence, with adoption rates of 82% and 76% respectively. Moreover, class files and XML are identified as the most frequently packaged file types, encompassing 71% and 61% of the packages, respectively among a very diverse classification. These findings provide insights into Maven library characteristics and inform optimization of library usage., CSE3000 Research Project, Computer Science and Engineering
Published: 2023

4. eUF: A framework for detecting over-the-air malicious updates in autonomous vehicles

Author: Anam Qureshi, Jawwad Ahmed Shamsi, Adnan Aijaz, and Murk Marvi
Subjects: General Computer Science, business.industry, Computer science, Distributed computing, 020206 networking & telecommunications, 02 engineering and technology, computer.file_format, computer.software_genre, Convolutional neural network, Software, Channel (programming), Scalability, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), Wireless, Malware, 020201 artificial intelligence & image processing, Executable, business, computer
Abstract: Software updates are highly significant in autonomous vehicles. These updates are utilized to provide enhanced features and updated security mechanisms. In order to ensure scalability and smooth roll-out Over-the-air (OTA) mechanism is a preferred option to propagate a software update. However, this approach is vulnerable to security attacks because of existence of wireless communication channel between the vehicle and the manufacturer. In that, an attacker can replace the legitimate software with a malicious software with an intent to get control over the vehicle. In this work, we are motivated to address this problem. We develop an enhanced uptane framework for detection of malicious OTA software updates in autonomous vehicles. For enhancing security, we incorporate convolutional neural network (CNN) in the uptane framework. The proposed framework is able to distinguish between malicious and benign software executables with high accuracy. For training and testing, we create two datasets by collecting executables of Windows and Linux operating system. We encourage the use of transfer learning by exploiting the developed CNN models in order to detect malicious executable designed for autonomous vehicles. We also benchmark the CNN models against state-of-the art models. Our work is highly beneficial for the community in providing a secure mechanism for software updates.
Published: 2022

5. A Framework for Dynamic Composition and Management of Emergency Response Processes

Author: Nabil R. Adam, Abeer Elahraf, Basit Shafiq, Ahmed Akhtar, Jaideep Vaidya, Ayesha Afzal, and Shafay Shamail
Subjects: Service (systems architecture), Information Systems and Management, Computer Networks and Communications, Process (engineering), Computer science, business.industry, Information sharing, Interoperability, computer.file_format, Ontology (information science), Article, Computer Science Applications, Workflow, Resource (project management), Hardware and Architecture, Executable, Software engineering, business, computer
Abstract: An emergency response process outlines the workflow of different activities that need to be performed in response to an emergency. Effective emergency response requires communication and coordination with the operational systems belonging to different collaborating organizations. Therefore, it is necessary to establish information sharing and system-level interoperability among the diverse operational systems. Unlike typical e-government processes that are well structured and have a well-defined outcome, emergency response processes are knowledge-centric and their workflow structure and execution may evolve as the incident unfolds. It is impractical to define static plans and response process workflows for every possible situation. Instead, a dynamic response should be adaptable to the changing situation. We present an integrated approach that facilitates the dynamic composition of an executable response process. The proposed approach employs ontology-based reasoning to determine the default actions and resource requirements for the given incident and to identify relevant response organizations based on their jurisdictional and mutual aid agreement rules. The Web service APIs of the identified response organizations are then used to generate an executable response process that evolves dynamically. The proposed approach is implemented and experimentally validated using an example scenario derived from the FEMA Hazardous Materials Tabletop Exercises Manual.
Published: 2023

6. Composing Web Services Using a Multi-Agent Framework

Author: Daniel Alencar da Costa, Ying Zou, and Yu Zhao
Subjects: Information Systems and Management, Syntax (programming languages), Computer Networks and Communications, Computer science, business.industry, media_common.quotation_subject, Control (management), computer.file_format, computer.software_genre, Payment, Computer Science Applications, Hardware and Architecture, Order (business), Code (cryptography), Business logic, Executable, Web service, Software engineering, business, computer, media_common
Abstract: Different web services can be composed to perform increasingly complex tasks (e.g., making an on-line payment). However, existing approaches compose web services with hard-coded control and data flows. To proactively and autonomously compose web services, developers can develop agents. However, the development of agents for service composition is complex, due to the reasons that: 1) developers may not have the knowledge from various domains to identify the necessary tasks to carry out the required web services; and 2) a deep understanding of the agent specific code is required in order to implement agents. To alleviate the required efforts to develop agents, we propose an approach to separate the development of agent specific code from the business logic code in the service composition. More specifically, we provide an easy-to-understand syntax that abstracts agent specific code and automatically generates executable agent code. Our experimental results show that our approach can accurately identify tasks for service composition with an Area Under the Curve (AUC) of 0.88. Our experiments also demonstrate that our approach can correctly generate agent code from seven agent specifications. Finally, our user studies reveal that developers are satisfied with our approach to develop agents for service composition.
Published: 2022

7. On a Consistency Testing Model and Strategy for Revealing RISC Processor’s Dark Instructions and Vulnerabilities

Author: Yuze Wang, Yingtao Jiang, Xiaohang Wang, Peng Liu, and Weidong Wang
Subjects: Reduced instruction set computing, Programming language, Computer science, Code coverage, computer.file_format, Space (commercial competition), computer.software_genre, Theoretical Computer Science, Test (assessment), Instruction set, Consistency (database systems), Computational Theory and Mathematics, Hardware and Architecture, Encoding (memory), Executable, Hardware_CONTROLSTRUCTURESANDMICROPROGRAMMING, computer, Software
Abstract: As the reduced instruction set computing (RISC) processors are widely used nowadays, to meet the requirement that no secret instructions be included in the processor ISA or implemented in the processor micro-architecture, a consistency testing approach capable of revealing any possible dark instructions (i.e., executable instructions without clear definitions) in RISC processors has been proposed and comes in three phases. During the generation phase, based on the instruction set encoding rules, all the undefined instructions are generated. Even with a smaller test space, this step guarantees the test coverage needed to reveal all possible dark instructions that exist. In the next phase, all the undefined instructions obtained from the previous phase are executed on the processor under test, following some persistence strategies; any instruction exhibiting usual execution result will be deemed suspicious and recorded so. During the last analysis phase, each of those recorded suspicious instructions will be checked and analyzed to decide whether it truly constitutes a dark instruction. We have applied the proposed testing model and strategy to several RISC processors and found that all of them have a few dark instructions previously unknown. The potential vulnerabilities introduced by these dark instructions have thus been evaluated and exposed.
Published: 2022

8. Generating Effective Software Obfuscation Sequences With Reinforcement Learning

Author: Dongpeng Xu, Huaijin Wang, Shuai Wang, Xiangyu Zhang, and Xiao Liu
Subjects: Reverse engineering, Source code, Programming language, Computer science, media_common.quotation_subject, ComputingMilieux_LEGALASPECTSOFCOMPUTING, computer.file_format, computer.software_genre, Identifier, Control flow, Obfuscation, Reinforcement learning, Instrumentation (computer programming), Executable, Electrical and Electronic Engineering, computer, media_common
Abstract: Obfuscation is a prevalent security technique which transforms syntactic representation of a program to a complicated form, but still keeps program semantics unchanged. So far, developers heavily rely on obfuscation to harden their products and reduce the risk of adversarial reverse engineering. However, despite its spectacular progress, one crucial hurdle is that each of existing obfuscation method is designed specifically for obfuscating one program feature (e.g., identifier name, control flow), so an effective obfuscation scheme usually composes a considerable amount of different obfuscation methods. Therefore, one primary challenge lies in identifying effective combinations of obfuscation methods. In this research, we propose a principled technique for generating an optimal program obfuscation scheme by adopting a reinforcement learning approach. Given a program and a set of obfuscation transformations, a reinforcement learning model is progressively trained to select a sequence of obfuscation transformations, such that applying each transformation in order towards the program yields the optimal obfuscation result, making programs dissimilar while retaining reasonable instrumentation overhead. Our implementation can directly work on raw binary executables without source code, and our evaluation demonstrates that the trained models can effectively obfuscate executable files with low cost.
Published: 2022

9. Through the Looking Glass: Automated Design Understanding of SystemC-Based VPs at the ESL

Author: Rolf Drechsler and Mehran Goli
Subjects: Electronic system-level design and verification, business.industry, Computer science, computer.file_format, Computer Graphics and Computer-Aided Design, Software, SystemC, Scalability, Transaction-level modeling, Design process, Executable, Electrical and Electronic Engineering, Software engineering, business, computer, computer.programming_language, TRACE (psycholinguistics)
Abstract: The emergence of Virtual Prototypes (VPs) at the Electronic System Level (ESL) has played a major role in modernizing the System-on-Chips (SoCs) design process to raise design productivity and reduce time-to-market. A VP is an abstract and executable software model implemented typically using SystemC and its Transaction Level Modeling (TLM) framework. However, this modern VP-based design process still has weaknesses, in particular, due to the significant manual effort involved for design understanding, analysis and modeling tasks which is both time consuming and error-prone. This paper introduces an automated and fast design understanding approach that enables designers to trace detailed information of the VPs’ structure and behavior. Experimental results including a real-world VP-based SoC show the advantages of our approach such as its accuracy, applicability, and scalability.
Published: 2022

10. Multi-stage complex task assignment in spatial crowdsourcing

Author: Ningbo Zhu, Xu Zhou, Liu Zhao, Kenli Li, Yunjun Gao, and Keqin Li
Subjects: Information Systems and Management, Theoretical computer science, Dependency (UML), Matching (graph theory), Computer science, business.industry, computer.file_format, Crowdsourcing, Computer Science Applications, Theoretical Computer Science, Task (project management), Set (abstract data type), symbols.namesake, Artificial Intelligence, Control and Systems Engineering, Nash equilibrium, symbols, Executable, Greedy algorithm, business, computer, Software
Abstract: With the widespread application of smart devices, spatial crowdsourcing (SC) has been extensively integrated into daily life. Task assignment is a crucial issue in SC and has attracted much attention. Most prior studies on task assignment ignore the importance of dependency among tasks, resulting in some ineffective matching pairs and wasting workers’ time. To this end, we formulate a new problem in SC, abbreviated as multi-stage complex task assignment (MSCTA), which aims to assign workers to multi-stage complex tasks to maximize the total profit. Compared with existing studies, MSCTA can obtain more effective assignments by considering the dependency constraints among tasks. We prove that the MSCTA problem is NP-hard and propose a greedy algorithm and a game algorithm. Specifically, both algorithms iteratively utilize a filtering module to obtain a set of executable tasks (ET) for assignment. The greedy algorithm can quickly assign the most profitable workers to the subtasks in each round of ET, and obtain a provable approximate result. The game algorithm is proved to be convergent and can win a Nash equilibrium when processing the subtasks in each round of ET. Extensive experimental results demonstrate the efficiency of our algorithm.
Published: 2022

11. DeepUMQA: ultrafast shape recognition-based protein model quality assessment using deep learning

Author: Sai-Sai Guo, Jun Liu, Xiao-Gen Zhou, and Gui-Jun Zhang
Subjects: Statistics and Probability, Source code, Computer science, business.industry, Deep learning, media_common.quotation_subject, computer.file_format, Protein structure prediction, computer.software_genre, Biochemistry, Computer Science Applications, Set (abstract data type), Computational Mathematics, Computational Theory and Mathematics, Component (UML), Feature (machine learning), Executable, Data mining, Artificial intelligence, business, computer, Molecular Biology, media_common, Complement (set theory)
Abstract: Motivation Protein model quality assessment is a key component of protein structure prediction. In recent research, the voxelization feature was used to characterize the local structural information of residues, but it may be insufficient for describing residue-level topological information. Design features that can further reflect residue-level topology when combined with deep learning methods are therefore crucial to improve the performance of model quality assessment. Results We developed a deep-learning method, DeepUMQA, based on Ultrafast Shape Recognition (USR) for the residue-level single-model quality assessment. In the framework of the deep residual neural network, the residue-level USR feature was introduced to describe the topological relationship between the residue and overall structure by calculating the first moment of a set of residue distance sets and then combined with 1D, 2D and voxelization features to assess the quality of the model. Experimental results on the CASP13, CASP14 test datasets and CAMEO blind test show that USR could supplement the voxelization features to comprehensively characterize residue structure information and significantly improve model assessment accuracy. The performance of DeepUMQA ranks among the top during the state-of-the-art single-model quality assessment methods, including ProQ2, ProQ3, ProQ3D, Ornate, VoroMQA, ProteinGCN, ResNetQA, QDeep, GraphQA, ModFOLD6, ModFOLD7, ModFOLD8, QMEAN3, QMEANDisCo3 and DeepAccNet. Availability and implementation The DeepUMQA server is freely available at http://zhanglab-bioinf.com/DeepUMQA/. Supplementary information Supplementary data are available at Bioinformatics online.
Published: 2022

12. Creating a Foundation for Next-Generation Autonomous Systems

Author: Assaf Marron, Joseph Sifakis, and David Harel
Subjects: Focus (computing), business.industry, Computer science, computer.internet_protocol, Foundation (evidence), Autonomous system (Internet), computer.file_format, Term (time), Trustworthiness, Deliverable, Hardware and Architecture, Executable, Electrical and Electronic Engineering, Software engineering, business, computer, Software
Abstract: The potential benefits of autonomous systems are obvious. However, there are still major issues to be dealt with before developing such systems becomes a commonplace engineering practice, with accepted and trustworthy deliverables. We argue that a solid, evolving, publicly available, community-controlled foundation for developing next-generation autonomous systems is a must, and term the desired foundation Autonomics. We focus on three main challenges: (i) how to specify autonomous system behavior in the face of unpredictability; (ii) how to carry out faithful analysis of system behavior with respect to rich environments that include humans, physical artifacts, and other systems; and (iii) how to build such systems by combining executable modeling techniques from software engineering with artificial intelligence and machine learning.
Published: 2022

13. A Model of Extraction of Rail’s Vertical Corrugation Based on Flexible Virtual Ruler

Author: Mianxiong Dong, Xun Shao, Yaonan Wang, Yun Teng, Ziji Ma, and Kehuang Xu
Subjects: business.product_category, Computer science, Mechanical Engineering, media_common.quotation_subject, Flatness (systems theory), Process (computing), computer.file_format, Degrees of freedom (mechanics), Computer Science Applications, Ruler, Sampling (signal processing), Automotive Engineering, Shaping, Quality (business), Executable, business, computer, Simulation, media_common
Abstract: Rail corrugation (RC) is one of the most important indicator to evaluate the quality of rail, which is used to describe the irregularity of rail surface, also known as rail's vertical flatness. However, the definition of RC is still an empirical description for low-speed measurement devices. In this paper, the process of RC measurement is divided into two steps, sampling and extraction, which helps the users to understand more clearly. Then a new mathematic model is proposed to make the process of RC's extraction be an executable operation for machine calculation. The model adopts a new concept of flexible virtual ruler to perform sliding filtering on an overall rail and extract the instantaneous RC with a successive approximation algorithm according to user's requirements and national standards. The proposed FVR model can not only be fully compatible with the traditional extraction method of RC, but also provide a new idea for evaluating RC's quality. Comparing with the current popular methods, the proposed model gives a meaningful strategy for rail maintenance with a complete mathematic description having more degrees of freedom. Experiment results demonstrate its validity and reliability for both indoor simulation and actual outdoor experiments.
Published: 2022

14. Code Synthesis for Dataflow-Based Embedded Software Design

Author: Wanli Chang, Yixiao Yang, Jiaguang Sun, Yu Jiang, Zhuo Su, Wen Li, Liming Fang, and Dongyan Wang
Subjects: Schedule, Source lines of code, Generator (computer programming), Java, Dataflow, Programming language, Computer science, computer.file_format, computer.software_genre, Computer Graphics and Computer-Aided Design, Code (cryptography), Code generation, Executable, Electrical and Electronic Engineering, computer, Software, computer.programming_language
Abstract: Model-driven methodology has been widely adopted in embedded software design, and Dataflow is a widely used computation model, with strong modeling and simulation ability supported in tools such as Ptolemy. However, its code synthesis support is quite limited, which restricts its applications in real industrial practice. In this paper, we focus on the automatic code synthesis of Dataflow, and implement , a code generator that could support most of the widely used modeling features such as expression type and boolean switch, more efficiently. First, we disassemble the Dataflow model into actors embedded in if-else or switch-case statements based on schedule analysis, which bridges the semantic gap between the code and the original Dataflow model. Then, we design well-designed templates for each actor, and synthesize well-structured executable C and Java codes with sequential code assembly. Compared to the existing C and Java code generators of Dataflow model in Ptolemy-II, and the C code generator in Simulink, the lines of code synthesized by are decreased by an average of , and , and the execution time of the synthesized code by is also decreased by an average of , and respectively.
Published: 2022

15. I♥LA

Author: Yong Li, Yotam Gingold, Shoaib Kamil, and Alec Jacobson
Subjects: Syntax (programming languages), Programming language, Semantics (computer science), Computer science, NumPy, 020207 software engineering, 02 engineering and technology, computer.file_format, Python (programming language), computer.software_genre, 01 natural sciences, Computer Graphics and Computer-Aided Design, 010101 applied mathematics, Linear algebra, 0202 electrical engineering, electronic engineering, information engineering, Compiler, Executable, 0101 mathematics, computer, Markdown, computer.programming_language
Abstract: Communicating linear algebra in written form is challenging: mathematicians must choose between writing in languages that produce well-formatted but semantically-underdefined representations such as LaTeX; or languages with well-defined semantics but notation unlike conventional math, such as C++/Eigen. In both cases, the underlying linear algebra is obfuscated by the requirements of esoteric language syntax (as in LaTeX) or awkward APIs due to language semantics (as in C++). The gap between representations results in communication challenges, including underspecified and irreproducible research results, difficulty teaching math concepts underlying complex numerical code, as well as repeated, redundant, and error-prone translations from communicated linear algebra to executable code. We introduce I$\heartsuit$LA, a language with syntax designed to closely mimic conventionally-written linear algebra, while still ensuring an unambiguous, compilable interpretation. Inspired by Markdown, a language for writing naturally-structured plain text files that translate into valid HTML, I$\heartsuit$LA allows users to write linear algebra in text form and compile the same source into LaTeX, C++/Eigen, Python/NumPy/SciPy, and MATLAB, with easy extension to further math programming environments. We outline the principles of our language design and highlight design decisions that balance between readability and precise semantics, and demonstrate through case studies the ability for I$\heartsuit$LA to bridge the semantic gap between conventionally-written linear algebra and unambiguous interpretation in math programming environments.
Published: 2021

16. Formal Verification of a Trusted Execution Environment-Based Architecture for IoT Applications

Author: Angelo Perkusich, Dalton Cézane Gomes Valadares, Kyller Costa Gorgônio, and Alvaro Sobrinho
Subjects: Authentication, Computer Networks and Communications, business.industry, Computer science, Cloud computing, computer.file_format, Petri net, Computer security, computer.software_genre, Encryption, Computer Science Applications, Hardware and Architecture, Server, Signal Processing, Key (cryptography), Executable, business, Formal verification, computer, Information Systems
Abstract: The Internet-of-Things (IoT) scenarios commonly present security and privacy concerns, either due to the processing constraints of devices or the employment of external servers to process and store data, for instance, in cloud-based IoT applications. In this sense, to protect data and decrease user distrust in external entities, security technologies are of utmost importance. Trusted execution environments (TEEs), which process data in an isolated and protected region of memory, are among these technologies. We focus on a trusted architecture solution based on the use of TEEs and the application of authentication, authorization, and encryption mechanisms to protect data in IoT applications. We specified the trusted IoT architecture (TIoTA) using hierarchical colored Petri nets, and performed simulations and model checking of key security properties related to desired and prohibited behaviors, enabling model-based testing. This article enhances the state of the art by providing project artifacts (e.g., executable and parametric models) for correctly implementing the TIoTA and by presenting evidence that the usage of the architecture can improve security and privacy.
Published: 2021

17. Using Bayesian optimization algorithm for model-based integration testing

Author: Erik Cuevas, Somayeh Mohammady, and Vahid Rafe
Subjects: Model checking, Graph rewriting, Integration testing, Computer science, computer.file_format, Theoretical Computer Science, Test case, Test suite, Redundancy (engineering), State space, Geometry and Topology, Executable, computer, Algorithm, Software
Abstract: Model-based testing is an automated process in which executable tests are derived from behavioral models of a system. Model checking is a verification technique to reveal errors in which all reachable states of a system can be generated as state space. In the literature, different approaches suggest using model checkers for model-based testing. Model checker explores all possible system states, so utilizing the various paths in the state-space as test cases seems a promising solution. However, these approaches suffer from two main challenges. The first challenge is state space explosion, which prevents generating all reachable states by the model checker. The second one is generating redundant test cases. Recently, several methods using meta-heuristic and evolutionary approaches have been proposed to cope with these problems. Therefore, exploring a portion of state space using an optimization approach to detect the test objectives can be a proper way to manage the state space explosion and generate an optimal test suite with the least redundancy. In this paper, a method is proposed using a Bayesian optimization algorithm (BOA), and a model checker is as a bed to generate test cases for the service-oriented systems. In the proposed approach, the test suite is a set of paths on the state space starting from an initial state and leading to the state in which all the test objectives are satisfied. In this research, we have implemented BOA with three different structures in GROOVE toolset, an open-source toolset for designing and model checking graph transformation. Experimental results show that our solution generates better results in terms of coverage and speed in different case studies than the existing approaches.
Published: 2021

18. Emulating complex simulations by machine learning methods

Author: Paola Stolfi and Filippo Castiglione
Subjects: Workstation, Mean squared error, Computer science, QH301-705.5, Computer applications to medicine. Medical informatics, R858-859.7, Machine learning, computer.software_genre, simulatione, Biochemistry, law.invention, Structural Biology, law, Humans, Emulation, Biology (General), Molecular Biology, Emulazione, Type-2 diabetes, business.industry, Applied Mathematics, Research, Work (physics), modello matematico, Construct (python library), computer.file_format, Risk prediction, Computer Science Applications, machine learning, Diabetes Mellitus, Type 2, Computational modelling, Self-assessment, Ordinary differential equation, Artificial intelligence, Executable, business, computer, Mobile device, Algorithms
Abstract: Background The aim of the present paper is to construct an emulator of a complex biological system simulator using a machine learning approach. More specifically, the simulator is a patient-specific model that integrates metabolic, nutritional, and lifestyle data to predict the metabolic and inflammatory processes underlying the development of type-2 diabetes in absence of familiarity. Given the very high incidence of type-2 diabetes, the implementation of this predictive model on mobile devices could provide a useful instrument to assess the risk of the disease for aware individuals. The high computational cost of the developed model, being a mixture of agent-based and ordinary differential equations and providing a dynamic multivariate output, makes the simulator executable only on powerful workstations but not on mobile devices. Hence the need to implement an emulator with a reduced computational cost that can be executed on mobile devices to provide real-time self-monitoring. Results Similarly to our previous work, we propose an emulator based on a machine learning algorithm but here we consider a different approach which turn out to have better performances, indeed in terms of root mean square error we have an improvement of two order magnitude. We tested the proposed emulator on samples containing different number of simulated trajectories, and it turned out that the fitted trajectories are able to predict with high accuracy the entire dynamics of the simulator output variables. We apply the emulator to control the level of inflammation while leveraging on the nutritional input. Conclusion The proposed emulator can be implemented and executed on mobile health devices to perform quick-and-easy self-monitoring assessments.
Published: 2021

19. Python Programming in PyPI for Translational Medicine

Author: Yoshiyasu Takefuji
Subjects: PyPI package, Python program, business.industry, Computer science, Big data, Computer applications to medicine. Medical informatics, R858-859.7, computer.file_format, Python (programming language), Upload, Software, translational medicine, Index (publishing), Code (cryptography), dataset, The Internet, Executable, Software engineering, business, computer, computer.programming_language
Abstract: This is the world’s first tutorial article on Python Packaging for beginners and practitioners for translational medicine or medicine in general. This tutorial will allow researchers to demonstrate and showcase their tools on PyPI packages around the world. Nowadays, for translational medicine, researchers need to deal with big data. This paper describes how to build an executable Python Package Index (PyPI) code and package. PyPI is a repository of software for the Python programming language with 5,019,737 files and 544,359 users (programmers) as of 19 October 2021. First, programmers must understand how to scrape a dataset over the Internet; second, they must read the dataset file in csv format; third, build a program to compute the target values; fourth, convert the Python program to the PyPI package.; and fifth, upload the PyPI package. This paper depicts a covidlag executable package as an example for calculating the accurate case fatality rate (CFR) and the lag time from infection to death. You can install the covidlag by pip terminal command and test it. This paper also introduces deathdaily and scorecovid packages on PyPI Stats, which can inform how many users have downloaded the specified PyPI package. The usefulness and applicability of a developed tool can be verified by PyPI Stats with the number of downloaded users.
Published: 2021

20. Prediction of antimicrobial peptides toxicity based on their physico-chemical properties using machine learning techniques

Author: Bagher BabaAli, Mohammad Hossein Karimi-Jafari, Hossein Khabbaz, and Ali Akbar Saboury
Subjects: Pore Forming Cytotoxic Proteins, Computer science, QH301-705.5, Antimicrobial peptides, Computer applications to medicine. Medical informatics, R858-859.7, Feature selection, Machine learning, computer.software_genre, Biochemistry, Physico-chemical properties, Structural Biology, Animals, Peptide toxicity, Biology (General), Molecular Biology, Low toxicity, business.industry, Applied Mathematics, Research, Drug Resistance, Microbial, computer.file_format, Computer Science Applications, Toxicity, Artificial intelligence, Executable, F1 score, business, Peptides, computer, Hybrid model
Abstract: Background Antimicrobial peptides are promising tools to fight against ever-growing antibiotic resistance. However, despite many advantages, their toxicity to mammalian cells is a critical obstacle in clinical application and needs to be addressed. Results In this study, by using an up-to-date dataset, a machine learning model has been trained successfully to predict the toxicity of antimicrobial peptides. The comprehensive set of features of both physico-chemical and linguistic-based with local and global essences have undergone feature selection to identify key properties behind toxicity of antimicrobial peptides. After feature selection, the hybrid model showed the best performance with a recall of 0. 876 and a F1 score of 0. 849. Conclusions The obtained model can be useful in extracting AMPs with low toxicity from AMP libraries in clinical applications. On the other hand, several properties with local nature including positions of strand forming and hydrophobic residues in final selected features show that these properties are critical definer of peptide properties and should be considered in developing models for activity prediction of peptides. The executable code is available at https://git.io/JRZaT.
Published: 2021

21. VESPA: static profiling for binary optimization

Author: Fernando Magno Quintão Pereira, Guilherme Ottoni, and Angélica Aparecida Moreira
Subjects: Profiling (computer programming), Binary optimization, Computer science, business.industry, Propeller, Binary number, Context (language use), computer.file_format, computer.software_genre, Software, Computer engineering, Compiler, Executable, Safety, Risk, Reliability and Quality, business, computer
Abstract: Over the past few years, there has been a surge in the popularity of binary optimizers such as BOLT, Propeller, Janus and HALO. These tools use dynamic profiling information to make optimization decisions. Although effective, gathering runtime data presents developers with inconveniences such as unrepresentative inputs, the need to accommodate software modifications, and longer build times. In this paper, we revisit the static profiling technique proposed by Calder et al. in the late 90’s, and investigate its application to drive binary optimizations, in the context of the BOLT binary optimizer, as a replacement for dynamic profiling. A few core modifications to Calder et al.’s original proposal, consisting of new program features and a new regression model, are sufficient to enable some of the gains obtained through runtime profiling. An evaluation of BOLT powered by our static profiler on four large benchmarks (clang, GCC, MySQL and PostgreSQL) yields binaries that are 5.47 % faster than the executables produced by clang -O3.
Published: 2021

22. Enabling Collaborative Data Science Development with the Ballet Framework

Author: Kalyan Veeramachaneni, Micah J. Smith, Kelvin Lu, and Jürgen Cito
Subjects: FOS: Computer and information sciences, Feature engineering, Computer Science - Machine Learning, Computer Networks and Communications, Computer science, Computer Science - Human-Computer Interaction, Cloud computing, 02 engineering and technology, Machine Learning (cs.LG), Human-Computer Interaction (cs.HC), Computer Science - Software Engineering, Software, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Software system, business.industry, Software development, 020207 software engineering, computer.file_format, Data science, Software Engineering (cs.SE), Human-Computer Interaction, Conceptual framework, Programming paradigm, Executable, business, computer, Social Sciences (miscellaneous)
Abstract: While the open-source software development model has led to successful large-scale collaborations in building software systems, data science projects are frequently developed by individuals or small teams. We describe challenges to scaling data science collaborations and present a conceptual framework and ML programming model to address them. We instantiate these ideas in Ballet, the first lightweight framework for collaborative, open-source data science through a focus on feature engineering, and an accompanying cloud-based development environment. Using our framework, collaborators incrementally propose feature definitions to a repository which are each subjected to software and ML performance validation and can be automatically merged into an executable feature engineering pipeline. We leverage Ballet to conduct a case study analysis of an income prediction problem with 27 collaborators, and discuss implications for future designers of collaborative projects.
Published: 2021

23. A Statistically Based Methodology to Estimate the Probability of Encountering Rock Blocks When Tunneling in Heterogeneous Ground

Author: Maria Lia Napoli, Roberto Fontana, and Monica Barbero
Subjects: Mining engineering. Metallurgy, Computer science, TN1-997, Excavation, computer.file_format, cutterhead design, executable code, Matrix (geology), heterogeneous ground, tunneling, Intersection, block-in-matrix, statistical simulation, Block (programming), Face (geometry), Code (cryptography), Executable, MATLAB, Algorithm, computer, computer.programming_language
Abstract: Strong rock blocks embedded in a weaker soil matrix are found in many geological units. When tunneling in ground containing cobbles and boulders, extremely challenging conditions can be encountered. Such inconveniences may be avoided by means of appropriate tunneling methods and cutterhead designs, which require the content, frequency, and size of rock blocks to be predicted as accurately as possible. Several approaches have been developed to estimate the block fraction of heterogeneous geomaterials for excavation. However, the estimation of cobble–boulder quantities both all along the tunnel and only partially embedded within the tunnel face remains a critical issue. This study develops a methodology for the estimation of the probability of encountering blocks partially or totally contained within the tunnel excavation area, wherein the area of intersection with the tunnel face is greater than the given critical values. For this purpose, a statistical approach has been implemented in a Matlab routine. The potential of this code is that it provides extremely useful and statistically based information that can be used for making a more rational choice regarding tunneling technique and in terms of designing a suitable cutterhead in order to avoid technical problems during tunnel excavations in heterogeneous ground. The executable code is provided.
Published: 2021

24. A program for the fitting of up to three Havriliak-Negami dispersions to dielectric data

Author: Constantino Grosse
Subjects: Computer science, Interface (computing), 02 engineering and technology, Dielectric, 010402 general chemistry, 01 natural sciences, Biomaterials, symbols.namesake, Superposition principle, Colloid and Surface Chemistry, Graphical user interface, Debye, business.industry, computer.file_format, 021001 nanoscience & nanotechnology, 0104 chemical sciences, Surfaces, Coatings and Films, Electronic, Optical and Magnetic Materials, Microsoft Windows, symbols, Executable, 0210 nano-technology, business, Algorithm, computer, Cole–Cole equation
Abstract: A new version of the DielParamFit program [J. Colloid Interface Sci. 419 (2014) 102-106] for the fitting of a superposition of dispersion terms to measured dielectric data is presented. It extends its applicability to a wide range of new systems by allowing the use of up to three Havriliak-Negami dispersions, each of which can be readily transformed into Debye, Cole-Cole, or Cole-Davidson terms. Moreover, it greatly enhances its usability by means of a graphic interface that displays the measured data together with the model spectra allowing to iteratively adjust the chosen terms and guess parameter values guided by a live view of the obtained results. The DielParamFit_2 program executable file for Microsoft Windows is available upon request from the author.
Published: 2021

25. Enseñanza y Aprendizaje de Robótica Industrial desde la Virtualidad

Author: Wilmer Sanz-Fernández
Subjects: business.industry, Computer science, General Medicine, computer.file_format, Mechatronics, computer.software_genre, Unimate, Toolbox, Software, Scripting language, Robot, GNU Octave, Executable, Software engineering, business, computer, computer.programming_language
Abstract: Junto al primer robot manipulador, el UNIMATE de Devol y Engelberger, nació la preocupación de investigadores e ingenieros por demostrar el potencial de autómatas similares en aplicaciones industriales. En consecuencia muchas prestigiosas universidades percibieron cuán importante era incluir asignaturas del área en planes de estudios de carreras como Ingeniería Eléctrica, Electrónica, Mecánica o Mecatrónica. La presente investigación fue bajo el método constructivista y el análisis de resultados se apoya en la respuesta observada en un simulador mediante el uso combinado de una Toolbox para modelación cinemática de robots (basada en scripts ejecutables en un entorno como GNU Octave o similar), además de un software para simulación 3D. La ejemplificación del uso de las herramientas y el reto al estudiante de programar RI virtuales en escenarios típicos de manufactura, empaquetado o paletizado, ha sido probado a lo largo de cuatro períodos lectivos, obteniéndose una realimentación positiva de los aprobados y egresados. Los resultados alcanzados al simular demuestran la exactitud de los modelos cinemáticos obtenidos mediante formulaciones matemáticas, lo cual evidencia la utilidad de las herramientas descritas para el aprendizaje a distancia basado en la práctica.
Published: 2021

26. RAP: A Software Framework of Developing Convolutional Neural Networks for Resource-constrained Devices Using Environmental Monitoring as a Case Study

Author: Chia-Heng Tu, Hsiao-Hsuan Chang, and Qihui Sun
Subjects: Control and Optimization, Artificial neural network, Computer Networks and Communications, Computer science, business.industry, Deep learning, media_common.quotation_subject, Cyber-physical system, computer.file_format, Python (programming language), computer.software_genre, Convolutional neural network, Human-Computer Interaction, Software framework, Computer architecture, Debugging, Artificial Intelligence, Hardware and Architecture, Artificial intelligence, Executable, business, computer, media_common, computer.programming_language
Abstract: Monitoring environmental conditions is an important application of cyber-physical systems. Typically, the monitoring is to perceive surrounding environments with battery-powered, tiny devices deployed in the field. While deep learning-based methods, especially the convolutional neural networks (CNNs), are promising approaches to enriching the functionalities offered by the tiny devices, they demand more computation and memory resources, which makes these methods difficult to be adopted on such devices. In this article, we develop a software framework, RAP , that permits the construction of the CNN designs by aggregating the existing, lightweight CNN layers, which are able to fit in the limited memory (e.g., several KBs of SRAM) on the resource-constrained devices satisfying application-specific timing constrains. RAP leverages the Python-based neural network framework Chainer to build the CNNs by mounting the C/C++ implementations of the lightweight layers, trains the built CNN models as the ordinary model-training procedure in Chainer, and generates the C version codes of the trained models. The generated programs are compiled into target machine executables for the on-device inferences. With the vigorous development of lightweight CNNs, such as binarized neural networks with binary weights and activations, RAP facilitates the model building process for the resource-constrained devices by allowing them to alter, debug, and evaluate the CNN designs over the C/C++ implementation of the lightweight CNN layers. We have prototyped the RAP framework and built two environmental monitoring applications for protecting endangered species using image- and acoustic-based monitoring methods. Our results show that the built model consumes less than 0.5 KB of SRAM for buffering the runtime data required by the model inference while achieving up to 93% of accuracy for the acoustic monitoring with less than one second of inference time on the TI 16-bit microcontroller platform.
Published: 2021

27. Data Pattern Aware Reliability Enhancement Scheme for 3D Solid-State Drives

Author: Weiguo Wu, Chi Zhang, and Shiqiang Nie
Subjects: Computer science, Reliability (computer networking), Latency (audio), NAND gate, Parallel computing, computer.file_format, Program optimization, Hardware and Architecture, Overhead (computing), Executable, State (computer science), Performance improvement, computer, Software
Abstract: 3D charge-trap (CT) NAND flash-based SSD has been used widely for its large capacity, low cost per bit, and high endurance. One-shot program (OSP) scheme, as a variation of incremental step pulse programming (ISPP) scheme, has been employed to program data for CT flash, whose program unit is the Word-Line (WL) instead of the page. The existing program optimization schemes either make trade-offs among program latency and reliability by adjusting the program step voltage on demand; or remap the most error-prone cell states to others by re-encoding programmed data. However, the data pattern, which represents the ratio of 1s in data values, has not been thoroughly studied. In this paper, we observe that most small files do not contain uniform 1s and 0s among these common file types (i.e., image, audio, text, executable file), leading to programming WL cells in different states unevenly. Some cell states dominate over the WL, while others are not. Based on this observation, we propose a flexible reliability enhancement scheme based on the OSP scheme. This scheme programs the cells into different states with varied , i.e., these cells in one state, whose number is the largest in one WL, are programmed with a fine-grained (namely slow write). In contrast, the minority are programmed with a coarse-grained (namely fast write). So the reliability is improved due to averaging the major enhanced cells with the minor degraded cells without program latency overhead. A series of experiments have been conducted, and the results indicate that the proposed scheme achieves 34% read performance improvement and 16% lifetime elongation on average.
Published: 2021

28. Selective Sharing of Outsourced Encrypted Data in Cloud Environments

Author: Jianbin Gao, Emmanuel Boateng Sifah, Kwame Opuni-Boachie Obour Agyekum, Christian Nii Aflah Cobblah, Hu Xia, Qi Xia, and Kingsley Nketia Acheampong
Subjects: Computer Networks and Communications, Computer science, business.industry, Cryptography, Cloud computing, Access control, computer.file_format, Computer security, computer.software_genre, Encryption, Computer Science Applications, Data access, Hardware and Architecture, Server, Signal Processing, Data Protection Act 1998, Executable, business, computer, Information Systems
Abstract: Owing to the vast volume of information gathered by computers, data protection and security has become a problem for organizations with the enormous rise in data transmission. Due to many advantages that cloud service providers provide, mainly economic benefits, several data owners outsource their data to cloud repositories. However, data owners do not have full ownership of the data after their data are outsourced. Thus, external data management systems are implemented to manage the data. Several kinds of research refer to the use of encryption techniques to prevent unauthorized access to data. Selective encryption aims at supporting selective and private access to outsourced data. However, the combination of this approach and indexing techniques cause confidentiality violations. In this article, a blockchain-based approach to data access is presented by implementing smart contracts over data access. These executable scripts bind users by stating access policies on the data. Furthermore, we provide a system, where users can offload their computational capabilities due to limitations in their computations. Our systems’ computational capabilities outperform that of when users do the computation on their own. The results show a practical approach to data access management using blockchain technology.
Published: 2021

29. MINAD: Multi-inputs Neural Network based on Application Structure for Android Malware Detection

Author: Thang T. Nguyen, Duc Van Nguyen, Anh H. Ngo, Giang L. Nguyen, and Giang T. Pham
Subjects: Artificial neural network, Computer Networks and Communications, business.industry, Computer science, Payload (computing), Stability (learning theory), computer.file_format, computer.software_genre, Machine learning, Encryption, Feature (computer vision), Malware, Artificial intelligence, Executable, Android (operating system), business, computer, Software
Abstract: With the proliferation of smartphone demand, the number of malicious applications has increased exponentially with about tens of thousands per month. Among smartphone platforms, the Android operating system with high popularity has become the most target by malware. By some techniques such as employing polymorphic or encrypting payload, signature-based scanning is easily bypassed. With the support from some useful tools and sandboxes recently, the Android applications could be easy to decoded and tracked the executable behavior. It leads machine learning methods to have potential benefits to classify the malware. However, how to define the suitable model with competent features and avoid over-fitting in learning models become other challenges for researchers. In this paper, we propose MINAD (Multi-Inputs Neural network based on application structure for Android malware Detection) method. First, we collect the features of an Android application based on many aspects, and then those features are grouped into three categories: System-based, Library-based, and User-based corresponding the parts of Android application structure which are related with Android system definition, library, users’ definitions. Second, each group is reconstructed to have effective feature sets. At last, a multi-input deep neural network is designed with two phases to learn the abstract of each feature group before making the final decision for malware detection. Our performances are evaluated in various samples which are collected from Google Play Store, the Drebin, and AMD Datasets with more than 155,000 samples. The results show that the MINAD method does not only improve Android malware detection’s accuracy in comparison with other methods but also improves the stability of the model and reduces the computation costs.
Published: 2021

30. Virtual Prototyping a Production Line Using Assume–Guarantee Contracts

Author: Roberta Chirico, Stefano Spellini, Franco Fummi, Michele Lora, and Marco Panato
Subjects: Production line, Computer science, Simulation and Validation, 02 engineering and technology, Design Automation, computer.software_genre, 0202 electrical engineering, electronic engineering, information engineering, Production (economics), Advanced manufacturing, Advanced Manufacturing, Electrical and Electronic Engineering, Virtual Prototyping, business.industry, 020208 electrical & electronic engineering, computer.file_format, Computer Science Applications, Simulation software, Control and Systems Engineering, Advanced Manufacturing, Design Automation, Simulation and Validation, Virtual Prototyping, Robot, Electronic design automation, Executable, Software engineering, business, computer, Information Systems, Virtual prototyping
Abstract: This article presents a methodology to formalize the behavior of the machines composing a production line, and to automatically generate their virtual prototypes for efficient and correct plant simulation. The approach exploits assume-guarantee reasoning through contracts to model the interaction between the different components of a production line. The approach is guided by a well-known taxonomy of industrial machines and associated manufacturing processes to identify each elementary action related to a specific machine. Contracts enable to build executable models of all the machines available in the production line by using automatic synthesis. The generated models can be integrated into a state-of-the-practice industrial plant simulation software to estimate and validate the production line's behavior. The presentation of the methodology is supported by a running example based on a real production line, showing the step-by-step application of the approach to a concrete scenario.
Published: 2021

31. Custom workflows to improve joint variant calling from multiple related tumour samples: FreeBayesSomatic and Strelka2Pass

Author: Sarah-Jane Dawson, Benjamin Solomon, Dineika Chandrananda, Sebastian Hollizeck, and Stephen Q. Wong
Subjects: Statistics and Probability, Supplementary data, Source code, Computer science, Programming language, media_common.quotation_subject, Sequencing data, Sample (statistics), computer.file_format, computer.software_genre, Biochemistry, Computer Science Applications, Computational Mathematics, Workflow, Computational Theory and Mathematics, Executable, Joint (audio engineering), Precision and recall, Molecular Biology, computer, media_common
Abstract: Summary This work describes two novel workflows for variant calling that extend the widely used algorithms of Strelka2 and FreeBayes to call somatic mutations from multiple related tumour samples and one matched normal sample. We show that these workflows offer higher precision and recall than their single tumour-normal pair equivalents in both simulated and clinical sequencing data. Availability and implementation Source code freely available at the following link: https://atlassian.petermac.org.au/bitbucket/projects/DAW/repos/multisamplevariantcalling and executable through Janis (https://github.com/PMCC-BioinformaticsCore/janis) under the GPLv3 licence. Supplementary information Supplementary data are available at Bioinformatics online.
Published: 2021

32. TA-SPESC: Toward Asset-Driven Smart Contract Language Supporting Ownership Transaction and Rule-Based Generation on Blockchain

Author: William C. Chu, Weijing Song, Di Wang, Di Ma, and Yan Zhu
Subjects: 021103 operations research, Smart contract, Computer science, business.industry, 0211 other engineering and technologies, Rule-based system, 02 engineering and technology, computer.file_format, Term (time), Renting, Risk analysis (engineering), Solidity, Executable, Asset (economics), Electrical and Electronic Engineering, Safety, Risk, Reliability and Quality, business, computer, Database transaction
Abstract: Aiming at insufficient situation to express and operate assets in smart contracts, in this article we attempt to add a new asset model into smart contract language (such as SPESC) through combing method of asset's expressions and transactions in real-world contracts. Moreover, a translation mechanism can be set up to accomplish a conversion from the asset model to an executable contract program. On this basis, we propose a new language design toward asset-driven specific smart contracts, called TA-SPESC. This language complies with the structure of real-world contracts and supports a formal definition composed of four modules: Party, asset, term, and contract attribute. This asset model on it can be used to define various types of rights (including the right of ownership, use, possession, usufruct, and disposition of assets), as well as five asset operations (including asset registration, deposit, withdrawal, transfer, and cancellation) to effectively support asset transaction. More important, a series of generation rules are proposed to translate the TA-SPESC contract to an executable contract program. Moreover, taking house rental contract as an example, we provide a TA-SPESC instance and its specific description of translation process according to the generation rules, which supports a semiautomatic generation to executable programs. Finally, the Solidity codes derived from TA-SPESC contracts are run and tested, and the experiment and comparison results indicate that TA-SPESC contracts have high abstraction and low complexity, as well as versatility and convenience of asset transaction, which lead to more reliable software with less errors and fewer misunderstanding.
Published: 2021

33. Modular, compositional, and executable formal semantics for LLVM IR

Author: Vadim Zaliva, Calvin Beck, Irene Yoon, Yannick Zakowski, Steve Zdancewic, Ilia Zaichuk, CASH - Compilation and Analysis, Software and Hardware (CASH), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire de l'Informatique du Parallélisme (LIP), École normale supérieure de Lyon (ENS de Lyon)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Université de Lyon-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure de Lyon (ENS de Lyon)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Université de Lyon-Centre National de la Recherche Scientifique (CNRS), University of Pennsylvania, and Department of Computer and Information Science [Pennsylvania] (CIS)
Subjects: Monads, Verified Compilation, Correctness, Semantics (computer science), Computer science, Formal semantics (linguistics), 0102 computer and information sciences, 02 engineering and technology, computer.software_genre, Semantic data model, 01 natural sciences, Operational semantics, Software and its engineering, 0202 electrical engineering, electronic engineering, information engineering, Coq, Safety, Risk, Reliability and Quality, Theory of computation, Denotational semantics, Bisimulation, [INFO.INFO-PL]Computer Science [cs]/Programming Languages [cs.PL], Interpretation (logic), Programming language, 020207 software engineering, computer.file_format, Semantics, Compilers, 010201 computation theory & mathematics, Program verification, LLVM, TheoryofComputation_LOGICSANDMEANINGSOFPROGRAMS, Executable, computer, Software
Abstract: International audience; This paper presents a novel formal semantics, mechanized in Coq, for a large, sequential subset of the LLVM IR. In contrast to previous approaches, which use relationally-specified operational semantics, this new semantics is based on monadic interpretation of interaction trees, a structure that provides a more compositional approach to defining language semantics while retaining the ability to extract an executable interpreter. Our semantics handles many of the LLVM IR's non-trivial language features and is constructed modularly in terms of event handlers, including those that deal with nondeterminism in the specification. We show how this semantics admits compositional reasoning principles derived from the interaction trees equational theory of weak bisimulation, which we extend here to better deal with nondeterminism, and we use them to prove that the extracted reference interpreter faithfully refines the semantic model. We validate the correctness of the semantics by evaluating it on unit tests and LLVM IR programs generated by HELIX.
Published: 2021

34. Knowledge-driven framework for industrial robotic systems

Author: Munir Merdan, Markus Vincze, Timon Hoebert, and Wilfried Lepuschitz
Subjects: Flexibility (engineering), business.industry, Computer science, Context (language use), Robotics, computer.file_format, Ontology (information science), Industrial and Manufacturing Engineering, Domain (software engineering), Artificial Intelligence, Robot, Executable, Artificial intelligence, Software engineering, business, Batch production, computer, Software
Abstract: Due to their advantages, there is an increase of applying robotic systems for small batch production as well as for complex manufacturing processes. However, programming and configuring robots is time and resource consuming while being also accompanied by high costs that are especially challenging for small- and medium-sized enterprises. The current way of programming industrial robots by using teach-in control devices and/or using vendor-specific programming languages is in general a complex activity that requires extensive knowledge in the robotics domain. It is therefore important to offer new practical methods for the programming of industrial robots that provide flexibility and versatility in order to achieve feasible robotics solutions for small lot size productions. This paper focuses on the development of a knowledge-driven framework, which should overcome the limitations of state-of-the-art robotics solutions and enhance the agility and autonomy of industrial robotics systems using ontologies as a knowledge-source. The framework includes reasoning and perception abilities as well as the ability to generate plans, select appropriate actions, and finally execute these actions. In this context, a challenge is the fusion of vision system information with the decision-making component, which can use this information for generating the assembly tasks and executable programs. The introduced product model in the form of an ontology enables that the framework can semantically link perception data to product models to consequently derive handling operations and required tools. Besides, the framework enables an easier adaption of robot-based production systems for individualized production, which requires swift configuration and efficient planning. The presented approach is demonstrated in a laboratory environment with an industrial pilot test case. Our application shows the potential to reduce the efforts needed to program robots in an automated production environment. In this context, the benefits as well as shortcomings of the approach are also discussed in the paper.
Published: 2021

35. EasyNanopore: A Ready-to-Use Processing Software for Translocation Events in Nanopore Translocation Experiments

Author: Jing Tu, Guohao Xi, Linlin Wu, Zuhong Lu, Hao Meng, and Jiye Fu
Subjects: Computer science, business.industry, Event (computing), Real-time computing, Process (computing), Surfaces and Interfaces, computer.file_format, Processing, Condensed Matter Physics, Nanopores, Nanopore, Software, Installation, Electrochemistry, General Materials Science, Executable, business, computer, Spectroscopy, computer.programming_language, Graphical user interface
Abstract: We developed EasyNanopore which is a ready-to-use software to select the events of a nanopore molecular translocation experiment. The software is released as an executable file with a graphical user interface and provides several versions suitable for different operating systems without installing any running environment to execute it. We use the adaptive threshold which adapts to the low-frequency variation of the baseline to detect events and uses a multiprocess method to accelerate the process of event detection. After the event is identified, its duration and amplitude information will be extracted and a resulting txt file will be generated for further analysis. Our software runs fast and can effectively extract the data from data of large-scale nanopore molecular translocation experiments.
Published: 2021

36. Efficient Motion Planning Based on Kinodynamic Model for Quadruped Robots Following Persons in Confined Spaces

Author: Yong Liu, Zhen Zhang, Xin Kong, Guangyao Zhai, and Jiaqing Yan
Subjects: Robot kinematics, Computer science, Terrain, computer.file_format, Collision, Computer Science Applications, Computer Science::Robotics, Control and Systems Engineering, Robustness (computer science), Control theory, Trajectory, Robot, Motion planning, Executable, Electrical and Electronic Engineering, computer
Abstract: Quadruped robots have superior terrain adaptability and flexible movement capabilities than traditional robots. In this article, we innovatively apply it in person-following tasks, and propose an efficient motion planning scheme for quadruped robots to generate a flexible and effective trajectory in confined spaces. The method builds a real-time local costmap via onboard sensors, which involves both static and dynamic obstacles. And we exploit a simplified kinodynamic model and formulate the friction pyramids formed by ground reaction forces’ inequality constraints to ensure the executable of the optimized trajectory. In addition, we obtain the optimal following trajectory in the costmap completely based on the robot's rectangular footprint description, which ensures that it can walk through the narrow spaces avoiding collision. Finally, a receding horizon control strategy is employed to improve the robustness of motion in complex environments. The proposed motion planning framework is integrated on the quadruped robot JueYing and tested in simulation as well as real scenarios. It shows that the execution success rates in various scenes are all over 90%.
Published: 2021

37. An executable framework for modeling and validating cooperative capability requirements in emergency response system

Author: Yu Minggang, He Hongyue, Chai Lei, Wang Zhixue, and He Ming
Subjects: Emergency response, Computer science, business.industry, Executable, computer.file_format, Software engineering, business, computer
Published: 2021

38. Large-scale and Robust Code Authorship Identification with Deep Feature Learning

Author: Tamer AbuHmed, David Mohaisen, DaeHun Nyang, and Mohammed Abuhamad
Subjects: Source code, General Computer Science, Java, business.industry, Programming language, Computer science, media_common.quotation_subject, Deep learning, 020207 software engineering, 02 engineering and technology, computer.file_format, Python (programming language), computer.software_genre, Toolchain, Software, Obfuscation, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, Executable, Safety, Risk, Reliability and Quality, business, computer, media_common, computer.programming_language
Abstract: Successful software authorship de-anonymization has both software forensics applications and privacy implications. However, the process requires an efficient extraction of authorship attributes. The extraction of such attributes is very challenging, due to various software code formats from executable binaries with different toolchain provenance to source code with different programming languages. Moreover, the quality of attributes is bounded by the availability of software samples to a certain number of samples per author and a specific size for software samples. To this end, this work proposes a deep Learning-based approach for software authorship attribution, that facilitates large-scale, format-independent, language-oblivious, and obfuscation-resilient software authorship identification. This proposed approach incorporates the process of learning deep authorship attribution using a recurrent neural network, and ensemble random forest classifier for scalability to de-anonymize programmers. Comprehensive experiments are conducted to evaluate the proposed approach over the entire Google Code Jam (GCJ) dataset across all years (from 2008 to 2016) and over real-world code samples from 1,987 public repositories on GitHub. The results of our work show high accuracy despite requiring a smaller number of samples per author. Experimenting with source-code, our approach allows us to identify 8,903 GCJ authors, the largest-scale dataset used by far, with an accuracy of 92.3%. Using the real-world dataset, we achieved an identification accuracy of 94.38% for 745 C programmers on GitHub. Moreover, the proposed approach is resilient to language-specifics, and thus it can identify authors of four programming languages (e.g., C, C++, Java, and Python), and authors writing in mixed languages (e.g., Java/C++, Python/C++). Finally, our system is resistant to sophisticated obfuscation (e.g., using C Tigress) with an accuracy of 93.42% for a set of 120 authors. Experimenting with executable binaries, our approach achieves 95.74% for identifying 1,500 programmers of software binaries. Similar results were obtained when software binaries are generated with different compilation options, optimization levels, and removing of symbol information. Moreover, our approach achieves 93.86% for identifying 1,500 programmers of obfuscated binaries using all features adopted in Obfuscator-LLVM tool.
Published: 2021

39. Formal model-driven executable DSLs

Author: Akram Idani
Subjects: Markup language, Correctness, Computer science, Semantics (computer science), Programming language, B-Method, computer.file_format, computer.software_genre, Dependability, Automated reasoning, Executable, Kermeta, computer, Software, computer.programming_language
Abstract: One of the promising techniques to address the dependability of a system is to apply, at early design stages, domain-specific languages (DSLs) with execution semantics. Indeed, an executable DSL would not only represent the expected system’s structure, but it is intended to itself behave as the system should run. In order to make executable DSLs a powerful asset in the development of safety-critical systems, not only a rigorous development process is required but the domain expert should also have confidence in the execution semantics provided by the DSL developer. To this aim, we recently developed the Meeduse tool and showed how to bridge the gap between MDE and a proof-based formal approach. In this work, we apply our approach to the Petri-net DSL and we present MeeNET, a proved Petri-net designer and animator powered by Meeduse. MeeNET is built on top of PNML (Petri-Net Markup Language), the international standard ISO/IEC 15909 for Petri-nets, and provides underlying formal static and dynamic semantics that are verified by automated reasoning tools. This paper first presents simplified MDE implementations of Petri-nets applying Java, QVT, Kermeta and fUML that we experimented in order to debug a safety-critical system and summarises the lessons learned from this study. Then, it provides formal alternatives, based on the B method and process algebra, which are well-established techniques allowing interactive animation on the one hand and reasoning about the behaviour correctness, on the other hand.
Published: 2021

40. Survey of Methods for Automated Code-Reuse Exploit Generation

Author: A.R. Nurmukhametov and A. V. Vishnyakov
Subjects: FOS: Computer and information sciences, Computer Science - Cryptography and Security, Exploit, Computer science, Programming language, Code reuse, computer.file_format, computer.software_genre, Instruction set, Virtual machine, Gadget, Code generation, Compiler, Executable, Cryptography and Security (cs.CR), computer, ComputingMilieux_MISCELLANEOUS, Software
Abstract: This paper provides a survey of methods and tools for automated code-reuse exploit generation. Such exploits use code that is already contained in a vulnerable program. The code-reuse approach allows one to exploit vulnerabilities in the presence of operating system protection that prohibits data memory execution. This paper contains a description of various code-reuse methods: return-to-libc attack, return-oriented programming, jump-oriented programming, and others. We define fundamental terms: gadget, gadget frame, gadget catalog. Moreover, we show that, in fact, a gadget is an instruction, and a set of gadgets defines a virtual machine. We can reduce an exploit creation problem to code generation for this virtual machine. Each particular executable file defines a virtual machine instruction set. We provide a survey of methods for gadgets searching and determining their semantics (creating a gadget catalog). These methods allow one to get the virtual machine instruction set. If a set of gadgets is Turing-complete, then a compiler can use a gadget catalog as a target architecture. However, some instructions can be absent. Hence we discuss several approaches to replace missing instructions with multiple gadgets. An exploit generation tool can chain gadgets by pattern searching (regular expressions) or considering gadgets semantics. Furthermore, some chaining methods use genetic algorithms, while others use SMT-solvers. We compare existing open-source tools and propose a testing system rop-benchmark that can be used to verify whether a generated chain successfully opens a shell.
Published: 2021

41. Evaluation of Requirements Management Processes Utilizing System Modeling Language (SysML) Executable Models

Author: Tami Katz
Subjects: Requirements management, business.industry, Computer science, Systems Modeling Language, Executable, computer.file_format, Systems modeling, Software engineering, business, computer
Published: 2021

42. Bin2vec: learning representations of binary executable programs for security tasks

Author: Sima Arasteh, Shushan Arakelyan, Christophe Hauser, Erik Kline, and Aram Galstyan
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer engineering. Computer hardware, Computer Science - Cryptography and Security, Computer Networks and Communications, Computer science, Binary number, Machine Learning (stat.ML), 02 engineering and technology, Machine learning, computer.software_genre, Machine Learning (cs.LG), Task (project management), Set (abstract data type), TK7885-7895, Artificial Intelligence, Statistics - Machine Learning, Computer security, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Vulnerability (computing), business.industry, 020207 software engineering, computer.file_format, QA75.5-76.95, Electronic computers. Computer science, Scalability, Vulnerability discovery, Binary code, Artificial intelligence, Executable, business, Heuristics, Binary program analysis, Cryptography and Security (cs.CR), computer, Software, Neural networks, Information Systems
Abstract: Tackling binary program analysis problems has traditionally implied manually defining rules and heuristics, a tedious and time consuming task for human analysts. In order to improve automation and scalability, we propose an alternative direction based on distributed representations of binary programs with applicability to a number of downstream tasks. We introduce Bin2vec, a new approach leveraging Graph Convolutional Networks (GCN) along with computational program graphs in order to learn a high dimensional representation of binary executable programs. We demonstrate the versatility of this approach by using our representations to solve two semantically different binary analysis tasks – functional algorithm classification and vulnerability discovery. We compare the proposed approach to our own strong baseline as well as published results, and demonstrate improvement over state-of-the-art methods for both tasks. We evaluated Bin2vec on 49191 binaries for the functional algorithm classification task, and on 30 different CWE-IDs including at least 100 CVE entries each for the vulnerability discovery task. We set a new state-of-the-art result by reducing the classification error by 40% compared to the source-code based inst2vec approach, while working on binary code. For almost every vulnerability class in our dataset, our prediction accuracy is over 80% (and over 90% in multiple classes).
Published: 2021

43. Automatic Generation of Object-Oriented Code from the ReLEL Requirements Model

Author: Andrianjaka Miary Rapatsalahy, Mihaela Ilie, Sorin Ilie, Raft Nicolas Razafindrakoto, Hajarisena Razafimahatratra, and Thomas Mahatody
Subjects: Programming language, Computer science, business.industry, Model transformation, 05 social sciences, Software development, 020207 software engineering, 02 engineering and technology, computer.file_format, computer.software_genre, Software development process, Unified Modeling Language, Component (UML), 0502 economics and business, 0202 electrical engineering, electronic engineering, information engineering, Code generation, Class diagram, Executable, business, computer, 050203 business & management, computer.programming_language
Abstract: The final executable code should no longer be considered as a central element in a software development process but rather a naturally important component that results from a model transformation. The objective of the MDA (Model Driven Architecture) approach is to lift the lock of software development automation from the CIM (Computation Independent Model) requirements until the code of an application is obtained. Therefore, we have proposed in the framework of MDA an approach that consists of automatically generating object-oriented code from the CIM model represented by ReLEL (Restructuring extended Lexical Elaborate Language). ReLEL is a natural language-oriented model that represents both the client requirements and the conceptual level of a system. However, the MDA framework does not recommend the type of UML model that corresponds to each business activity. Consequently, automating the software development process from the CIM model specified by ReLEL becomes a complex task. Our strategy in this paper includes the instantiation of the ReLEL model in the Praxeme methodology, which models each of the company's concerns, grouped in a homogeneous whole, using the UML (Unified Modeling Language) and which considers the articulation of these aspects by adopting the MDA principle. To do this, we propose to automate the articulation that covers the intentional, semantic, logical, and software aspects of Praxeme. To validate our approach, we measure the coupling and cohesion of the UML class diagram obtained from the Java code generated from this article using the slicing technique. The results show that the coupling is weak, and the cohesion is strong. It can be deduced that the method proposed in this paper can produce a more reliable and efficient system.
Published: 2021

44. EASIER (EXECUTABLE ACCESS TO STATISTICS FOR INTERACTIVE AND EFFICIENT RESEARCH)

Author: Merilyn D. Juacalla and Ferdie S. Ching
Subjects: Computer science, Programming language, Executable, computer.file_format, computer.software_genre, computer
Abstract: The purpose of the study is to develop a reliable computer-aided statistical instrument for data processing. The researcher come up to the idea to formulate an executable program running in Microsoft Excel platform. The platform is chosen based on the fact that it is widely used office application and known to be user-friendly. EASIER or Executable Access to Statistics for Interactive and Efficient Research was born. Executable because the program can be run by a computer, it is accessible in terms that most teachers use MS Excel as an office application, it can solve and analyze most statistics problems, interactive because there is a two-way flow of information between a computer and the user which respond to a certain input, the system promise to achieve a maximum productivity with a minimum wasted effort or expense, and to establish facts and reach new conclusions. The statistical instrument was evaluated by twenty-four (24) Senior High School Teachers from Nagcarlan, Liliw, Majayjay, Magdalena, Pila, Victoria, and Sta. Cruz district and six (6) College Teachers from Laguna State Polytechnic University Sta. Cruz Main Campus, and from Philippine Women’s University Sta. Cruz, Laguna. It sought to answer the following questions: (1). What is the mean level of basic requirements of using computer-aided statistical instrument in terms of: 1.1 knowledge, 1.2 software and 1.3 hardware.? (2). What is the mean level of capability of EASIER as a computer-aided statistical instrument in computing statistical problems in terms of: 2.1 accepting input and data parameters, 2.2 organizing data, and 2.3 generating result, figures, charts, and drawing conclusion? (3). What is the mean level of acceptability of EASIER as a computer-aided statistical instrument in statistical analysis in terms of; 3.1 tool interface, and 3.2 operation and function? (4). Is there a significant difference between the level of responses of teachers from Senior High School and College instructors in terms of capability and acceptability of EASIER as a computer-aided statistical instrument?
Published: 2021

45. FTFL: A Fisher’s test-based approach for fault localization

Author: Rajib Mall, Krishna Kunal, Shubham Shankar, Saksham Sahai Srivastava, and Arpita Dutta
Subjects: Statement (computer science), Test case, Computer science, Rank (computer programming), Code (cryptography), Extension (predicate logic), Executable, computer.file_format, Fault (power engineering), computer, Algorithm, Software, Test (assessment)
Abstract: For effective fault localization, we propose a modified Fisher’s test-based statistical method that makes use of test execution results as well as statement coverage information to determine the suspiciousness of each executable statement. Our technique returns a rank list of statements based on their suspiciousness of containing a fault. We also discuss an extension to our proposed approach for localizing programs with multiple faults. This involves partitioning the failed test cases into clusters such that they target different faults. Our experimental studies show that on an average, our proposed fault localization technique requires examination of 37.09% less code than existing techniques for localizing faults.
Published: 2021

46. Educational Videogame to Learn the Periodic Table: Design Rationale and Lessons Learned

Author: V. Javier Traver, Vicente Martí-Centelles, Luis A. Leiva, and Jenifer Rubio-Magnieto
Subjects: humor/puzzles/games, internet/web-based learning, Computer science, first-year undergraduate/general, General Chemistry, computer.file_format, elementary/middle school science, high school/introductory chemistry, Symbol (chemistry), Memorization, Education, Variety (cybernetics), Chemistry [G01] [Physical, chemical, mathematical & earth Sciences], periodicity/periodic table, Entertainment, Human–computer interaction, Group (periodic table), Design rationale, Chimie [G01] [Physique, chimie, mathématiques & sciences de la terre], multimedia-based learning, second-year undergraduate, Executable, computer-based learning, computer, Complement (set theory)
Abstract: The periodic table allows students to easily understand the chemical elements and predict the behavior of theoretical yet undiscovered new elements. Many memorization techniques have been used for learning the periodic table, yet serious games (i.e., designed for a primary purpose other than pure entertainment) have been underexplored to complement or even replace such memorization techniques. Since CHEMMEND, an existing physical card game, was found to assist with learning the periodic table, we explore the potential of E-CHEMMEND, a digital version of the game as an aid to memorize the group and period numbers of the elements. E-CHEMMEND is a single-player serious game to explore the effect of four different game conditions involving two experimental factors that account for different educational scenarios. The first factor investigates the role of playing through levels of increasing difficulty versus playing with all elements from the very beginning. The second factor investigates the role of displaying the group and period numbers of the chemical element along with its symbol versus only displaying the element symbol. Preliminary results show that E-CHEMMEND is perceived as more enjoyable when the group and period numbers are displayed. In contrast, the game is found to better assist learning when this information is hidden and levels are shown. Taken together, our results suggest that a variety of educational purposes can be accommodated with a range of game settings. Ultimately, the design rationale and the lessons learned while testing E-CHEMMEND will be valuable for chemistry instructors and education researchers. A desktop-based Windows executable version of the game is available at http://www.chemmend.uji.es/game
Published: 2021

47. Applying NLP techniques to malware detection in a practical environment

Author: Ryo Ito and Mamoru Mimura
Subjects: Computer Networks and Communications, Computer science, business.industry, Cryptography, 02 engineering and technology, computer.file_format, computer.software_genre, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Malware, 020201 artificial intelligence & image processing, The Internet, Artificial intelligence, Executable, Safety, Risk, Reliability and Quality, business, computer, Computer communication networks, Software, Natural language processing, Information Systems
Abstract: Executable files still remain popular to compromise the endpoint computers. These executable files are often obfuscated to avoid anti-virus programs. To examine all suspicious files from the Internet, dynamic analysis requires too much time. Therefore, a fast filtering method is required. With the recent development of natural language processing (NLP) techniques, printable strings became more effective to detect malware. The combination of the printable strings and NLP techniques can be used as a filtering method. In this paper, we apply NLP techniques to malware detection. This paper reveals that printable strings with NLP techniques are effective for detecting malware in a practical environment. Our dataset consists of more than 500,000 samples obtained from multiple sources. Our experimental results demonstrate that our method is effective to not only subspecies of the existing malware, but also new malware. Our method is effective against packed malware and anti-debugging techniques.
Published: 2021

48. Bayesian <scp>single‐arm</scp> phase <scp>II</scp> trial designs with <scp>time‐to‐event</scp> endpoints

Author: Haitao Pan, Jianrong Wu, and Chia-Wei Hsu
Subjects: Statistics and Probability, Computer science, Bayesian probability, Phase (waves), Machine learning, computer.software_genre, 01 natural sciences, Article, 010104 statistics & probability, 03 medical and health sciences, 0302 clinical medicine, Frequentist inference, Pharmacology (medical), 030212 general & internal medicine, 0101 mathematics, Event (probability theory), Pharmacology, business.industry, Bayes Theorem, Small sample, computer.file_format, R package, Research Design, Sample size determination, Sample Size, Immunotherapy, Artificial intelligence, Executable, business, computer, Algorithms
Abstract: For the cancer clinical trials with immunotherapy and molecularly targeted therapy, time-to-event endpoint is often a desired endpoint. In this paper, we present an event-driven approach for Bayesian one-stage and two-stage single-arm phase II trial designs. Two versions of Bayesian one-stage designs were proposed with executable algorithms and meanwhile, we also develop theoretical relationships between the frequentist and Bayesian designs. These findings help investigators who want to design a trial using Bayesian approach have an explicit understanding of how the frequentist properties can be achieved. Moreover, the proposed Bayesian designs using the exact posterior distributions accommodate the single-arm phase II trials with small sample sizes. We also proposed an optimal two-stage approach, which can be regarded as an extension of Simon's two-stage design with the time-to-event endpoint. Comprehensive simulations were conducted to explore the frequentist properties of the proposed Bayesian designs and an R package BayesDesign can be assessed via R CRAN for convenient use of the proposed methods.
Published: 2021

49. A Declarative Approach for Transforming SysML Models to Executable Simulation Models

Author: Mara Nikolaidou, Anargyros Tsadimas, Christos Kotronis, George-Dimitrios Kapos, Vassilis Dalakas, and Dimosthenis Anagnostopoulos
Subjects: business.industry, Computer science, Model transformation, 020208 electrical & electronic engineering, 020207 software engineering, 02 engineering and technology, computer.file_format, Object (computer science), Computer Science Applications, Human-Computer Interaction, Unified Modeling Language, Control and Systems Engineering, Systems Modeling Language, Component (UML), 0202 electrical engineering, electronic engineering, information engineering, Code generation, Executable, Electrical and Electronic Engineering, Software engineering, business, computer, Software, computer.programming_language, Declarative programming
Abstract: Systems Modeling Language (SysML) is an object management group standard for systems-of-systems engineering. It enables the description of complex system models; however, it cannot effectively support all system engineering activities. For instance, system performance evaluation is usually performed via simulation. In this case, the transformation of SysML system models to executable simulation models for specific simulation methodologies and tools is required. Model transformation is a key component for addressing the challenges of seamless integration of SysML model simulation in model-based system engineering. In this paper, we explore a declarative approach, based on the query/view/transformation-relations (QVT-R) standard, for the transformation of SysML models to executable simulation models, fully adhering model-driven architecture (MDA) concepts. It is supported by a framework implemented to provide executable simulation model and code generation. Methodological guidelines, for the effective use of a declarative language as QVT-R, for model transformation, are provided, emphasizing the utilization of existing domain-specific SysML profiles, as well as executable simulation library components. The experience obtained from two different domains, namely, enterprise information and railway transportation systems, modeled as systems-of-systems via SysML, is discussed, based on a quantitative analysis of the respective QVT-R transformations.
Published: 2021

50. A Multi-Dimensional Deep Learning Framework for IoT Malware Classification and Family Attribution

Author: Chadi Assi, Mirabelle Dib, Elias Bou-Harb, and Sadegh Torabi
Subjects: Computer Networks and Communications, business.industry, Computer science, Deep learning, Feature extraction, Botnet, 020206 networking & telecommunications, 02 engineering and technology, computer.file_format, computer.software_genre, Computer security, Obfuscation, 0202 electrical engineering, electronic engineering, information engineering, Malware, The Internet, Artificial intelligence, Executable, Electrical and Electronic Engineering, business, computer, 5G
Abstract: The emergence of Internet of Things malware, which leverages exploited IoT devices to perform large-scale cyber attacks (e.g., Mirai botnet), is considered as a major threat to the Internet ecosystem. To mitigate such threat, there is an utmost need for effective IoT malware classification and family attribution, which provide essential steps towards initiating attack mitigation/prevention countermeasures. In this paper, motivated by the lack of sophisticated malware obfuscation in the implementation of IoT malware, we utilize features extracted from strings- and image-based representations of the executable binaries to propose a novel multi-dimensional classification approach using Deep Learning (DL) architectures. To this end, we analyze more than 70,000 recently detected IoT malware samples. Our in-depth experiments with four prominent IoT malware families highlight the significant accuracy of the approach (99.78%), which outperforms conventional single-level classifiers. Additionally, we utilize our IoT-tailored approach for labeling newly detected “unknown” malware samples, which were mainly attributed to a few predominant families. Finally, this work contributes to the security of future networks (e.g., 5G) through the implementation of effective tools/techniques for timely IoT malware classification, and attack mitigation.
Published: 2021

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

10,049 results on '"Executable"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources