Author: "Aridhi A" / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Aridhi A"' showing total 50 results

Start Over Author "Aridhi A" Topic computer science

50 results on '"Aridhi A"'

1. A data sampling and attribute selection strategy for improving decision tree construction

Author: Didier Rémond, Wajdi Dhifli, Nour El Islem Karabadji, Hassina Seridi, Sabeur Aridhi, Ilyes Khelf, Laboratoire de Gestion Electronique de Document [Annaba] (LabGED), Université Badji Mokhtar Annaba (UBMA), Ecole Supérieure de Technologies Industrielles Annaba, Laboratoire de Mécanique des Contacts et des Structures [Villeurbanne] (LaMCoS), Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Vibrations Acoustique (LVA), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA), Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Université de Lille, Laboratoire de gestion electronique de documents [Annaba] (LabGED), Université Badji Mokhtar - Annaba [Annaba] (UBMA), Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Centre National de la Recherche Scientifique (CNRS), Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon, Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)
Subjects: 0209 industrial biotechnology, Computer science, Decision tree, Feature selection, 02 engineering and technology, Variation (game tree), Residual, computer.software_genre, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 020901 industrial engineering & automation, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Sampling, Fault diagnosis, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Particle swarm optimization, General Engineering, Condition monitoring, Sampling (statistics), Attribute selection, Computer Science Applications, Instantaneous angular seed, 020201 artificial intelligence & image processing, Noise (video), Data mining, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], computer
Abstract: International audience; Decision trees are efficient means for building classification models due to the compressibility, simplicity and ease of interpretation of their results. However, during the construction phase of decision trees, the outputs are often large trees that are affected by many uncertainties in the data (particularity, noise and residual variation). Combining attribute selection and data sampling presents one of the most promising research directions to overcome decision tree construction problems. However, the search space composed of all possible combinations of subsets of training samples and attributes is extremely large. In this paper, a novel approach is presented that allows generating an optimized decision tree by selecting an optimal couple of training samples and attributes subsets for training. As the search space of candidate couples of training samples and attributes subsets is extremely large, we use particle swarm optimization to make the search of an “optimal” solution tractable. The selected optimized solution helps in avoiding over-fitting and complexity problems suffered in the construction phase of decision trees. We conducted an extensive experimental evaluation on 22 datasets from the UCI Machine Learning Repository. The obtained results show that the proposed approach outperforms state-of-the-art classical as well as evolutionary decision tree construction methods in terms of simplicity, accuracy, and F-measure. We further evaluate our approach on a real-world engineering application for condition monitoring of rotating machinery under severe non-stationary conditions. The obtained results showed that the proposed approach allowed to optimize the use of instantaneous angular speed to diagnose gears defects.
Published: 2019
Full Text: View/download PDF

2. An Experimental Evaluation of Similarity-Based and Embedding-Based Link Prediction Methods on Graphs

Author: Malika Smaïl-Tabbone, Kamrul Islam, Sabeur Aridhi, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Theoretical computer science, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Computer science, Homogeneous Graph & Node Embedding, Graph Neural Network, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Similarity (network science), [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Prediction methods, Embedding, Link Prediction, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], Link (knot theory)
Abstract: International audience; The task of inferring missing links or predicting future ones in a graph based on its current structure is referred to as link prediction. Link prediction methods that are based on pairwise node similarity are well-established approaches in the literature and show good prediction performance in many realworld graphs though they are heuristic. On the other hand, graph embedding approaches learn lowdimensional representation of nodes in graph and are capable of capturing inherent graph features, and thus support the subsequent link prediction task in graph. This paper studies a selection of methods from both categories on several benchmark (homogeneous) graphs with different properties from various domains. Beyond the intra and inter category comparison of the performances of the methods, our aim is also to uncover interesting connections between Graph Neural Network(GNN)based methods and heuristic ones as a means to alleviate the black-box well-known limitation.
Published: 2021
Full Text: View/download PDF

3. Graph Based Automatic Protein Function Annotation Improved by Semantic Similarity

Author: Navya Khare, Bishnu Sarker, Marie-Dominique Devignes, Sabeur Aridhi, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), International Institute of Information Technology, Hyderabad [Hyderabad] (IIIT-H), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Bioinformatics, Computer science, 0206 medical engineering, Protein domain, 02 engineering and technology, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 03 medical and health sciences, Annotation, Protein sequencing, Semantic similarity, Knowledge extraction, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], ComputingMilieux_MISCELLANEOUS, 030304 developmental biology, 0303 health sciences, GrAPFI, Information retrieval, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Enzyme Commission number, Knowledge Discovery, Graph (abstract data type), UniProt, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], Network Inference, Protein Function Annotation, Graph Mining, 020602 bioinformatics
Abstract: International audience; Functional annotation of protein is a very challenging task primarily because manual annotation requires a great amount of human efforts and still it’s nearly impossible to keep pace with the exponentially growing number of protein sequences coming into the public databases, thanks to the high throughput sequencing technology. For example, the UniProt Knowledge-base (UniProtKB) is currently the largest and most comprehensive resource for protein sequence and annotation data. According to the November, 2019 release of UniProtKB, some 561,000 sequences are manually reviewed but over 150 million sequences lack reviewed functional annotations. Moreover, it is an expensive deal in terms of the cost it incurs and the time it takes. On the contrary, exploiting this huge quantity of data is important to understand life at the molecular level, and is central to understanding human disease processes and drug discovery. To be useful, protein sequences need to be annotated with functional properties such as Enzyme Commission (EC) numbers and Gene Ontology(GO) terms. The ability to automatically annotate protein sequences in UniProtKB/TrEMBL, the non-reviewed UniProt sequence repository, would represent a major step towards bridging the gap between annotated and un-annotated protein sequences. In this paper, we extend a neighborhood based network inference technique for automatic GO annotation using protein similarity graph built on protein domain and family information. The underlying philosophy of our approach assumes that proteins can be linked through the domains, families, and superfamilies that they share. We propose an efficient pruning and post-processing technique by integrating semantic similarity of GO terms. We show by empirical results that the proposed hierarchical post-processing potentially improves the performance of other GO annotation tools as well.
Published: 2020
Full Text: View/download PDF

4. Special issue on 'Advances on Large Evolving Graphs'

Author: Karine Zeitouni, José Antônio Fernandes de Macêdo, Sabeur Aridhi, Engelbert Mephu Nguifo, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Universidade Federal do Ceará = Federal University of Ceará (UFC), Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), Données et algorithmes pour une ville intelligente et durable - DAVID (DAVID), Université de Versailles Saint-Quentin-en-Yvelines (UVSQ), and Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])
Subjects: Theoretical computer science, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Computer Networks and Communications, Computer science, 020206 networking & telecommunications, 02 engineering and technology, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Hardware and Architecture, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], Software, ComputingMilieux_MISCELLANEOUS
Abstract: International audience
Published: 2020
Full Text: View/download PDF

5. A Structure Based Multiple Instance Learning Approach for Bacterial Ionizing Radiation Resistance Prediction

Author: Manel Zoghlami, Mondher Maddouri, Sabeur Aridhi, and Engelbert Mephu Nguifo
Subjects: 0301 basic medicine, biology, Computer science, In silico, 02 engineering and technology, computer.software_genre, biology.organism_classification, 03 medical and health sciences, 030104 developmental biology, Bioremediation, Discriminative model, 0202 electrical engineering, electronic engineering, information engineering, General Earth and Planetary Sciences, Structure based, 020201 artificial intelligence & image processing, Data mining, computer, Classifier (UML), Bacteria, General Environmental Science
Abstract: Ionizing-radiation-resistant bacteria (IRRB) could be used for bioremediation of radioactive wastes and in the therapeutic industry. Limited computational works are available for the prediction of bacterial ionizing radiation resistance (IRR). In this work, we present ABClass, an in silico approach that predicts if an unknown bacterium belongs to IRRB or ionizing-radiation-sensitive bacteria (IRSB). This approach is based on a multiple instance learning (MIL) formulation of the IRR prediction problem. It takes into account the relation between semantically related instances across bags. In ABClass, a preprocessing step is performed in order to extract substructures/motifs from each set of related sequences. These motifs are then used as attributes to construct a vector representation for each set of sequences. In order to compute partial prediction results, a discriminative classifier is applied to each sequence of the unknown bag and its correspondent related sequences in the learning dataset. Finally, an aggregation method is applied to generate the final result. The algorithm provides good overall accuracy rates. ABClass can be downloaded at the following link: http://homepages.loria.fr/SAridhi/software/MIL/.
Published: 2019
Full Text: View/download PDF

6. Design and FPGA-implementation of a PID controller for temperature control in a refrigeration system

Author: Rim Ben Ali, Abdelkader Mami, Emna Aridhi, and Mohamed Akram Jaballah
Subjects: Multidisciplinary, Temperature control, Computer science, business.industry, 020209 energy, Hardware description language, PID controller, Refrigeration, ComputerApplications_COMPUTERSINOTHERSYSTEMS, Control engineering, 02 engineering and technology, Automation, Control theory, VHDL, 0202 electrical engineering, electronic engineering, information engineering, Field-programmable gate array, business, computer, Hardware_LOGICDESIGN, computer.programming_language
Abstract: Objectives: The universal controller PID (Proportional-Integra-Derivative) is widely used in automation systems. Methods: This work is focused to develop a PID controller for a refrigeration system in order to implement it in a Field Programmable Gate Array (FPGA) by using the Very High Speed Integrated Circuit Hardware Description Language (VHDL). Firstly, the nonlinear model of the refrigeration system is presented by a third order state system and simulated under Matlab/Simulink environment to develop its PID controller, which is aimed to adjust the indoor temperature. Further, the synthetizable program of the PID algorithm has been implemented on a Map Altera DE0-nano Kit using the Quartus II software. Findings: The performance of the proposed controller has been successfully validated with good tracking results. Application: The FPGA target presents a good solution to implement the PID algorithm. Keywords: FPGA-Implementation, PID Controller, Quartus II, Refrigeration System, Temprature Control, Altera DE0-Nano Kit.
Published: 2018
Full Text: View/download PDF

7. Special issue on 'Uncertainty in Cloud Computing: Concepts, Challenges and Current Solutions'

Author: Allel Hadjali, Haithem Mezni, Andrei Tchernykh, Sabeur Aridhi, Laboratoire d'Informatique et d'Automatique pour les Systèmes (LIAS), Université de Poitiers-ENSMA, Taibah University, Strategies for Modelling and ARtificial inTelligence Laboratory (SMART-LAB), Université de Tunis, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), Centro de Investigacion Cientifica y de Education Superior de Ensenada [Mexico] (CICESE), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Computer science, Applied Mathematics, Cloud computing, Data science, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Theoretical Computer Science, 03 medical and health sciences, 0302 clinical medicine, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, 030221 ophthalmology & optometry, 030212 general & internal medicine, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], Current (fluid), business, ComputingMilieux_MISCELLANEOUS, Software
Abstract: International audience
Published: 2019
Full Text: View/download PDF

8. FPGA based co-design of a speed fuzzy logic controller applied to an autonomous car

Author: Abdelkader Mami, Emna Aridhi, and Decebal Popescu
Subjects: Hardware architecture, business.industry, Computer science, Speed control, Controller (computing), Hardware description language, Fuzzy logic, Embedded system, VHDL, Software design, Hardware design, Field-programmable gate array, business, computer, FPGA, Digital signal processing, computer.programming_language
Abstract: This paper invests in FPGA technology to control the speed of an autonomous car using fuzzy logic. For that purpose, we propose a co-design based on a novel fuzzy controller IP. It was developed using the hardware language VHDL and driven by the Zynq processor through an SDK software design written in C. The proposed IP acts according to the ambient temperature and the presence or absence of an obstacle and its distance from the car. The partitioning of the co-design tasks divides them into hardware and software parts. The simulation results of the fuzzy IP and those of the complete co-design implementation on a Xilinx Zynq board showed the effectiveness of the proposed controller to meet the target constraints and generate suitable PWM signals. The proposed hardware architecture based on 6-LUT blocks uses 11 times fewer logic resources than other previous similar designs. Also, it can be easily updated when new constraints on the system are to be considered, which makes it suitable for many related applications. Fuzzy computing was accelerated thanks to the use of digital signal processing blocks that ensure parallel processing. Indeed, a complete execution cycle takes only 7 us.
Published: 2021
Full Text: View/download PDF

9. Exploiting bounds optimization for the semi-formal verification of analog circuits

Author: Ons Lahiouel, Mohamed H. Zaki, Sofiène Tahar, and Henda Aridhi
Subjects: State variable, Analogue electronics, Computer science, 020208 electrical & electronic engineering, Enclosure, 02 engineering and technology, Multivariate optimization, Semi-formal, 020202 computer hardware & architecture, Hardware and Architecture, 0202 electrical engineering, electronic engineering, information engineering, Electronic engineering, Transient response, Electrical and Electronic Engineering, Differential (infinitesimal), Software
Abstract: This paper proposes a semi-formal methodology for modeling and verification of analog circuits behavioral properties using multivariate optimization techniques. Analog circuit differential models are automatically extracted and their qualitative behavior is computed for interval-valued parameters, inputs and initial conditions. The method has the advantage of guaranteeing the rough enclosure of any possible dynamical behavior of analog circuits. The circuit behavioral properties are then verified on the generated transient response bounds. Experimental results show that the resulting state variable envelopes can be effectively employed for a sound verification of analog circuit properties, in an acceptable run-time.
Published: 2017
Full Text: View/download PDF

10. Functional Annotation of Proteins using Domain Embedding based Sequence Classification

Author: Sabeur Aridhi, David W. Ritchie, Bishnu Sarker, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), TELECOM Nancy, Université de Lorraine (UL), Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), and Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Word embedding, Computer science, Bioinformatics, Representation Learning, Computational biology, Domain (software engineering), [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Machine Learning, 03 medical and health sciences, Annotation, 0302 clinical medicine, Protein sequencing, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], ComputingMilieux_MISCELLANEOUS, 030304 developmental biology, Domain Embed- ding, 0303 health sciences, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Enzyme Commission number, Knowledge base, 030220 oncology & carcinogenesis, UniProt, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], business, Feature learning, Protein Function Annotation
Abstract: International audience; Due to the recent advancement in genomic sequencing technologies, the number of protein sequences in public databases is growing exponentially. The UniProt Knowledgebase (UniProtKB) is currently the largest and most comprehensive resource for protein sequence and annotation data. The May 2019 release of the Uniprot Knowledge base (UniprotKB) contains around 158 million protein sequences. For the complete exploitation of this huge knowledge base, protein sequences need to be annotated with functional properties such as Enzyme Commission (EC) numbers and Gene Ontology terms. However, there is only about half a million sequences (UniprotKB/SwissProt) are reviewed and functionally annotated by expert curators using information extracted from the published literature and computational analyses. The manual annotation by experts are expensive, slow and insufficient to fill the gap between the annotated and unannotated protein sequences. In this paper, we present an automatic functional annotation technique using neural network based based word embedding exploiting domain and family information of proteins. Domains are the most conserved regions in protein sequences and constitute the building blocks of 3D protein structures. To do the experiment, we used fastText a , a library for learning of word embeddings and text classification developed by Facebook's AI Research lab. The experimental results show that domain embeddings perform much better than k-mer based word embeddings. a https://github.com/facebookresearch/fasttext
Published: 2019
Full Text: View/download PDF

11. The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Author: Renzhi Cao, Alice C. McHardy, Cen Wan, Jonathan G. Lees, Vedrana Vidulin, Alex Warwick Vesztrocy, Huy N Nguyen, Devon Johnson, Ian Sillitoe, Alessandro Petrini, Richard Bonneau, Hans Moen, Peter L. Freddolino, Rui Fa, Alfredo Benso, Jianlin Cheng, Indika Kahanda, Qizhong Mao, Zihan Zhang, Chenguang Zhao, Rebecca L. Hurto, Predrag Radivojac, Stefano Di Carlo, Sayoni Das, Suwisa Kaewphan, Sabeur Aridhi, Alan Medlar, Casey S. Greene, Constance J. Jeffery, Christophe Dessimoz, Jose Manuel Rodriguez, Gianfranco Politano, Michele Berselli, Jia-Ming Chang, Deborah A. Hogan, Julian Gough, Tunca Doğan, David T. Jones, Claire O'Donovan, Volkan Atalay, Paolo Fontana, Feng Zhang, Shuwei Yao, Robert Hoehndorf, Olivier Lichtarge, Alex W. Crocker, Ahmet Sureyya Rifaioglu, Rabie Saidi, Farrokh Mehryary, Neven Sumonja, Yang Zhang, Florian Boecker, Jie Hou, Christine A. Orengo, Matteo Re, Natalie Thurlby, Chengxin Zhang, Stefano Pascarelli, Alberto Paccanaro, Hafeez Ur Rehman, Yuxiang Jiang, Mohammad R. K. Mofrad, Naihui Zhou, Asa Ben-Hur, Steven E. Brenner, Martti Tolvanen, Filip Ginter, Mark N. Wass, Patricia C. Babbitt, David W. Ritchie, George Georghiou, Stefano Toppo, Caleb Chandler, Larry Davis, Da Chen Emily Koo, Itamar Borukhov, Petri Törönen, Rengul Cetin-Atalay, Fabio Fabris, Haixuan Yang, Kai Hakala, Silvio C. E. Tosatto, Domenico Cozzetto, Slobodan Vucetic, Balint Z. Kacsoh, Luke W Sagers, Alex A. Freitas, Tapio Salakoski, Fran Supek, Alfonso E. Romero, Angela D. Wilkins, Elaine Zosa, Shanshan Zhang, Yotam Frank, Jonathan B. Dayton, Jeffrey M. Yunes, Pier Luigi Martelli, Dallas J. Larsen, Giuliano Grossi, Alexandra J. Lee, Marco Mesiti, Yi-Wei Liu, Jonas Reeb, Damiano Piovesan, Sean D. Mooney, Magdalena Antczak, Erica Suh, Marco Falda, Marie-Dominique Devignes, Castrense Savojardo, Zheng Wang, Danielle A Brackenridge, Peter W. Rose, Enrico Lavezzo, Dane Jo, Ronghui You, Tomislav Šmuc, Liam J. McGuffin, Michael L. Tress, Ilya Novikov, Adrian M. Altenhoff, Burkhard Rost, Miguel Amezola, Mateo Torres, Prajwal Bhat, Wen-Hung Liao, Meet Barot, Marco Notaro, Suyang Dai, Giorgio Valentini, Jari Björne, Nevena Veljkovic, Wei-Cheng Tseng, Po-Han Chi, Alperen Dalkiran, Maxat Kulmanov, Nafiz Hamid, Aashish Jain, Branislava Gemovic, Alexandre Renaux, Ashton Omdahl, Daniel B. Roche, Vladimir Perovic, Iddo Friedberg, Daisuke Kihara, Giovanni Bosco, Gage S. Black, Saso Dzeroski, Liisa Holm, Marco Frasca, Michal Linial, Ehsaneddin Asgari, Tatyana Goldberg, Maria Jesus Martin, Vladimir Gligorijević, Marco Carraro, Shanfeng Zhu, Radoslav Davidovic, Timothy Bergquist, Hai Fang, José M. Fernández, Giuseppe Profiti, Weidong Tian, Imane Boudellioua, Kimberley A. Lewis, Seyed Ziaeddin Alborzi, and Rita Casadio
Subjects: 0303 health sciences, Protein function, biology, Computer science, 030302 biochemistry & molecular biology, Pseudomonas, Computational biology, Biological process, biology.organism_classification, Genome, 3. Good health, 03 medical and health sciences, Molecular function, Cellular component, Mutation screening, Critical assessment, Protein function prediction, Gene, Function (biology), 030304 developmental biology
Abstract: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Here we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility (P. aureginosa only). We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. We conclude that, while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. We finally report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bioontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.
Published: 2019
Full Text: View/download PDF

12. GrAPFI: predicting enzymatic function of proteins from domain similarity graphs

Author: Bishnu Sarker, David W. Ritchie, Sabeur Aridhi, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), and This work was partially supported by the CNRS-INRIA/FAPs project 'TempoGraphs' (PRC2243).
Subjects: Proteome, K-nearest neighbor, Computer science, Arabidopsis, Computational biology, Saccharomyces cerevisiae, Label propagation, lcsh:Computer applications to medicine. Medical informatics, Biochemistry, Domain (software engineering), [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 03 medical and health sciences, Mice, Domain similarity graph, Similarity (network science), [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Protein Domains, Structural Biology, Protein network, Animals, Humans, Databases, Protein, lcsh:QH301-705.5, Molecular Biology, 030304 developmental biology, chemistry.chemical_classification, 0303 health sciences, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], GrAPFI, Applied Mathematics, Methodology Article, 030302 biochemistry & molecular biology, Proteins, Computer Science Applications, Enzymes, Rats, Enzyme, lcsh:Biology (General), chemistry, Protein function annotation, lcsh:R858-859.7, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], EC annotation, Function (biology), Algorithms, Software
Abstract: This work is dedicated to the memory of David W. Ritchie, who recently passed away.; International audience; Background: Thanks to recent developments in genomic sequencing technologies, the number of protein sequences in public databases is growing enormously. To enrich and exploit this immensely valuable data, it is essential to annotate these sequences with functional properties such as Enzyme Commission (EC) numbers, for example. The January 2019 release of the Uniprot Knowledge base (UniprotKB) contains around 140 million protein sequences. However, only about half of a million of these (UniprotKB/SwissProt) have been reviewed and functionally annotated by expert curators using data extracted from the literature and computational analyses. To reduce the gap between the annotated and unannotated protein sequences, it is essential to develop accurate automatic protein function annotation techniques. Results: In this work, we present GrAPFI (Graph-based Automatic Protein Function Inference) for automatically annotating proteins with EC number functional descriptors from a protein domain similarity graph. We validated the performance of GrAPFI using six reference proteomes in UniprotKB/SwissProt, namely Human, Mouse, Rat, Yeast, E. Coli and Arabidopsis thaliana. We also compared GrAPFI with existing EC prediction approaches such as ECPred, DEEPre, and SVMProt. This shows that GrAPFI achieves better accuracy and comparable or better coverage with respect to these earlier approaches. Conclusions: GrAPFI is a novel protein function annotation tool that performs automatic inference on a network of proteins that are related according to their domain composition. Our evaluation of GrAPFI shows that it gives better performance than other state of the art methods. GrAPFI is available at https://gitlab.inria.fr/bsarker/bmc_grapfi.git as a stand alone tool written in Python.
Published: 2019
Full Text: View/download PDF

13. Supporting tools to technical advance in the early stages of design

Author: Abderrahmen Aridhi, Juan Pedro Berro, Mireia Mesas, and Jose Jorge Espi
Subjects: lcsh:TA1-2040, Computer science, 020209 energy, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, 010501 environmental sciences, lcsh:Engineering (General). Civil engineering (General), 01 natural sciences, 0105 earth and related environmental sciences
Abstract: The development of methods and tools to support the advancement in scientific and technical solutions has emerged in recent years in order to assist decisions that allow to avoid extra efforts due to the common traditional “trial and error” approach. At the same time, new challenges in terms of environmental protection have also engaged to adopt decisions having into account that climate protection needs to remain as a primer driver in the development of the aviation sector. AIRPOXY will recover all current requirements through an integrated approach where LCA (Life Cycle Assessment), LCC (Life Cycle Cost analysis), HHRA (Human Health Risk Assessment) and numerical simulation of manufacturing processes will work together in order to demonstrate and support the development of thermoformable, repairable and bondable smart epoxy based composites for aero structures. By considering all stated before, the final aim will be double. On one hand, to be informed about technical, environmental, economic and safety requirements during key stages, in order to take informed decisions and optimise it following the Eco-design principles. On the other hand, to obtain objective data to support performance in order to increase the impact of the project and support the further implementation of the technologies as the AIRPOXY solutions reach higher TRLs.
Published: 2019
Full Text: View/download PDF

14. Multiple instance learning for sequence data with across bag dependencies

Author: Sabeur Aridhi, Engelbert Mephu Nguifo, Manel Zoghlami, Mondher Maddouri, Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Taibah University, Laboratoire d'Informatique, de Modélisation et d'optimisation des Systèmes (LIMOS), SIGMA Clermont (SIGMA Clermont)-Université d'Auvergne - Clermont-Ferrand I (UdA)-Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Blaise Pascal - Clermont-Ferrand 2 (UBP), Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), and Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Université d'Auvergne - Clermont-Ferrand I (UdA)-SIGMA Clermont (SIGMA Clermont)-Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Centre National de la Recherche Scientifique (CNRS)
Subjects: 0301 basic medicine, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Computational intelligence, 02 engineering and technology, Similarity measure, ENCODE, Machine Learning (cs.LG), [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 03 medical and health sciences, Data sequences, Discriminative model, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Partial classification, ComputingMilieux_MISCELLANEOUS, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Pattern recognition, 030104 developmental biology, ComputingMethodologies_PATTERNRECOGNITION, Classification result, 020201 artificial intelligence & image processing, Computer Vision and Pattern Recognition, Artificial intelligence, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Classifier (UML), Software
Abstract: In Multiple Instance Learning (MIL) problem for sequence data, the instances inside the bags are sequences. In some real world applications such as bioinformatics, comparing a random couple of sequences makes no sense. In fact, each instance may have structural and/or functional relations with instances of other bags. Thus, the classification task should take into account this across bag relation. In this work, we present two novel MIL approaches for sequence data classification named ABClass and ABSim. ABClass extracts motifs from related instances and use them to encode sequences. A discriminative classifier is then applied to compute a partial classification result for each set of related sequences. ABSim uses a similarity measure to discriminate the related instances and to compute a scores matrix. For both approaches, an aggregation method is applied in order to generate the final classification result. We applied both approaches to solve the problem of bacterial Ionizing Radiation Resistance prediction. The experimental results of the presented approaches are satisfactory.
Published: 2019
Full Text: View/download PDF

15. Exploiting Complex Protein Domain Networks for Protein Function Annotation

Author: Bishnu Sarker, David W. Rtichie, Sabeur Aridhi, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), Université de Lorraine (UL), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), and SARKER, BIshnu
Subjects: [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], 0301 basic medicine, InterPro, Protein family, complex protein domain networks, Computer science, Protein domain, Computational biology, PROSITE, [INFO.INFO-SI]Computer Science [cs]/Social and Information Networks [cs.SI], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 03 medical and health sciences, Annotation, 0302 clinical medicine, [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM], label propagation, Biological data, GrAPFI, [INFO.INFO-SI] Computer Science [cs]/Social and Information Networks [cs.SI], Enzyme Commission number, bioinformatics, 030104 developmental biology, protein function annotation, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], UniProt, 030217 neurology & neurosurgery
Abstract: International audience; Huge numbers of protein sequences are now available in public databases. In order to exploit more fully this valuable biological data, these sequences need to be annotated with functional properties such as Enzyme Commission (EC) numbers and Gene Ontology terms. The UniProt Knowledgebase (UniProtKB) is currently the largest and most comprehensive resource for protein sequence and annotation data. In the March 2018 release of UniProtKB, some 556,000 sequences have been manually curated but over 111 million sequences still lack functional annotations. The ability to annotate automatically these unannotated sequences would represent a major advance for the field of bioinformatics. Here, we present a novel network-based approach called GrAPFI for the automatic functional annotation of protein sequences. The underlying assumption of GrAPFI is that proteins may be related to each other by the protein domains, families, and super-families that they share. Several protein domain databases exist such as In-terPro, Pfam, SMART, CDD, Gene3D, and Prosite, for example. Our approach uses Interpro domains, because the InterPro database contains information from several other major protein family and domain databases. Our results show that GrAPFI achieves better EC number annotation performance than several other previously described approaches.
Published: 2018
Full Text: View/download PDF

16. The uncertain cloud: State of the art and research challenges

Author: Haithem Mezni, Allel Hadjali, Sabeur Aridhi, Université de Jendouba (UJ), Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), and ENSMA
Subjects: Computer science, Data_MISCELLANEOUS, Cloud computing, 02 engineering and technology, Theoretical Computer Science, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, 020204 information systems, Taxonomy (general), 0202 electrical engineering, electronic engineering, information engineering, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Uncertain data, business.industry, Applied Mathematics, Uncertainty, Provisioning, Data science, Uncertain cloud services, Uncertainty models, 020201 artificial intelligence & image processing, State (computer science), [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Host (network), Software
Abstract: International audience; During the last decade, cloud computing became a natural choice to host and provide various computing resources as on-demand services. The correct characterization and management of cloud environment objects (clouds, data centers, providers, services, data, users, etc.) is the first step towards effective provisioning and integration of cloud services. However, cloud computing environment is often subject to uncertainty. This could be attributed to the incompleteness and imprecision of cloud available information, as well as the highly changing conditions. The purpose of this survey is to study, criticize and classify the already existing works that deal with uncertainty in the cloud. We present a taxonomy on the uncertainty in the cloud and we study how such concept was tackled by researchers in cloud environments. Finally, we identify the challenges and the requirements to deal with uncertain data in the cloud, as well as the future directions.
Published: 2018
Full Text: View/download PDF

17. Improving memory-based user collaborative filtering with evolutionary multi-objective optimization

Author: Sabeur Aridhi, Wajdi Dhifli, Nour El Islem Karabadji, Hassina Seridi, Samia Beldjoudi, Laboratoire de gestion electronique de documents [Annaba] ( LabGED ), Université Badji Mokhtar - Annaba [Annaba] ( UBMA ), Badji Mokhtar University, Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes ( LIMOS ), Sigma CLERMONT ( Sigma CLERMONT ) -Université Clermont Auvergne ( UCA ) -Centre National de la Recherche Scientifique ( CNRS ), Laboratoire de Gestion Electronique de Document [Annaba] (LabGED), Université Badji Mokhtar Annaba (UBMA), Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), Laboratoire de gestion electronique de documents [Annaba] (LabGED), Université Badji Mokhtar - Annaba [Annaba] (UBMA), and Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])
Subjects: Computer science, 02 engineering and technology, Recommender system, Space (commercial competition), Machine learning, computer.software_genre, Multi-objective optimization, MovieLens, [ INFO.INFO-LG ] Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Task (project management), [ INFO.INFO-DC ] Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, 020204 information systems, [ INFO.INFO-BI ] Computer Science [cs]/Bioinformatics [q-bio.QM], 0202 electrical engineering, electronic engineering, information engineering, Collaborative filtering, [ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI], ComputingMilieux_MISCELLANEOUS, Focus (computing), [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, General Engineering, Computer Science Applications, [ INFO.INFO-DB ] Computer Science [cs]/Databases [cs.DB], Benchmark (computing), 020201 artificial intelligence & image processing, Artificial intelligence, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, computer
Abstract: The primary task of a memory-based collaborative filtering (CF) recommendation system is to select a group of nearest (similar) user neighbors for an active user. Traditional memory-based CF schemes tend to only focus on improving as much as possible the accuracy by recommending familiar items (i.e., popular items over the group). Yet, this may reduce the number of items that could be recommended and consequently weakens the chances of recommending novel items. To address this problem, it is desirable to consider recommendation coverage when selecting the appropriate group. This could help in simultaneously making both accurate and diverse recommendations. In this paper, we propose to focus mainly on the growing of the large search space of users’ profiles and to use an evolutionary multi-objective optimization-based recommendation system to pull up a group of profiles that maximizes both similarity with the active user and diversity between its members. In such manner, the recommendation system will provide high performances in terms of both accuracy and diversity. The experimental results on the Movielens benchmark and on a real-world insurance dataset show the efficiency of our approach in terms of accuracy and diversity compared to state-of-the-art competitors.
Published: 2018
Full Text: View/download PDF

18. An experimental survey on big data frameworks

Author: Sabeur Aridhi, Wissem Inoubli, Mondher Maddouri, Haithem Mezni, Engelbert Mephu Nguifo, Université Tunis El Manar ( UTM ), Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes ( LIMOS ), Sigma CLERMONT ( Sigma CLERMONT ) -Université Clermont Auvergne ( UCA ) -Centre National de la Recherche Scientifique ( CNRS ), Taibah University, Université de Tunis El Manar (UTM), Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Université de Jendouba (UJ), Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020]), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), and Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS)
Subjects: FOS: Computer and information sciences, Big Data, Computer Networks and Communications, Computer science, Best practice, Big data, 02 engineering and technology, [ INFO.INFO-LG ] Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Samza, [ INFO.INFO-DC ] Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], 020204 information systems, [ INFO.INFO-BI ] Computer Science [cs]/Bioinformatics [q-bio.QM], Storm, 0202 electrical engineering, electronic engineering, information engineering, MapReduce, [ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI], ComputingMilieux_MISCELLANEOUS, Spark, HDFS, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Flink, batch/stream processing, Data science, [ INFO.INFO-DB ] Computer Science [cs]/Databases [cs.DB], Computer Science - Distributed, Parallel, and Cluster Computing, Hardware and Architecture, Hadoop, Graph (abstract data type), 020201 artificial intelligence & image processing, Distributed, Parallel, and Cluster Computing (cs.DC), [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Software
Abstract: Recently, increasingly large amounts of data are generated from a variety of sources.Existing data processing technologies are not suitable to cope with the huge amounts of generated data. Yet, many research works focus on Big Data, a buzzword referring to the processing of massive volumes of (unstructured) data. Recently proposed frameworks for Big Data applications help to store, analyze and process the data. In this paper, we discuss the challenges of Big Data and we survey existing Big Data frameworks. We also present an experimental evaluation and a comparative study of the most popular Big Data frameworks with several representative batch and iterative workloads. This survey is concluded with a presentation of best practices related to the use of studied frameworks in several application domains such as machine learning, graph processing and real-world applications.
Published: 2018
Full Text: View/download PDF

19. On Memory Reuse Between Inputs and Outputs of Dataflow Actors

Author: Karol Desnos, Maxime Pelcat, Jean-Francois Nezan, Slaheddine Aridhi, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Texas Insruments (TI), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Nantes Université (NU)-Université de Rennes 1 (UR1)
Subjects: business.industry, Computer science, Dataflow, Multiprocessing, 02 engineering and technology, Reuse, buffer merging, memory optimization, 020202 computer hardware & architecture, Set (abstract data type), Reduction (complexity), Computer architecture, Hardware and Architecture, 0202 electrical engineering, electronic engineering, information engineering, Memory footprint, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, 020201 artificial intelligence & image processing, Minification, business, Software, Digital signal processing
Abstract: International audience; This article introduces a new technique to minimize the memory footprints of Digital Signal Processing (DSP) applications specified with Synchronous Dataflow (SDF) graphs and implemented on shared-memory Multiprocessor Systems-on-Chips (MPSoCs). In addition to the SDF specification, which captures data dependencies between coarse-grained tasks called actors, the proposed technique relies on two optional inputs abstracting the internal data dependencies of actors: annotations of the ports of actors, and script-based specifications of merging opportunities between input and output buffers of actors. Experimental results on a set of applications show a reduction of the memory footprint by 48% compared to state-of-the-art minimization techniques.
Published: 2016
Full Text: View/download PDF

20. Density-based data partitioning strategy to approximate large-scale subgraph mining

Author: Laurent d'Orazio, Engelbert Mephu Nguifo, Mondher Maddouri, Sabeur Aridhi, Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), and Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])
Subjects: [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Theoretical computer science, Graph database, Dense graph, Computer science, Graph partition, 02 engineering and technology, computer.software_genre, Graph, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Task (computing), [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Hardware and Architecture, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, computer, Software, MathematicsofComputing_DISCRETEMATHEMATICS, Information Systems
Abstract: Recently, graph mining approaches have become very popular, especially in certain domains such as bioinformatics, chemoinformatics and social networks. One of the most challenging tasks is frequent subgraph discovery. This task has been highly motivated by the tremendously increasing size of existing graph databases. Due to this fact, there is an urgent need of efficient and scaling approaches for frequent subgraph discovery. In this paper, we propose a novel approach for large-scale subgraph mining by means of a density-based partitioning technique, using the MapReduce framework. Our partitioning aims to balance computational load on a collection of machines. We experimentally show that our approach decreases significantly the execution time and scales the subgraph discovery process to large graph databases.
Published: 2015
Full Text: View/download PDF

21. An evolutionary scheme for decision tree construction

Author: Nour El Islem Karabadji, Hassina Seridi, Wajdi Dhifli, Sabeur Aridhi, Fouad Bousetouane, Laboratoire de Gestion Electronique de Document [Annaba] (LabGED), Université Badji Mokhtar Annaba (UBMA), University of Nevada [Las Vegas] (WGU Nevada), Institut de biologie systémique et synthétique (ISSB), Université d'Évry-Val-d'Essonne (UEVE)-Centre National de la Recherche Scientifique (CNRS), Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Laboratoire de gestion electronique de documents [Annaba] (LabGED), Université Badji Mokhtar - Annaba [Annaba] (UBMA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)
Subjects: 0301 basic medicine, Scheme (programming language), Information Systems and Management, Computer science, Decision tree, 02 engineering and technology, computer.software_genre, Machine learning, Management Information Systems, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Set (abstract data type), 03 medical and health sciences, Naive Bayes classifier, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Artificial Intelligence, Genetic algorithm, 0202 electrical engineering, electronic engineering, information engineering, computer.programming_language, Training set, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Genetic Algorithms, Decision Trees, Decision problem, Attributes Selection, Support vector machine, Data Reduction, 030104 developmental biology, Benchmark (computing), 020201 artificial intelligence & image processing, Data mining, Artificial intelligence, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, computer, Software
Abstract: International audience; Classification is a central task in machine learning and data mining. Decision tree (DT) is one of the most popular learning models in data mining. The performance of a DT in a complex decision problem depends on the efficiency of its construction. However, obtaining the optimal DT is not a straightforward process. In this paper, we propose a new evolutionary meta-heuristic optimization based approach for identifying the best settings during the construction of a DT. We designed a genetic algorithm coupled with a multi-task objective function to pull out the optimal DT with the best parameters. This objective function is based on three main factors: (1) Precision over the test samples, (2) Trust in the construction and validation of a DT using the smallest possible training set and the largest possible testing set, and (3) Simplicity in terms of the size of the generated candidate DT, and the used set of attributes. We extensively evaluate our approach on 13 benchmark datasets and a fault diagnosis dataset. The results show that it outperforms classical DT construction methods in terms of accuracy and simplicity. They also show that the proposed approach outperforms Ant-Tree-Miner (an evolutionary DT construction approach), Naive Bayes and Support Vector Machine in terms of accuracy and F-measure.
Published: 2017
Full Text: View/download PDF

22. MR-SimLab: Scalable subgraph selection with label similarity for big data

Author: Engelbert Mephu Nguifo, Wajdi Dhifli, Sabeur Aridhi, Institut de biologie systémique et synthétique (ISSB), Université d'Évry-Val-d'Essonne (UEVE)-Centre National de la Recherche Scientifique (CNRS), Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Laboratoire d'Informatique, de Modélisation et d'optimisation des Systèmes (LIMOS), SIGMA Clermont (SIGMA Clermont)-Université d'Auvergne - Clermont-Ferrand I (UdA)-Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Blaise Pascal - Clermont-Ferrand 2 (UBP), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), and Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Université d'Auvergne - Clermont-Ferrand I (UdA)-SIGMA Clermont (SIGMA Clermont)-Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Centre National de la Recherche Scientifique (CNRS)
Subjects: subgraph mining, label similarity, Theoretical computer science, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], Computer science, business.industry, Big data, Feature selection, 02 engineering and technology, Graph, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Hardware and Architecture, 020204 information systems, Scalability, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, MapReduce, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Software, Information Systems
Abstract: International audience; With the increasing size and complexity of available databases, existing machine learning and data mining algorithms are facing a scalability challenge. In many applications, the number of features describing the data could be extremely high. This hinders or even could make any further exploration infeasible. In fact, many of these features are redundant or simply irrelevant. Hence, feature selection plays a key role in helping to overcome the problem of information overload especially in big data applications. Since many complex datasets could be modeled by graphs of interconnected labeled elements, in this work, we are particularly interested in feature selection for subgraph patterns. In this paper, we propose MR-SimLab, a MapReduce-based approach for subgraph selection from large input subgraph sets. In many applications, it is easy to compute pairwise similarities between labels of the graph nodes. Our approach leverages such rich information to measure an approximate subgraph matching by aggre-gating the elementary label similarities between the matched nodes. Based on the aggregated similarity scores, our approach selects a small subset of informative representative subgraphs. We provide a distributed implementation of our algorithm on top of the MapReduce framework that optimizes the computational efficiency of our approach for big data applications. We experimentally evaluate MR-SimLab on real datasets. The obtained results show that our approach is scalable and that the selected subgraphs are informative.
Published: 2017
Full Text: View/download PDF

23. BLADYG: A Graph Processing Framework for Large Dynamic Graphs

Author: Yannis Velegrakis, Alberto Montresor, Sabeur Aridhi, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), University of Trento [Trento], Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: FOS: Computer and information sciences, Power graph analysis, Information Systems and Management, Theoretical computer science, graph partitioning, Computer science, Computation, Vertex connectivity, 02 engineering and technology, Management Information Systems, Spatial network, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, k-Core decomposition, Dynamic graphs, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], akka framework, Distributed graph processing, Graph partitioning, AKKA framework, Information Systems, Computer Science Applications1707 Computer Vision and Pattern Recognition, Graph, Computer Science Applications, Vertex (geometry), Computer Science - Distributed, Parallel, and Cluster Computing, 020201 artificial intelligence & image processing, Distributed, Parallel, and Cluster Computing (cs.DC), k-core decomposition, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], MathematicsofComputing_DISCRETEMATHEMATICS
Abstract: International audience; Recently, distributed processing of large dynamic graphs has become very popular , especially in certain domains such as social network analysis, Web graph analysis and spatial network analysis. In this context, many distributed/parallel graph processing systems have been proposed, such as Pregel, PowerGraph, GraphLab, and Trinity. However, these systems deal only with static graphs and do not consider the issue of processing evolving and dynamic graphs. In this paper, we are considering the issues of scale and dynamism in the case of graph processing systems. We present BLADYG, a graph processing framework that addresses the issue of dynamism in large-scale graphs. We present an implementation of BLADYG on top of akka framework. We experimentally evaluate the performance of the proposed framework by applying it to problems such as distributed k-core decomposition and partitioning of large dynamic graphs. The experimental results show that the performance and scalability of BLADYG are satisfying for large-scale dynamic graphs.
Published: 2017
Full Text: View/download PDF

24. Word-Level Identification of Romanized Tunisian Dialect

Author: Jihene Younes, Emna Souissi, Hadhemi Achour, and Chaima Aridhi
Subjects: Language identification, Computer science, business.industry, 02 engineering and technology, computer.software_genre, Social web, Romanization, Support vector machine, Identification (information), Margin (machine learning), 020204 information systems, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, 0202 electrical engineering, electronic engineering, information engineering, Latin alphabet, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Natural language processing, Word (computer architecture)
Abstract: In the Arabic-speaking world, textual productions on social networks are often informal and generally characterized by the use of various dialects, which can be transcribed in Latin or Arabic characters. More specifically, electronic writing in Tunisia is characterized in large part by a mixture of Tunisian dialect with other languages and by a margin of individualization giving users the freedom to write without depending on orthographic or grammatical constraints. In this work, we address the problem of the automatic Tunisian dialect identification within the electronic writings that are produced on social networks using the Latin alphabet. We propose to study and experiment two different identification approaches. Our experiments show that the best performance is obtained using a machine learning based approach using Support Vector Machines.
Published: 2017
Full Text: View/download PDF

25. Big Graph Mining: Frameworks and Techniques

Author: Sabeur Aridhi, Engelbert Mephu Nguifo, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Aalto University, Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020]), and DOREAU, Bastien
Subjects: FOS: Computer and information sciences, [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Information Systems and Management, Computer science, Big data, Big graph, 02 engineering and technology, Feature scaling, Data mining algorithm, Management Information Systems, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Computer Science - Databases, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], Computer Science::Databases, ta113, graph processing frameworks, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Computer Science::Information Retrieval, Databases (cs.DB), [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], data mining, Data science, Graph, Computer Science Applications, Important research, Categorization, Computer Science - Distributed, Parallel, and Cluster Computing, Cheminformatics, 020201 artificial intelligence & image processing, Distributed, Parallel, and Cluster Computing (cs.DC), Big graphs, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Information Systems, pattern mining
Abstract: Big graph mining is an important research area and it has attracted considerable attention. It allows to process, analyze, and extract meaningful information from large amounts of graph data. Big graph mining has been highly motivated not only by the tremendously increasing size of graphs but also by its huge number of applications. Such applications include bioinformatics, chemoinformatics and social networks. One of the most challenging tasks in big graph mining is pattern mining in big graphs. This task consists on using data mining algorithms to discover interesting, unexpected and useful patterns in large amounts of graph data. It aims also to provide deeper understanding of graph data. In this context, several graph processing frameworks and scaling data mining/pattern mining techniques have been proposed to deal with very big graphs. This paper gives an overview of existing data mining and graph processing frameworks that deal with very big graphs. Then it presents a survey of current researches in the field of data mining / pattern mining in big graphs and discusses the main research issues related to this field. It also gives a categorization of both distributed data mining and machine learning techniques, graph processing frameworks and large scale pattern mining approaches., Submitted to Big Data Research, Elsevier
Published: 2016
Full Text: View/download PDF

26. Distributed Memory Allocation Technique for Synchronous Dataflow Graphs

Author: Maxime Pelcat, Slaheddine Aridhi, Karol Desnos, Jean-Francois Nezan, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Texas Insruments (TI), IEEE, IEEE Signal Processing Society, IEEE CAS, Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Nantes Université (NU)-Université de Rennes 1 (UR1)
Subjects: Distributed shared memory, Hardware_MEMORYSTRUCTURES, Flat memory model, Computer science, Dataflow, Distributed computing, Cache-only memory architecture, Uniform memory access, Multiprocessing, 02 engineering and technology, Parallel computing, MPSoC, Static memory allocation, 020202 computer hardware & architecture, Non-uniform memory access, Memory bank, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Shared memory, 0202 electrical engineering, electronic engineering, information engineering, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, 020201 artificial intelligence & image processing, Computing with Memory, Distributed memory, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC]
Abstract: International audience; This paper introduces a new distributed memory allocation technique for applications modeled with Synchronous Dataflow (SDF) graphs. This technique builds on a State-of-the-Art shared memory allocation technique based on a weighted graph, called Memory Exclusion Graph (MEG). A MEG captures the memory reuse opportunities between memory objects that must be allocated before the execution of an SDF graph. The algorithms detailed in this paper enable a single MEG to be split into separate MEGs, each of which is associated with a memory bank accessible only by one core of the architecture. The proposed technique is implemented within a rapid prototyping framework and is evaluated by deploying real computer vision applications on a Multiprocessor System-on-Chip (MPSoC). Results show a systematic performance improvement due to better memory usage, with application speedups ranging from 2% up to 380%.
Published: 2016
Full Text: View/download PDF

27. Distributed k-core decomposition and maintenance in large dynamic graphs

Author: Alberto Montresor, Yannis Velegrakis, Sabeur Aridhi, Martin Brugnara, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), Aalto University, University of Trento [Trento], Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Theoretical computer science, Exploit, Computer Networks and Communications, Computer science, Distributed computing, Distributedk-core decomposition, k-core maintenance, akkaframework, 02 engineering and technology, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], 020204 information systems, Partial k-tree, 0202 electrical engineering, electronic engineering, information engineering, Electrical and Electronic Engineering, Dynamic graphs, ta113, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Efficient algorithm, K-core maintenance, Maintenance strategy, AKKA framework, Distributed K-core decomposition, Computer Science Applications1707 Computer Vision and Pattern Recognition, Graph, Modular decomposition, Analytics, 020201 artificial intelligence & image processing, Dy-namic graphs, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Graph product
Abstract: International audience; Distributed processing of large, dynamic graphs has recentlyreceived considerable attention, especially in domains suchas the analytics of social networks, web graphs and spatialnetworks.k-core decomposition is one of the significant fig-ures of merit that can be analyzed in graphs. Efficient algo-rithms to computek-cores exist already, both in centralizedand decentralized setting. Yet, these algorithms have beendesigned for static graphs, without significant support todeal with the addition or removal of nodes and edges. Typi-cally, this challenge is handled by re-executing the algorithmagain on the updated graph. In this work, we propose dis-tributedk-core decomposition and maintenance algorithmsfor large dynamic graphs. The proposed algorithms exploit,as much as possible, the topology of the graph to compute allthek-cores and maintain them in streaming settings whereedge insertions and removals happen frequently. The keyidea of the maintenance strategy is that whenever the orig-inal graph is updated by the insertion/deletion of one ormore edges, only a limited number of nodes need their core-ness to be re-evaluated. We present an implementation ofthe proposed approach on top of theakkaframework, andexperimentally show the efficiency of our approach in thecase of large dynamic networks.
Published: 2016
Full Text: View/download PDF

28. BLADYG A novel block-centric framework for the analysis of large dynamic graphs

Author: Alberto Montresor, Yannis Velegrakis, and Sabeur Aridhi
Subjects: ta113, 020203 distributed computing, Theoretical computer science, Computer science, Distributed computing, Distributed graph processing, Graph theory, 02 engineering and technology, Modular decomposition, 020204 information systems, Partial k-tree, 0202 electrical engineering, electronic engineering, information engineering, Topological graph theory, AKKA framework, Graph operations, Graph product, MathematicsofComputing_DISCRETEMATHEMATICS, Universal graph, Distance-hereditary graph, Dynamic graphs
Abstract: Recently, distributed processing of large dynamic graphs has become very popular, especially in certain domains such as social network analysis, Web graph analysis and spatial network analysis. In this context, many distributed/parallel graph processing systems have been proposed, such as Pregel, GraphLab, and Trinity. These systems can be divided into two categories: (1) vertex-centric and (2) block-centric approaches. In vertex-centric approaches, each vertex corresponds to a process, and message are exchanged among vertices. In block-centric approaches, the unit of computation is a block, a connected subgraph of the graph, and message exchanges occur among blocks. In this paper, we are considering the issues of scale and dynamism in the case of block-centric approaches. We present BLADYG, a block-centric framework that addresses the issue of dynamism in large-scale graphs. We present an implementation of BLADYG on top of AKKA framework. We experimentally evaluate the performance of the proposed framework.
Published: 2016

29. DynamicDFEP

Author: Sabeur Aridhi, Alessio Guerrieri, Chayma Sakouhi, Alberto Montresor, and Salma Sassi
Subjects: ta113, Block graph, Theoretical computer science, Computer science, 02 engineering and technology, Modular decomposition, Treewidth, Indifference graph, Pathwidth, Chordal graph, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Topological graph theory, 020201 artificial intelligence & image processing, Graph product, MathematicsofComputing_DISCRETEMATHEMATICS
Abstract: Distributed graph processing has become a very popular research topic recently, particularly in domains such as the analysis of social networks, web graphs and spatial networks. In this context, graph partitioning is an important task. Several partitioning algorithms have been proposed, such as DFEP, JABEJA and POWERGRAPH, but they are limited to static graphs only. In fact, they do not consider dynamic graphs in which vertices and edges are added and/or removed. In this paper, we propose a graph partitioning method for large dynamic graphs. We present an implementation of the proposed approach on top of the AKKA framework, and we experimentally show that our approach is efficient in the case of large dynamic graphs.
Published: 2016
Full Text: View/download PDF

30. Memory Analysis and Optimized Allocation of Dataflow Applications on Shared-Memory MPSoCs

Author: Slaheddine Aridhi, Karol Desnos, Maxime Pelcat, Jean-Francois Nezan, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), CIV Texas Instruments, Texas Instruments, Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Nantes Université (NU)-Université de Rennes 1 (UR1)
Subjects: Flat memory model, Computer science, Multiprocessing, Parallel computing, MPSoC, Static memory allocation, Theoretical Computer Science, Non-uniform memory access, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, multiprocessor, system-on-chip, Computing with Memory, Multi-core processor, Hardware_MEMORYSTRUCTURES, business.industry, Cache-only memory architecture, Uniform memory access, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], stereo vision, synchronous dataflow, Memory management, Shared memory, Hardware and Architecture, Control and Systems Engineering, Modeling and Simulation, Embedded system, [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV], Signal Processing, Memory footprint, memory allocation, Distributed memory, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Cache coherence, Information Systems
Abstract: International audience; The majority of applications, ranging from the low complexity to very multifaceted entities requiring dedicated hardware accelerators, are very well suited for Multiprocessor Systems-on-Chips (MPSoCs). It is critical to understand the general characteristics of a given embedded application: its behavior and its requirements in terms of MPSoC resources.This paper presents a complete method to study the important aspect of memory characteristic of an application. This method spans the theoretical, architecture-independent memory characterization to the quasi optimal static memory allocation of an application on a real shared-memory MPSoC. The application is modeled as an Synchronous Dataflow (SDF) graph which is used to derive a Memory Exclusion Graph (MEG) essential for the analysis and allocation techniques. Practical considerations, such as cache coherence and memory broad-casting, are extensively treated. Memory footprint optimization is demonstrated using the example of a stereo matching algorithm from the computer vision domain. Experimental results show a reduction of the memory footprint by up to 43% compared to a state-of-the-art minimization technique, a throughput improvement of 33% over dynamic allocation, and the introduction of a tradeoff between multi-core scheduling flexibility and memory footprint.
Published: 2015
Full Text: View/download PDF

31. Buffer Merging Technique for Minimizing Memory Footprints of Synchronous Dataflow Specifications

Author: Maxime Pelcat, Jean-Francois Nezan, Slaheddine Aridhi, Karol Desnos, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Texas Insruments (TI), Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Nantes Université (NU)-Université de Rennes 1 (UR1)
Subjects: Input/output, Hardware_MEMORYSTRUCTURES, business.industry, Computer science, Dataflow, Process (computing), Multiprocessing, Buffer Merging, Memory management, Computer architecture, Memory, Memory footprint, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, Resource management (computing), business, Digital signal processing
Abstract: International audience; This paper introduces and assesses a new technique to minimize the memory footprints of Digital Signal Processing (DSP) applications specified with Synchronous Dataflow (SDF) graphs and implemented on shared-memory Multiprocessor Systems-on-Chips (MP-SoCs). In addition to the SDF specification, which captures data dependencies between coarse-grained tasks called actors, the proposed technique relies on two optional inputs abstracting the internal data dependencies of actors: annotations of the ports of SDF actors, and script-based specifications of merging opportunities between input and output buffers of actors. An automated optimization process is used to exploit these buffer merging opportunities and to minimize the memory footprints of applications. Experimental results on a computer vision application show a reduction of the memory footprint by 34% compared to state-of-the-art minimization techniques.
Published: 2015
Full Text: View/download PDF

32. A MapReduce-based approach for shortest path problem in large-scale networks

Author: Benjamin Vincent, Sabeur Aridhi, Libo Ren, Philippe Lacomme, Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne-Centre National de la Recherche Scientifique (CNRS)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020]), DOREAU, Bastien, and Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS)
Subjects: Theoretical computer science, [INFO.INFO-RO] Computer Science [cs]/Operations Research [cs.RO], business.industry, Computer science, Distributed computing, Cloud computing, 02 engineering and technology, [INFO.INFO-RO]Computer Science [cs]/Operations Research [cs.RO], Graph, Distance matrix, Artificial Intelligence, Control and Systems Engineering, 020204 information systems, Parallel programming model, Shortest path problem, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, Electrical and Electronic Engineering, business, Dijkstra's algorithm, Constrained Shortest Path First
Abstract: The cloud computing allows to use virtually infinite resources, and seems to be a new promising opportunity to solve scientific computing problems. The MapReduce parallel programming model is a new framework favoring the design of algorithms for cloud computing. Such framework favors processing of problems across huge datasets using a large number of heterogeneous computers over the web. In this paper, we are interested in evaluating how the MapReduce framework can create an innovative way for solving operational research problems. We proposed a MapReduce-based approach for the shortest path problem in large-scale real-road networks. Such a problem is the cornerstone of any real-world routing problem including the dial-a-ride problem (DARP), the pickup and delivery problem (PDP) and its dynamic variants. Most of efficient methods dedicated to these routing problems have to use the shortest path algorithms to construct the distance matrix between each pair of nodes and it could be a time-consuming task on a large-scale network due to its size. We focus on the design of an efficient MapReduce-based approach since a classical shortest path algorithm is not suitable to accomplish efficiently such task. Our objective is not to guarantee the optimality but to provide high quality solutions in acceptable computational time. The proposed approach consists in partitioning the original graph into a set of subgraphs, then solving the shortest path on each subgraph in a parallel way to obtain a solution for the original graph. An iterative improvement procedure is introduced to improve the solution. It is benchmarked on a graph modeling French road networks extracted from OpenStreetMap. The results of the experiment show that such approach achieves significant gain of computational time.
Published: 2015

33. Demonstrating a Dataflow-based RTOS for Heterogeneous MPSoC by means of a Stereo Matching Application

Author: Muriel Pressigout, Julien Heulot, Jean-Francois Nezan, Luce Morin, Slaheddine Aridhi, Judicael Menant, Maxime Pelcat, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Institut de Recherche Technologique b-com (IRT b-com), CIV Texas Instruments, Texas Instruments, ANR-11-INSE-0012,COMPA,Conception Orientée Modèle de calcul pour multi-Processeurs Adaptables(2011), Nantes Université (NU)-Université de Rennes 1 (UR1), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), and Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)
Subjects: Digital signal processor, Multi-core processor, business.industry, Computer science, Dataflow, Multiprocessing, 02 engineering and technology, Parallel computing, MPSoC, 030218 nuclear medicine & medical imaging, 03 medical and health sciences, 0302 clinical medicine, Shared memory, Embedded system, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, business, Real-time operating system, Digital signal processing
Abstract: International audience; This demonstration paper presents a multicore Real Time Operating System (RTOS) that schedules a param-eterized dataflow Model of Computation (MoC) onto a multicore Digital Signal Processor (DSP) at runtime. This RTOS called Synchronous Parameterized and Interfaced Dataflow Embedded Runtime (SPIDER) exploits the Parameterized and Interfaced Synchronous Dataflow (PiSDF) MoC and its features at runtime to identify locally static regions and to optimize their execution onto multicore platforms. The RTOS is used to dispatch a stereo matching algorithm tasks with a varying range of disparities. The platform used for this demonstration is a Texas Instruments Keystone II Multiprocessor System-on-Chip (MPSoC) device composed of 8 DSP cores, 4 ARM cores, a shared memory sub-system, Multicore Navigator and multiple dedicated accelerators.
Published: 2014

34. SPIDER: A Synchronous Parameterized and Interfaced Dataflow-Based RTOS for Multicore DSPs

Author: Julien Heulot, Karol Desnos, Jean-Francois Nezan, Maxime Pelcat, Slaheddine Aridhi, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), CIV Texas Instruments, Texas Instruments, ANR-11-INSE-0012,COMPA,Conception Orientée Modèle de calcul pour multi-Processeurs Adaptables(2011), Nantes Université (NU)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Université de Nantes (UN)-Université de Rennes 1 (UR1)
Subjects: Digital signal processor, Multi-core processor, business.industry, Dataflow, Computer science, 05 social sciences, 050301 education, Parameterized complexity, 02 engineering and technology, Parallel computing, 020202 computer hardware & architecture, Scheduling (computing), [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, business, 0503 education, Real-time operating system, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, Digital signal processing
Abstract: International audience; This paper introduces a novel Real-Time Operating System (RTOS) based on a parameterized dataflow Model of Computation (MoC). This RTOS, called Synchronous Parameterized and Interfaced Dataflow Embedded Runtime (SPiDER), aims at efficiently scheduling Parameterized and Interfaced Synchronous Dataflow (PiSDF) graphs on multicore architectures. It exploits features of PiSDF to locate locally static regions that exhibit predictable application behavior. This paper uses a multicore signal processing benchmark to demonstrate that the SPiDER runtime can exploit more parallelism than a conventional multicore task scheduler. By comparing experimental results of the SPiDER runtime on an 8-core Texas Instruments Keystone I Digital Signal Processor (DSP) with those obtained from the OpenMP framework, latency improvements of up to 26% are demonstrated.
Published: 2014

35. A semi-formal approach for analog circuits behavioral properties verification

Author: Henda Aridhi, Ons Lahiouel, Mohamed H. Zaki, and Sofiène Tahar
Subjects: Analogue electronics, Computer science, Hardware_INTEGRATEDCIRCUITS, Tunnel diode, Electronic engineering, State space, Hardware_PERFORMANCEANDRELIABILITY, Qualitative simulation, Global optimization, Semi-formal, Hardware_LOGICDESIGN
Abstract: We propose an environment for the verification of analog circuits behavioral properties, where the circuit state space bounds are first computed using qualitative simulation. Then, their specified behavioral properties are verified on these bounds. The effectiveness of the method is illustrated with a tunnel diode oscillator.
Published: 2014
Full Text: View/download PDF

36. PREESM: A Dataflow-Based Rapid Prototyping Framework For Simplifying Multicore DSP Programming

Author: Maxime Pelcat, Julien Heulot, Slaheddine Aridhi, Karol Desnos, Clement Guy, Jean-Francois Nezan, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), CIV Texas Instruments, Texas Instruments, Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Nantes Université (NU)-Université de Rennes 1 (UR1)
Subjects: Multi-core processor, Digital signal processor, Computer science, Dataflow, Dataflow programming, Multiprocessing, 02 engineering and technology, 020202 computer hardware & architecture, Parallel processing (DSP implementation), Computer architecture, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Code generation, Algorithm design, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Abstract: International audience; The high performance Digital Signal Processors (DSP) currently manufactured by Texas Instruments are heterogeneous multiprocessor architectures. Programming these architectures is a complex task often reserved to specialized engineers because the bottlenecks of both the algorithm and the architecture need to be deeply understood in order to obtain a fairly parallel execution. The PREESM framework objective is to simplify the programming of multicore DSP systems by building on dataflow programming methods. The current functionalities of this scalable framework cover memory and time analysis, as well as automatic deadlock-free code generation. Several tutorials are provided with the tool for fast initiation of C programmers to multicore DSP programming. This paper demonstrates PREESM capabilities by comparing simulation and execution performances on a stereo matching algorithm prototyped on the TMS320C6678 8-core DSP device.
Published: 2014
Full Text: View/download PDF

37. PiMM: Parameterized and Interfaced dataflow Meta-Model for MPSoCs runtime reconfiguration

Author: Jean-Francois Nezan, Shuvra S. Bhattacharyya, Karol Desnos, Slaheddine Aridhi, Maxime Pelcat, Institut d'Électronique et des Technologies du numéRique (IETR), Nantes Université (NU)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), University of Maryland [College Park], University of Maryland System, CIV Texas Instruments, Texas Instruments, Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Université de Nantes (UN)-Université de Rennes 1 (UR1)
Subjects: Reconfigurable, [INFO.INFO-AR]Computer Science [cs]/Hardware Architecture [cs.AR], Computer science, Design space exploration, Dataflow, Model of computation, Modeling, Control reconfiguration, 020206 networking & telecommunications, Context (language use), 02 engineering and technology, Parallel computing, MPSoC, Pipeline (software), [INFO.INFO-MO]Computer Science [cs]/Modeling and Simulation, 020202 computer hardware & architecture, Metamodeling, 0202 electrical engineering, electronic engineering, information engineering, Meta-Model, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], Embedded Systems
Abstract: International audience; —Dataflow models of computation are widely used for the specification, analysis, and optimization of Digital Signal Processing (DSP) applications. In this paper a new meta-model called PiMM is introduced to address the important challenge of managing dynamics in DSP-oriented representations. PiMM extends a dataflow model by introducing an explicit parameter dependency tree and an interface-based hierarchical compositionality mechanism. PiMM favors the design of highly-efficient heterogeneous multicore systems, specifying algorithms with customizable trade-offs among predictability and exploita-tion of both static and adaptive task, data and pipeline paral-lelism. PiMM fosters design space exploration and reconfigurable resource allocation in a flexible dynamic dataflow context.
Published: 2013
Full Text: View/download PDF

38. Applying the Adaptive Hybrid Flow-Shop Scheduling Method to Schedule a 3GPP LTE Physical Layer Algorithm onto Many-Core Digital Signal Processors

Author: Jani Boutellier, Jean-Francois Nezan, Slaheddine Aridhi, Maxime Pelcat, Julien Heulot, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Machine Vision Group (MVG), University of Oulu, Université européenne de Bretagne - European University of Brittany (UEB), CIV Texas Instruments, Texas Instruments, ANR-11-INSE-0012,COMPA,Conception Orientée Modèle de calcul pour multi-Processeurs Adaptables(2011), Nantes Université (NU)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), and Université de Nantes (UN)-Université de Rennes 1 (UR1)
Subjects: Multi-core processor, 021103 operations research, business.industry, Computer science, 0211 other engineering and technologies, 020206 networking & telecommunications, 02 engineering and technology, Flow shop scheduling, Round-robin scheduling, Fair-share scheduling, Scheduling (computing), Fixed-priority pre-emptive scheduling, Two-level scheduling, Embedded system, 0202 electrical engineering, electronic engineering, information engineering, [INFO.INFO-ES]Computer Science [cs]/Embedded Systems, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Digital signal processing
Abstract: International audience; Currently, Multicore Digital Signal Processor (DSP) platforms are commonly used in telecommunications baseband processing. In the next few years, high performance DSPs are likely to combine many more DSP cores for signal processing with some General-Purpose Processor (GPP) cores for application control. As the number of cores increases in new DSP platform designs, scheduling of applications is becoming a complex operation. Meanwhile, the variability of the scheduled applications also tends to increase as applications become more sophisticated. Such variations require runtime adaptivity of application scheduling. This paper extends the previous work on adaptive scheduling by using the Hybrid Flow-Shop (HFS) scheduling method, which enables the device architecture to be modeled as a pipeline of Processing Elements (PEs) with multiple alternate PEs for each pipeline stage. HFS scheduling is applied to the scheduling of 3rd Generation Partnership Project (3GPP) Long Term Evolution (LTE) telecommunication standard Uplink Physical Layer data processing (PUSCH). The experiments, conducted on an ARM Cortex-A9 GPP, show that an HFS scheduling algorithm has an overhead that increases very slowly with the number of PEs. This makes the method suitable for executing the adaptive scheduling in less than 1 ms for the 501 actors of a LTE PUSCH dataflow description executed on a 256-core architecture.
Published: 2013

39. Modeling and Control of Micro-grid Powered by Solar and Wind Energies

Author: Emna Aridhi, Abdelkader Mami, and Sameh Zenned
Subjects: Battery (electricity), Computer science, business.industry, Photovoltaic system, Energy Engineering and Power Technology, Turbine, Automotive engineering, Energy storage, Power (physics), Stand-alone power system, Hybrid system, Electricity, Electrical and Electronic Engineering, business, Simulation
Abstract: The number of installations of Micro-Grid or intelligent micro power networks will increase to quadruple by 2020.The purpose is to reduce the cost and the consumption of electricity in transmission and distribution networks, using a hybrid system powered by solar and wind sources, as well as integrating storage devices. This paper reviews and discusses the Micro-Grid Model. It describes various Micro-Grid components and different configurations. It also presents the model of two generation units (Photovoltaic and Wind Turbine). Then, a comparative study of different battery types used for large-scale electricity storage is carried out, followed by a review of control strategies. Full Text: PDF DOI: http://dx.doi.org/10.11591/ijpeds.v8.i1.pp402-416
Published: 2017
Full Text: View/download PDF

40. Feature extraction in protein sequences classification: a new stability measure

Author: Mondher Maddouri, Engelbert Mephu Nguifo, Rabie Saidi, Sabeur Aridhi, European Bioinformatics Institute [Hinxton] (EMBL-EBI), EMBL Heidelberg, Computational Algorithms for Protein Structures and Interactions (CAPSID), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes (LIMOS), Ecole Nationale Supérieure des Mines de St Etienne-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS), Taibah University, Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), and Ecole Nationale Supérieure des Mines de St Etienne (ENSM ST-ETIENNE)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020])-Centre National de la Recherche Scientifique (CNRS)
Subjects: FOS: Computer and information sciences, Computer science, Feature extraction, 02 engineering and technology, Quantitative Biology - Quantitative Methods, Machine Learning (cs.LG), [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Computational Engineering, Finance, and Science (cs.CE), 03 medical and health sciences, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], 020204 information systems, Subsequence, 0202 electrical engineering, electronic engineering, information engineering, Computer Science - Computational Engineering, Finance, and Science, Quantitative Methods (q-bio.QM), 030304 developmental biology, 0303 health sciences, [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB], business.industry, Pattern recognition, Computer Science - Learning, ComputingMethodologies_PATTERNRECOGNITION, FOS: Biological sciences, Artificial intelligence, [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business
Abstract: Feature extraction is an unavoidable task, especially in the critical step of preprocessing biological sequences. This step consists for example in transforming the biological sequences into vectors of motifs where each motif is a subsequence that can be seen as a property (or attribute) characterizing the sequence. Hence, we obtain an object-property table where objects are sequences and properties are motifs extracted from sequences. This output can be used to apply standard machine learning tools to perform data mining tasks such as classification. Several previous works have described feature extraction methods for bio-sequence classification, but none of them discussed the robustness of these methods when perturbing the input data. In this work, we introduce the notion of stability of the generated motifs in order to study the robustness of motif extraction methods. We express this robustness in terms of the ability of the method to reveal any change occurring in the input data and also its ability to target the interesting motifs. We use these criteria to evaluate and experimentally compare four existing extraction methods for biological sequences., The paper has been accepted by the ACM Conference on Bioinformatics, Computational Biology and Biomedicine (ACM BCB) 2012. We want to cancel the submission because of the double entries of the paper in DBLP. Thank you for your understanding
Published: 2012
Full Text: View/download PDF

41. Dataflow Model of Computation

Author: Slaheddine Aridhi, Jean-Francois Nezan, Maxime Pelcat, and Jonathan Piat
Subjects: Signal programming, Hardware_GENERAL, Computer science, Dataflow, Model of computation, Computation, Parallelism (grammar), Physical layer, Parallel computing, Precedence graph, Dataflow architecture
Abstract: To study the LTE physical layer on multi-core architectures, a Model of Computation (MoC) is needed to specify the LTE algorithms. This MoC must have the necessary expressivity, must show the algorithm parallelism and must be capable of locating system bottlenecks.
Published: 2012
Full Text: View/download PDF

42. Generating Code from LTE Models

Author: Jean-Francois Nezan, Maxime Pelcat, Slaheddine Aridhi, and Jonathan Piat
Subjects: Computer science, Code generation, Parallel computing, Scheduling (computing)
Abstract: Literature on automatic multi-core code generation was reviewed in Sect. 4.5 and scheduling strategies in Sect. 4.4.1. In this section, generated code execution schemes are defined, detailing how code is generated from a given scheduling strategy.
Published: 2012
Full Text: View/download PDF

43. Rapid Prototyping and Programming Multi-Core Architectures

Author: Slaheddine Aridhi, Jonathan Piat, Jean-Francois Nezan, and Maxime Pelcat
Subjects: Rapid prototyping, Multi-core processor, Digital signal processor, Parallelism (rhetoric), Computer architecture, Computer science, Software deployment, Very long instruction word, business.industry, Transaction-level modeling, business, Digital signal processing
Abstract: This chapter gives an over view of the existing work on rapid prototyping and multi-core deployment in the signal processing world. The concept of rapid prototyping was introduced in Fig. 1.2 when outlining the structure of this document. It consists of automatically generating a system simulation or a system prototype from quickly constructed models. Rapid prototyping may be used for several purposes; this study uses it to manage the parallelism of DSP architectures. Parallelism must be handled differently for the macroscopic or microscopic views of a system.
Published: 2012
Full Text: View/download PDF

44. Physical Layer Multi-Core Prototyping: A Dataflow-Based Approach for LTE eNodeB

Author: Maxime Pelcat, Jean-Franois Nezan, Slaheddine Aridhi, Jonathan Piat, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Université européenne de Bretagne - European University of Brittany (UEB), CIV Texas Instruments, Texas Instruments, Institut d'Electronique et de Télécommunications de Rennes (IETR), Centre National de la Recherche Scientifique (CNRS)-Ecole Supérieure d'Electricité - SUPELEC (FRANCE)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES), Nantes Université (NU)-Université de Rennes 1 (UR1), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), and Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Ecole Supérieure d'Electricité - SUPELEC (FRANCE)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Multi-core processor, Dataflow, Computer science, Distributed computing, Physical layer, 020206 networking & telecommunications, 02 engineering and technology, Load balancing (computing), Porting, 020202 computer hardware & architecture, Scheduling (computing), EnodeB, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Systems design, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Abstract: International audience; Base stations developed according to the 3GPP Long Term Evolution (LTE) standard require unprecedented processing power. 3GPP LTE enables data rates beyond hundreds of Mbits/s by using advanced technologies, necessitating a highly complex LTE physical layer. The operating power of base stations is a significant cost for operators, and is currently optimized using state-of-the-art hardware solutions, such as heterogeneous distributed systems. The traditional system design method of porting algorithms to heterogeneous distributed systems based on test-and-refine methods is a manual, thus time-expensive, task. Physical Layer Multi-Core Prototyping: A Dataflow-Based Approach provides a clear introduction to the 3GPP LTE physical layer and to dataflow-based prototyping and programming. The difficulties in the process of 3GPP LTE physical layer porting are outlined, with particular focus on automatic partitioning and scheduling, load balancing and computation latency reduction, specifically in systems based on heterogeneous multi-core Digital Signal Processors. Multi-core prototyping methods based on algorithm dataflow modeling and architecture system-level modeling are assessed with the goal of automating and optimizing algorithm porting. With its analysis of physical layer processing and proposals of parallel programming methods, which include automatic partitioning and scheduling, Physical Layer Multi-Core Prototyping: A Dataflow-Based Approach is a key resource for researchers and students. This study of LTE algorithms which require dynamic or static assignment and dynamic or static scheduling, allows readers to reassess and expand their knowledge of this vital component of LTE base station design.
Published: 2012
Full Text: View/download PDF

45. Enhanced Rapid Prototyping

Author: Jean-Francois Nezan, Slaheddine Aridhi, Maxime Pelcat, and Jonathan Piat
Subjects: Rapid prototyping, Job shop scheduling, business.industry, Computer science, Embedded system, Process (computing), business, Directed acyclic graph, Critical path method, Digital signal processing
Abstract: In Chap. 4, an overview of the multi-core scheduling problem and solutions presented in the literature were summarized. A flexible rapid prototyping process has an important role to play in all the design steps of a multi-core DSP system.
Published: 2012
Full Text: View/download PDF

46. A System-Level Architecture Model

Author: Maxime Pelcat, Jonathan Piat, Slaheddine Aridhi, and Jean-Francois Nezan
Subjects: Enterprise architecture framework, Architecture framework, Multilayered architecture, Computer science, Distributed computing, Applications architecture, Systems architecture, Data architecture, Reference architecture, Software architecture description
Abstract: For the LTE physical layer to be properly prototyped, the target hardware architectures need to be specified at system-level, using a simple model focusing on architectural limitations. The System-Level Architecture Model (S-LAM), which enables such specifications. Sections 5.2.4 and 5.3 explain how to compute routes between operators from an S-LAM specification and Sect. 5.4 shows how transfers on these routes are simulated. Finally, the role of the S-LAM model in the rapid prototyping process.
Published: 2012
Full Text: View/download PDF

47. Memory bounds for the distributed execution of a hierarchical Synchronous Data-Flow graph

Author: Slaheddine Aridhi, Maxime Pelcat, Jean-Francois Nezan, Karol Desnos, Institut d'Électronique et des Technologies du numéRique (IETR), Université de Nantes (UN)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), Université européenne de Bretagne - European University of Brittany (UEB), CIV Texas Instruments, Texas Instruments, Nantes Université (NU)-Université de Rennes 1 (UR1), Université de Nantes (UN)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), and Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)
Subjects: 010302 applied physics, Wait-for graph, Theoretical computer science, Computer science, Multiprocessing, Graph theory, 02 engineering and technology, Parallel computing, MPSoC, [INFO.INFO-MO]Computer Science [cs]/Modeling and Simulation, 01 natural sciences, Synchronous Data Flow, 020202 computer hardware & architecture, Scheduling (computing), IEEE, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Shared memory, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Systems architecture, [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], Memory Bounds, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Abstract: International audience; This paper presents an application analysis technique to define the boundary of shared memory requirements of Multiprocessor System-on-Chip (MPSoC) in early stages of development. This technique is part of a rapid prototyping process and is based on the analysis of a hierarchical Synchronous Data-Flow (SDF) graph description of the system application. The analysis does not require any knowledge of the system architecture, the mapping or the scheduling of the system application tasks. The initial step of the method consists of applying a set of transformations to the SDF graph so as to reveal its memory characteristics. These transformations produce a weighted graph that represents the different memory objects of the application as well as the memory allocation constraints due to their relationships. The memory boundaries are then derived from this weighted graph using analogous graph theory problems, in particular the Maximum-Weight Clique (MWC) problem. Stateof-the-art algorithms to solve these problems are presented and a heuristic approach is proposed to provide a near-optimal solution of the MWC problem. A performance evaluation of the heuristic approach is presented, and is based on hierarchical SDF graphs of realistic applications. This evaluation shows the efficiency of proposed heuristic approach in finding near optimal solutions.
Published: 2012
Full Text: View/download PDF

48. FPGA implementation of predictive control

Author: Abdelkader Mami, Emna Aridhi, Mehdi Abbes, LACS, Ecole Nationale d'Ingénieurs de Tunis (ENIT), and Université de Tunis El Manar (UTM)-Université de Tunis El Manar (UTM)
Subjects: 0209 industrial biotechnology, Computer science, business.industry, Hardware description language, 02 engineering and technology, Parallel computing, Application software, computer.software_genre, 020202 computer hardware & architecture, Model predictive control, [SPI]Engineering Sciences [physics], 020901 industrial engineering & automation, Software, Computer engineering, Control system, VHDL, 0202 electrical engineering, electronic engineering, information engineering, business, MATLAB, Field-programmable gate array, computer, Hardware_REGISTER-TRANSFER-LEVELIMPLEMENTATION, Hardware_LOGICDESIGN, computer.programming_language
Abstract: International audience; In this paper, we propose an implementation of a synthesizable VHDL program of generalized predictive control (GPC) without constraints on a map XC3S700A Xilinx Starter Kit using the Xilinx ISE 10.1 software. The control strategy was applied to a second order state system. The VHDL language was used as a programming tool. Real variables were described with the fixed-point representation to overcome the overflow problems during the computations in the VHDL program. The use of FPGA circuits presents a good choice regarding to the problem of computation time encountered in predictive algorithms. A GPC Matlab program was also implemented in order to make a performance comparison. The simulation results show a good set-point tracking.
Published: 2012
Full Text: View/download PDF

49. Building a RTOS for MPSoC Dataflow Programming

Author: Yaset Oliva, Jean-Francois Nezan, Maxime Pelcat, Slaheddine Aridhi, Jean-Christophe Prévotet, Institut d'Electronique et de Télécommunications de Rennes (IETR), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Ecole Supérieure d'Electricité - SUPELEC (FRANCE)-Centre National de la Recherche Scientifique (CNRS), CIV Texas Instruments, Texas Instruments, Centre National de la Recherche Scientifique (CNRS)-Ecole Supérieure d'Electricité - SUPELEC (FRANCE)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Institut National des Sciences Appliquées (INSA)-Université de Rennes (UNIV-RENNES)-Université de Rennes 1 (UR1), and Université de Rennes (UNIV-RENNES)
Subjects: business.industry, Dataflow, Computer science, Dataflow programming, 020206 networking & telecommunications, Multiprocessing, 02 engineering and technology, ComputerSystemsOrganization_PROCESSORARCHITECTURES, Load balancing (computing), MPSoC, 020202 computer hardware & architecture, Computer architecture, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Symmetric multiprocessing, Embedded system, 0202 electrical engineering, electronic engineering, information engineering, System on a chip, ComputerSystemsOrganization_SPECIAL-PURPOSEANDAPPLICATION-BASEDSYSTEMS, [INFO.INFO-OS]Computer Science [cs]/Operating Systems [cs.OS], [INFO.INFO-DC]Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], business, Real-time operating system, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Abstract: International audience; Multiprocessor Systems-on-Chip (MPSoC) are becoming the standard high performance Digital Signal Processing (DSP) systems. Hardware complexity abstraction is needed to enable efficient MPSoC programming. A major challenge of MPSoC programming is efficiently handling the combination of new features necessary in a MPSoC operating system: load balancing and efficient use of the parallel resources, with the more traditional features of Real-Time Operating Systems (RTOS): resource sharing between applications, task priorities and reactivity to events. This paper presents a method to combine dataflow methods and RTOS features. The resulting system prototypes an RTOS for symmetric multiprocessing MPSoCs whose inputs are dataflow graphs of applications. The prototype is built on the uC/OS-II RTOS. Experimental results are given on a 3GPP Long Term Evolution algorithm executed on a 4-core MPSoC.
Published: 2011

50. Performance analysis of type-I and type-II hybrid ARQ protocols using concatenated codes in a DS-CDMA Rayleigh fading channel

Author: Charles Despins and S. Aridhi
Subjects: Go-Back-N ARQ, business.industry, Computer science, Code division multiple access, Automatic repeat request, Retransmission, Concatenated error correction code, ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS, Concatenation, Hybrid automatic repeat request, Throughput, Data_CODINGANDINFORMATIONTHEORY, Selective Repeat ARQ, Convolutional code, Sliding window protocol, Electronic engineering, Fading, business, Error detection and correction, Algorithm, Computer network, Rayleigh fading, Power control, Data transmission, Communication channel
Abstract: To achieve reliable data transmission in a power-controlled, direct-sequence code division multiple access cellular system, we investigate two error-control schemes based on the concatenation of Reed-Solomon and convolutional codes. The first scheme is a type-I hybrid selective repeat ARQ protocol (CC/HARQ) where the error detection capability of the RS outer code is used to trigger retransmission requests. The second scheme is a type-II hybrid selective repeat ARQ protocol where the error correction capability is adapted to the varying channel conditions, The performance criteria are the protocol error probability, the throughput and delay efficiencies over a Rayleigh fading DS-CDMA channel with a Gaussian approximation for the multiple access interference. While type-I CC/HARQ yields superior throughput when the number of active users is small, the results show that type-II CC/HARQ provides reliable data transfer to the largest number of users, even in the presence of imperfect power control.
Published: 2002
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

50 results on '"Aridhi A"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources