38 results on '"Piotr Kopta"'
Search Results
2. Introducing VECMAtk - Verification, Validation and Uncertainty Quantification for Multiscale and HPC Simulations.
- Author
-
Derek Groen, Robin A. Richardson, David W. Wright 0001, Vytautas Jancauskas, Robert Sinclair, Paul Karlshoefer, Maxime Vassaux, Hamid Arabnejad, Tomasz Piontek, Piotr Kopta, Bartosz Bosak, Jalal Lakhlili, Olivier Hoenen, Diana Suleimenova, Wouter Edeling, Daan Crommelin, Anna Nikishova, and Peter V. Coveney
- Published
- 2019
- Full Text
- View/download PDF
3. Patterns for High Performance Multiscale Computing.
- Author
-
Saad Alowayyed, Tomasz Piontek, James L. Suter, Olivier Hoenen, Derek Groen, Onnie Luk, Bartosz Bosak, Piotr Kopta, Krzysztof Kurowski, Oliver Perks, Keeran Brabazon, Vytautas Jancauskas, David Coster, Peter V. Coveney, and Alfons G. Hoekstra
- Published
- 2019
- Full Text
- View/download PDF
4. VECMAtk: A Scalable Verification, Validation and Uncertainty Quantification Toolkit for Scientific Simulations.
- Author
-
Derek Groen, Hamid Arabnejad, Vytautas Jancauskas, Wouter Edeling, Fredrik Jansson, Robin A. Richardson, Jalal Lakhlili, Linda van Veen, Bartosz Bosak, Piotr Kopta, D. W. Wright, Nicolas Monnier, Paul Karlshoefer, Diana Suleimenova, Robert Sinclair, Maxime Vassaux, Anna Nikishova, Mariusz Bieniek, Onnie O. Luk, Michal Kulczewski, Erwan Raffin, Daan Crommelin, Olivier Hoenen, David P. Coster, Tomasz Piontek, and Peter V. Coveney
- Published
- 2020
5. Scaling the GCR Solver Using a High-Level Stencil Framework on Multi- and Many-Core Architectures.
- Author
-
Milosz Ciznicki, Michal Kulczewski, Piotr Kopta, and Krzysztof Kurowski
- Published
- 2015
- Full Text
- View/download PDF
6. MultiGrain/MAPPER: A distributed multiscale computing approach to modeling and simulating gene regulation networks.
- Author
-
Alexandru E. Mizeranschi, Martin T. Swain, Raluca Scona, Quentin Fazilleau, Bartosz Bosak, Tomasz Piontek, Piotr Kopta, Paul Thompson, and Werner Dubitzky
- Published
- 2016
- Full Text
- View/download PDF
7. Development of Science Gateways Using QCG - Lessons Learned from the Deployment on Large Scale Distributed and HPC Infrastructures.
- Author
-
Tomasz Piontek, Bartosz Bosak, Milosz Ciznicki, Piotr Grabowski, Piotr Kopta, Michal Kulczewski, Dawid Szejnfeld, and Krzysztof Kurowski
- Published
- 2016
- Full Text
- View/download PDF
8. Adaptation of fluid model EULAG to graphics processing unit architecture.
- Author
-
Krzysztof Andrzej Rojek, Milosz Ciznicki, Bogdan Rosa, Piotr Kopta, Michal Kulczewski, Krzysztof Kurowski, Zbigniew Pawel Piotrowski, Lukasz Szustak, Damian Karol Wójcik, and Roman Wyrzykowski
- Published
- 2015
- Full Text
- View/download PDF
9. Methods to Load Balance a GCR Pressure Solver Using a Stencil Framework on Multi- and Many-Core Architectures.
- Author
-
Milosz Ciznicki, Michal Kulczewski, Piotr Kopta, and Krzysztof Kurowski
- Published
- 2015
- Full Text
- View/download PDF
10. Elliptic Solver Performance Evaluation on Modern Hardware Architectures.
- Author
-
Milosz Ciznicki, Piotr Kopta, Michal Kulczewski, Krzysztof Kurowski, and Pawel Gepner
- Published
- 2013
- Full Text
- View/download PDF
11. Two-Dimensional Discrete Wavelet Transform on Large Images for Hybrid Computing Architectures: GPU and CELL.
- Author
-
Marek Blazewicz, Milosz Ciznicki, Piotr Kopta, Krzysztof Kurowski, and Pawel Lichocki
- Published
- 2011
- Full Text
- View/download PDF
12. Vine Toolkit - Grid-Enabled Portal Solution for Community Driven Computing Workflows with Meta-scheduling Capabilities.
- Author
-
Dawid Szejnfeld, Piotr Domagalski, Piotr Dziubecki, Piotr Kopta, Michal Krysinski, Tomasz Kuczynski, Krzysztof Kurowski, Bogdan Ludwiczak, Jaroslaw Nabrzyski, Tomasz Piontek, Dominik Tarnawczyk, Krzysztof Witkowski, and Malgorzata Wolniewicz
- Published
- 2009
- Full Text
- View/download PDF
13. Benchmarking JPEG 2000 implementations on modern CPU and GPU architectures.
- Author
-
Milosz Ciznicki, Michal Kierzynka, Piotr Kopta, Krzysztof Kurowski, and Pawel Gepner
- Published
- 2014
- Full Text
- View/download PDF
14. Grid Access and User Interface in CLUSTERIX Project.
- Author
-
Piotr Kopta, Tomasz Kuczynski, and Roman Wyrzykowski
- Published
- 2005
- Full Text
- View/download PDF
15. Benchmarking Data and Compute Intensive Applications on Modern CPU and GPU Architectures.
- Author
-
Milosz Ciznicki, Michal Kierzynka, Piotr Kopta, Krzysztof Kurowski, and Pawel Gepner
- Published
- 2012
- Full Text
- View/download PDF
16. Parallel application benchmarks and performance evaluation of the Intel Xeon 7500 family processors.
- Author
-
Piotr Kopta, Michal Kulczewski, Krzysztof Kurowski, Tomasz Piontek, Pawel Gepner, Mariusz Puchalski, and Jacek Komasa
- Published
- 2011
- Full Text
- View/download PDF
17. VECMAtk: a scalable verification, validation and uncertainty quantification toolkit for scientific simulations
- Author
-
Linda van Veen, O. O. Luk, Peter V. Coveney, Robert C. Sinclair, Jalal Lakhlili, Maxime Vassaux, Tomasz Piontek, D. P. Coster, Paul Karlshoefer, David W. Wright, N. Monnier, Diana Suleimenova, Erwan Raffin, Piotr Kopta, Michal Kulczewski, Daan Crommelin, Robin A. Richardson, Fredrik Jansson, Mariusz Bieniek, Derek Groen, Hamid Arabnejad, Anna Nikishova, Vytautas Jancauskas, O. Hoenen, Bartosz Bosak, Wouter Edeling, Brunel University London [Uxbridge], Centrum Wiskunde & Informatica, Amsterdam (CWI), Department of Chemistry [UCL, London], University College of London [London] (UCL), Computational Science Lab (IVI, FNWI), Analysis (KDV, FNWI), KdV Other Research (FNWI), Faculty of Science, and Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands
- Subjects
FOS: Computer and information sciences ,010504 meteorology & atmospheric sciences ,uncertainty quantification ,General Mathematics ,General Physics and Astronomy ,[INFO.INFO-CG]Computer Science [cs]/Computational Geometry [cs.CG] ,01 natural sciences ,010305 fluids & plasmas ,[SPI.MAT]Engineering Sciences [physics]/Materials ,[PHYS.MECA.MEMA]Physics [physics]/Mechanics [physics]/Mechanics of materials [physics.class-ph] ,multiscale simulations ,Software ,[PHYS.MECA.STRU]Physics [physics]/Mechanics [physics]/Structural mechanics [physics.class-ph] ,0103 physical sciences ,Validation ,[PHYS.MECA.SOLID]Physics [physics]/Mechanics [physics]/Solid mechanics [physics.class-ph] ,[SPI.GCIV.RISQ]Engineering Sciences [physics]/Civil Engineering/Risques ,[MATH.MATH-AP]Mathematics [math]/Analysis of PDEs [math.AP] ,Sensitivity (control systems) ,Uncertainty quantification ,Reliability (statistics) ,Research Articles ,0105 earth and related environmental sciences ,validation ,business.industry ,[SPI.GCIV.GEOTECH]Engineering Sciences [physics]/Civil Engineering/Géotechnique ,General Engineering ,Verification ,Articles ,[PHYS.MECA.MSMECA]Physics [physics]/Mechanics [physics]/Materials and structures in mechanics [physics.class-ph] ,[INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA] ,cs.MS ,Range (mathematics) ,Scalability ,Key (cryptography) ,Computer Science - Mathematical Software ,[SPI.GCIV.STRUCT]Engineering Sciences [physics]/Civil Engineering/Structures ,Multiscale simulations ,[SPI.GCIV.MAT]Engineering Sciences [physics]/Civil Engineering/Matériaux composites et construction ,verification ,Software engineering ,business ,Focus (optics) ,Mathematical Software (cs.MS) ,[SPI.GCIV.GCN]Engineering Sciences [physics]/Civil Engineering/Génie civil nucléaire - Abstract
We present the VECMA toolkit (VECMAtk), a flexible software environment for single and multiscale simulations that introduces directly applicable and reusable procedures for verification, validation (V&V), sensitivity analysis (SA) and uncertainty quantification (UQ). It enables users to verify key aspects of their applications, systematically compare and validate the simulation outputs against observational or benchmark data, and run simulations conveniently on any platform from the desktop to current multi-petascale computers. In this sequel to our paper on VECMAtk which we presented last year, we focus on a range of functional and performance improvements that we have introduced, cover newly introduced components, and applications examples from seven different domains such as conflict modelling and environmental sciences. We also present several implemented patterns for UQ/SA and V&V, and guide the reader through one example concerning COVID-19 modelling in detail., 23 pages, 10 figures
- Published
- 2021
18. Verification, Validation and Uncertainty Quantification of Large-Scale Applications with QCG-PilotJob
- Author
-
Erwan Raffin, Jalal Lakhlili, Bartosz Bosak, Tomasz Piontek, Paul Karlshoefer, and Piotr Kopta
- Subjects
Focus (computing) ,business.industry ,Computer science ,media_common.quotation_subject ,Distributed computing ,Scale (chemistry) ,Interdependence ,Software ,Workflow ,Computer cluster ,Scalability ,Uncertainty quantification ,business ,media_common - Abstract
Efficient execution of large-scale and extremely demanding computational scenarios is a challenge for both the infrastructure providers and end-users, usually scientists, that need to develop highly scalable computational codes. Nevertheless, at this time, on the eve of exa-scale supercomputers, the particular role has to be given also to the intermediate software that can help in the preparation of applications so they can be efficiently executed on the emerging HPC systems. The efficiency and scalability of such software can be seen as priorities, however, these are not the only elements that should be addressed. Equally important is to offer software that is elastic, portable between platforms of different sizes, and easy to use. Trying to fulfill all the above needs we present QCG-PilotJob, a tool designed to enable flexible execution of numerous potentially dynamic and interdependent computing tasks in a single allocation on a computing cluster. QCG-PilotJob is built on many years of collaboration with computational scientists representing various domains and it responses to the practical requirements of real scientific use-cases. In this paper, we focus on the recent integration of QCG-PilotJob with the EasyVVUQ library and its successful use for Uncertainty Quantification workflows of several complex multiscale applications being developed within the VECMA project. However, we believe that with a well-thought-out design that allows for fully user-space execution and straightforward installation, QCG-PilotJob may be easily exploited in many other application scenarios, even by inexperienced users.
- Published
- 2021
19. Vine Toolkit - Towards portal based production solutions for scientific and engineering communities with grid-enabled resources support.
- Author
-
Dawid Szejnfeld, Piotr Dziubecki, Piotr Kopta, Michal Krysinski, Tomasz Kuczynski, Krzysztof Kurowski, Bogdan Ludwiczak, Tomasz Piontek, Dominik Tarnawczyk, Malgorzata Wolniewicz, Piotr Domagalski, Jaroslaw Nabrzyski, and Krzysztof Witkowski
- Published
- 2010
20. Building Confidence in Simulation: Applications of EasyVVUQ
- Author
-
Derek Groen, Hamid Arabnejad, Peter V. Coveney, Irina Chirca, Piotr Kopta, Michal Kulczewski, Robin A. Richardson, Daan Crommelin, Tomasz Piontek, Diana Suleimenova, Jan Węglarz, Bartosz Bosak, Jalal Lakhlili, O. O. Luk, David W. Wright, Wouter Edeling, O. Hoenen, Robert C. Sinclair, Vytautas Jancauskas, Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands, Analysis (KDV, FNWI), and Computational Science Lab (IVI, FNWI)
- Subjects
Statistics and Probability ,Numerical Analysis ,Multidisciplinary ,010504 meteorology & atmospheric sciences ,Computer science ,uncertainty quantification ,high‐performance computing ,high-performance computing ,Python (programming language) ,Supercomputer ,01 natural sciences ,Multiscale modeling ,010305 fluids & plasmas ,Workflow ,multiscale simulations ,Application areas ,Modeling and Simulation ,0103 physical sciences ,Systems engineering ,Uncertainty quantification ,computer ,0105 earth and related environmental sciences ,computer.programming_language - Abstract
© 2020 The Authors. Validation, verification, and uncertainty quantification (VVUQ) of simulation workflows are essential for building trust in simulation results, and their increased use in decision‐making processes. The EasyVVUQ Python library is designed to facilitate implementation of advanced VVUQ techniques in new or existing workflows, with a particular focus on high‐performance computing, middleware agnosticism, and multiscale modeling. Here, the application of EasyVVUQ to five very diverse application areas is demonstrated: materials properties, ocean circulation modeling, fusion reactors, forced human migration, and urban air quality prediction. Horizon 2020 Framework Programme. Grant Number: 800925; Engineering and Physical Sciences Research Council
- Published
- 2020
21. Application of the extreme scaling computing pattern on multiscale fusion plasma modelling
- Author
-
Alberto Bottino, Oliver Perks, Bruce D. Scott, D. P. Coster, O. O. Luk, O. Hoenen, Tomasz Piontek, Piotr Kopta, Bartosz Bosak, and Keeran Brabazon
- Subjects
Profiling (computer programming) ,Fusion ,Computer science ,Turbulence ,General Mathematics ,General Engineering ,Fusion plasma ,General Physics and Astronomy ,Articles ,Supercomputer ,01 natural sciences ,7. Clean energy ,010305 fluids & plasmas ,Computational science ,Workflow ,0103 physical sciences ,010306 general physics ,Scaling - Abstract
The extreme scaling pattern of the ComPat project is applied to a multi-scale workflow relevant to the magnetically confined fusion problem. This workflow combines transport, turbulence and equilibrium codes (together with additional auxiliaries such as initial conditions and numerical module), which aims at calculating the behaviour of a fusion plasma on long (transport) time scales based on information from much faster (turbulence) time scales. Initial findings of profile measurements are reported in this paper and indicate that, depending on the chosen performance metric for defining ‘cost’, such as time to completion, efficiency and total energy consumption of the mutliscale workflow, different choices on the number of cores would be made when determining the optimal execution configuration. A variant of the workflow which increases the inherent parallelism is presented, and shown to produce equivalent results at (typically) lower cost compared with the original workflow. This article is part of the theme issue ‘Multiscale modelling, simulation and computing: from the desktop to the exascale’.
- Published
- 2019
22. Patterns for High Performance Multiscale Computing
- Author
-
Derek Groen, Vytautas Jancauskas, Saad Alowayyed, Peter V. Coveney, Piotr Kopta, Bartosz Bosak, Oliver Perks, O. O. Luk, Tomasz Piontek, O. Hoenen, James L. Suter, Krzysztof Kurowski, Alfons G. Hoekstra, Keeran Brabazon, D. P. Coster, and Computational Science Lab (IVI, FNWI)
- Subjects
Modelling methodology ,Computer Networks and Communications ,Computer science ,business.industry ,Distributed computing ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020206 networking & telecommunications ,02 engineering and technology ,Load balancing (computing) ,Supercomputer ,Model coupling ,Multiscale computing ,Software ,Hardware and Architecture ,Middleware ,0202 electrical engineering, electronic engineering, information engineering ,Graph (abstract data type) ,Leverage (statistics) ,020201 artificial intelligence & image processing ,High performance computing ,business - Abstract
We describe our Multiscale Computing Patterns software for High Performance Multiscale Computing. Following a short review of Multiscale Computing Patterns, this paper introduces the Multiscale Computing Patterns Software, which consists of description, optimisation and execution components. First, the description component translates the task graph, representing a multiscale simulation, to a particular type of multiscale computing pattern. Second, the optimisation component selects and applies algorithms to find the most suitable mapping between submodels and available HPC resources. Third, the execution component which a middleware layer maps submodels to the number and type of physical resources based on the suggestions emanating from the optimisation part together with infrastructure-specific metrics such as queueing time and resource availability. The main purpose of the Multiscale Computing Patterns software is to leverage the Multiscale Computing Patterns to simplify and automate the execution of complex multiscale simulations on high performance computers, and to provide both application-specific and pattern-specific performance optimisation. We test the performance and the resource usage for three multiscale models, which are expressed in terms of two Multiscale Computing Patterns. In doing so, we demonstrate how the software automates resource selection and load balancing, and delivers performance benefits from both the end-user and the HPC system level perspectives.
- Published
- 2019
23. Development of Science Gateways Using QCG — Lessons Learned from the Deployment on Large Scale Distributed and HPC Infrastructures
- Author
-
Bartosz Bosak, Piotr Grabowski, Krzysztof Kurowski, Tomasz Piontek, Dawid Szejnfeld, Piotr Kopta, Michal Kulczewski, and M. CiźNicki
- Subjects
Communication design ,Computer Networks and Communications ,Computer science ,business.industry ,020206 networking & telecommunications ,Cloud computing ,02 engineering and technology ,Supercomputer ,Grid ,computer.software_genre ,World Wide Web ,Software ,User experience design ,Hardware and Architecture ,Software deployment ,Middleware (distributed applications) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,business ,computer ,Information Systems - Abstract
Today, various Science Gateways created in close collaboration with scientific communities provide access to remote and distributed HPC, Grid and Cloud computing resources and large-scale storage facilities. However, as we have observed there are still many entry barriers for new users and various limitations for active scientists. In this paper we present our latest achievements and software solutions that significantly simplify the use of large scale and distributed computing. We describe several Science Gateways that have been successfully created with the help of our application tools and the QCG (Quality in Cloud and Grid) middleware, in particular Vine Toolkit, QCG-Portal and QCG-Now, and make the use of HPC, Grid and Cloud more straightforward and transparent. Additionally, we share the best practices and lessons learned after creating jointly with user communities many domain-specific Science Gateways, e.g. dedicated for physicists, medical scientists, chemists, engineers and external communities performing multi-scale simulations. As our deployed software solutions have reached recently a critical mass of active users in the PLGrid e-infrastructure in Poland, we also discuss in this paper how changing technologies, visual design and user experience could impact the way we should re-design Science Getaways or even develop new attractive tools, e.g. desktop or mobile-based applications in the future. Finally, we present information and statistics regarding the behaviour of users to help readers understand how new capabilities and functionalities may influence the growth of user interest in Science Gateways and HPC technologies.
- Published
- 2016
24. MultiGrain/MAPPER: A distributed multiscale computing approach to modeling and simulating gene regulation networks
- Author
-
Tomasz Piontek, Martin T. Swain, Alexandru Mizeranschi, Werner Dubitzky, Bartosz Bosak, Paul Thompson, Quentin Fazilleau, Raluca Scona, and Piotr Kopta
- Subjects
0301 basic medicine ,Regulation of gene expression ,Distributed Computing Environment ,Theoretical computer science ,Computational complexity theory ,Computer Networks and Communications ,business.industry ,Computer science ,Distributed computing ,Systems biology ,Gene regulatory network ,Particle swarm optimization ,02 engineering and technology ,Modeling and simulation ,03 medical and health sciences ,030104 developmental biology ,Software ,Hardware and Architecture ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,020201 artificial intelligence & image processing ,business - Abstract
Modeling and simulation of gene-regulatory networks (GRNs) has become an important aspect of modern systems biology investigations into mechanisms underlying gene regulation. A key task in this area is the automated inference or reverse-engineering of dynamic mechanistic GRN models from gene expression time-course data. Besides a lack of suitable data (in particular multi-condition data from the same system), one of the key challenges of this task is the computational complexity involved. The more genes in the GRN system and the more parameters a GRN model has, the higher the computational load. The computational challenge is likely to increase substantially in the near future when we tackle larger GRN systems. The goal of this study was to develop a distributed computing framework and system for reverse-engineering of GRN models. We present the resulting software called MultiGrain/MAPPER. This software is based on a new architecture and tools supporting multiscale computing in a distributed computing environment. A key feature of MultiGrain/MAPPER is the realization of GRN reverse-engineering based on the underlying distributed computing framework and multi-swarm particle swarm optimization. We demonstrate some of the features of MultiGrain/MAPPER and evaluate its performance using both real and artificial gene expression data.
- Published
- 2016
25. Methods to Load Balance a GCR Pressure Solver Using a Stencil Framework on Multi- and Many-Core Architectures
- Author
-
Krzysztof Kurowski, Milosz Ciznicki, Piotr Kopta, and Michal Kulczewski
- Subjects
Article Subject ,Preconditioner ,business.industry ,Computer science ,Eulerian path ,Parallel computing ,Solver ,Stencil ,Bottleneck ,Exascale computing ,Computer Science Applications ,QA76.75-76.765 ,symbols.namesake ,Software ,symbols ,Computer software ,Graphics ,business - Abstract
The recent advent of novel multi- and many-core architectures forces application programmers to deal with hardware-specific implementation details and to be familiar with software optimisation techniques to benefit from new high-performance computing machines. Extra care must be taken for communication-intensive algorithms, which may be a bottleneck for forthcoming era of exascale computing. This paper aims to present a high-level stencil framework implemented for the EULerian or LAGrangian model (EULAG) that efficiently utilises multi- and many-cores architectures. Only an efficient usage of both many-core processors (CPUs) and graphics processing units (GPUs) with the flexible data decomposition method can lead to the maximum performance that scales the communication-intensive Generalized Conjugate Residual (GCR) elliptic solver with preconditioner.
- Published
- 2015
26. Adaptation of fluid model EULAG to graphics processing unit architecture
- Author
-
Piotr Kopta, Bogdan Rosa, Michal Kulczewski, Krzysztof Rojek, Milosz Ciznicki, Roman Wyrzykowski, Krzysztof Kurowski, Lukasz Szustak, Zbigniew P. Piotrowski, and Damian K. Wójcik
- Subjects
Computer Networks and Communications ,Computer science ,Computation ,Graphics processing unit ,Parallel computing ,Solver ,Supercomputer ,Computer Science Applications ,Theoretical Computer Science ,symbols.namesake ,CUDA ,Computational Theory and Mathematics ,Scalability ,symbols ,Graphics ,General-purpose computing on graphics processing units ,Software ,Lagrangian - Abstract
The goal of this study is to adapt the multiscale fluid solver EULerian or LAGrangian framewrok EULAG to future graphics processing units GPU platforms. The EULAG model has the proven record of successful applications, and excellent efficiency and scalability on conventional supercomputer architectures. Currently, the model is being implemented as the new dynamical core of the COSMO weather prediction framework. Within this study, two main modules of EULAG, namely the multidimensional positive definite advection transport algorithm MPDATA and the variational generalized conjugate residual, elliptic pressure solver Generalized Conjugate Residual GCR are analyzed and optimized. In this paper, a method is proposed, which ensures a comprehensive analysis of the resource consumption including registers, shared, and global memories. This method allows us to identify bottlenecks of the algorithm, including data transfers between host and global memory, global and shared memories, as well as GPU occupancy. We put the emphasis on providing a fixed memory access pattern, padding as well as organizing computation in the MPDATA algorithm. The testing and validation of the new GPU implementation have been carried out based on modeling decaying turbulence of a homogeneous incompressible fluid in a triply-periodic cube. Simulations performed using the standard version of EULAG and its new GPU implementation give similar solutions. Preliminary results show a promising increase in terms of computational efficiency. Copyright © 2014 John Wiley & Sons, Ltd.
- Published
- 2014
27. Benchmarking JPEG 2000 implementations on modern CPU and GPU architectures
- Author
-
Krzysztof Kurowski, Milosz Ciznicki, Piotr Kopta, Pawel Gepner, and Michal Kierzynka
- Subjects
Signal processing ,General Computer Science ,business.industry ,Computer science ,Graphics hardware ,Benchmarking ,computer.file_format ,Theoretical Computer Science ,Software ,Computer architecture ,Modeling and Simulation ,JPEG 2000 ,business ,Implementation ,computer ,Data compression ,Coding (social sciences) - Abstract
The use of graphics hardware for non-graphics applications has become popular among many scientific programmers and researchers as we have observed a higher rate of theoretical performance increase than the CPUs in recent years. However, performance gains may be easily lost in the context of a specific parallel application due to various both hardware and software factors. JPEG 2000 is a complex standard for data compression and coding, that provides many advanced capabilities demanded by more specialized applications. There are several JPEG 2000 implementations that utilize emerging parallel architectures with the built-in support for parallelism at different levels. Unfortunately, many available implementations are only optimized for a certain parallel architecture or they do not take advantage of recent capabilities provided by modern hardware and low level APIs. Thus, the main aim of this paper is to present a comprehensive real performance analysis of JPEG 2000. It consists of a chain of data and compute intensive tasks that can be treated as good examples of software benchmarks for modern parallel hardware architectures. In this paper we compare achieved performance results of various JPEG 2000 implementations executed on selected architectures for different data sets to identify possible bottlenecks. We discuss also best practices and advices for parallel software development to help users to evaluate in advance and then select appropriate solutions to accelerate the execution of their applications.
- Published
- 2014
28. Scaling the GCR Solver Using a High-Level Stencil Framework on Multi- and Many-Core Architectures
- Author
-
Krzysztof Kurowski, Milosz Ciznicki, Piotr Kopta, and Michal Kulczewski
- Subjects
Elliptic solver ,Preconditioner ,Computer science ,020206 networking & telecommunications ,02 engineering and technology ,Parallel computing ,Program optimization ,Solver ,Stencil ,Bottleneck ,Exascale computing ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Scaling - Abstract
The recent advent of novel multi- and many-core architectures forces application programmers to deal with hardware-specific implementation details and to be familiar with software optimization techniques to benefit from new high-performance computing machines. An extra care must be taken for communication-intensive algorithms, which may be a bottleneck for forthcoming era of exascale computing. This paper aims to present a high level stencil framework implemented for the EULAG model that efficiently utilizes heterogeneous clusters. Only an efficient usage of both CPUs and GPUs with the flexible data decomposition method can lead to the maximum performance that scales communication-intensive elliptic solver with preconditioner.
- Published
- 2016
29. Convergence of Explicitly Correlated Gaussian Wave Functions
- Author
-
Piotr Kopta, Tomasz Piontek, Krzysztof Kurowski, Jacek Komasa, and Mariusz Puchalski
- Subjects
symbols.namesake ,Matrix (mathematics) ,Basis (linear algebra) ,Gaussian ,Diagonalizable matrix ,Mathematical analysis ,symbols ,Extrapolation ,Function (mathematics) ,Statistical physics ,Wave function ,Mathematics ,Schrödinger equation - Abstract
Results of high precision quantum-chemical calculations on selected diatomic molecular systems are reported. The wave function is expanded in the basis of exponentially correlated Gaussian functions. For each of the systems the Schrodinger equation is solved variationally with several lengths of this expansion, which enables the energy convergence to be studied as well as an extrapolation to infinite basis set size and an error estimation to be performed. The algorithms applied to evaluate matrix elements and the matrix diagonalization are analyzed for their scalability, and their strong and weak points are revealed.
- Published
- 2014
30. New QosCosGrid Middleware Capabilities and Its Integration with European e-Infrastructure
- Author
-
Mariusz Mamonski, Tomasz Piontek, Piotr Kopta, Krzysztof Kurowski, and Bartosz Bosak
- Subjects
Computer science ,End user ,business.industry ,Distributed computing ,Reservation ,Scenario ,computer.software_genre ,Workflow ,Grid computing ,Middleware (distributed applications) ,Message oriented middleware ,Resource management ,Software engineering ,business ,computer - Abstract
QosCosGrid QCG is an integrated system offering leading job and resource management capabilities in order to deliver supercomputer-like performance and structure to end users. By combining many distributed computing resources together, QCG offers highly efficient mapping, execution and monitoring capabilities for a variety of applications, such as parameter sweep, workflows, multi-scale, MPI or hybrid MPI-OpenMP. The QosCosGrid middleware also provides ai¾źset of unique features, such as advance reservation, co-allocation of distributed computing resources, support for interactive tasks and monitoring of ai¾źprogress of running applications. The middleware is offered to end users by well-designed and easy-to-use client tools. At the time of writing, QosCosGrid is the most popular middleware within the PL-Grid Infrastructure. After its successful adoption within the Polish research communities, it has been integrated with the EGI infrastructure and through a release in UMD and EGI-AppDB it is also available at European level. In this article, we focus on the extensions that were introduced to QosCosGrid during the period of the PL-Grid and PLGrid Plus projects in order to support advanced user scenarios and to integrate the stack with the Polish and European e-Infrastructures.
- Published
- 2014
31. New Capabilities in QosCosGrid Middleware for Advanced Job Management, Advance Reservation and Co-allocation of Computing Resources – Quantum Chemistry Application Use Case
- Author
-
ski, Jacek Komasa, Tomasz Piontek, Krzysztof Kurowski, Piotr Kopta, Mariusz Mamo, and Bartosz Bosak
- Subjects
Workflow ,Utility computing ,Computer science ,Middleware ,Distributed computing ,Programming paradigm ,High-throughput computing ,Supercomputer ,Resource management (computing) ,Grid - Abstract
In this chapter we present the new capabilities of QosCosGrid (QCG) middleware for advanced job and resource management in the grid environment. By connecting many computing clusters together, QosCosGrid offers easy-to-use mapping, execution and monitoring capabilities for a variety of complex computations, such as parameter sweep, workflows, MPI or hybrid MPI-OpenMP as well as multiscale simulations. Thanks to QosCosGrid, large-scale programming models written in Fortran, C, C++ or Java can be automatically distributed over a network of computing resources with guaranteed Quality of Service --- for example guaranteed startup time of a job. Consequently, applications can be run at specified periods with reduced execution time and waiting times. This enables more complex problem instances to be addressed. In order to prove the usefulness of the new functionality of QosCosGrid a detailed description of the system along with a real use case scenario from the quantum chemistry science domain will be presented in this chapter.
- Published
- 2012
32. Vine Toolkit - Grid-Enabled Portal Solution for Community Driven Computing Workflows with Meta-scheduling Capabilities
- Author
-
Krzysztof Kurowski, Bogdan Ludwiczak, Tomasz Kuczynski, Jaroslaw Nabrzyski, Krzysztof Witkowski, Piotr Domagalski, Dominik Tarnawczyk, Dawid Szejnfeld, Piotr Kopta, Piotr Dziubecki, Michal Krysinski, Tomasz Piontek, and Malgorzata Wolniewicz
- Subjects
Workflow ,Computer science ,Computation ,Distributed computing ,Load balancing (computing) ,User interface ,Grid ,Workflow engine ,Meta-scheduling ,Grid security - Abstract
In large scale production environments, the information sets to perform calculations on come from various sources. In particular, some computations may require the information obtained as a result of previous computations. Workflow description offers an attractive approach to formally deal with such complex processes. Vine Toolkit [1] solution addresses some major challenges here such as the synchronization of distributed workflows, establishing a community driven Grid environment for the seamless results sharing and collaboration. In order to accomplish these goals Vine Toolkit offers integration on different layers starting from rich user interface web components, integration with workflow engine and Grid security and ending up with a built-in meta-scheduling mechanisms, that allow IT administrators to perform load balancing automatically among computing clusters and data centers to meet peak demands. As a result of this particular project a complete solution has been developed and delivered.
- Published
- 2010
33. Parallel Large Scale Simulations in the PL-Grid Environment
- Author
-
Krzysztof Kurowski, Tomasz Piontek, Piotr Kopta, Mariusz Mamoński, and Bartosz Bosak
- Subjects
Java ,Fortran ,Computer science ,Distributed computing ,General Medicine ,Grid ,Supercomputer ,computer.software_genre ,Resource (project management) ,Semantic grid ,Grid computing ,Middleware ,computer ,computer.programming_language - Abstract
The growing demand for computational power causes that Grids are becoming mission-critical components in research and industry, offering sophisticated solutions in leveraging large-scale computing and storage resources. The nature a Grid in which resources are usually shared among multiple organizations offering resources under their control based on the "best effort" approach with no guarantee concerning the quality-of-service may be inadequate to support large-scale simulations. Requirements of such simulations often exceed capabilities of a single computing center causing the need to simultaneously allocate and synchronize resources belonging to many administrative domains whose functionality is missing in leading grid middlewares preventing researchers from executing large-scale simulations in grids. The paper presents tools and services that were designed to build multilayered infrastructure capable of dealing with computationally intensive large-scale simulations in the grid environment. The developed and deployed middleware enables computing clusters in different administrative domains to be virtually welded into a single powerful compute resource that can be treated as a quasi-opportunistic supercomputer. We describe the middleware developed in the QosCosGrid project and being enhanced under the PL-Grid national grid initiative, which provides advance reservation and resource co-allocation functionality as well as support for parallel large-scale applications based on OpenMPI (for C/C++ and Fortran) or ProActive for Java.
- Published
- 2010
- Full Text
- View/download PDF
34. Grid Access and User Interface in CLUSTERIX Project
- Author
-
Piotr Kopta, Tomasz Kuczynski, and Roman Wyrzykowski
- Subjects
Command-line interface ,Computer science ,computer.internet_protocol ,Interface (Java) ,Shell (computing) ,Scalable Vector Graphics ,computer.file_format ,computer.software_genre ,X3D ,Operating system ,Session (computer science) ,User interface ,computer ,XML - Abstract
This paper presents the SSH Session Server Framework for the dynamic generation of interfaces of user applications. The original goal of this framework, developed for the CLUSTERIX grid project [2], is to create a flexible portal interface that can be easily extended and adapted for utilization with different shell applications. The approach is based on describing command line interface models with XML files. During the development, the main pressure is put on separation of the visualization layer from the application logic, as well as providing possibility of the framework extension at run-time. The support for VRML, X3D, SVG, charts (JPEG, PNG) formats of visualization is also an important feature. Full language localization is supported with any type of the visualisation. The SSH Session Server Framework is integrated with the GridSphere portal framework [5] – one of the products of the GridLab project [4].
- Published
- 2006
35. Benchmarking Data and Compute Intensive Applications on Modern CPU and GPU Architectures
- Author
-
Krzysztof Kurowski, Piotr Kopta, Michal Kierzynka, Pawel Gepner, and Milosz Ciznicki
- Subjects
Signal processing ,Computer science ,business.industry ,Graphics hardware ,GPU ,JPEG 2000 ,Benchmarking ,multi-core CPU ,Software ,Computer architecture ,benchmarks ,General Earth and Planetary Sciences ,business ,signal processing ,General Environmental Science ,Data compression - Abstract
The use of graphics hardware for non-graphics applications has become popular among many scientific programmers and researchers as we have observed a higher rate of theoretical performance increase than the CPUs in recent years. However, performance gains may be easily lost in the context of a specific parallel application due to various both hardware and software factors. Consequently, software benchmarks and performance testing are still the best techniques to compare the effciency of emerging parallel architectures with the built-in support for parallelism at different levels. Unfortunately, many available benchmarks are either relatively simple application kernels, they have been optimized only for a certain parallel architecture or they do not take advantage of recent capabilities provided by modern hardware and low level APIs. Thus, the main aim of this paper is to present a comprehensive real performance analysis of selected applications following the complex standard for data compression and coding -JPEG 2000. It consists of a chain of data and compute intensive tasks that can be treated as good examples of software benchmarks for modern parallel hardware architectures. In this paper we compare achieved performance results of our standard based benchmarks executed on selected architectures for different data sets to identify possible bottlenecks. We discuss also best practices and advices for parallel software development to help users to evaluate in advance and then select appropriate solutions to accelerate the execution of their applications.
- Full Text
- View/download PDF
36. Parallel application benchmarks and performance evaluation of the Intel Xeon 7500 family processors
- Author
-
Piotr Kopta, Tomasz Piontek, Krzysztof Kurowski, Mariusz Puchalski, Michal Kulczewski, Jacek Komasa, and Pawel Gepner
- Subjects
petascale computing ,Xeon ,Computer science ,Parallel computing ,Program optimization ,Supercomputer ,Data structure ,parallel applications ,Intel Xeon 7500 ,high performance computing ,quantum chemistry ,Petascale computing ,Computer architecture ,Parallel processing (DSP implementation) ,Synchronization (computer science) ,Scalability ,General Earth and Planetary Sciences ,CFD ,General Environmental Science - Abstract
With the recent advent of novel multi- and many-core hardware architectures, application programmers have to deal with many hardware-specific implementation details and have to be familiar with software optimization techniques to benefit from new high-performance computing machines. Highly effcient parallel application design is in fact an interdisciplinary process involving domain specific and IT experts. Therefore, this paper aims to present early experiences with computationally demanding applications, development efforts and evaluation of their performance on the new family of Intel Xeon 7500 processors. We selected two application benchmarks applicable to real quantum chemistry and Computational Fluid Dynamics (CFD) problems as they can potentially take advantage of parallel processing on novel hardware architectures and built-in new features. Additionally, we discuss various parallel software improvements to mentioned applications, including appropriate changes to data structures as well as to communication and synchronization routines to deal with multi-level parallelism and hybrid hardware architectures. The obtained results confirmed that new hardware solutions can improve the overall application performance. However, in order to obtain a high level of parallel scalability various application modifications and tuning procedures are required as hardware configurations, including processors characteristics, interconnects and topologies, and they have a great influence on large-scale simulations.
- Full Text
- View/download PDF
37. Two-Dimensional Discrete Wavelet Transform on Large Images for Hybrid Computing Architectures: GPU and CELL
- Author
-
Pawel Lichocki, Piotr Kopta, Milosz Ciznicki, Krzysztof Kurowski, Marek Blazewicz, Alexander, Michael, D’Ambra, Pasqua, Roman, Jean, Scott, Stephen L., Traff, Jesper Larsson, Vallée, Geoffroy, Weidendorfer, Josef, Belloum, Adam, Bosilca, George, Cannataro, Mario, Danelutto, Marco, Martino, Beniamino, Gerndt, Michael, Jeannot, Emmanuel, and Namyst, Raymond
- Subjects
Discrete wavelet transform ,Signal processing ,Speedup ,Computer science ,JPEG 2000 ,Parallel algorithm ,Parallel computing ,computer.file_format ,computer ,Image compression ,Data compression - Abstract
The Discrete Wavelet Transform (DWT) has gained the momentum in signal processing and image compression over the last decade bringing the concept up to the level of new image coding standard JPEG2000. Thanks to many added values in DWT, in particular inherent multi-resolution nature, wavelet-coding schemes are suitable for various applications where scalability and tolerable degradation are rel- evant. Moreover, as we demonstrate in this paper, it can be used as a perfect benchmarking procedure for more sophisticated data compression and multimedia applications using General Purpose Graphical Processor Units (GPGPUs). Thus, in this paper we show and compare experiments performed on reference implementations of DWT on Cell Broadband Engine Architecture (Cell B.E) and nVidia Graphical Processing Units (GPUs). The achieved results show clearly that although both GPU and Cell B.E. are being considered as representatives of the same hybrid architecture devices class they differ greatly in programming style and optimization techniques that need to be taken into account during the development. In order to show the speedup, the parallel algorithm has been compared to sequential computation performed on the x86 architecture.
38. Predicting queue wait time probabilities for multi-scale computing
- Author
-
Bartosz Bosak, Tomasz Piontek, Vytautas Jancauskas, and Piotr Kopta
- Subjects
Scale (ratio) ,Computer science ,multi-scale computing ,General Mathematics ,Distributed computing ,General Engineering ,General Physics and Astronomy ,020206 networking & telecommunications ,02 engineering and technology ,Articles ,Supercomputer ,Wait time ,Naive Bayes classifier ,Naive Bayes ,Resource (project management) ,machine learning ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Queue ,Selection (genetic algorithm) ,Research Article - Abstract
We describe a method for queue wait time prediction in supercomputing clusters. It was designed for use as a part of multi-criteria brokering mechanisms for resource selection in a multi-site High Performance Computing environment. The aim is to incorporate the time jobs stay queued in the scheduling system into the selection criteria. Our method can also be used by the end users to estimate the time to completion of their computing jobs. It uses historical data about the particular system to make predictions. It returns a list of probability estimates of the form ( t i , p i ), where p i is the probability that the job will start before time t i . Times t i can be chosen more or less freely when deploying the system. Compared to regression methods that only return a single number as a queue wait time estimate (usually without error bars) our prediction system provides more useful information. The probability estimates are calculated using the Bayes theorem with the naive assumption that the attributes describing the jobs are independent. They are further calibrated to make sure they are as accurate as possible, given available data. We describe our service and its REST API and the underlying methods in detail and provide empirical evidence in support of the method's efficacy. This article is part of the theme issue ‘Multiscale modelling, simulation and computing: from the desktop to the exascale’.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.