Author: "Jesun Sahariar Firoz" / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Jesun Sahariar Firoz"' showing total 19 results

Start Over Author "Jesun Sahariar Firoz" Topic computer science

19 results on '"Jesun Sahariar Firoz"'

1. Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures

Author: Jieyang Chen, Mark Raugas, Jesun Sahariar Firoz, Ang Li, Shuaiwen Leon Song, Chenhao Xie, Kevin J. Barker, and Jiajia Li
Subjects: FOS: Computer and information sciences, Speedup, Computer science, Parallel computing, Solver, Supernode, Computer Science - Distributed, Parallel, and Cluster Computing, Hardware Architecture (cs.AR), Scalability, Synchronization (computer science), Overhead (computing), Distributed, Parallel, and Cluster Computing (cs.DC), Partitioned global address space, Computer Science - Hardware Architecture, Execution model
Abstract: Designing efficient and scalable sparse linear algebra kernels on modern multi-GPU based HPC systems is a challenging task due to significant irregular memory references and workload imbalance across GPUs. These challenges are particularly compounded in the case of Sparse Triangular Solver (SpTRSV), which introduces additional complexity of two-dimensional computation dependencies among subsequent computation steps. Dependency information may need to be exchanged and shared among GPUs, thus warranting for efficient memory allocation, data partitioning, and workload distribution as well as fine-grained communication and synchronization support. In this work, we focus on designing algorithm for SpTRSV in a single-node, multi-GPU setting. We demonstrate that directly adopting unified memory can adversely affect the performance of SpTRSV on multi-GPU architectures, despite linking via fast interconnect like NVLinks and NVSwitches. Alternatively, we employ the latest NVSHMEM technology based on Partitioned Global Address Space programming model to enable efficient fine-grained communication and drastic synchronization overhead reduction. Furthermore, to handle workload imbalance, we propose a malleable task-pool execution model which can further enhance the utilization of GPUs. By applying these techniques, our experiments on the NVIDIA multi-GPU supernode V100-DGX-1 and DGX-2 systems demonstrate that our design can achieve an average of 3.53 × (up to 9.86 ×) speedup on a DGX-1 system and 3.66 × (up to 9.64 ×) speedup on a DGX-2 system with four GPUs over the Unified-Memory design. The comprehensive sensitivity and scalability studies also show that the proposed zero-copy SpTRSV is able to fully utilize the computing and communication resources of the multi-GPU systems.
Published: 2021

2. Evaluation of Graph Analytics Frameworks Using the GAP Benchmark Suite

Author: Timothy G. Mattson, Andrew Lumsdaine, Kevin Deweese, Jinhao Chen, Tze Meng Low, Gábor Szárnyas, Keshav Pingali, Gurbinder Gill, Henry A. Gabb, Mohsen Mahmoudi Aznaveh, Yunming Zhang, Ariful Azad, Upasana Sridhar, Roshan Dathathri, Scott McMillan, Ramesh Peri, Timothy A. Davis, Scott Beamer, Bálint Hegyi, Mark Blanco, Jesun Sahariar Firoz, Yongzhe Zhang, Luke Dalessandro, Scott P. Kolodziej, and Tugsbayasgalan Manlaibaatar
Subjects: Connected component, Graph kernel, Theoretical computer science, Computer science, Suite, 010103 numerical & computational mathematics, 02 engineering and technology, computer.software_genre, 01 natural sciences, 020202 computer hardware & architecture, law.invention, Software framework, PageRank, Betweenness centrality, law, Shortest path problem, 0202 electrical engineering, electronic engineering, information engineering, Software system, 0101 mathematics, computer, MathematicsofComputing_DISCRETEMATHEMATICS
Abstract: Graphs play a key role in data analytics. Graphs and the software systems used to work with them are highly diverse. Algorithms interact with hardware in different ways and which graph solution works best on a given platform changes with the structure of the graph. This makes it difficult to decide which graph programming framework is the best for a given situation. In this paper, we try to make sense of this diverse landscape. We evaluate five different frameworks for graph analytics: SuiteS-parse GraphBLAS, Galois, the NWGraph library, the Graph Kernel Collection, and GraphIt. We use the GAP Benchmark Suite to evaluate each framework. GAP consists of 30 tests: six graph algorithms (breadth-first search, single-source shortest path, PageRank, betweenness centrality, connected components, and triangle counting) on five graphs. The GAP Benchmark Suite includes high-performance reference implementations to provide a performance baseline for comparison. Our results show the relative strengths of each framework, but also serve as a case study for the challenges of establishing objective measures for comparing graph frameworks.
Published: 2020

3. Triangle Counting with Cyclic Distributions

Author: Scott McMillan, Andrew Lumsdaine, Luke Dalessandro, Kevin Deweese, and Jesun Sahariar Firoz
Subjects: Combinatorics, Work (thermodynamics), Distribution (number theory), Degree (graph theory), Triangle counting, Simple (abstract algebra), Computer science, Order (group theory), 010103 numerical & computational mathematics, Overall performance, 0101 mathematics, 01 natural sciences, MathematicsofComputing_DISCRETEMATHEMATICS
Abstract: Triangles are the simplest non-trivial subgraphs and triangle counting is used in a number of different applications. The order in which vertices are processed in triangle counting strongly effects the amount of work that needs to be done (and thus the overall performance). Ordering vertices by degree has been shown to be one particularly effective ordering approach. However, for graphs with skewed degree distributions (such as power-law graphs), ordering by degree effects the distribution of work; parallelization must account for this distribution in order to balance work among workers. In this paper we provide an in-depth analysis of the ramifications of degree-based ordering on parallel triangle counting. We present approach for partitioning work in triangle counting, based on cyclic distribution and some surprisingly simple C++ implementations. Experimental results demonstrate the effectiveness of our approach, particularly for power-law (and social network) graphs.
Published: 2020

4. On the Feasibility of Using Reduced-Precision Tensor Core Operations for Graph Analytics

Author: Jiajia Li, Kevin J. Barker, Jesun Sahariar Firoz, and Ang Li
Subjects: Computer science, business.industry, 020207 software engineering, 010103 numerical & computational mathematics, 02 engineering and technology, 01 natural sciences, Pipeline (software), Matrix multiplication, Computational science, Analytics, Tensor (intrinsic definition), Core (graph theory), 0202 electrical engineering, electronic engineering, information engineering, Leverage (statistics), Graph (abstract data type), 0101 mathematics, business, Sparse matrix
Abstract: Today's data-driven analytics and machine learning workload have been largely driven by the General-Purpose Graphics Processing Units (GPGPUs). To accelerate dense matrix multiplications on the GPUs, Tensor Core Units (TCUs) have been introduced in recent years. In this paper, we study linear-algebra-based and vertex-centric algorithms for various graph kernels on the GPUs with an objective of applying this new hardware feature to graph applications. We identify the potential stages in these graph kernels that can be executed on the Tensor Core Units. In particular, we leverage the reformulation of the reduction and scan operations in terms of matrix multiplication [1] on the TCUs. We demonstrate that executing these operations on the TCUs, available inside different graph kernels, can assist in establishing an end-to-end pipeline on the GPGPUs without depending on hand-tuned external libraries and still can deliver comparable performance for various graph analytics.
Published: 2020

5. Computing Hypergraph Homology in Chapel

Author: Louis Jenkins, Emilie Purvine, Brenda Praggastis, Jesun Sahariar Firoz, Mark Raugas, and Cliff A. Joslyn
Subjects: Discrete mathematics, Hypergraph, Computer science, Betti number, Computation, Homology (mathematics), Python (programming language), Homology (biology), Succinctness, ComputingMethodologies_SYMBOLICANDALGEBRAICMANIPULATION, Linear algebra, Computer Science::Programming Languages, computer, Language construct, computer.programming_language
Abstract: In this paper, we discuss our experience in implementing homology computation, in particular the Betti number calculations in Chapel hypergraph Library (CHGL). Given a dataset represented as a hypergraph, a Betti number for a particular dimension k indicates how many k-dimensional `voids' are present in the dataset. Computing the Betti numbers involve various array-centric and linear algebra operations. We demonstrate that implementing these operations in Chapel is both concise and intuitive. In addition, we show that Chapel provides language constructs for implementing parallel and distributed execution of the linear algebra kernels with minimal effort. Syntactically, Chapel provides succinctness of Python, while delivering comparable and better performance than C++-based and Julia-based packages for calculating the Betti numbers respectively.
Published: 2020

6. Hypergraph Analytics of Domain Name System Relationships

Author: Louis Jenkins, Marcin Zalewski, Cliff A. Joslyn, Sinan Aksoy, Brenda Praggastis, Emilie Purvine, Jesun Sahariar Firoz, and Dustin Arendt
Subjects: 0303 health sciences, Hypergraph, Theoretical computer science, business.industry, Computer science, Domain Name System, Network science, Python (programming language), Supercomputer, 01 natural sciences, 010305 fluids & plasmas, Visualization, 03 medical and health sciences, Knowledge extraction, Analytics, 0103 physical sciences, business, computer, 030304 developmental biology, computer.programming_language
Abstract: We report on the use of novel mathematical methods in hypergraph analytics over a large quantity of DNS data. Hypergraphs generalize graphs, as used in network science, to better model complex multiway relations in cyber data. Specifically, casting DNS data from Georgia Tech’s ActiveDNS repository as hypergraphs allows us to fully represent the interactions between collections of domains and IP addresses. To facilitate large-scale analytics, we fielded an analytical pipeline of two capabilities: HyperNetX (HNX) is a Python package for the exploration and visualization of hypergraphs; while on the backend, the Chapel HyperGraph Library (CHGL) is a library for high performance hypergraph analytics written in the exascale programming language Chapel. CHGL was used to process gigascale DNS data, performing compute-intensive calculations for data reduction and segmentation. Identified portions are then sent to HNX for both exploratory analysis and knowledge discovery targeting known tactics, techniques, and procedures.
Published: 2020

7. A Synchronization-Avoiding Distance-1 Grundy Coloring Algorithm for Power-Law Graphs

Author: Andrew Lumsdaine, Marcin Zalewski, and Jesun Sahariar Firoz
Subjects: Waiting time, Vertex (graph theory), Coloring algorithm, Computer science, Atomic operations, Scaling, Algorithm, Decentralised system, Power law graphs
Abstract: In this paper, we propose a distributed, unordered, label-correcting distance-1 Grundy (vertex) coloring algorithm, namely, Distributed Control (DC) coloring algorithm. Our algorithm eliminates the need for vertex-centric barriers and global synchronization for color refinement, relying only on atomic operations and local termination detection to update vertex color. DC proceeds optimistically, correcting the colors asynchronously as the algorithm progresses and depends on local ordering of tasks to minimize the execution of sub-optimal work. We implement our DC coloring algorithm and the well-known Jones-Plassmann algorithm and compare their performance with 4 different types of standard RMAT graphs and real-world graphs. We show that the elimination of waiting time of global and vertex-centric barriers and investing this time for local ordering leads to improved scaling for graphs with prominent power-law characteristics and densely interconnected local subgraphs.
Published: 2019

8. Distributed Direction-Optimizing Label Propagation for Community Detection

Author: Andrew Lumsdaine, Kevin J. Barker, Xu Liu, Marcin Zalewski, Assefaw H. Gebremedhin, Mahantesh Halappanavar, and Jesun Sahariar Firoz
Subjects: Computer science, Distributed computing, 02 engineering and technology, Graph, Distributed algorithm, Asynchronous communication, 020204 information systems, Graph traversal, 0202 electrical engineering, electronic engineering, information engineering, Scalable algorithms, 020201 artificial intelligence & image processing, Seeding, Distributed memory, Label propagation
Abstract: Designing a scalable algorithm for community detection is challenging due to the simultaneous need for both high performance and quality of solution. We propose a new distributed algorithm for community detection based on a novel Label Propagation algorithm. The algorithm is inspired by the direction optimization technique in graph traversal algorithms, relies on the use of frontiers, and alternates between abstractions called label push and label pull. This organization creates flexibility and affords us with opportunities for balancing performance and quality of solution. We implement our algorithm in distributed memory with the active-message based asynchronous many-task runtime AM++. We experiment with two seeding strategies for the initial seeding stage, namely, random seeding and degree seeding. With the Graph Challenge dataset, our distributed implementation, in conjunction with the runtime support, detects the communities in graphs having 20 million vertices in less than one second while achieving reasonably high quality of solution.
Published: 2019

9. Graph Algorithms in PGAS: Chapel and UPC++

Author: Cliff A. Joslyn, Marcin Zalewski, Louis Jenkins, Jesun Sahariar Firoz, and Mark Raugas
Subjects: 050101 languages & linguistics, Computer science, Computation, 05 social sciences, InfiniBand, Runtime library, 02 engineering and technology, Parallel computing, computer.software_genre, Asynchronous communication, 0202 electrical engineering, electronic engineering, information engineering, Programming paradigm, Graph (abstract data type), 020201 artificial intelligence & image processing, 0501 psychology and cognitive sciences, Partitioned global address space, Compiler, computer
Abstract: The Partitioned Global Address Space (PGAS) programming model can be implemented either with programming language features or with runtime library APIs, each implementation favoring different aspects (e.g., productivity, abstraction, flexibility, or performance). Certain language and runtime features, such as collectives, explicit and asynchronous communication primitives, and constructs facilitating overlap of communication and computation (such as futures and conjoined futures) can enable better performance and scaling for irregular applications, in particular for distributed graph analytics. We compare graph algorithms in one of each of these environments: the Chapel PGAS programming language and the the UPC++ PGAS runtime library. We implement algorithms for breadth-first search and triangle counting graph kernels in both environments. We discuss the code in each of the environments, and compile performance data on a Cray Aries and an Infiniband platform. Our results show that the library-based approach of UPC++ currently provides strong performance while Chapel provides a high-level abstraction that, harder to optimize, still provides comparable performance.
Published: 2019

10. Synchronization-Avoiding Graph Algorithms

Author: Thejaka Amila Kanewala, Andrew Lumsdaine, Marcin Zalewski, and Jesun Sahariar Firoz
Subjects: Connected component, 020203 distributed computing, Theoretical computer science, Computer science, Approximation algorithm, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Graph, Scheduling (computing), Order of operations, Asynchronous communication, Scalability, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), Distributed memory, Graph coloring, 0105 earth and related environmental sciences
Abstract: Because they were developed for optimal sequential complexity, classical graph algorithms as found in textbooks have strictly-defined orders of operations. Enforcing a prescribed order of operations, or even an approximate order, in a distributed memory setting requires significant amounts of synchronization, which in turn can severely limit scalability. As a result, new algorithms are typically required to achieve scalable performance, even for solving well-known graph problems. Yet, even in these cases, parallel graph algorithms are written according to parallel programming models that evolved for, e.g., scientific computing, and that still have inherent, and scalability-limiting, amounts of synchronization. In this paper we present a new approach to parallel graph algorithms: synchronization-avoiding algorithms. To eliminate synchronization and its associated overhead, synchronization-avoiding algorithms perform work in an unordered and fully asynchronous fashion in such a way that the result is constantly refined toward its final state. "Wasted" work is minimized by locally prioritizing tasks using problem-dependent task utility metrics. We classify algorithms for graph applications into two broad categories: algorithms with monotonic updates (which evince global synchronization) and algorithms with non-monotonic updates (which evince vertex-centric synchronization). We apply our approach to both classes and develop novel, synchronization-avoiding algorithms for solving exemplar problems: SSSP and connected components for the former, graph coloring for the latter. We demonstrate that eliminating synchronization in conjunction with effective scheduling policies and optimizations in the runtime results in improved scalability for both classes of algorithms.
Published: 2018

11. Adaptive Runtime Features for Distributed Graph Algorithms

Author: Andrew Lumsdaine, Marcin Zalewski, Jesun Sahariar Firoz, and Joshua Suetterlein
Subjects: 020203 distributed computing, Runtime system, Computer science, Asynchronous communication, 020204 information systems, Distributed computing, 0202 electrical engineering, electronic engineering, information engineering, Programming paradigm, Approximation algorithm, Workload, 02 engineering and technology, Execution time, Scheduling (computing)
Abstract: Performance of distributed graph algorithms can benefit greatly by forming rapport between algorithmic abstraction and the underlying runtime system that is responsible for scheduling work and exchanging messages. However, due to their dynamic and irregular nature of computation, distributed graph algorithms written in different programming models impose varying degrees of workload pressure on the runtime. To cope with such vastly different workload characteristics, a runtime has to make several trade-offs. One such trade-off arises, for example, when the runtime scheduler has to choose among alternatives such as whether to execute algorithmic work, or progress the network by probing network buffers, or throttle sending messages (termed flow control). This trade-off decides between optimizing the throughput of a runtime scheduler by increasing the rate of execution of algorithmic work, and reducing the latency of the network messages. Another trade-off exists when a decision has to be made about when to send aggregated messages in buffers (message coalescing). This decision chooses between trading off latency for network bandwidth and vice versa. At any instant, such trade-offs emphasize either on improving the quantity of work being executed (by maximizing the scheduler throughput) or on improving the quality of work (by prioritizing better work). However, encoding static policies for different runtime features (such as flow control, coalescing) can prevent graph algorithms from achieving their full potentials, thus can under-mine the actual performance of a distributed graph algorithm . In this paper, we investigate runtime support for distributed graph algorithms in the context of two paradigms: variants of well-known Bulk-Synchronous Parallel model and asynchronous programming model. We explore generic runtime features such as message coalescing (aggregation) and flow control and show that execution policies of these features need to be adjusted over time to make a positive impact on the execution time of a distributed graph algorithm. Since synchronous and asynchronous graph algorithms have different workload characteristics, not all of such runtime features may be good candidates for adaptation. Each of these algorithmic paradigms may require different set of features to be adapted over time. We demonstrate which set of feature(s) can be useful in each case to achieve the right balance of work in the runtime layer. Existing implementation of different graph algorithms can benefit from adapting dynamic policies in the underlying runtime.
Published: 2018

12. Runtime Scheduling Policies for Distributed Graph Algorithms

Author: Andrew Lumsdaine, Marcin Zalewski, Jesun Sahariar Firoz, and Martina Barnas
Subjects: Computer science, Distributed computing, Computation, Degree of parallelism, Approximation algorithm, 020206 networking & telecommunications, 02 engineering and technology, Scheduling (computing), Runtime system, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, Graph algorithms, Implementation
Abstract: In this paper we explore scheduling and runtime system support for unordered distributed graph computations that rely on optimistic (speculative) execution. Performance of such algorithms is impacted by two competing trends: the higher degree of parallelism enabled by optimistic execution in turn requires substantial runtime support. To address the potentially high overhead and scheduling complexity introduced by the runtime, we investigate customizable scheduling policies that augment the scheduler of the underlying runtime to adapt it to a specific graph application. We present several implementations of Distributed Control (DC), a data-driven unordered approach with work prioritization and demonstrate that customizable scheduling policies result in the most efficient implementation, outperforming the well-known ?-stepping Single-Source Shortest Paths (SSSP) and Jones-Plassmann vertex-coloring algorithms. We apply two scheduling techniques, flow control and adaptive frequency of network progress, which allow application-level control over the balance of domain work and the runtime work. Experimental results show the benefit of such application-aware scheduling for irregular distributed graph algorithms.
Published: 2018

13. A scalable distance-1 vertex coloring algorithm for power-law graphs

Author: Jesun Sahariar Firoz, Marcin Zalewski, and Andrew Lumsdaine
Subjects: Vertex (graph theory), Computer science, 05 social sciences, 010501 environmental sciences, Computer Graphics and Computer-Aided Design, 01 natural sciences, Execution time, Graph, Vertex (geometry), Coloring algorithm, 0502 economics and business, Scalability, 050207 economics, Algorithm, Software, Power law graphs, 0105 earth and related environmental sciences
Abstract: We propose a distributed, unordered, label-correcting distance-1 vertex coloring algorithm, called Distributed Control (DC) coloring algorithm. DC eliminates the need for vertex-centric barriers and global synchronization for color refinement, relying only on atomic operations and local termination detection to update vertex color. We implement our DC coloring algorithm and the well-known Jones-Plassmann algorithm in the AM++ AMT runtime and compare their performance. We show that, with runtime support, the elimination of waiting time of vertex-centric barriers and investing this time for local ordering results in better execution time for power-law graphs with dense local subgraphs.
Published: 2018

14. Improving Performance of Distributed Graph Traversals via Application-Aware Plug-In Work Scheduler

Author: Andrew Lumsdaine, Martina Barnas, Jesun Sahariar Firoz, and Marcin Zalewski
Subjects: business.industry, Computer science, Distributed computing, Breadth-first search, computer.software_genre, Graph, Scheduling (computing), Runtime system, Software, Graph (abstract data type), Plug-in, Priority queue, business, computer
Abstract: Unordered graph algorithms can offer efficient resource utilization that is advantageous for performance in distributed setting. Unordered execution allows for parallel computation without synchronization. In unordered algorithms, work is data-driven and can be performed in any order, refining the result as the algorithm progresses. Unfortunately, a sub-optimal work ordering may lead to more time spent on correcting the results than on useful work. On HPC systems, the issue is compounded by irregular nature of distributed graph algorithms which makes them sensitive to the whole software/hardware stack, collectively referred to as runtime. In this paper, we consider an example of such algorithms: Distributed Control (DC) single-source shortest paths (SSSP). DC relies on performance gains stemming from the inherent asynchrony of unordered algorithms while optimizing work ordering locally. We demonstrate that distributed runtime scheduling policy can prevent effective work ordering optimization. We show that lifting and delegating some scheduling decisions to the algorithm level can result in significantly better performance. We propose that this strategy can be useful for performance engineering.
Published: 2017

15. New Sufficient Conditions for Hamiltonian Paths

Author: M. Sohel Rahman, Mohammad Kaykobad, and Jesun Sahariar Firoz
Subjects: TheoryofComputation_COMPUTATIONBYABSTRACTDEVICES, Article Subject, Computer science, lcsh:Medicine, computer.software_genre, lcsh:Technology, General Biochemistry, Genetics and Molecular Biology, symbols.namesake, TheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITY, ComputingMethodologies_SYMBOLICANDALGEBRAICMANIPULATION, lcsh:Science, Mathematics::Symplectic Geometry, Mathematics, General Environmental Science, Hamiltonian path problem, Discrete mathematics, lcsh:T, lcsh:R, Quartic graph, General Medicine, Hamiltonian path, Longest path problem, Hypercube graph, Path (graph theory), symbols, Graph (abstract data type), Path graph, lcsh:Q, Data mining, computer, Distance, Hamiltonian (control theory), Research Article, MathematicsofComputing_DISCRETEMATHEMATICS
Abstract: A Hamiltonian path in a graph is a path involving all the vertices of the graph. In this paper, we revisit the famous Hamiltonian path problem and present new sufficient conditions for the existence of a Hamiltonian path in a graph.
Published: 2014

16. The Value of Variance

Author: Andrew Lumsdaine, Martina Barnas, Marcin Zalewski, and Jesun Sahariar Firoz
Subjects: 010302 applied physics, Mathematical optimization, Speedup, Computer science, Value (computer science), 020207 software engineering, 02 engineering and technology, Variance (accounting), 01 natural sciences, Performance results, Standard deviation, Distributed algorithm, 0103 physical sciences, Statistics, 0202 electrical engineering, electronic engineering, information engineering, Graph algorithms, Performance metric
Abstract: Measurements for distributed algorithms, such as performance results, are usually reported using averages, similarly to prevailing practice in other areas of computer science. We argue that including standard deviations offers additional information and that the minimal burden of providing standard deviations is outweighed by the benefits. We propose a new way of reporting run time speedup that incorporates standard deviation and demonstrate its usefulness in terms of two distributed graph algorithms.
Published: 2016

17. Comparison of Single Source Shortest Path Algorithms on Two Recent Asynchronous Many-task Runtime Systems

Author: Marcin Zalewski, Martina Barnas, Andrew Lumsdaine, and Jesun Sahariar Firoz
Subjects: Computer science, Asynchronous communication, Shortest path problem, Chaotic, Graph (abstract data type), Algorithm design, Parallel computing, Graph
Abstract: With the advent of the exascale era, new runtimes and algorithm design techniques need to be explored. In this paper, we investigate performance of three different single-source shortest path algorithms in two relatively recent asynchronous many-task runtime systems AMpp and HPX-5. We identify the underlying set of differential features for these runtimes, and we compare and contrast the performance of D-stepping algorithm, Distributed Control based algorithm, K-level Asynchronous algorithm in AMpp and in HPX-5, for which we also include chaotic implementation. We observe that specific runtime characteristics or lack thereoff and different graph inputs can impact the feasibility of an algorithmic approach.
Published: 2015

18. Importance of Runtime Considerations in Performance Engineering of Large-Scale Distributed Graph Algorithms

Author: Thejaka Amila Kanewala, Martina Barnas, Andrew Lumsdaine, Marcin Zalewski, and Jesun Sahariar Firoz
Subjects: Runtime system, Computer science, Distributed computing, Scale (chemistry), Performance engineering, Breadth-first search, Graph algorithms
Abstract: Due to the ever increasing complexity of the modern supercomputers, performance analysis of irregular applications became an experimental endeavor. We show that runtime considerations are inseparable from algorithmic concerns in performance engineering of large-scale distributed graph algorithms, and we argue that the whole system stack, starting with the algorithm at the top down to low-level communication libraries must be considered.
Published: 2015

19. Bee algorithms for solving DNA fragment assembly problem with noisy and noiseless data

Author: Jesun Sahariar Firoz, Tanay Kumar Saha, and M. Sohel Rahman
Subjects: biology, Computer science, Queen bee, biology.organism_classification, Swarm intelligence, DNA sequencing, law.invention, Set (abstract data type), chemistry.chemical_compound, Fragment (logic), chemistry, DNA computing, law, Genetic algorithm, Combinatorial optimization, Algorithm, Metaheuristic, DNA
Abstract: DNA fragment assembly problem is one of the crucial challenges faced by computational biologists where, given a set of DNA fragments, we have to construct a complete DNA sequence from them. As it is an NP-hard problem, accurate DNA sequence is hard to find. Moreover, due to experimental limitations, the fragments considered for assembly are exposed to additional errors while reading the fragments. In such scenarios, meta-heuristic based algorithms can come in handy. We analyze the performance of two swarm intelligence based algorithms namely Artificial Bee Colony (ABC) algorithm and Queen Bee Evolution Based on Genetic Algorithm (QEGA) to solve the fragment assembly problem and report quite promising results. Our main focus is to design meta-heuristic based techniques to efficiently handle DNA fragment assembly problem for noisy and noiseless data.
Published: 2012

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

19 results on '"Jesun Sahariar Firoz"'

1. Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures

2. Evaluation of Graph Analytics Frameworks Using the GAP Benchmark Suite

3. Triangle Counting with Cyclic Distributions

4. On the Feasibility of Using Reduced-Precision Tensor Core Operations for Graph Analytics

5. Computing Hypergraph Homology in Chapel

6. Hypergraph Analytics of Domain Name System Relationships

7. A Synchronization-Avoiding Distance-1 Grundy Coloring Algorithm for Power-Law Graphs

8. Distributed Direction-Optimizing Label Propagation for Community Detection

9. Graph Algorithms in PGAS: Chapel and UPC++

10. Synchronization-Avoiding Graph Algorithms

11. Adaptive Runtime Features for Distributed Graph Algorithms

12. Runtime Scheduling Policies for Distributed Graph Algorithms

13. A scalable distance-1 vertex coloring algorithm for power-law graphs

14. Improving Performance of Distributed Graph Traversals via Application-Aware Plug-In Work Scheduler

15. New Sufficient Conditions for Hamiltonian Paths

16. The Value of Variance

17. Comparison of Single Source Shortest Path Algorithms on Two Recent Asynchronous Many-task Runtime Systems

18. Importance of Runtime Considerations in Performance Engineering of Large-Scale Distributed Graph Algorithms

19. Bee algorithms for solving DNA fragment assembly problem with noisy and noiseless data

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

19 results on '"Jesun Sahariar Firoz"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources