Author: "Nane, Razvan" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Nane, Razvan"' showing total 25 results

Start Over Author "Nane, Razvan"

25 results on '"Nane, Razvan"'

1. An Evaluation and Comparison of GPU Hardware and Solver Libraries for Accelerating the OPM Flow Reservoir Simulator

Author: Qiu, Tong Dong, Thune, Andreas, Blatt, Markus, Rustad, Alf Birger, and Nane, Razvan
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Hardware Architecture
Abstract: Realistic reservoir simulation is known to be prohibitively expensive in terms of computation time when increasing the accuracy of the simulation or by enlarging the model grid size. One method to address this issue is to parallelize the computation by dividing the model in several partitions and using multiple CPUs to compute the result using techniques such as MPI and multi-threading. Alternatively, GPUs are also a good candidate to accelerate the computation due to their massively parallel architecture that allows many floating point operations per second to be performed. The numerical iterative solver takes thus the most computational time and is challenging to solve efficiently due to the dependencies that exist in the model between cells. In this work, we evaluate the OPM Flow simulator and compare several state-of-the-art GPU solver libraries as well as custom developed solutions for a BiCGStab solver using an ILU0 preconditioner and benchmark their performance against the default DUNE library implementation running on multiple CPU processors using MPI. The evaluated GPU software libraries include a manual linear solver in OpenCL and the integration of several third party sparse linear algebra libraries, such as cuSparse, rocSparse, and amgcl. To perform our bench-marking, we use small, medium, and large use cases, starting with the public test case NORNE that includes approximately 50k active cells and ending with a large model that includes approximately 1 million active cells. We find that a GPU can accelerate a single dual-threaded MPI process up to 5.6 times, and that it can compare with around 8 dual-threaded MPI processes.
Published: 2023

2. SDC-based Resource Constrained Scheduling for Quantum Control Architectures

Author: Nane, Razvan
Subjects: Quantum Physics, Computer Science - Hardware Architecture
Abstract: Instruction scheduling is a key transformation in backend compilers that take an untimed description of an algorithm and assigns time slots to the algorithm's instructions so that they can be executed as efficiently as possible while taking into account the target processor limitations, such as the amount of computational units available. For example, for a superconducting quantum processor these restrictions include the amount of analogue instruments available to play the waveforms to drive the qubit rotations or on-chip connectivity between qubits. Current small-scale quantum processors contain only a few qubits; therefore, it is feasible to drive qubits individually albeit not scalable. Consequently, for NISQ and beyond NISQ devices, it is expected that classical instrument sharing to be designed in the future quantum control architectures where several qubits are connected to an instrument and multiplexing is used to activate only the qubits performing the same quantum operation at a time. Existing quantum scheduling algorithms either rely on ILP formulations, which do not scale well, or use heuristic based algorithms such as list scheduling which are not versatile enough to deal with quantum requirements such as scheduling with exact relative timing constraints between instructions, situation that might occur when decomposing complex instructions into native ones and requiring to keep a fixed timing between the primitive ones to guarantee correctness. In this paper, we propose a novel resource constrained scheduling algorithm that is based on the SDC formulation, which is the state-of-the-art algorithm used in the reconfigurable computing. We evaluate it against a list scheduler and describe the benefits of the proposed approach. We find that the SDC-based scheduling is not only able to find better schedules but also model flexible relative timing constraints.
Published: 2022

3. Hardware Acceleration of HPC Computational Flow Dynamics using HBM-enabled FPGAs

Author: Hogervorst, Tom, Qiu, Tong Dong, Marchiori, Giacomo, Birger, Alf, Blatt, Markus, and Nane, Razvan
Subjects: Computer Science - Hardware Architecture, Physics - Computational Physics
Abstract: Scientific computing is at the core of many High-Performance Computing applications, including computational flow dynamics. Because of the uttermost importance to simulate increasingly larger computational models, hardware acceleration is receiving increased attention due to its potential to maximize the performance of scientific computing. A Field-Programmable Gate Array is a reconfigurable hardware accelerator that is fully customizable in terms of computational resources and memory storage requirements of an application during its lifetime. Therefore, it is an ideal candidate to accelerate scientific computing applications because of the possibility to fully customize the memory hierarchy important in irregular applications such as iterative linear solvers found in scientific libraries. In this paper, we study the potential of using FPGA in HPC because of the rapid advances in reconfigurable hardware, such as the increase in on-chip memory size, increasing number of logic cells, and the integration of High-Bandwidth Memories on board. To perform this study, we first propose a novel ILU0 preconditioner tightly integrated with a BiCGStab solver kernel designed using a mixture of High-Level Synthesis and Register-Transfer Level hand-coded design. Second, we integrate the developed preconditioned iterative solver in Flow from the Open Porous Media (OPM) project, a state-of-the-art open-source reservoir simulator. Finally, we perform a thorough evaluation of the FPGA solver kernel in both standalone mode and integrated into the reservoir simulator that includes all the on-chip URAM and BRAM, on-board High-Bandwidth Memory, and off-chip CPU memory data transfers required in a complex simulator software such as OPM's Flow. We evaluate the performance on the Norne field, a real-world case reservoir model using a grid with more than 10^5 cells and using 3 unknowns per cell.
Published: 2021
Full Text: View/download PDF

4. Hardware/Software Compilation

Author: Nobre, Ricardo, Cardoso, João M. P., Olivier, Bryan, Nane, Razvan, Fitzpatrick, Liam, de F. Coutinho, José Gabriel, van Someren, Hans, Sima, Vlad-Mihai, Bertels, Koen, Diniz, Pedro C., Cardoso, João Manuel Paiva, editor, Diniz, Pedro C., editor, de Figueiredo Coutinho, José Gabriel, editor, and Petrov, Zlatko Marinov, editor
Published: 2013
Full Text: View/download PDF

5. LARA Experiments

Author: Gonçalves, Fernando, Petrov, Zlatko, de F. Coutinho, José Gabriel, Nane, Razvan, Sima, Vlad-Mihai, Cardoso, João M. P., Werner, Stephan, Bhattacharya, Sujit, Carvalho, Tiago, Nobre, Ricardo, de Sá, José, Teixeira, João, Diniz, Pedro C., Bertels, Koen, Constantinides, George, Luk, Wayne, Becker, Juergen, Alves, José Carlos, Ferreira, João Canas, Almeida, Gabriel M., Cardoso, João Manuel Paiva, editor, Diniz, Pedro C., editor, de Figueiredo Coutinho, José Gabriel, editor, and Petrov, Zlatko Marinov, editor
Published: 2013
Full Text: View/download PDF

6. The REFLECT Design-Flow

Author: Cardoso, João M. P., de F. Coutinho, José Gabriel, Nane, Razvan, Sima, Vlad-Mihai, Olivier, Bryan, Carvalho, Tiago, Nobre, Ricardo, Diniz, Pedro C., Petrov, Zlatko, Bertels, Koen, Gonçalves, Fernando, van Someren, Hans, Hübner, Michael, Constantinides, George, Luk, Wayne, Becker, Jürgen, Krátký, Kamil, Bhattacharya, Sujit, Alves, José Carlos, Ferreira, João Canas, Cardoso, João Manuel Paiva, editor, Diniz, Pedro C., editor, de Figueiredo Coutinho, José Gabriel, editor, and Petrov, Zlatko Marinov, editor
Published: 2013
Full Text: View/download PDF

7. REFLECT: Rendering FPGAs to Multi-core Embedded Computing

Author: Cardoso, João M. P., Diniz, Pedro C., Petrov, Zlatko, Bertels, Koen, Hübner, Michael, van Someren, Hans, Gonçalves, Fernando, de Coutinho, José Gabriel F., Constantinides, George A., Olivier, Bryan, Luk, Wayne, Becker, Juergen, Kuzmanov, Georgi, Thoma, Florian, Braun, Lars, Kühnle, Matthias, Nane, Razvan, Sima, Vlad Mihai, Krátký, Kamil, Alves, José Carlos, Ferreira, João Canas, Cardoso, João M. P., editor, and Hübner, Michael, editor
Published: 2011
Full Text: View/download PDF

8. Deriving Resource Efficient Designs Using the REFLECT Aspect-Oriented Approach : (Extended Abstract)

Author: Coutinho, José G. F., Cardoso, João M. P., Carvalho, Tiago, Nobre, Ricardo, Bhattacharya, Sujit, Diniz, Pedro C., Fitzpatrick, Liam, Nane, Razvan, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Brisk, Philip, editor, de Figueiredo Coutinho, José Gabriel, editor, and Diniz, Pedro C., editor
Published: 2013
Full Text: View/download PDF

9. Controlling a complete hardware synthesis toolchain with LARA aspects

Author: Cardoso, João M.P., Carvalho, Tiago, Coutinho, José G.F., Nobre, Ricardo, Nane, Razvan, Diniz, Pedro C., Petrov, Zlatko, Luk, Wayne, and Bertels, Koen
Published: 2013
Full Text: View/download PDF

10. Skeleton-Based Synthesis Flow for Computation-in-Memory Architectures

Author: Yu, Jintao, primary, Nane, Razvan, additional, Ashraf, Imran, additional, Taouil, Mottaqiallah, additional, Hamdioui, Said, additional, Corporaal, Henk, additional, and Bertels, Koen, additional
Published: 2020
Full Text: View/download PDF

11. Sparstition: A Partitioning Scheme for Large-Scale Sparse Matrix Vector Multiplication on FPGA

Author: Sigurbergsson, Bjorn, primary, Hogervorst, Tom, additional, Qiu, Tong Dong, additional, and Nane, Razvan, additional
Published: 2019
Full Text: View/download PDF

12. On the Implementation of Computation-in-Memory Parallel Adder

Author: Du Nguyen, Hoang Anh, primary, Xie, Lei, additional, Taouil, Mottaqiallah, additional, Nane, Razvan, additional, Hamdioui, Said, additional, and Bertels, Koen, additional
Published: 2017
Full Text: View/download PDF

13. A Domain-Specific Language and Compiler for Computation-in-Memory Skeletons

Author: Yu, Jintao, primary, Hogervorst, Tom, additional, and Nane, Razvan, additional
Published: 2017
Full Text: View/download PDF

14. An Image Processing VLIW Architecture for Real-Time Depth Detection

Author: Iorga, Dan, primary, Nane, Razvan, additional, Lu, Yi, additional, Van Dalen, Edwin, additional, and Bertels, Koen, additional
Published: 2016
Full Text: View/download PDF

15. A Survey and Evaluation of FPGA High-Level Synthesis Tools

Author: Nane, Razvan, primary, Sima, Vlad-Mihai, additional, Pilato, Christian, additional, Choi, Jongsok, additional, Fort, Blair, additional, Canis, Andrew, additional, Chen, Yu Ting, additional, Hsiao, Hsuan, additional, Brown, Stephen, additional, Ferrandi, Fabrizio, additional, Anderson, Jason, additional, and Bertels, Koen, additional
Published: 2016
Full Text: View/download PDF

16. Parallel matrix multiplication on memristor-based computation-in-memory architecture

Author: Haron, Adib, primary, Yu, Jintao, additional, Nane, Razvan, additional, Taouil, Mottaqiallah, additional, Hamdioui, Said, additional, and Bertels, Koen, additional
Published: 2016
Full Text: View/download PDF

17. Low-Cost Software Control-Flow Error Recovery

Author: Nazarian, Ghazaleh, primary, Nane, Razvan, additional, and Gaydadjiev, Georgi N., additional
Published: 2015
Full Text: View/download PDF

18. Computation-in-memory based parallel adder

Author: Du Nguyen, Hoang Anh, primary, Xie, Lei, additional, Taouil, Mottaqiallah, additional, Nane, Razvan, additional, Hamdioui, Said, additional, and Bertels, Koen, additional
Published: 2015
Full Text: View/download PDF

19. High-Level Synthesis in the Delft Workbench Hardware/Software Co-design Tool-Chain

Author: Nane, Razvan, primary, Sima, Vlad Mihai, additional, Pham Quoc, Cuong, additional, Goncalves, Fernando, additional, and Bertels, Koen, additional
Published: 2014
Full Text: View/download PDF

20. Computation-in-memory based parallel adder.

Author: Nguyen, Hoang Anh Du, Xie, Lei, Taouil, Mottaqiallah, Nane, Razvan, Hamdioui, Said, and Bertels, Koen
Published: 2015
Full Text: View/download PDF

21. Quipu

Author: Meeuws, Roel, primary, Ostadzadeh, S. Arash, additional, Galuzzi, Carlo, additional, Sima, Vlad Mihai, additional, Nane, Razvan, additional, and Bertels, Koen, additional
Published: 2013
Full Text: View/download PDF

22. A lightweight speculative and predicative scheme for hardware execution

Author: Nane, Razvan, primary, Sima, Vlad-Mihai, additional, and Bertels, Koen, additional
Published: 2012
Full Text: View/download PDF

23. Low-Cost Software Control-Flow Error Recovery.

Author: Nazarian, Ghazaleh, Nane, Razvan, and Gaydadjiev, Georgi N.
Published: 2014
Full Text: View/download PDF

24. DWARV 2.0: A CoSy-based C-to-VHDL hardware compiler

Author: Nane, Razvan, primary, Sima, Vlad-Mihai, additional, Olivier, Bryan, additional, Meeuws, Roel, additional, Yankova, Yana, additional, and Bertels, Koen, additional
Published: 2012
Full Text: View/download PDF

25. IP-XACT extensions for Reconfigurable Computing

Author: Nane, Razvan, primary, Haastregt, Sven van, additional, Stefanov, Todor, additional, Kienhuis, Bart, additional, Sima, Vlad Mihai, additional, and Bertels, Koen, additional
Published: 2011
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

25 results on '"Nane, Razvan"'

1. An Evaluation and Comparison of GPU Hardware and Solver Libraries for Accelerating the OPM Flow Reservoir Simulator

2. SDC-based Resource Constrained Scheduling for Quantum Control Architectures

3. Hardware Acceleration of HPC Computational Flow Dynamics using HBM-enabled FPGAs

4. Hardware/Software Compilation

5. LARA Experiments

6. The REFLECT Design-Flow

7. REFLECT: Rendering FPGAs to Multi-core Embedded Computing

8. Deriving Resource Efficient Designs Using the REFLECT Aspect-Oriented Approach : (Extended Abstract)

9. Controlling a complete hardware synthesis toolchain with LARA aspects

10. Skeleton-Based Synthesis Flow for Computation-in-Memory Architectures

11. Sparstition: A Partitioning Scheme for Large-Scale Sparse Matrix Vector Multiplication on FPGA

12. On the Implementation of Computation-in-Memory Parallel Adder

13. A Domain-Specific Language and Compiler for Computation-in-Memory Skeletons

14. An Image Processing VLIW Architecture for Real-Time Depth Detection

15. A Survey and Evaluation of FPGA High-Level Synthesis Tools

16. Parallel matrix multiplication on memristor-based computation-in-memory architecture

17. Low-Cost Software Control-Flow Error Recovery

18. Computation-in-memory based parallel adder

19. High-Level Synthesis in the Delft Workbench Hardware/Software Co-design Tool-Chain

20. Computation-in-memory based parallel adder.

21. Quipu

22. A lightweight speculative and predicative scheme for hardware execution

23. Low-Cost Software Control-Flow Error Recovery.

24. DWARV 2.0: A CoSy-based C-to-VHDL hardware compiler

25. IP-XACT extensions for Reconfigurable Computing

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

25 results on '"Nane, Razvan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources