Search

Your search keyword '"Informàtica::Arquitectura de computadors::Arquitectures paral·leles [Àrees temàtiques de la UPC]"' showing total 304 results

Search Constraints

Start Over You searched for: Descriptor "Informàtica::Arquitectura de computadors::Arquitectures paral·leles [Àrees temàtiques de la UPC]" Remove constraint Descriptor: "Informàtica::Arquitectura de computadors::Arquitectures paral·leles [Àrees temàtiques de la UPC]"
304 results on '"Informàtica::Arquitectura de computadors::Arquitectures paral·leles [Àrees temàtiques de la UPC]"'

Search Results

1. Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures

2. Mitigating the NUMA effect on task-based runtime systems

3. Accelerating Edit-Distance Sequence Alignment on GPU Using the Wavefront Algorithm

4. DynAMO: Improving parallelism through dynamic placement of atomic memory operations

5. OmpSs-2 and OpenACC interoperation

6. GPU acceleration of Levenshtein distance computation between long strings

7. Improving the performance of classical linear algebra iterative methods via hybrid parallelism

8. Dynamic spawning of MPI processes applied to malleability

9. Seamless optimization of the GEMM kernel for task-based programming models

10. XFeatur: Hardware Feature Extraction for DNN Auto-tuning

11. A Data-Centric Directive-Based Framework to Accelerate Out-of-Core Stencil Computation on a GPU

12. Fine‐grain task‐parallel algorithms for matrix factorizations and inversion on many‐threaded CPUs

13. Towards OmpSs-2 and OpenACC interoperation

14. Acceleration strategies for large-scale sequential simulations using parallel neighbour search: Non-LVA and LVA scenarios

15. A Novel Set of Directives for Multi-device Programming with OpenMP

16. Task-based acceleration of bidirectional recurrent neural networks on multi-core architectures

17. TD-NUCA: runtime driven management of NUCA caches in task dataflow programming models

18. OmpSs@cloudFPGA: An FPGA task-based programming model with message passing

19. Sargantana: A 1 GHz+ in-order RISC-V processor with SIMD vector extensions in 22nm FD-SOI

20. A model of checkpoint behavior for applications that have I/O

21. Resiliency in numerical algorithm design for extreme scale simulations

22. Aging-aware parallel execution

23. Implementation of a parallel tridiagonal solver for linear system of equations arising in Physicell-BioFVM

24. Arbitration Policies for On-Demand User-Level I/O Forwarding on HPC Platforms

25. Parallelware Tools: An Experimental Evaluation on POWER Systems

26. An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks

27. Multi-GPU parallelization of the NAS multi-zone parallel benchmarks

28. A new generation of task-parallel algorithms for matrix inversion in many-threaded CPUs

29. gem5 + rtl: A framework to enable RTL models inside a full-system simulator

30. Multi-GPU design and performance evaluation of homomorphic encryption on GPU clusters

31. Efficiently running SpMV on long vector architectures

32. A low overhead tasking model for OpenMP

33. TALP - A Lightweight Tool to Unveil Parallel Efficiency of Large-scale Executions

34. Enhancing OpenMP tasking model: performance and portability

35. Implementation of a high-accuracy phase unwrapping algorithm using parallel-hybrid programming approach for displacement sensing using self-mixing interferometry

36. OmpSs@FPGA framework for high performance FPGA computing

37. Combining dynamic concurrency throttling with voltage and frequency scaling on task-based programming models

38. PH-RLS: A parallel hybrid recursive least square algorithm for self-mixing interferometric laser sensor

39. Improving HPC system throughput and response time using memory disaggregation

40. Human biventricular electromechanical simulations on the progression of electrocardiographic and mechanical abnormalities in post-myocardial infarction

41. OpenMP to CUDA graphs

42. Worksharing Tasks: An Efficient Way to Exploit Irregular and Fine-Grained Loop Parallelism

43. sLASs: a fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library)

44. Shortest path computing in directed graphs with weighted edges mapped on random networks of memristors

45. Performance and energy effects on task-based parallelized applications

46. Static Analysis to Enhance Programmability and Performance in OmpSs-2

47. Breaking master-slave model between host and FPGAs

48. Wavefront parallelization of recurrent neural networks on multi-core architectures

49. Towards a qualifiable openMP framework for embedded systems

50. HRM: Merging Hardware Event Monitors for Improved Timing Analysis of Complex MPSoCs

Catalog

Books, media, physical & digital resources