Search

Your search keyword '"PARALLEL processing"' showing total 620 results

Search Constraints

Start Over You searched for: Descriptor "PARALLEL processing" Remove constraint Descriptor: "PARALLEL processing" Journal ieee transactions on parallel & distributed systems Remove constraint Journal: ieee transactions on parallel & distributed systems
620 results on '"PARALLEL processing"'

Search Results

1. $TC-Stream$ T C - S t r e a m : Large-Scale Graph Triangle Counting on a Single Machine Using GPUs.

2. Guest Editorial.

3. Auto-GNAS: A Parallel Graph Neural Architecture Search Framework.

4. Mixing Activations and Labels in Distributed Training for Split Learning.

5. Accelerating Large Sparse Neural Network Inference Using GPU Task Graph Parallelism.

6. ReHy: A ReRAM-Based Digital/Analog Hybrid PIM Architecture for Accelerating CNN Training.

7. BLB-gcForest: A High-Performance Distributed Deep Forest With Adaptive Sub-Forest Splitting.

8. Exploring Data Analytics Without Decompression on Embedded GPU Systems.

9. Workload Balancing via Graph Reordering on Multicore Systems.

10. An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications.

11. Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms.

12. LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications.

13. Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication.

14. Compiler-Assisted Compaction/Restoration of SIMD Instructions.

15. gSoFa: Scalable Sparse Symbolic LU Factorization on GPUs.

16. Repurposing GPU Microarchitectures with Light-Weight Out-Of-Order Execution.

17. Tardiness Bounds for Sporadic Gang Tasks Under Preemptive Global EDF Scheduling.

18. Trust: Triangle Counting Reloaded on GPUs.

19. gIM: GPU Accelerated RIS-Based Influence Maximization Algorithm.

20. The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs With Hybrid Parallelism.

21. Model Parallelism Optimization for Distributed Inference Via Decoupled CNN Structure.

22. Analysis of Global and Local Synchronization in Parallel Computing.

23. High Performance Simulation of Spiking Neural Network on GPGPUs.

24. Boosting the Performance of SSDs via Fully Exploiting the Plane Level Parallelism.

25. Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures.

26. Partitioning Tree-Shaped Task Graphs for Distributed Platforms With Limited Memory.

27. Approximate NoC and Memory Controller Architectures for GPGPU Accelerators.

28. REACT: Scalable and High-Performance Regular Expression Pattern Matching Accelerator for In-Storage Processing.

29. cuTensor-Tubal: Efficient Primitives for Tubal-Rank Tensor Learning Operations on GPUs.

30. cuPC: CUDA-Based Parallel PC Algorithm for Causal Structure Learning on GPU.

31. Toward Designing Cost-Optimal Policies to Utilize IaaS Clouds with Online Learning.

32. Single Restart with Time Stamps for Parallel Task Processing with Known and Unknown Processors.

33. Data-Parallel Hashing Techniques for GPU Architectures.

34. Exploiting Parallelism and Vectorisation in Breadth-First Search for the Intel Xeon Phi.

35. Resource-Efficient Index Shard Replication in Large Scale Search Engines.

36. Fast Deep Neural Network Training on Distributed Systems and Cloud TPUs.

37. Massively Parallel Tree Search for High-Dimensional Sphere Decoders.

38. A Compiler for Agnostic Programming and Deployment of Big Data Analytics on Multiple Platforms.

39. Privacy Regulation Aware Process Mapping in Geo-Distributed Cloud Data Centers.

40. Exploring GPU-Accelerated Routing for FPGAs.

41. A Bi-layered Parallel Training Architecture for Large-Scale Convolutional Neural Networks.

42. Solving Linear Diophantine Systems on Parallel Architectures.

43. Performance-Aware Model for Sparse Matrix-Matrix Multiplication on the Sunway TaihuLight Supercomputer.

44. Integrating Concurrency Control in n-Tier Application Scaling Management in the Cloud.

45. Ultra-Fast Bloom Filters using SIMD Techniques.

46. Exploiting Parallelism for CNN Applications on 3D Stacked Processing-In-Memory Architecture.

47. A Virtual Multi-Channel GPU Fair Scheduling Method for Virtual Machines.

48. Comparative Analysis of Intra-Algorithm Parallel Multiobjective Evolutionary Algorithms: Taxonomy Implications on Bioinformatics Scenarios.

49. Adaptive Scheduling Parallel Jobs with Dynamic Batching in Spark Streaming.

50. M-Oscillating: Performance Maximization on Temperature-Constrained Multi-Core Processors.

Catalog

Books, media, physical & digital resources