Search

Showing total 487 results

Search Constraints

Start Over You searched for: Topic computer architecture Remove constraint Topic: computer architecture Journal ieee transactions on parallel & distributed systems Remove constraint Journal: ieee transactions on parallel & distributed systems
487 results

Search Results

1. Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From the University of Texas at Austin.

2. A Survey of Desktop Grid Scheduling.

3. A Novel Compute-Efficient Tridiagonal Solver for Many-Core Architectures.

4. A Survey of Techniques for Architecting and Managing GPU Register File.

5. Survey on Real-Time Networks-on-Chip.

6. Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors.

7. Predicting Throughput of Distributed Stochastic Gradient Descent.

8. $TC-Stream$ T C - S t r e a m : Large-Scale Graph Triangle Counting on a Single Machine Using GPUs.

9. Guest Editor's Introduction: Special Section on Power-Aware Parallel and Distributed Computing (PAPADS).

10. Solving Computation Slicing Using Predicate Detection.

11. Symmetric Indefinite Linear Solver Using OpenMP Task on Multicore Architectures.

12. Exploring Data Analytics Without Decompression on Embedded GPU Systems.

13. BFS-4K: An Efficient Implementation of BFS for Kepler GPU Architectures.

14. Exploring New Opportunities to Defeat Low-Rate DDoS Attack in Container-Based Cloud Environment.

15. Optimizing Finite Volume Method Solvers on Nvidia GPUs.

16. GPU Implementation of Bitplane Coding with Parallel Coefficient Processing for High Performance Image Compression.

17. Task Scheduling Techniques for Asymmetric Multi-Core Systems.

18. Trajectory Pattern Mining for Urban Computing in the Cloud.

19. Repurposing GPU Microarchitectures with Light-Weight Out-Of-Order Execution.

20. Performance-Aware Model for Sparse Matrix-Matrix Multiplication on the Sunway TaihuLight Supercomputer.

21. Parana: A Parallel Neural Architecture Considering Thermal Problem of 3D Stacked Memory.

22. A Self-Adaptive Network for HPC Clouds: Architecture, Framework, and Implementation.

23. Power/Performance/Thermal Design-Space Exploration for Multicore Architectures.

24. Towards Exploring Data-Intensive Scientific Applications at Extreme Scales through Systems and Simulations.

25. An Efficient Privacy-Preserving Ranked Keyword Search Method.

26. Reproducibility: Performance Evaluation of MemXCT on Azure CycleCloud Platform.

27. GPU Acceleration for Simulating Massively Parallel Many-Core Platforms.

28. On All-to-All Broadcast in Dense Gaussian Network On-Chip.

29. Power-Aware Job Scheduling on Heterogeneous Multicore Architectures.

30. Distributed Randomized $k$ <alternatives><inline-graphic xlink:href="pratap-ieq1-2800050.gif"/></alternatives>-Clustering Based PCID Assignment for Ultra-Dense Femtocellular Networks.

31. A Hardware Architecture for Radial Basis Function Neural Network Classifier.

32. Architectural Synthesis of Multi-SIMD Dataflow Accelerators for FPGA.

33. A GPU-Architecture Optimized Hierarchical Decomposition Algorithm for Support Vector Machine Training.

34. A General-Purpose Architecture for Replicated Metadata Services in Distributed File Systems.

35. Multi-Core Embedded Wireless Sensor Networks: Architecture and Applications.

36. Hybrid Dataflow/von-Neumann Architectures.

37. Optimization of Duplication-Based Schedules on Network-on-Chip Based Multi-Processor System-on-Chips.

38. Automated Synthesis of Distributed Network Access Controls: A Formal Framework with Refinement.

39. Workflow Scheduling in Multi-Tenant Cloud Computing Environments.

40. Fast Consensus Using Bounded Staleness for Scalable Read-Mostly Synchronization.

41. Shield: A Reliable Network-on-Chip Router Architecture for Chip Multiprocessors.

42. Fast and Accurate Simulation of the Cray XMT Multithreaded Supercomputer.

43. Floating Point Calculation of the Cube Function on FPGAs.

44. Coupling-Based Internal Clock Synchronization for Large-Scale Dynamic Distributed Systems.

45. gMig: Efficient vGPU Live Migration with Overlapped Software-Based Dirty Page Verification.

46. cuTensor-Tubal: Efficient Primitives for Tubal-Rank Tensor Learning Operations on GPUs.

47. JSensor: A Parallel Simulator for Huge Wireless Sensor Networks Applications.

48. Optimizing Dual-Core Execution for Power Efficiency and Transient-Fault Recovery.

49. Throughput Region of Finite-Buffered Networks.

50. A Class of Multistage Conference Switching Networks for Group Communication.