518 results on '"Chunyuan Zhang"'
Search Results
52. Towards a Uniform Architecture for the Efficient Implementation of 2D and 3D Deconvolutional Neural Networks on FPGAs.
53. An Efficient Design Flow for Accelerating Complicated-connected CNNs on a Multi-FPGA Platform.
54. GENIE: QoS-guided Dynamic Scheduling for CNN-based Tasks on SME Clusters.
55. Estimation of the parameters of a weighted nuclear norm model and its application in image denoising.
56. Toward an Efficient Deep Pipelined Template-Based Architecture for Accelerating the Entire 2-D and 3-D CNNs on FPGA.
57. Deep Learning Research and Development Platform: Characterizing and Scheduling with QoS Guarantees on GPU Clusters.
58. Efficient Parallel TLD on CPU-GPU Platform for Real-Time Tracking.
59. Recursive Least Squares Advantage Actor-Critic Algorithms.
60. Recursive Least Squares Policy Control with Echo State Network.
61. Recursive Least Squares for Training and Pruning Convolutional Neural Networks.
62. Multiple CNN-based Tasks Scheduling across Shared GPU Platform in Research and Development Scenarios.
63. High performance graph analytics with productivity on hybrid CPU-GPU platforms.
64. Towards a Multi-array Architecture for Accelerating Large-scale Matrix Multiplication on FPGAs.
65. Parallel programming course development based on parallel computational thinking.
66. Towards a Uniform Template-based Architecture for Accelerating 2D and 3D CNNs on FPGA.
67. Revisiting Recursive Least Squares for Training Deep Neural Networks.
68. Winograd Algorithm for 3D Convolution Neural Networks.
69. Optimizing OpenCL Implementation of Deep Convolutional Neural Network on FPGA.
70. RVNet: A fast and high energy efficiency network packet processing system on RISC-V.
71. DCC: Distributed Cache Consistency.
72. HPGraph: High-Performance Graph Analytics with Productivity on the GPU.
73. Targeting erythrocyte-mediated hypoxia to alleviate lung injury induced by pyrrolizidine alkaloids
74. Multikernel Recursive Least-Squares Temporal Difference Learning.
75. Enabling Tissue-Scale Cardiac Simulations Using Heterogeneous Computing on Tianhe-2.
76. Improve security and availability for cloud storage.
77. Sociodemographic and Health Profiles of the Oldest Old in China
78. Scalable FPGA-based Architecture for High-Performance Per-Flow Traffic Measurement.
79. The Healthy Longevity Survey and the Active Life Expectancy of the Oldest Old in China
80. Effects of swirl injection on the combustion of a novel composite hybrid rocket fuel grain
81. Poster Abstract: A Template-based Framework for Generating Network Processor in FPGA.
82. Poster Abstract: Deep Learning Workloads Scheduling with Reinforcement Learning on GPU Clusters.
83. High Performance Implementation of 3D Convolutional Neural Networks on a GPU.
84. A Highly Parallel and Scalable Motion Estimation Algorithm with GPU for HEVC.
85. Exploiting a depth context model in visual tracking with correlation filter.
86. Applying Detection Proposals to Visual Tracking for Scale and Aspect Ratio Adaptability.
87. Enable Scale and Aspect Ratio Adaptability in Visual Tracking with Detection Proposals.
88. Fast tracking via context depth model learning.
89. Kernel Recursive Least-Squares Temporal Difference Algorithms with Sparsification and Regularization.
90. Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs.
91. A fault detection mechanism in a Data-flow scheduled Multithreaded processor.
92. Rethread: A Low-Cost Transient Fault Recovery Scheme for Multithreaded Processors.
93. Utilizing Multiple Xeon Phi Coprocessors on One Compute Node.
94. Accelerating 3D CNN-based Lung Nodule Segmentation on a Multi-FPGA System.
95. Scale-out Acceleration for 3D CNN-based Lung Nodule Segmentation on a Multi-FPGA System.
96. Towards simulation of subcellular calcium dynamics at nanometre resolution.
97. An analytical GPU performance model for 3D stencil computations from the angle of data traffic.
98. Enabling a Uniform OpenCL Device View for Heterogeneous Platforms.
99. Communication-hiding programming for clusters with multi-coprocessor nodes.
100. A Computational Model of the Short-Cut Rule for 2D Shape Decomposition.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.