Author: "Barry Chen" / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Barry Chen"' showing total 8 results

Start Over Author "Barry Chen" Topic computer science

8 results on '"Barry Chen"'

1. Fast neural network training on a cluster of GPUs for action recognition with high accuracy

Author: Fan Zhou, Barry Chen, Guojing Cong, Joshua Shapiro, Giacomo Domeniconi, and Chih-Chieh Yang
Subjects: Artificial neural network, Computer Networks and Communications, Computer science, business.industry, Training (meteorology), 020206 networking & telecommunications, 02 engineering and technology, Machine learning, computer.software_genre, Residual neural network, Theoretical Computer Science, Artificial Intelligence, Hardware and Architecture, Distributed algorithm, 0202 electrical engineering, electronic engineering, information engineering, Cluster (physics), Action recognition, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Software
Abstract: We propose algorithms and techniques to accelerate training of deep neural networks for action recognition on a cluster of GPUs. The convergence analysis of our algorithm shows it is possible to reduce communication cost and at the same time minimize the number of iterations needed for convergence. We customize the Adam optimizer for our distributed algorithm to improve efficiency. In addition, we employ transfer-learning to further reduce training time while improving validation accuracy. For the UCF101 and HMDB51 datasets, the validation accuracies achieved are 93.1% and 67.9% respectively. With an additional end-to-end trained temporal stream, the validation accuracies achieved for UCF101 and HMDB51 are 93.47% and 81.24% respectively. As far as we know, these are the highest accuracies achieved with the two-stream approach using ResNet that does not involve computationally expensive 3D convolutions or pretraining on much larger datasets.
Published: 2019
Full Text: View/download PDF

2. 4.1 A 7nm 5G Mobile SoC Featuring a 3.0GHz Tri-Gear Application Processor Subsystem

Author: Jason Tsai, Manzur Rahman, Lee-Kee Yong, Rolf Lagerquist, Henry Hsieh, Vincent Lin, Sa Huang, Sudhakar Maruthi, Elly Chiang, Wade Wu, Ericbill Wang, Hsinchen Chen, Ashish Nayak, Anand Rajagopalan, Tao Chen, Gordon Gammie, Curtis Lin, Cheng-Yuh Wu, Hugh Mair, Ramu Madhavaram, Gokulakrishnan Manoharan, Amjad Sikiligiri, Daniel Dia, Efron Ho, Jenny Wiedemeier, Barry Chen, Achuta Thippana, Madhur Jagota, Chi-Jui Chung, and Po-Yang Hsu
Subjects: Application processor, business.industry, Operating environment, Computer science, Clock rate, CAD, Topology (electrical circuits), business, Computer hardware, 5G, Die (integrated circuit)
Abstract: This paper describes a new CPU subsystem featured in a 5G mobile SoC. The High-Performance (HP) core achieves a 3GHz clock frequency with full production yield across the fabrication range and operating environment. In contrast to previously published work [1], a third, balanced-performance (BP), gear is introduced which features a power-optimized implementation of the high-performance (HP) core. Physical implementation differences of the BP and HP cores are illustrated, while circuit techniques developed to enable full-yield 3GHz operation are detailed. A die photograph alongside a more detailed HP core CAD drawing are shown in Fig. 4.1.7 and the cluster topology, including architectural features, is summarized in Fig. 4.1.1.
Published: 2021
Full Text: View/download PDF

3. Preparation and optimization of a diverse workload for a large-scale heterogeneous system

Author: Martin Schulz, Ulrike Meier Yang, David F. Richards, Tong Chen, Shiv Sundram, Todd Gamblin, Shelby Lockhart, Phil Regier, David Beckingsale, Ed Zywicz, Ruipeng Li, Giacomo Domeniconi, James C. Sexton, Bob Walkup, Jarom Nelson, Carlos Costa, Hui-Fang Wen, Ramesh Pankajakshan, John A. Gunnels, Xiaohua Zhang, Brian Van Essen, Kathryn M. O'Brien, I-Feng W. Kuo, Johann Dahm, Guillaume Thomas-Collignon, Bert Still, Naoya Maruyama, Jamie A. Bramwell, David Boehme, Kathleen Shoga, Carol S. Woodward, Howard A. Scott, M. P. Katz, Ian Karlin, T Epperly, Tzanio V. Kolev, Eun Kyung Lee, Steven H. Langer, Christopher Ward, David J. Gardner, Sara I. L. Kokkila-Schumacher, Christopher Young, Kevin O'Brien, Barry Chen, Björn Sjögreen, Jose R. Brunheroto, Claudia Misale, Roger Pearce, Guojing Cong, Matthew Legendre, Lu Wang, Jaime H. Moreno, Kathleen McCandless, Cyril Zeller, Rao Nimmakayala, Bronis R. de Supinski, Xinyu Que, Sorin Bastea, Robert D. Falgout, Peng Wang, Charway R. Cooper, Aaron Fisher, Jim Brase, R. Neely, David Appelhans, Alexey Voronin, James N. Glosli, Slaven Peles, Pei-Hung Lin, Tony Degroot, Hai Le, Daniel A. White, Levi Barnes, Steve Rennich, Yoonho Park, Peter D. Barnes, Bob Anderson, Jonathan J. Wong, and Robert C. Blake
Subjects: 020203 distributed computing, geography, Summit, geography.geographical_feature_category, Computer science, business.industry, Emerging technologies, Scale (chemistry), Center of excellence, Workload, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Engineering management, 0202 electrical engineering, electronic engineering, information engineering, Systems architecture, Programming paradigm, Project management, business, 0105 earth and related environmental sciences
Abstract: Productivity from day one on supercomputers that leverage new technologies requires significant preparation. An institution that procures a novel system architecture often lacks sufficient institutional knowledge and skills to prepare for it. Thus, the "Center of Excellence" (CoE) concept has emerged to prepare for systems such as Summit and Sierra, currently the top two systems in the Top 500. This paper documents CoE experiences that prepared a workload of diverse applications and math libraries for a heterogeneous system. We describe our approach to this preparation, including our management and execution strategies, and detail our experiences with and reasons for using different programming approaches. Our early science and performance results show that the project enabled significant early seismic science with up to a l4X throughput increase over Cori. In addition to our successes, we discuss our challenges and failures so others may benefit from our experience.
Published: 2019
Full Text: View/download PDF

4. Video Action Recognition With an Additional End-to-End Trained Temporal Stream

Author: Joshua Shapiro, Giacomo Domeniconi, Chih-Chieh Yang, Barry Chen, and Guojing Cong
Subjects: Flexibility (engineering), Artificial neural network, Computer science, business.industry, Training (meteorology), Optical flow, Process (computing), Pattern recognition, 02 engineering and technology, Ensemble learning, Convolution, End-to-end principle, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: Detecting actions in videos requires understanding the temporal relationships among frames. Typical action recognition approaches rely on optical flow estimation methods to convey temporal information to a CNN. Recent studies employ 3D convolutions in addition to optical flow to process the temporal information. While these models achieve slightly better results than two-stream 2D convolutional approaches, they are significantly more complex, requiring more data and time to be trained. We propose an efficient, adaptive batch size distributed training algorithm with customized optimizations for training the two 2D streams. We introduce a new 2D convolutional temporal stream that is trained end-to-end with a neural network. The flexibility to freeze some network layers from training in this temporal stream brings the possibility of ensemble learning with more than one temporal streams. Our architecture that combines three streams achieves the highest accuracies as we know of on UCF101 and HMDB51 by systems that do not pretrain on much larger datasets (e.g., Kinetics). We achieve these results while keeping our spatial and temporal streams 4.67x faster to train than the 3D convolution approaches.
Published: 2019
Full Text: View/download PDF

5. Accelerating Deep Neural Network Training for Action Recognition on a Cluster of GPUs

Author: Joshua Shapiro, Barry Chen, Guojing Cong, Fan Zhou, and Giacomo Domeniconi
Subjects: Artificial neural network, Computer science, business.industry, Training (meteorology), 02 engineering and technology, 010501 environmental sciences, Cluster (spacecraft), Machine learning, computer.software_genre, 01 natural sciences, Stochastic gradient descent, Dimension (vector space), Asynchronous communication, Distributed algorithm, Convergence (routing), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, 0105 earth and related environmental sciences
Abstract: Due to the additional temporal dimension, large-scale video action recognition is even more challenging than image recognition and typically takes days to train on modern GPUs even for modest-sized datasets. We propose algorithms and techniques to accelerate training of deep neural networks for action recognition on a cluster of GPUs. In terms of convergence and scaling, our distributed training algorithm with adaptive batch size is provably superior to popular asynchronous stochastic gradient descent algorithms. The convergence analysis of our algorithm shows it is possible to reduce communication cost and at the same time minimize the number of iterations needed for convergence. We customize the Adam optimizer for our distributed algorithm to improve efficiency. In addition, we employ transfer-learning to further reduce training time while improving validation accuracy. Compared with the base-line single-GPU stochastic gradient descent implementation of the two-stream training approach, our implementation achieves super-linear speedups on 16 GPUs while improving validation accuracy. For the UCFI0l and HMDB51 datasets, the validation accuracies achieved are 93.1 % and 67.9% respectively. As far as we know, these are the highest accuracies achieved with the two-stream approach that does not involve computationally expensive 3D convolutions or pretraining on much larger datasets.
Published: 2018
Full Text: View/download PDF

6. Assessing semantic information in convolutional neural network representations of images via image annotation

Author: Barry Chen, Karl Ni, and Michael B. Mayhew
Subjects: Contextual image classification, Artificial neural network, Computer science, business.industry, Feature extraction, Pattern recognition, 02 engineering and technology, 010501 environmental sciences, Semantics, Machine learning, computer.software_genre, 01 natural sciences, Convolutional neural network, k-nearest neighbors algorithm, Automatic image annotation, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Image retrieval, 0105 earth and related environmental sciences
Abstract: Image annotation, or prediction of multiple tags for an image, is a challenging task. Most current algorithms are based on large sets of handcrafted features. Deep convolutional neural networks have recently outperformed humans in image classification, and these networks can be used to extract features highly predictive of an image's tags. In this study, we analyze semantic information in features derived from two pre-trained deep network classifiers by evaluating their performance in nearest neighbor-based approaches to tag prediction. We generally exceed performance of the manual features when using the deep features. We also find complementary information in the manual and deep features when used in combination for image annotation.
Published: 2016
Full Text: View/download PDF

7. LBANN

Author: Brian Van Essen, Hyojin Kim, Kofi Boakye, Roger Pearce, and Barry Chen
Subjects: Training set, Artificial neural network, Computer science, Data parallelism, business.industry, Scale (chemistry), Distributed computing, Deep learning, Artificial intelligence, business, Supercomputer
Abstract: Recent successes of deep learning have been largely driven by the ability to train large models on vast amounts of data. We believe that High Performance Computing (HPC) will play an increasingly important role in helping deep learning achieve the next level of innovation fueled by neural network models that are orders of magnitude larger and trained on commensurately more training data. We are targeting the unique capabilities of both current and upcoming HPC systems to train massive neural networks and are developing the Livermore Big Artificial Neural Network (LBANN) toolkit to exploit both model and data parallelism optimized for large scale HPC resources. This paper presents our preliminary results in scaling the size of model that can be trained with the LBANN toolkit.
Published: 2015
Full Text: View/download PDF

8. Geospatial image mining for nuclear proliferation detection: Challenges and new opportunities

Author: Aggelos K. Katsaggelos, Ranga Raju Vatsavai, Anil Cheriyadat, Reid B. Porter, Carl F. Diegert, Ryan E. Hohimer, Barry Chen, Eddie A Bright, James S. Bollinger, Lloyd F. Arrowood, Budhendra L. Bhaduri, Shaun S. Gleason, and Thrasos Pappas
Subjects: Nuclear technology, Geospatial analysis, Computer science, Key (cryptography), Nuclear proliferation, ComputerApplications_COMPUTERSINOTHERSYSTEMS, Context (language use), Earth observation satellite, computer.software_genre, Data science, computer, Nuclear program
Abstract: With increasing understanding and availability of nuclear technologies, and increasing persuasion of nuclear technologies by several new countries, it is increasingly becoming important to monitor the nuclear proliferation activities. There is a great need for developing technologies to automatically or semi-automatically detect nuclear proliferation activities using remote sensing. Images acquired from earth observation satellites is an important source of information in detecting proliferation activities. High-resolution remote sensing images are highly useful in verifying the correctness, as well as completeness of any nuclear program. DOE national laboratories are interested in detecting nuclear proliferation by developing advanced geospatial image mining algorithms. In this paper we describe the current understanding of geospatial image mining techniques and enumerate key gaps and identify future research needs in the context of nuclear proliferation.
Published: 2010
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Barry Chen"'

1. Fast neural network training on a cluster of GPUs for action recognition with high accuracy

2. 4.1 A 7nm 5G Mobile SoC Featuring a 3.0GHz Tri-Gear Application Processor Subsystem

3. Preparation and optimization of a diverse workload for a large-scale heterogeneous system

4. Video Action Recognition With an Additional End-to-End Trained Temporal Stream

5. Accelerating Deep Neural Network Training for Action Recognition on a Cluster of GPUs

6. Assessing semantic information in convolutional neural network representations of images via image annotation

7. LBANN

8. Geospatial image mining for nuclear proliferation detection: Challenges and new opportunities

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

8 results on '"Barry Chen"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources