Author: "Emani, Murali" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Emani, Murali"' showing total 122 results

Start Over Author "Emani, Murali"

122 results on '"Emani, Murali"'

1. LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Author: Chitty-Venkata, Krishna Teja, Raskar, Siddhisanket, Kale, Bharat, Ferdaus, Farah, Tanikanti, Aditya, Raffenetti, Ken, Taylor, Valerie, Emani, Murali, and Vishwanath, Venkatram
Subjects: Computer Science - Machine Learning
Abstract: Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges, requiring efficient hardware acceleration. Benchmarking the performance of LLMs across diverse hardware platforms is crucial to understanding their scalability and throughput characteristics. We introduce LLM-Inference-Bench, a comprehensive benchmarking suite to evaluate the hardware inference performance of LLMs. We thoroughly analyze diverse hardware platforms, including GPUs from Nvidia and AMD and specialized AI accelerators, Intel Habana and SambaNova. Our evaluation includes several LLM inference frameworks and models from LLaMA, Mistral, and Qwen families with 7B and 70B parameters. Our benchmarking results reveal the strengths and limitations of various models, hardware platforms, and inference frameworks. We provide an interactive dashboard to help identify configurations for optimal performance for a given hardware platform.
Published: 2024

2. AI-coupled HPC Workflow Applications, Middleware and Performance

Author: Brewer, Wes, Gainaru, Ana, Suter, Frédéric, Wang, Feiyi, Emani, Murali, and Jha, Shantenu
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: AI integration is revolutionizing the landscape of HPC simulations, enhancing the importance, use, and performance of AI-driven HPC workflows. This paper surveys the diverse and rapidly evolving field of AI-driven HPC and provides a common conceptual basis for understanding AI-driven HPC workflows. Specifically, we use insights from different modes of coupling AI into HPC workflows to propose six execution motifs most commonly found in scientific applications. The proposed set of execution motifs is by definition incomplete and evolving. However, they allow us to analyze the primary performance challenges underpinning AI-driven HPC workflows. We close with a listing of open challenges, research issues, and suggested areas of investigation including the the need for specific benchmarks that will help evaluate and improve the execution of AI-driven HPC workflows.
Published: 2024

3. DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Author: Song, Shuaiwen Leon, Kruft, Bonnie, Zhang, Minjia, Li, Conglong, Chen, Shiyang, Zhang, Chengming, Tanaka, Masahiro, Wu, Xiaoxia, Rasley, Jeff, Awan, Ammar Ahmad, Holmes, Connor, Cai, Martin, Ghanem, Adam, Zhou, Zhongzhu, He, Yuxiong, Luferenko, Pete, Kumar, Divya, Weyn, Jonathan, Zhang, Ruixiong, Klocek, Sylwester, Vragov, Volodymyr, AlQuraishi, Mohammed, Ahdritz, Gustaf, Floristean, Christina, Negri, Cristina, Kotamarthi, Rao, Vishwanath, Venkatram, Ramanathan, Arvind, Foreman, Sam, Hippe, Kyle, Arcomano, Troy, Maulik, Romit, Zvyagin, Maxim, Brace, Alexander, Zhang, Bin, Bohorquez, Cindy Orozco, Clyde, Austin, Kale, Bharat, Perez-Rivera, Danilo, Ma, Heng, Mann, Carla M., Irvin, Michael, Pauloski, J. Gregory, Ward, Logan, Hayot, Valerie, Emani, Murali, Xie, Zhen, Lin, Diangen, Shukla, Maulik, Foster, Ian, Davis, James J., Papka, Michael E., Brettin, Thomas, Balaprakash, Prasanna, Tourassi, Gina, Gounley, John, Hanson, Heidi, Potok, Thomas E, Pasini, Massimiliano Lupo, Evans, Kate, Lu, Dan, Lunga, Dalton, Yin, Junqi, Dash, Sajal, Wang, Feiyi, Shankar, Mallikarjun, Lyngaas, Isaac, Wang, Xiao, Cong, Guojing, Zhang, Pei, Fan, Ming, Liu, Siyan, Hoisie, Adolfy, Yoo, Shinjae, Ren, Yihui, Tang, William, Felker, Kyle, Svyatkovskiy, Alexey, Liu, Hang, Aji, Ashwin, Dalton, Angela, Schulte, Michael, Schulz, Karl, Deng, Yuntian, Nie, Weili, Romero, Josh, Dallago, Christian, Vahdat, Arash, Xiao, Chaowei, Gibbs, Thomas, Anandkumar, Anima, and Stevens, Rick
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique capabilities through AI system technology innovations to help domain experts to unlock today's biggest science mysteries. By leveraging DeepSpeed's current technology pillars (training, inference and compression) as base technology enablers, DeepSpeed4Science will create a new set of AI system technologies tailored for accelerating scientific discoveries by addressing their unique complexity beyond the common technical approaches used for accelerating generic large language models (LLMs). In this paper, we showcase the early progress we made with DeepSpeed4Science in addressing two of the critical system challenges in structural biology research.
Published: 2023

4. A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators

Author: Emani, Murali, Foreman, Sam, Sastry, Varuni, Xie, Zhen, Raskar, Siddhisanket, Arnold, William, Thakur, Rajeev, Vishwanath, Venkatram, and Papka, Michael E.
Subjects: Computer Science - Performance, Computer Science - Artificial Intelligence, Computer Science - Hardware Architecture, Computer Science - Machine Learning
Abstract: Artificial intelligence (AI) methods have become critical in scientific applications to help accelerate scientific discovery. Large language models (LLMs) are being considered as a promising approach to address some of the challenging problems because of their superior generalization capabilities across domains. The effectiveness of the models and the accuracy of the applications is contingent upon their efficient execution on the underlying hardware infrastructure. Specialized AI accelerator hardware systems have recently become available for accelerating AI applications. However, the comparative performance of these AI accelerators on large language models has not been previously studied. In this paper, we systematically study LLMs on multiple AI accelerators and GPUs and evaluate their performance characteristics for these models. We evaluate these systems with (i) a micro-benchmark using a core transformer block, (ii) a GPT- 2 model, and (iii) an LLM-driven science use case, GenSLM. We present our findings and analyses of the models' performance to better understand the intrinsic capabilities of AI accelerators. Furthermore, our analysis takes into account key factors such as sequence lengths, scaling behavior, sparsity, and sensitivity to gradient accumulation steps.
Published: 2023

5. HPC-GPT: Integrating Large Language Model for High-Performance Computing

Author: Ding, Xianzhong, Chen, Le, Emani, Murali, Liao, Chunhua, Lin, Pei-Hung, Vanderbruggen, Tristan, Xie, Zhen, Cerpa, Alberto E., and Du, Wan
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Large Language Models (LLMs), including the LLaMA model, have exhibited their efficacy across various general-domain natural language processing (NLP) tasks. However, their performance in high-performance computing (HPC) domain tasks has been less than optimal due to the specialized expertise required to interpret the model responses. In response to this challenge, we propose HPC-GPT, a novel LLaMA-based model that has been supervised fine-tuning using generated QA (Question-Answer) instances for the HPC domain. To evaluate its effectiveness, we concentrate on two HPC tasks: managing AI models and datasets for HPC, and data race detection. By employing HPC-GPT, we demonstrate comparable performance with existing methods on both tasks, exemplifying its excellence in HPC-related scenarios. Our experiments on open-source benchmarks yield extensive results, underscoring HPC-GPT's potential to bridge the performance gap between LLMs and HPC-specific tasks. With HPC-GPT, we aim to pave the way for LLMs to excel in HPC domains, simplifying the utilization of language models in complex computing applications., Comment: 9 pages
Published: 2023
Full Text: View/download PDF

6. Data Race Detection Using Large Language Models

Author: Chen, Le, Ding, Xianzhong, Emani, Murali, Vanderbruggen, Tristan, Lin, Pei-hung, and Liao, Chuanhua
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Large language models (LLMs) are demonstrating significant promise as an alternate strategy to facilitate analyses and optimizations of high-performance computing programs, circumventing the need for resource-intensive manual tool creation. In this paper, we explore a novel LLM-based data race detection approach combining prompting engineering and fine-tuning techniques. We create a dedicated dataset named DRB-ML, which is derived from DataRaceBench, with fine-grain labels showing the presence of data race pairs and their associated variables, line numbers, and read/write information. DRB-ML is then used to evaluate representative LLMs and fine-tune open-source ones. Our experiment shows that LLMs can be a viable approach to data race detection. However, they still cannot compete with traditional data race detection tools when we need detailed information about variable pairs causing data races.
Published: 2023

7. A Survey of Techniques for Optimizing Transformer Inference

Author: Chitty-Venkata, Krishna Teja, Mittal, Sparsh, Emani, Murali, Vishwanath, Venkatram, and Somani, Arun K.
Subjects: Computer Science - Machine Learning, Computer Science - Hardware Architecture, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent years have seen a phenomenal rise in performance and applications of transformer neural networks. The family of transformer networks, including Bidirectional Encoder Representations from Transformer (BERT), Generative Pretrained Transformer (GPT) and Vision Transformer (ViT), have shown their effectiveness across Natural Language Processing (NLP) and Computer Vision (CV) domains. Transformer-based networks such as ChatGPT have impacted the lives of common men. However, the quest for high predictive performance has led to an exponential increase in transformers' memory and compute footprint. Researchers have proposed techniques to optimize transformer inference at all levels of abstraction. This paper presents a comprehensive survey of techniques for optimizing the inference phase of transformer networks. We survey techniques such as knowledge distillation, pruning, quantization, neural architecture search and lightweight network design at the algorithmic level. We further review hardware-level optimization techniques and the design of novel hardware accelerators for transformers. We summarize the quantitative results on the number of parameters/FLOPs and accuracy of several models/techniques to showcase the tradeoff exercised by them. We also outline future directions in this rapidly evolving field of research. We believe that this survey will educate both novice and seasoned researchers and also spark a plethora of research efforts in this field.
Published: 2023

8. LM4HPC: Towards Effective Language Model Application in High-Performance Computing

Author: Chen, Le, Lin, Pei-Hung, Vanderbruggen, Tristan, Liao, Chunhua, Emani, Murali, and de Supinski, Bronis
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: In recent years, language models (LMs), such as GPT-4, have been widely used in multiple domains, including natural language processing, visualization, and so on. However, applying them for analyzing and optimizing high-performance computing (HPC) software is still challenging due to the lack of HPC-specific support. In this paper, we design the LM4HPC framework to facilitate the research and development of HPC software analyses and optimizations using LMs. Tailored for supporting HPC datasets, AI models, and pipelines, our framework is built on top of a range of components from different levels of the machine learning software stack, with Hugging Face-compatible APIs. Using three representative tasks, we evaluated the prototype of our framework. The results show that LM4HPC can help users quickly evaluate a set of state-of-the-art models and generate insightful leaderboards.
Published: 2023

9. A Multi-Level, Multi-Scale Visual Analytics Approach to Assessment of Multifidelity HPC Systems

Author: Shilpika, Lusch, Bethany, Emani, Murali, Simini, Filippo, Vishwanath, Venkatram, Papka, Michael E., and Ma, Kwan-Liu
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computer Vision and Pattern Recognition
Abstract: The ability to monitor and interpret of hardware system events and behaviors are crucial to improving the robustness and reliability of these systems, especially in a supercomputing facility. The growing complexity and scale of these systems demand an increase in monitoring data collected at multiple fidelity levels and varying temporal resolutions. In this work, we aim to build a holistic analytical system that helps make sense of such massive data, mainly the hardware logs, job logs, and environment logs collected from disparate subsystems and components of a supercomputer system. This end-to-end log analysis system, coupled with visual analytics support, allows users to glean and promptly extract supercomputer usage and error patterns at varying temporal and spatial resolutions. We use multiresolution dynamic mode decomposition (mrDMD), a technique that depicts high-dimensional data as correlated spatial-temporal variations patterns or modes, to extract variation patterns isolated at specified frequencies. Our improvements to the mrDMD algorithm help promptly reveal useful information in the massive environment log dataset, which is then associated with the processed hardware and job log datasets using our visual analytics system. Furthermore, our system can identify the usage and error patterns filtered at user, project, and subcomponent levels. We exemplify the effectiveness of our approach with two use scenarios with the Cray XC40 supercomputer.
Published: 2023

10. Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation

Author: Verma, Gaurav, Raskar, Siddhisanket, Xie, Zhen, Malik, Abid M, Emani, Murali, and Chapman, Barbara
Subjects: Computer Science - Programming Languages, Computer Science - Machine Learning
Abstract: Tuning tensor program generation involves searching for various possible program transformation combinations for a given program on target hardware to optimize the tensor program execution. It is already a complex process because of the massive search space and exponential combinations of transformations make auto-tuning tensor program generation more challenging, especially when we have a heterogeneous target. In this research, we attempt to address these problems by learning the joint neural network and hardware features and transferring them to the new target hardware. We extensively study the existing state-of-the-art dataset, TenSet, perform comparative analysis on the test split strategies and propose methodologies to prune the dataset. We adopt an attention-inspired approach for tuning the tensor programs enabling them to embed neural network and hardware-specific features. Our approach could prune the dataset up to 45\% of the baseline without compromising the Pairwise Comparison Accuracy (PCA). Further, the proposed methodology can achieve on-par or improved mean inference time with 25%-40% of the baseline tuning time across different networks and target hardware.
Published: 2023
Full Text: View/download PDF

11. FAIR for AI: An interdisciplinary and international community building perspective.

Author: Huerta, E, Blaiszik, Ben, Brinson, L, Bouchard, Kristofer, Diaz, Daniel, Doglioni, Caterina, Emani, Murali, Foster, Ian, Fox, Geoffrey, Harris, Philip, Heinrich, Lukas, Jha, Shantenu, Katz, Daniel, Kindratenko, Volodymyr, Kirkpatrick, Christine, Lassila-Perini, Kati, Madduri, Ravi, Neubauer, Mark, Psomopoulos, Fotis, Roy, Avik, Rübel, Oliver, Zhao, Zhizhen, Zhu, Ruike, and Duarte, Javier
Abstract: A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to include the software, tools, algorithms, and workflows that produce data. FAIR principles are now being adapted in the context of AI models and datasets. Here, we present the perspectives, vision, and experiences of researchers from different countries, disciplines, and backgrounds who are leading the definition and adoption of FAIR principles in their communities of practice, and discuss outcomes that may result from pursuing and incentivizing FAIR AI research. The material for this report builds on the FAIR for AI Workshop held at Argonne National Laboratory on June 7, 2022.
Published: 2023

12. Towards Seamless Management of AI Models in High-Performance Computing

Author: Yu, Sixing, Emani, Murali, Liao, Chunhua, Lin, Pei-Hung, Vanderbruggen, Tristan, Shen, Xipeng, and Jannesari, Ali
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: With the increasing prevalence of artificial intelligence (AI) in diverse science/engineering communities, AI models emerge on an unprecedented scale among various domains. However, given the complexity and diversity of the software and hardware environments, reusing AI artifacts (models and datasets) is extremely challenging, especially with AI-driven science applications. Building an ecosystem to run and reuse AI applications/datasets at scale efficiently becomes increasingly essential for diverse science and engineering and high-performance computing (HPC) communities. In this paper, we innovate over an HPC-AI ecosystem -- HPCFair, which enables the Findable, Accessible, Interoperable, and Reproducible (FAIR) principles. HPCFair enables the collection of AI models/datasets allowing users to download/upload AI artifacts with authentications. Most importantly, our proposed framework provides user-friendly APIs for users to easily run inference jobs and customize AI artifacts to their tasks as needed. Our results show that, with HPCFair API, users irrespective of technical expertise in AI, can easily leverage AI artifacts to their tasks with minimal effort., Comment: Accepted at the 2nd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)
Published: 2022

13. WActiGrad: Structured Pruning for Efficient Finetuning and Inference of Large Language Models on AI Accelerators

Author: Chitty-Venkata, Krishna Teja, Sastry, Varuni Katti, Emani, Murali, Vishwanath, Venkatram, Shanmugavelu, Sanjif, Howland, Sylvia, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Carretero, Jesus, editor, Shende, Sameer, editor, Garcia-Blas, Javier, editor, Brandic, Ivona, editor, Olcoz, Katzalin, editor, and Schreiber, Martin, editor
Published: 2024
Full Text: View/download PDF

14. Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study

Author: Lin, Pei-Hung, Liao, Chunhua, Chen, Winson, Vanderbruggen, Tristan, Emani, Murali, and Xu, Hailu
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: The FAIR Guiding Principles aim to improve the findability, accessibility, interoperability, and reusability of digital content by making them both human and machine actionable. However, these principles have not yet been broadly adopted in the domain of machine learning-based program analyses and optimizations for High-Performance Computing (HPC). In this paper, we design a methodology to make HPC datasets and machine learning models FAIR after investigating existing FAIRness assessment and improvement techniques. Our methodology includes a comprehensive, quantitative assessment for elected data, followed by concrete, actionable suggestions to improve FAIRness with respect to common issues related to persistent identifiers, rich metadata descriptions, license and provenance information. Moreover, we select a representative training dataset to evaluate our methodology. The experiment shows the methodology can effectively improve the dataset and model's FAIRness from an initial score of 19.1% to the final score of 83.0%.
Published: 2022

15. FAIR for AI: An interdisciplinary and international community building perspective

Author: Huerta, E. A., Blaiszik, Ben, Brinson, L. Catherine, Bouchard, Kristofer E., Diaz, Daniel, Doglioni, Caterina, Duarte, Javier M., Emani, Murali, Foster, Ian, Fox, Geoffrey, Harris, Philip, Heinrich, Lukas, Jha, Shantenu, Katz, Daniel S., Kindratenko, Volodymyr, Kirkpatrick, Christine R., Lassila-Perini, Kati, Madduri, Ravi K., Neubauer, Mark S., Psomopoulos, Fotis E., Roy, Avik, Rübel, Oliver, Zhao, Zhizhen, and Zhu, Ruike
Subjects: Computer Science - Computers and Society, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning, High Energy Physics - Experiment, I.2.0, E.0
Abstract: A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to include the software, tools, algorithms, and workflows that produce data. FAIR principles are now being adapted in the context of AI models and datasets. Here, we present the perspectives, vision, and experiences of researchers from different countries, disciplines, and backgrounds who are leading the definition and adoption of FAIR principles in their communities of practice, and discuss outcomes that may result from pursuing and incentivizing FAIR AI research. The material for this report builds on the FAIR for AI Workshop held at Argonne National Laboratory on June 7, 2022., Comment: 10 pages, comments welcome!; v2: 12 pages, accepted to Scientific Data
Published: 2022
Full Text: View/download PDF

16. Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines

Author: Flynn, Patrick, Vanderbruggen, Tristan, Liao, Chunhua, Lin, Pei-Hung, Emani, Murali, and Shen, Xipeng
Subjects: Computer Science - Machine Learning, Computer Science - Programming Languages
Abstract: Programming Language Processing (PLP) using machine learning has made vast improvements in the past few years. Increasingly more people are interested in exploring this promising field. However, it is challenging for new researchers and developers to find the right components to construct their own machine learning pipelines, given the diverse PLP tasks to be solved, the large number of datasets and models being released, and the set of complex compilers or tools involved. To improve the findability, accessibility, interoperability and reusability (FAIRness) of machine learning components, we collect and analyze a set of representative papers in the domain of machine learning-based PLP. We then identify and characterize key concepts including PLP tasks, model architectures and supportive tools. Finally, we show some example use cases of leveraging the reusable components to construct machine learning pipelines to solve a set of PLP tasks.
Published: 2022

17. MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

Author: Farrell, Steven, Emani, Murali, Balma, Jacob, Drescher, Lukas, Drozd, Aleksandr, Fink, Andreas, Fox, Geoffrey, Kanter, David, Kurth, Thorsten, Mattson, Peter, Mu, Dawei, Ruhela, Amit, Sato, Kento, Shirahata, Koichi, Tabaru, Tsuguchika, Tsaris, Aristeidis, Balewski, Jan, Cumming, Ben, Danjo, Takumi, Domke, Jens, Fukai, Takaaki, Fukumoto, Naoto, Fukushi, Tatsuya, Gerofi, Balazs, Honda, Takumi, Imamura, Toshiyuki, Kasagi, Akihiko, Kawakami, Kentaro, Kudo, Shuhei, Kuroda, Akiyoshi, Martinasso, Maxime, Matsuoka, Satoshi, Mendonça, Henrique, Minami, Kazuki, Ram, Prabhat, Sawada, Takashi, Shankar, Mallikarjun, John, Tom St., Tabuchi, Akihiro, Vishwanath, Venkatram, Wahib, Mohamed, Yamazaki, Masafumi, and Yin, Junqi
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Scientific communities are increasingly adopting machine learning and deep learning models in their applications to accelerate scientific insights. High performance computing systems are pushing the frontiers of performance with a rich diversity of hardware resources and massive scale-out capabilities. There is a critical need to understand fair and effective benchmarking of machine learning applications that are representative of real-world scientific use cases. MLPerf is a community-driven standard to benchmark machine learning workloads, focusing on end-to-end performance metrics. In this paper, we introduce MLPerf HPC, a benchmark suite of large-scale scientific machine learning training applications driven by the MLCommons Association. We present the results from the first submission round, including a diverse set of some of the world's largest HPC systems. We develop a systematic framework for their joint analysis and compare them in terms of data staging, algorithmic convergence, and compute performance. As a result, we gain a quantitative understanding of optimizations on different subsystems such as staging and on-node loading of data, compute-unit utilization, and communication scheduling, enabling overall $>10 \times$ (end-to-end) performance improvements through system scaling. Notably, our analysis shows a scale-dependent interplay between the dataset size, a system's memory hierarchy, and training convergence that underlines the importance of near-compute storage. To overcome the data-parallel scalability challenge at large batch sizes, we discuss specific learning techniques and hybrid data-and-model parallelism that are effective on large systems. We conclude by characterizing each benchmark with respect to low-level memory, I/O, and network behavior to parameterize extended roofline performance models in future rounds.
Published: 2021

18. TrainBF: High-Performance DNN Training Engine Using BFloat16 on AI Accelerators

Author: Xie, Zhen, Raskar, Siddhisanket, Emani, Murali, Vishwanath, Venkatram, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Cano, José, editor, Dikaiakos, Marios D., editor, Papadopoulos, George A., editor, Pericàs, Miquel, editor, and Sakellariou, Rizos, editor
Published: 2023
Full Text: View/download PDF

19. Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines

Author: Flynn, Patrick, Vanderbruggen, Tristan, Liao, Chunhua, Lin, Pei-Hung, Emani, Murali, Shen, Xipeng, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Batista, Thais, editor, Bureš, Tomáš, editor, Raibulet, Claudia, editor, and Muccini, Henry, editor
Published: 2023
Full Text: View/download PDF

20. TrainBF: High-Performance DNN Training Engine Using BFloat16 on AI Accelerators

Author: Xie, Zhen, primary, Raskar, Siddhisanket, additional, Emani, Murali, additional, and Vishwanath, Venkatram, additional
Published: 2023
Full Text: View/download PDF

21. LM4HPC: Towards Effective Language Model Application in High-Performance Computing

Author: Chen, Le, primary, Lin, Pei-Hung, additional, Vanderbruggen, Tristan, additional, Liao, Chunhua, additional, Emani, Murali, additional, and de Supinski, Bronis, additional
Published: 2023
Full Text: View/download PDF

22. AI Benchmarking for Science: Efforts from the MLCommons Science Working Group

Author: Thiyagalingam, Jeyan, von Laszewski, Gregor, Yin, Junqi, Emani, Murali, Papay, Juri, Barrett, Gregg, Luszczek, Piotr, Tsaris, Aristeidis, Kirkpatrick, Christine, Wang, Feiyi, Gibbs, Tom, Vishwanath, Venkatram, Shankar, Mallikarjun, Fox, Geoffrey, Hey, Tony, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Anzt, Hartwig, editor, Bienz, Amanda, editor, Luszczek, Piotr, editor, and Baboulin, Marc, editor
Published: 2022
Full Text: View/download PDF

23. 2022 AI Testbed Expeditions Report

Author: Vishwanath, Venkat, primary, Emani, Murali, additional, Taylor, Valerie, additional, Foster, Ian, additional, Habib, Salman, additional, Papka, Michael, additional, Babu, Anakha, additional, Chan, Henry, additional, Cherukara, Mathew, additional, Monsalve Diaz, Jose, additional, Doerfert, Johannes, additional, Feinstein, Jeremy, additional, Harder, Ross, additional, Hickey, Kevin, additional, Huckelheim, Jan, additional, Kandel, Saugat, additional, Kettimuthu, Rajkumar, additional, Kumar, Tadbhagya, additional, Liu, Zhengchun, additional, MacDonell, Margaret, additional, Miceli, Antonino, additional, Ngom, Marieme, additional, Pal, Pinaki, additional, Paulson, Noah, additional, Picel, Kurt, additional, Raghavan, Krishnan, additional, Ramanathan, Arvind, additional, Rangel, Esteban, additional, Raskar, Siddhisanket, additional, Sastry, Varuni, additional, Sivaraman, Ganesh, additional, Sun, Baixi, additional, Trovato, Marco, additional, Valentino, Lauren, additional, Xie, Zhen, additional, Yan, Eugene, additional, Yao, Yudong, additional, Yoshii, Kazutomo, additional, Yu, Xiaodong, additional, and Zhou, Tao, additional
Published: 2022
Full Text: View/download PDF

24. Thorough Characterization and Analysis of Large Transformer Model Training At-Scale

Author: Cheng, Scott, primary, Lin, Jun-Liang, additional, Emani, Murali, additional, Raskar, Siddhisanket, additional, Foreman, Sam, additional, Xie, Zhen, additional, Vishwanath, Venkatram, additional, and Kandemir, Mahmut Taylan, additional
Published: 2024
Full Text: View/download PDF

25. Cross-Feature Transfer Learning for Efficient Tensor Program Generation

Author: Verma, Gaurav, primary, Raskar, Siddhisanket, additional, Emani, Murali, additional, and Chapman, Barbara, additional
Published: 2024
Full Text: View/download PDF

26. Adaptive parallelism mapping in dynamic environments using machine learning

Author: Emani, Murali Krishna, O'Boyle, Michael, and Franke, Bjoern
Subjects: 006.3, parallel mapping, machine learning
Abstract: Modern day hardware platforms are parallel and diverse, ranging from mobiles to data centers. Mainstream parallel applications execute in the same system competing for resources. This resource contention may lead to a drastic degradation in a program’s performance. In addition, the execution environment composed of workloads and hardware resources, is dynamic and unpredictable. Efficient matching of program parallelism to machine parallelism under uncertainty is hard. The mapping policies that determine the optimal allocation of work to threads should anticipate these variations. This thesis proposes solutions to the mapping of parallel programs in dynamic environments. It employs predictive modelling techniques to determine the best degree of parallelism. Firstly, this thesis proposes a machine learning-based model to determine the optimal thread number for a target program co-executing with varying workloads. For this purpose, this offline trained model uses static code features and dynamic runtime information as input. Next, this thesis proposes a novel solution to monitor the proposed offline model and adjust its decisions in response to the environment changes. It develops a second predictive model for determining how the future environment should be, if the current thread prediction was optimal. Depending on how close this prediction was to the actual environment, the predicted thread numbers are adjusted. Furthermore, considering the multitude of potential execution scenarios where no single policy is best suited in all cases, this work proposes an approach based on the idea of mixture of experts. It considers a number of offline experts or mapping policies, each specialized for a given scenario, and learns online the best expert that is optimal for the current execution. When evaluated on highly dynamic executions, these solutions are proven to surpass default, state-of-art adaptive and analytic approaches.
Published: 2015

27. Characterizing the Performance of Triangle Counting on Graphcore's IPU Architecture

Author: Barik, Reet, primary, Raskar, Siddhisanket, additional, Emani, Murali, additional, and Vishwanath, Venkatram, additional
Published: 2023
Full Text: View/download PDF

28. HPC-GPT: Integrating Large Language Model for High-Performance Computing

Author: Ding, Xianzhong, primary, Chen, Le, additional, Emani, Murali, additional, Liao, Chunhua, additional, Lin, Pei-Hung, additional, Vanderbruggen, Tristan, additional, Xie, Zhen, additional, Cerpa, Alberto, additional, and Du, Wan, additional
Published: 2023
Full Text: View/download PDF

29. Data Race Detection Using Large Language Models

Author: Chen, Le, primary, Ding, Xianzhong, additional, Emani, Murali, additional, Vanderbruggen, Tristan, additional, Lin, Pei-Hung, additional, and Liao, Chunhua, additional
Published: 2023
Full Text: View/download PDF

30. GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics

Author: Zvyagin, Maxim, primary, Brace, Alexander, additional, Hippe, Kyle, additional, Deng, Yuntian, additional, Zhang, Bin, additional, Bohorquez, Cindy Orozco, additional, Clyde, Austin, additional, Kale, Bharat, additional, Perez-Rivera, Danilo, additional, Ma, Heng, additional, Mann, Carla M., additional, Irvin, Michael, additional, Ozgulbas, Defne G., additional, Vassilieva, Natalia, additional, Pauloski, James Gregory, additional, Ward, Logan, additional, Hayot-Sasson, Valerie, additional, Emani, Murali, additional, Foreman, Sam, additional, Xie, Zhen, additional, Lin, Diangen, additional, Shukla, Maulik, additional, Nie, Weili, additional, Romero, Josh, additional, Dallago, Christian, additional, Vahdat, Arash, additional, Xiao, Chaowei, additional, Gibbs, Thomas, additional, Foster, Ian, additional, Davis, James J., additional, Papka, Michael E., additional, Brettin, Thomas, additional, Stevens, Rick, additional, Anandkumar, Anima, additional, Vishwanath, Venkatram, additional, and Ramanathan, Arvind, additional
Published: 2023
Full Text: View/download PDF

31. A survey of techniques for optimizing transformer inference

Author: Chitty-Venkata, Krishna Teja, primary, Mittal, Sparsh, additional, Emani, Murali, additional, Vishwanath, Venkatram, additional, and Somani, Arun K., additional
Published: 2023
Full Text: View/download PDF

32. Mapping Medley: Adaptive Parallelism Mapping with Varying Optimization Goals

Author: Emani, Murali Krishna, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Ding, Chen, editor, Criswell, John, editor, and Wu, Peng, editor
Published: 2017
Full Text: View/download PDF

33. Change Detection Based Parallelism Mapping: Exploiting Offline Models and Online Adaptation

Author: Emani, Murali Krishna, O’Boyle, Michael, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Brodman, James, editor, and Tu, Peng, editor
Published: 2015
Full Text: View/download PDF

34. Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation

Author: Verma, Gaurav, primary, Raskar, Siddhisanket, additional, Xie, Zhen, additional, Malik, Abid M, additional, Emani, Murali, additional, and Chapman, Barbara, additional
Published: 2023
Full Text: View/download PDF

35. Differentiable Neural Architecture, Mixed Precision and Accelerator Co-search

Author: Chitty-Venkata, Krishna Teja, primary, Bian, Yiming, additional, Emani, Murali, additional, Vishwanath, Venkatram, additional, and Somani, Arun K., additional
Published: 2023
Full Text: View/download PDF

36. Neural Architecture Search Benchmarks: Insights and Survey

Author: Chitty-Venkata, Krishna Teja, primary, Emani, Murali, additional, Vishwanath, Venkatram, additional, and Somani, Arun K., additional
Published: 2023
Full Text: View/download PDF

37. Early Experience with Transformer-Based Similarity Analysis for DataRaceBench

Author: Chen, Winson, primary, Vanderbruggen, Tristan, additional, Lin, Pei-Hung, additional, Liao, Chunhua, additional, and Emani, Murali, additional
Published: 2022
Full Text: View/download PDF

38. Interactive NLU-Powered Ontology-Based Workflow Synthesis for FAIR Support of HPC

Author: Nan, Zifan, primary, Dave, Mithil, additional, Shen, Xipeng, additional, Liao, Chunhua, additional, Vanderbruggen, Tristan, additional, Lin, Pei-Hung, additional, and Emani, Murali, additional
Published: 2022
Full Text: View/download PDF

39. A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads

Author: Emani, Murali, primary, Xie, Zhen, additional, Raskar, Siddhisanket, additional, Sastry, Varuni, additional, Arnold, William, additional, Wilson, Bruce, additional, Thakur, Rajeev, additional, Vishwanath, Venkatram, additional, Liu, Zhengchun, additional, Papka, Michael E., additional, Bohorquez, Cindy Orozco, additional, Weisner, Rick, additional, Li, Karen, additional, Sheng, Yongning, additional, Du, Yun, additional, Zhang, Jian, additional, Tsyplikhin, Alexander, additional, Khaira, Gurdaman, additional, Fowers, Jeremy, additional, Sivakumar, Ramakrishnan, additional, Godsoe, Victoria, additional, Macias, Adrian, additional, Tekur, Chetan, additional, and Boyd, Matthew, additional
Published: 2022
Full Text: View/download PDF

40. GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics

Author: Zvyagin, Maxim, primary, Brace, Alexander, additional, Hippe, Kyle, additional, Deng, Yuntian, additional, Zhang, Bin, additional, Bohorquez, Cindy Orozco, additional, Clyde, Austin, additional, Kale, Bharat, additional, Perez-Rivera, Danilo, additional, Ma, Heng, additional, Mann, Carla M., additional, Irvin, Michael, additional, Gregory Pauloski, J., additional, Ward, Logan, additional, Hayot-Sasson, Valerie, additional, Emani, Murali, additional, Foreman, Sam, additional, Xie, Zhen, additional, Lin, Diangen, additional, Shukla, Maulik, additional, Nie, Weili, additional, Romero, Josh, additional, Dallago, Christian, additional, Vahdat, Arash, additional, Xiao, Chaowei, additional, Gibbs, Thomas, additional, Foster, Ian, additional, Davis, James J., additional, Papka, Michael E., additional, Brettin, Thomas, additional, Stevens, Rick, additional, Anandkumar, Anima, additional, Vishwanath, Venkatram, additional, and Ramanathan, Arvind, additional
Published: 2022
Full Text: View/download PDF

41. Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study

Author: Lin, Pei-Hung, primary, Liao, Chunhua, additional, Chen, Winson, additional, Vanderbruggen, Tristan, additional, Emani, Murali, additional, and Xu, Hailu, additional
Published: 2022
Full Text: View/download PDF

42. Mapping Medley: Adaptive Parallelism Mapping with Varying Optimization Goals

Author: Emani, Murali Krishna, primary
Published: 2017
Full Text: View/download PDF

43. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action

Author: Trifan, Anda, primary, Gorgun, Defne, additional, Salim, Michael, additional, Li, Zongyi, additional, Brace, Alexander, additional, Zvyagin, Maxim, additional, Ma, Heng, additional, Clyde, Austin, additional, Clark, David, additional, Hardy, David J, additional, Burnley, Tom, additional, Huang, Lei, additional, McCalpin, John, additional, Emani, Murali, additional, Yoo, Hyenseung, additional, Yin, Junqi, additional, Tsaris, Aristeidis, additional, Subbiah, Vishal, additional, Raza, Tanveer, additional, Liu, Jessica, additional, Trebesch, Noah, additional, Wells, Geoffrey, additional, Mysore, Venkatesh, additional, Gibbs, Thomas, additional, Phillips, James, additional, Chennubhotla, S Chakra, additional, Foster, Ian, additional, Stevens, Rick, additional, Anandkumar, Anima, additional, Vishwanath, Venkatram, additional, Stone, John E, additional, Tajkhorshid, Emad, additional, Harris, Sarah A, additional, and Ramanathan, Arvind, additional
Published: 2022
Full Text: View/download PDF

44. Efficient Design Space Exploration for Sparse Mixed Precision Neural Architectures

Author: Chitty-Venkata, Krishna Teja, primary, Emani, Murali, additional, Vishwanath, Venkatram, additional, and Somani, Arun K., additional
Published: 2022
Full Text: View/download PDF

45. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action

Author: Trifan, Anda, Gorgun, Defne, Salim, Michael, Li, Zongyi, Brace, Alexander, Zvyagin, Maxim, Ma, Heng, Clyde, Austin, Clark, David, Hardy, David J., Burnley, Tom, Huang, Lei, McCalpin, John, Emani, Murali, Yoo, Hyenseung, Yin, Junqi, Tsaris, Aristeidis, Subbiah, Vishal, Raza, Tanveer, Liu, Jessica, Trebesch, Noah, Wells, Geoffrey, Mysore, Venkatesh, Gibbs, Thomas, Phillips, James, Chennubhotla, S. Chakra, Foster, Ian, Stevens, Rick, Anandkumar, Anima, Vishwanath, Venkatram, Stone, John E., Tajkhorshid, Emad, Harris, Sarah A., Ramanathan, Arvind, Trifan, Anda, Gorgun, Defne, Salim, Michael, Li, Zongyi, Brace, Alexander, Zvyagin, Maxim, Ma, Heng, Clyde, Austin, Clark, David, Hardy, David J., Burnley, Tom, Huang, Lei, McCalpin, John, Emani, Murali, Yoo, Hyenseung, Yin, Junqi, Tsaris, Aristeidis, Subbiah, Vishal, Raza, Tanveer, Liu, Jessica, Trebesch, Noah, Wells, Geoffrey, Mysore, Venkatesh, Gibbs, Thomas, Phillips, James, Chennubhotla, S. Chakra, Foster, Ian, Stevens, Rick, Anandkumar, Anima, Vishwanath, Venkatram, Stone, John E., Tajkhorshid, Emad, Harris, Sarah A., and Ramanathan, Arvind
Abstract: The severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) replication transcription complex (RTC) is a multi-domain protein responsible for replicating and transcribing the viral mRNA inside a human cell. Attacking RTC function with pharmaceutical compounds is a pathway to treating COVID-19. Conventional tools, e.g. cryo-electron microscopy and all-atom molecular dynamics (AAMD), do not provide sufficiently high resolution or timescale to capture important dynamics of this molecular machine. Consequently, we develop an innovative workflow that bridges the gap between these resolutions, using mesoscale fluctuating finite element analysis (FFEA) continuum simulations and a hierarchy of AI-methods that continually learn and infer features for maintaining consistency between AAMD and FFEA simulations. We leverage a multi-site distributed workflow manager to orchestrate AI, FFEA, and AAMD jobs, providing optimal resource utilization across HPC centers. Our study provides unprecedented access to study the SARS-CoV-2 RTC machinery, while providing general capability for AI-enabled multi-resolution simulations at scale.
Published: 2022

46. GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics

Author: Zvyagin, Maxim, Brace, Alexander, Hippe, Kyle, Deng, Yuntian, Zhang, Bin, Orozco Bohorquez, Cindy, Clyde, Austin, Kale, Bharat, Perez-Rivera, Danilo, Ma, Heng, Mann, Carla M., Irvin, Michael, Pauloski, J. Gregory, Ward, Logan, Hayot-Sasson, Valerie, Emani, Murali, Foreman, Sam, Xie, Zhen, Lin, Diangen, Shukla, Maulik, Nie, Weili, Romero, Josh, Dallago, Christian, Vahdat, Arash, Xiao, Chaowei, Gibbs, Thomas, Foster, Ian, Davis, James J., Papka, Michael E., Brettin, Thomas, Stevens, Rick, Anandkumar, Anima, Vishwanath, Venkatram, Ramanathan, Arvind, Zvyagin, Maxim, Brace, Alexander, Hippe, Kyle, Deng, Yuntian, Zhang, Bin, Orozco Bohorquez, Cindy, Clyde, Austin, Kale, Bharat, Perez-Rivera, Danilo, Ma, Heng, Mann, Carla M., Irvin, Michael, Pauloski, J. Gregory, Ward, Logan, Hayot-Sasson, Valerie, Emani, Murali, Foreman, Sam, Xie, Zhen, Lin, Diangen, Shukla, Maulik, Nie, Weili, Romero, Josh, Dallago, Christian, Vahdat, Arash, Xiao, Chaowei, Gibbs, Thomas, Foster, Ian, Davis, James J., Papka, Michael E., Brettin, Thomas, Stevens, Rick, Anandkumar, Anima, Vishwanath, Venkatram, and Ramanathan, Arvind
Abstract: We seek to transform how new and emergent variants of pandemiccausing viruses, specifically SARS-CoV-2, are identified and classified. By adapting large language models (LLMs) for genomic data, we build genome-scale language models (GenSLMs) which can learn the evolutionary landscape of SARS-CoV-2 genomes. By pretraining on over 110 million prokaryotic gene sequences and finetuning a SARS-CoV-2-specific model on 1.5 million genomes, we show that GenSLMs can accurately and rapidly identify variants of concern. Thus, to our knowledge, GenSLMs represents one of the first whole genome scale foundation models which can generalize to other prediction tasks. We demonstrate scaling of GenSLMs on GPU-based supercomputers and AI-hardware accelerators utilizing 1.63 Zettaflops in training runs with a sustained performance of 121 PFLOPS in mixed precision and peak of 850 PFLOPS. We present initial scientific insights from examining GenSLMs in tracking evolutionary dynamics of SARS-CoV-2, paving the path to realizing this on large biological data.
Published: 2022

47. Towards neural architecture-aware exploration of compiler optimizations in a deep learning {graph} compiler

Author: Verma, Gaurav, primary, Finviya, Swetang, additional, Malik, Abid M., additional, Emani, Murali, additional, and Chapman, Barbara, additional
Published: 2022
Full Text: View/download PDF

48. Toward an In-Depth Analysis of Multifidelity High Performance Computing Systems

Author: Shilpika, Shilpika, primary, Lusch, Bethany, additional, Emani, Murali, additional, Simini, Filippo, additional, Vishwanath, Venkatram, additional, Papka, Michael E., additional, and Ma, Kwan-Liu, additional
Published: 2022
Full Text: View/download PDF

49. Throughput-oriented and Accuracy-aware DNN Training with BFloat16 on GPU

Author: Xie, Zhen, primary, Raskar, Siddhisanket, additional, and Emani, Murali, additional
Published: 2022
Full Text: View/download PDF

50. FAIR for AI: An interdisciplinary, international, inclusive, and diverse community building perspective

Author: Huerta, E. A., Blaiszik, Ben, Brinson, L. Catherine, Bouchard, Kristofer E., Diaz, Daniel, Doglioni, Caterina, Duarte, Javier M., Emani, Murali, Foster, Ian, Fox, Geoffrey, Harris, Philip, Heinrich, Lukas, Jha, Shantenu, Katz, Daniel S., Kindratenko, Volodymyr, Kirkpatrick, Christine R., Lassila-Perini, Kati, Madduri, Ravi K., Neubauer, Mark S., Psomopoulos, Fotis E., Roy, Avik, Rübel, Oliver, Zhao, Zhizhen, and Zhu, Ruike
Subjects: FOS: Computer and information sciences, Computer Science - Computers and Society, Computer Science - Machine Learning, High Energy Physics - Experiment (hep-ex), I.2.0, E.0, Computers and Society (cs.CY), Computer Science - Human-Computer Interaction, FOS: Physical sciences, High Energy Physics - Experiment, Human-Computer Interaction (cs.HC), Machine Learning (cs.LG)
Abstract: A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to include the software, tools, algorithms, and workflows that produce data. FAIR principles are now being adapted in the context of AI models and datasets. Here, we present the perspectives, vision, and experiences of researchers from different countries, disciplines, and backgrounds who are leading the definition and adoption of FAIR principles in their communities of practice, and discuss outcomes that may result from pursuing and incentivizing FAIR AI research. The material for this report builds on the FAIR for AI Workshop held at Argonne National Laboratory on June 7, 2022., Comment: 10 pages, comments welcome!
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

122 results on '"Emani, Murali"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources