722 results on '"Kim John P"'
Search Results
2. LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System
- Author
-
Kwon, Hyucksung, Koo, Kyungmo, Kim, Janghyeon, Lee, Woongkyu, Lee, Minjae, Lee, Hyungdeok, Jung, Yousub, Park, Jaehan, Song, Yosub, Yang, Byeongsu, Choi, Haerang, Kim, Guhyun, Won, Jongsoon, Shin, Woojae, Kim, Changhyun, Shin, Gyeongcheol, Kwon, Yongkee, Kim, Ilkon, Lim, Euicheol, Kim, John, and Choi, Jungwook
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Artificial Intelligence - Abstract
The expansion of large language models (LLMs) with hundreds of billions of parameters presents significant challenges to computational resources, particularly data movement and memory bandwidth. Long-context LLMs, which process sequences of tens of thousands of tokens, further increase the demand on the memory system as the complexity in attention layers and key-value cache sizes is proportional to the context length. Processing-in-Memory (PIM) maximizes memory bandwidth by moving compute to the data and can address the memory bandwidth challenges; however, PIM is not necessarily scalable to accelerate long-context LLM because of limited per-module memory capacity and the inflexibility of fixed-functional unit PIM architecture and static memory management. In this work, we propose LoL-PIM which is a multi-node PIM architecture that accelerates long context LLM through hardware-software co-design. In particular, we propose how pipeline parallelism can be exploited across a multi-PIM module while a direct PIM access (DPA) controller (or DMA for PIM) is proposed that enables dynamic PIM memory management and results in efficient PIM utilization across a diverse range of context length. We developed an MLIR-based compiler for LoL-PIM extending a commercial PIM-based compiler where the software modifications were implemented and evaluated, while the hardware changes were modeled in the simulator. Our evaluations demonstrate that LoL-PIM significantly improves throughput and reduces latency for long-context LLM inference, outperforming both multi-GPU and GPU-PIM systems (up to 8.54x and 16.0x speedup, respectively), thereby enabling more efficient deployment of LLMs in real-world applications., Comment: 15 pages, 12 figures
- Published
- 2024
3. IANUS: Integrated Accelerator based on NPU-PIM Unified Memory System
- Author
-
Seo, Minseok, Nguyen, Xuan Truong, Hwang, Seok Joong, Kwon, Yongkee, Kim, Guhyun, Park, Chanwook, Kim, Ilkon, Park, Jaehan, Kim, Jeongbin, Shin, Woojae, Won, Jongsoon, Choi, Haerang, Kim, Kyuyoung, Kwon, Daehan, Jeong, Chunseok, Lee, Sangheon, Choi, Yongseok, Byun, Wooseok, Baek, Seungcheol, Lee, Hyuk-Jae, and Kim, John
- Subjects
Computer Science - Hardware Architecture - Abstract
Accelerating end-to-end inference of transformer-based large language models (LLMs) is a critical component of AI services in datacenters. However, diverse compute characteristics of end-to-end LLM inference present challenges as previously proposed accelerators only address certain operations or stages (e.g., self-attention, generation stage, etc.). To address the unique challenges of accelerating end-to-end inference, we propose IANUS -- Integrated Accelerator based on NPU-PIM Unified Memory System. IANUS is a domain-specific system architecture that combines a Neural Processing Unit (NPU) with a Processing-in-Memory (PIM) to leverage both the NPU's high computation throughput and the PIM's high effective memory bandwidth. In particular, IANUS employs a unified main memory system where the PIM memory is used both for PIM operations and for NPU's main memory. The unified main memory system ensures that memory capacity is efficiently utilized and the movement of shared data between NPU and PIM is minimized. However, it introduces new challenges since normal memory accesses and PIM computations cannot be performed simultaneously. Thus, we propose novel PIM Access Scheduling that manages normal memory accesses and PIM computations through workload mapping and scheduling across the PIM and the NPU. Our detailed simulation evaluations show that IANUS improves the performance of GPT-2 by 6.2$\times$ and 3.2$\times$, on average, compared to the NVIDIA A100 GPU and the state-of-the-art accelerator. As a proof-of-concept, we develop a prototype of IANUS with a commercial PIM, NPU, and an FPGA-based PIM controller to demonstrate the feasibility of IANUS., Comment: Updated version of the paper accepted to ASPLOS 2024
- Published
- 2024
- Full Text
- View/download PDF
4. What Knowledge Resources Do General Chemistry Students Use to Agree or Disagree with Atomic Level Acid–Base Animations?
- Author
-
Kelly, Resa M., Kim, John H., Villalta-Cerdas, Adrian, Hansen, Sarah J. R., and Akaygun, Sevil
- Published
- 2025
- Full Text
- View/download PDF
5. Impact of Surgeon-Radiation Oncology Dyads in Oral Cavity Cancer Outcomes: Surgeon and Radiation Oncology Dyads in Oral Cancer
- Author
-
Wihlidal, Jacob, Esemezie, Alex O., Huang, Shao Hui, Watson, Erin, Gilbert, Ralph W., Waldron, John, Gullane, Patrick J., Hope, Andrew, Irish, Jonathan C., O’Sullivan, Brian, Chepeha, Douglas B., Kim, John J. H., Brown, Dale, Cho, B. C. John, Witterick, Ian J., Monteiro, Eric, Davies, Joel C., Ringash, Jolie, Goldstein, David P., Bratman, Scott, Bayley, Andrew, de Almeida, John R., Chan, Timothy C. Y., Hosni, Ali, and Yao, Christopher M. K. L.
- Published
- 2025
- Full Text
- View/download PDF
6. How Teachers Envision Using Data Visualization Discussion Tasks in Classroom Instruction: How Teachers Envision Using Data Visualization Discussion Tasks in Classroom Instruction
- Author
-
Wilkerson, Michelle H., Kim, John, Lee, Hollylynne S., Stokes, David J., and Ferrell, Matthew
- Published
- 2025
- Full Text
- View/download PDF
7. NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator
- Author
-
Shivdikar, Kaustubh, Agostini, Nicolas Bohm, Jayaweera, Malith, Jonatan, Gilbert, Abellan, Jose L., Joshi, Ajay, Kim, John, and Kaeli, David
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning ,Computer Science - Neural and Evolutionary Computing - Abstract
Graph Neural Networks (GNNs) are emerging as a formidable tool for processing non-euclidean data across various domains, ranging from social network analysis to bioinformatics. Despite their effectiveness, their adoption has not been pervasive because of scalability challenges associated with large-scale graph datasets, particularly when leveraging message passing. To tackle these challenges, we introduce NeuraChip, a novel GNN spatial accelerator based on Gustavson's algorithm. NeuraChip decouples the multiplication and addition computations in sparse matrix multiplication. This separation allows for independent exploitation of their unique data dependencies, facilitating efficient resource allocation. We introduce a rolling eviction strategy to mitigate data idling in on-chip memory as well as address the prevalent issue of memory bloat in sparse graph computations. Furthermore, the compute resource load balancing is achieved through a dynamic reseeding hash-based mapping, ensuring uniform utilization of computing resources agnostic of sparsity patterns. Finally, we present NeuraSim, an open-source, cycle-accurate, multi-threaded, modular simulator for comprehensive performance analysis. Overall, NeuraChip presents a significant improvement, yielding an average speedup of 22.1x over Intel's MKL, 17.1x over NVIDIA's cuSPARSE, 16.7x over AMD's hipSPARSE, and 1.5x over prior state-of-the-art SpGEMM accelerator and 1.3x over GNN accelerator. The source code for our open-sourced simulator and performance visualizer is publicly accessible on GitHub https://neurachip.us, Comment: Visit https://neurachip.us for WebGUI based simulations
- Published
- 2024
8. A multidimensional classifier to support lung transplant referral in patients with pulmonary fibrosis.
- Author
-
Pugashetti, Janelle, Kim, John, Combs, Michael, Ma, Shwu-Fan, Adegunsoye, Ayodeji, Linderholm, Angela, Strek, Mary, Chen, Ching-Hsien, Dilling, Daniel, Whelan, Timothy, Flaherty, Kevin, Martinez, Fernando, Noth, Imre, and Oldham, Justin
- Subjects
biomarker ,idiopathic pulmonary fibrosis ,interstitial lung disease ,proteomics ,survival ,Humans ,Lung Transplantation ,Male ,Female ,Middle Aged ,Prospective Studies ,Referral and Consultation ,Idiopathic Pulmonary Fibrosis ,Aged ,Proteomics - Abstract
BACKGROUND: Lung transplantation remains the sole curative option for patients with idiopathic pulmonary fibrosis (IPF), but donor organs remain scarce, and many eligible patients die before transplant. Tools to optimize the timing of transplant referrals are urgently needed. METHODS: Least absolute shrinkage and selection operator was applied to clinical and proteomic data generated as part of a prospective cohort study of interstitial lung disease (ILD) to derive clinical, proteomic, and multidimensional logit models of near-term death or lung transplant within 18 months of blood draw. Model-fitted values were dichotomized at the point of maximal sensitivity and specificity, and decision curve analysis was used to select the best-performing classifier. We then applied this classifier to independent IPF and non-IPF ILD cohorts to determine test performance characteristics. Cohorts were restricted to patients aged ≤72 years with body mass index 18 to 32 to increase the likelihood of transplant eligibility. RESULTS: IPF derivation, IPF validation, and non-IPF ILD validation cohorts consisted of 314, 105, and 295 patients, respectively. A multidimensional model comprising 2 clinical variables and 20 proteins outperformed stand-alone clinical and proteomic models. Following dichotomization, the multidimensional classifier predicted near-term outcome with 70% sensitivity and 92% specificity in the IPF validation cohort and 70% sensitivity and 80% specificity in the non-IPF ILD validation cohort. CONCLUSIONS: A multidimensional classifier of near-term outcomes accurately discriminated this end-point with good test performance across independent IPF and non-IPF ILD cohorts. These findings support refinement and prospective validation of this classifier in transplant-eligible individuals.
- Published
- 2024
9. A Bayesian joint model for mediation analysis with matrix-valued mediators
- Author
-
Liu, Zijin, Liu, Zhihui, Hosni, Ali, Kim, John, Jiang, Bei, and Saarela, Olli
- Subjects
Statistics - Methodology - Abstract
Unscheduled treatment interruptions may lead to reduced quality of care in radiation therapy (RT). Identifying the RT prescription dose effects on the outcome of treatment interruptions, mediated through doses distributed into different organs-at-risk (OARs), can inform future treatment planning. The radiation exposure to OARs can be summarized by a matrix of dose-volume histograms (DVH) for each patient. Although various methods for high-dimensional mediation analysis have been proposed recently, few studies investigated how matrix-valued data can be treated as mediators. In this paper, we propose a novel Bayesian joint mediation model for high-dimensional matrix-valued mediators. In this joint model, latent features are extracted from the matrix-valued data through an adaptation of probabilistic multilinear principal components analysis (MPCA), retaining the inherent matrix structure. We derive and implement a Gibbs sampling algorithm to jointly estimate all model parameters, and introduce a Varimax rotation method to identify active indicators of mediation among the matrix-valued data. Our simulation study finds that the proposed joint model has higher efficiency in estimating causal decomposition effects compared to an alternative two-step method, and demonstrates that the mediation effects can be identified and visualized in the matrix form. We apply the method to study the effect of prescription dose on treatment interruptions in anal canal cancer patients.
- Published
- 2023
10. ASO Visual Abstract: Impact of Surgeon-Radiation Oncology Dyads in Oral Cavity Cancer Outcomes
- Author
-
Wihlidal, Jacob, Esemezie, Alex O., Huang, Shao Hui, Watson, Erin, Gilbert, Ralph W., Waldron, John, Gullane, Patrick J., Hope, Andrew, Irish, Jonathan C., O’Sullivan, Brian, Chepeha, Douglas B., Kim, John J. H., Brown, Dale, Cho, B. C. John, Witterick, Ian J., Monteiro, Eric, Davies, Joel C., Ringash, Jolie, Goldstein, David P., Bratman, Scott, Bayley, Andrew, de Almeida, John R., Chan, Timothy C. Y., Hosni, Ali, and Yao, Christopher M. K. L.
- Published
- 2025
- Full Text
- View/download PDF
11. Nonlinear associations between computed tomography-measures of adiposity and long pentraxin-3 in the Multi-Ethnic Study of Atherosclerosis.
- Author
-
Anderson, Michaela, Kim, John, Podolanczuk, Anna, Ding, Jingzhong, Al-Naamani, Nadine, Allison, Matthew, Christie, Jason, and Diamond, Joshua
- Subjects
liver attenuation ,long pentraxin‐3 ,pericardial adipose ,visceral adipose - Abstract
OBJECTIVE: Long pentraxin-3 (PTX-3) is an acute phase protein associated with cardiovascular disease, lung injury, and mortality. We evaluated the association between computed tomography (CT)-measurements of adipose tissue and plasma levels of PTX-3. METHODS: We performed a cross-sectional analysis of community-dwelling adults enrolled in the multi-center Multiethnic Study of Atherosclerosis who underwent cardiac or abdominal CT and had available PTX-3 measurements. RESULTS: There was a U-shaped association between pericardial adipose tissue volume (PAT), abdominal visceral adipose tissue area (VAT), hepatic attenuation, and PTX-3 levels, with extremes of adiposity associated with greater PTX-3 levels. Using multivariable-adjusted piecewise regression models, among participants with low PAT, every 1% increase in PAT volume was associated with a 13.8% decrease in PTX-3 (95% confidence interval [CI] -21.6 to -6.0); among participants with high PAT, every 1% increase in PAT volume was associated with a 6.0% increase in PTX-3 (95% CI -0.4 to 12.5). Results were similar for abdominal VAT and hepatic attenuation. CONCLUSIONS: In a cohort of community-dwelling adults, we demonstrated a U-shaped association between pericardial, abdominal visceral, and hepatic adiposity with PTX3 levels, suggesting that extreme adiposity is associated with greater circulating levels of PTX3. Further work is required to identify the mechanisms linking adiposity and PTX-3.
- Published
- 2024
12. GME: GPU-based Microarchitectural Extensions to Accelerate Homomorphic Encryption
- Author
-
Shivdikar, Kaustubh, Bao, Yuhui, Agrawal, Rashmi, Shen, Michael, Jonatan, Gilbert, Mora, Evelio, Ingare, Alexander, Livesay, Neal, Abellán, José L., Kim, John, Joshi, Ajay, and Kaeli, David
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Hardware Architecture - Abstract
Fully Homomorphic Encryption (FHE) enables the processing of encrypted data without decrypting it. FHE has garnered significant attention over the past decade as it supports secure outsourcing of data processing to remote cloud services. Despite its promise of strong data privacy and security guarantees, FHE introduces a slowdown of up to five orders of magnitude as compared to the same computation using plaintext data. This overhead is presently a major barrier to the commercial adoption of FHE. In this work, we leverage GPUs to accelerate FHE, capitalizing on a well-established GPU ecosystem available in the cloud. We propose GME, which combines three key microarchitectural extensions along with a compile-time optimization to the current AMD CDNA GPU architecture. First, GME integrates a lightweight on-chip compute unit (CU)-side hierarchical interconnect to retain ciphertext in cache across FHE kernels, thus eliminating redundant memory transactions. Second, to tackle compute bottlenecks, GME introduces special MOD-units that provide native custom hardware support for modular reduction operations, one of the most commonly executed sets of operations in FHE. Third, by integrating the MOD-unit with our novel pipelined $64$-bit integer arithmetic cores (WMAC-units), GME further accelerates FHE workloads by $19\%$. Finally, we propose a Locality-Aware Block Scheduler (LABS) that exploits the temporal locality available in FHE primitive blocks. Incorporating these microarchitectural features and compiler optimizations, we create a synergistic approach achieving average speedups of $796\times$, $14.2\times$, and $2.3\times$ over Intel Xeon CPU, NVIDIA V100 GPU, and Xilinx FPGA implementations, respectively.
- Published
- 2023
- Full Text
- View/download PDF
13. HiHGNN: Accelerating HGNNs through Parallelism and Data Reusability Exploitation
- Author
-
Xue, Runzhen, Han, Dengke, Yan, Mingyu, Zou, Mo, Yang, Xiaocheng, Wang, Duo, Li, Wenming, Tang, Zhimin, Kim, John, Ye, Xiaochun, and Fan, Dongrui
- Subjects
Computer Science - Hardware Architecture - Abstract
Heterogeneous graph neural networks (HGNNs) have emerged as powerful algorithms for processing heterogeneous graphs (HetGs), widely used in many critical fields. To capture both structural and semantic information in HetGs, HGNNs first aggregate the neighboring feature vectors for each vertex in each semantic graph and then fuse the aggregated results across all semantic graphs for each vertex. Unfortunately, existing graph neural network accelerators are ill-suited to accelerate HGNNs. This is because they fail to efficiently tackle the specific execution patterns and exploit the high-degree parallelism as well as data reusability inside and across the processing of semantic graphs in HGNNs. In this work, we first quantitatively characterize a set of representative HGNN models on GPU to disclose the execution bound of each stage, inter-semantic-graph parallelism, and inter-semantic-graph data reusability in HGNNs. Guided by our findings, we propose a high-performance HGNN accelerator, HiHGNN, to alleviate the execution bound and exploit the newfound parallelism and data reusability in HGNNs. Specifically, we first propose a bound-aware stage-fusion methodology that tailors to HGNN acceleration, to fuse and pipeline the execution stages being aware of their execution bounds. Second, we design an independency-aware parallel execution design to exploit the inter-semantic-graph parallelism. Finally, we present a similarity-aware execution scheduling to exploit the inter-semantic-graph data reusability. Compared to the state-of-the-art software framework running on NVIDIA GPU T4 and GPU A100, HiHGNN respectively achieves an average 41.5$\times$ and 8.6$\times$ speedup as well as 106$\times$ and 73$\times$ energy efficiency with quarter the memory bandwidth of GPU A100., Comment: 16 pages, 17 figures; To appear in IEEE TPDS 2024
- Published
- 2023
14. NASA Earth Exchange Downscaled Climate Projections 30 Arcseconds CMIP6
- Author
-
Thrasher, Bridget, Khajehei, Sepideh, Kim, John B., and Brosnan, Ian
- Published
- 2024
- Full Text
- View/download PDF
15. Carbon, climate, and natural disturbance: a review of mechanisms, challenges, and tools for understanding forest carbon stability in an uncertain future
- Author
-
Dye, Alex W., Houtman, Rachel M., Gao, Peng, Anderegg, William R. L., Fettig, Christopher J., Hicke, Jeffrey A., Kim, John B., Still, Christopher J., Young, Kevin, and Riley, Karin L.
- Published
- 2024
- Full Text
- View/download PDF
16. Proactive Polypharmacy Management Using Large Language Models: Opportunities to Enhance Geriatric Care
- Author
-
Rao, Arya, Kim, John, Lie, Winston, Pang, Michael, Fuh, Lanting, Dreyer, Keith J., and Succi, Marc D.
- Published
- 2024
- Full Text
- View/download PDF
17. Tri-system integration in metal-oxide nanocomposites via in-situ solution-processed method for ultrathin flexible transparent electrodes
- Author
-
Kim, John Jinwook, Shuji, Kojima, Zheng, Jiawei, He, Xinjun, Sajjad, Ahmad, Zhang, Hong, Su, Haibin, and Choy, Wallace C. H.
- Published
- 2024
- Full Text
- View/download PDF
18. Demographic and Clinical Factors Associated With SARS-CoV-2 Spike 1 Antibody Response Among Vaccinated US Adults: the C4R Study
- Author
-
Kim, John S., Sun, Yifei, Balte, Pallavi, Cushman, Mary, Boyle, Rebekah, Tracy, Russell P., Styer, Linda M., Bell, Taison D., Anderson, Michaela R., Allen, Norrina B., Schreiner, Pamela J., Bowler, Russell P., Schwartz, David A., Lee, Joyce S., Xanthakis, Vanessa, Doyle, Margaret F., Regan, Elizabeth A., Make, Barry J., Kanaya, Alka M., Wenzel, Sally E., Coresh, Josef, Isasi, Carmen R., Raffield, Laura M., Elkind, Mitchell S. V., Howard, Virginia J., Ortega, Victor E., Woodruff, Prescott, Cole, Shelley A., Henderson, Joel M., Mantis, Nicholas J., Parker, Monica M., Demmer, Ryan T., and Oelsner, Elizabeth C.
- Published
- 2024
- Full Text
- View/download PDF
19. Apixaban for Prevention of Thromboembolism in Pediatric Heart Disease
- Author
-
Payne, R Mark, Burns, Kristin M, Glatz, Andrew C, Male, Christoph, Donti, Andrea, Brandão, Leonardo R, Balling, Gunter, VanderPluym, Christina J, Bu'Lock, Frances, Kochilas, Lazaros K, Stiller, Brigitte, Cnota, James F, Rahkonen, Otto, Khan, Asra, Adorisio, Rachele, Stoica, Serban, May, Lindsay, Burns, Jane C, Saraiva, Jose Francisco K, McHugh, Kimberly E, Kim, John S, Rubio, Agustin, Chía-Vazquez, Nadia G, Meador, Marcie R, Dyme, Joshua L, Reedy, Alison M, Ajavon-Hartmann, Toni, Jarugula, Praneeth, Carlson-Taneja, Lauren E, Mills, Donna, Wheaton, Olivia, and Monagle, Paul
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Sciences ,Orphan Drug ,Heart Disease ,Hematology ,Pediatric ,Clinical Research ,Women's Health ,Rare Diseases ,Prevention ,Patient Safety ,Cardiovascular ,Clinical Trials and Supportive Activities ,6.1 Pharmaceuticals ,Child ,Humans ,Infant ,Newborn ,Anticoagulants ,Fibrinolytic Agents ,Heart Diseases ,Hemorrhage ,Heparin ,Low-Molecular-Weight ,Pyridones ,Quality of Life ,Venous Thromboembolism ,Vitamin K ,anticoagulation ,apixaban ,congenital ,heart ,pediatric ,thromboembolism ,Cardiorespiratory Medicine and Haematology ,Public Health and Health Services ,Cardiovascular System & Hematology ,Cardiovascular medicine and haematology - Abstract
BackgroundChildren with heart disease frequently require anticoagulation for thromboprophylaxis. Current standard of care (SOC), vitamin K antagonists or low-molecular-weight heparin, has significant disadvantages.ObjectivesThe authors sought to describe safety, pharmacokinetics (PK), pharmacodynamics, and efficacy of apixaban, an oral, direct factor Xa inhibitor, for prevention of thromboembolism in children with congenital or acquired heart disease.MethodsPhase 2, open-label trial in children (ages, 28 days to
- Published
- 2023
20. Strix: An End-to-End Streaming Architecture with Two-Level Ciphertext Batching for Fully Homomorphic Encryption with Programmable Bootstrapping
- Author
-
Putra, Adiwena, Prasetiyo, Chen, Yi, Kim, John, and Kim, Joo-Young
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Hardware Architecture - Abstract
Homomorphic encryption (HE) enables computations on encrypted data by concealing information under noise for security. However, the process of bootstrapping, which resets the noise level in the ciphertext, is computationally expensive and requires a large bootstrapping key. The TFHE scheme offers a faster and programmable bootstrapping algorithm called PBS, crucial for security-focused applications like machine learning. Nevertheless, the current TFHE scheme lacks support for ciphertext packing, resulting in low throughput. This work thoroughly analyzes TFHE bootstrapping, identifies the bottleneck in GPUs caused by the blind rotation fragmentation problem, and proposes a hardware TFHE accelerator called Strix. Strix introduces a two-level batching approach to enhance the batch size in PBS, utilizes a specialized microarchitecture for efficient streaming data processing, and incorporates a fully-pipelined FFT microarchitecture to improve performance. It achieves significantly higher throughput than state-of-the-art implementations on both CPUs and GPUs, outperforming existing TFHE accelerators by a factor of 7.4.
- Published
- 2023
21. A marginal structural model for normal tissue complication probability
- Author
-
Tang, Thai-Son, Liu, Zhihui, Hosni, Ali, Kim, John, and Saarela, Olli
- Subjects
Statistics - Methodology - Abstract
The goal of radiation therapy for cancer is to deliver prescribed radiation dose to the tumor while minimizing dose to the surrounding healthy tissues. To evaluate treatment plans, the dose distribution to healthy organs is commonly summarized as dose-volume histograms (DVHs). Normal tissue complication probability (NTCP) modelling has centered around making patient-level risk predictions with features extracted from the DVHs, but few have considered adapting a causal framework to evaluate the safety of alternative treatment plans. We propose causal estimands for NTCP based on deterministic and stochastic interventions, as well as propose estimators based on marginal structural models that impose bivariable monotonicity between dose, volume, and toxicity risk. The properties of these estimators are studied through simulations, and their use is illustrated in the context of radiotherapy treatment of anal canal cancer patients.
- Published
- 2023
22. Hera: A Heterogeneity-Aware Multi-Tenant Inference Server for Personalized Recommendations
- Author
-
Choi, Yujeong, Kim, John, and Rhu, Minsoo
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Information Retrieval ,Computer Science - Machine Learning - Abstract
While providing low latency is a fundamental requirement in deploying recommendation services, achieving high resource utility is also crucial in cost-effectively maintaining the datacenter. Co-locating multiple workers of a model is an effective way to maximize query-level parallelism and server throughput, but the interference caused by concurrent workers at shared resources can prevent server queries from meeting its SLA. Hera utilizes the heterogeneous memory requirement of multi-tenant recommendation models to intelligently determine a productive set of co-located models and its resource allocation, providing fast response time while achieving high throughput. We show that Hera achieves an average 37.3% improvement in effective machine utilization, enabling 26% reduction in required servers, significantly improving upon the baseline recommedation inference server.
- Published
- 2023
23. Associations between eGFR and albuminuria with right ventricular measures: the MESA-Right Ventricle study.
- Author
-
Husain-Syed, Faeq, DiFrancesco, Matthew, Deo, Rajat, Barr, R, Scialla, Julia, Bluemke, David, Kronmal, Richard, Lima, Joao, Praestgaard, Amy, Tracy, Russell, Shlipak, Michael, Kawut, Steven, and Kim, John
- Subjects
cardiac magnetic resonance ,cardiorenal syndromes ,chronic kidney disease ,kidney function ,pulmonary hypertension - Abstract
BACKGROUND: Chronic kidney disease (CKD) is associated with an increased risk of pulmonary hypertension, which may lead to right ventricular (RV) pressure overload and RV dysfunction. However, the presence of subclinical changes in RV structure or function in early CKD and the influence of these changes on mortality are not well studied. We hypothesized that early CKD, as indicated by elevated albuminuria or mild reductions in estimated glomerular filtration rate (eGFR), is associated with greater RV dilation and RV mass. METHODS: We included 4063 participants (age 45-84 years) without baseline clinical cardiovascular disease from the Multi-Ethnic Study of Atherosclerosis. The associations of baseline creatinine-cystatin C-based eGFR and albuminuria with cardiac magnetic resonance-derived RV measures (2000-02) were examined cross-sectionally with linear regression models. Cox regression models were used to examine whether RV parameters modified the associations of eGFR and albuminuria with all-cause mortality. RESULTS: Participants with reductions in eGFR primarily within the 60-89 mL/min/1.73 m2 category had smaller RV end-diastolic and end-systolic volumes and stroke volume (all adjusted P-trends
- Published
- 2023
24. Medical Management Considerations of Premature Infants with Congenital Heart Disease
- Author
-
Varma, Pavika and Kim, John S.
- Published
- 2024
- Full Text
- View/download PDF
25. Incidence of interstitial lung abnormalities: the MESA Lung Study
- Author
-
McGroder, Claire F, Hansen, Spencer, Stukovsky, Karen Hinckley, Zhang, David, Nath, P Hrudaya, Salvatore, Mary M, Sonavane, Sushilkumar K, Terry, Nina, Stowell, Justin T, D'Souza, Belinda M, Leb, Jay S, Dumeer, Shifali, Aziz, Muhammad U, Batra, Kiran, Hoffman, Eric A, Bernstein, Elana J, Kim, John S, Podolanczuk, Anna J, Rotter, Jerome I, Manichaikul, Ani W, Rich, Stephen S, Lederer, David J, Barr, R Graham, McClelland, Robyn L, and Garcia, Christine Kim
- Subjects
Biomedical and Clinical Sciences ,Clinical Sciences ,Cardiovascular ,Lung ,Heart Disease ,Biomedical Imaging ,Atherosclerosis ,Prevention ,Genetics ,Respiratory ,Medical and Health Sciences ,Respiratory System ,Cardiovascular medicine and haematology - Abstract
BackgroundThe incidence of newly developed interstitial lung abnormalities (ILA) and fibrotic ILA has not been previously reported.MethodsTrained thoracic radiologists evaluated 13 944 cardiac computed tomography scans for the presence of ILA in 6197 Multi-Ethnic Study of Atherosclerosis (MESA) longitudinal cohort study participants >45 years of age from 2000 to 2012. Five percent of the scans were re-read by the same or a different observer in a blinded fashion. After exclusion of participants with ILA at baseline, incidence rates and incidence rate ratios for ILA and fibrotic ILA were calculated.ResultsThe intra-reader agreement of ILA was 92.0% (Gwet's AC1 0.912, intraclass correlation coefficient (ICC) 0.982) and the inter-reader agreement of ILA was 83.5% (Gwet's AC1 0.814, ICC 0.969). Incidence of ILA and fibrotic ILA was estimated to be 13.1 and 3.5 cases per 1000 person-years, respectively. In multivariable analyses, age (hazard ratio (HR) 1.06 (95% CI 1.05-1.08); p
- Published
- 2023
26. PCSK6 and Survival in Idiopathic Pulmonary Fibrosis
- Author
-
Oldham, Justin M, Allen, Richard J, Lorenzo-Salazar, Jose M, Molyneaux, Philip L, Ma, Shwu-Fan, Joseph, Chitra, Kim, John S, Guillen-Guio, Beatriz, Hernández-Beeftink, Tamara, Kropski, Jonathan A, Huang, Yong, Lee, Cathryn T, Adegunsoye, Ayodeji, Pugashetti, Janelle Vu, Linderholm, Angela L, Vo, Vivian, Strek, Mary E, Jou, Jonathan, Muñoz-Barrera, Adrian, Rubio-Rodriguez, Luis A, Hubbard, Richard, Hirani, Nik, Whyte, Moira KB, Hart, Simon, Nicholson, Andrew G, Lancaster, Lisa, Parfrey, Helen, Rassl, Doris, Wallace, William, Valenzi, Eleanor, Zhang, Yingze, Mychaleckyj, Josyf, Stockwell, Amy, Kaminski, Naftali, Wolters, Paul J, Molina-Molina, Maria, Banovich, Nicholas E, Fahy, William A, Martinez, Fernando J, Hall, Ian P, Tobin, Martin D, Maher, Toby M, Blackwell, Timothy S, Yaspan, Brian L, Jenkins, R Gisli, Flores, Carlos, Wain, Louise V, and Noth, Imre
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Sciences ,Genetics ,Rare Diseases ,Human Genome ,Autoimmune Disease ,Lung ,2.1 Biological and endogenous factors ,Respiratory ,Good Health and Well Being ,Humans ,Genome-Wide Association Study ,Idiopathic Pulmonary Fibrosis ,Proportional Hazards Models ,Europe ,Serine Endopeptidases ,Proprotein Convertases ,IPF ,genome-wide association study ,genomics ,survival ,PCSK6 protein ,Medical and Health Sciences ,Respiratory System ,Cardiovascular medicine and haematology ,Clinical sciences - Abstract
Rationale: Idiopathic pulmonary fibrosis (IPF) is a devastating disease characterized by limited treatment options and high mortality. A better understanding of the molecular drivers of IPF progression is needed. Objectives: To identify and validate molecular determinants of IPF survival. Methods: A staged genome-wide association study was performed using paired genomic and survival data. Stage I cases were drawn from centers across the United States and Europe and stage II cases from Vanderbilt University. Cox proportional hazards regression was used to identify gene variants associated with differential transplantation-free survival (TFS). Stage I variants with nominal significance (P
- Published
- 2023
27. MUC5B, telomere length and longitudinal quantitative interstitial lung changes: the MESA Lung Study
- Author
-
Kim, John S, Manichaikul, Ani W, Hoffman, Eric A, Balte, Pallavi, Anderson, Michaela R, Bernstein, Elana J, Madahar, Purnema, Oelsner, Elizabeth C, Kawut, Steven M, Wysoczanski, Artur, Laine, Andrew F, Adegunsoye, Ayodeji, Z, Jennie, Taub, Margaret A, Mathias, Rasika A, Rich, Stephen S, Rotter, Jerome I, Noth, Imre, Garcia, Christine Kim, Barr, R Graham, and Podolanczuk, Anna J
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Sciences ,Clinical Research ,Lung ,Atherosclerosis ,Genetics ,Good Health and Well Being ,Adult ,Humans ,Lung Diseases ,Interstitial ,Genotype ,Telomere ,Mucin-5B ,Imaging/CT MRI etc ,Interstitial Fibrosis ,Respiratory System ,Cardiovascular medicine and haematology ,Clinical sciences - Abstract
BackgroundThe MUC5B promoter variant (rs35705950) and telomere length are linked to pulmonary fibrosis and CT-based qualitative assessments of interstitial abnormalities, but their associations with longitudinal quantitative changes of the lung interstitium among community-dwelling adults are unknown.MethodsWe used data from participants in the Multi-Ethnic Study of Atherosclerosis with high-attenuation areas (HAAs, Examinations 1-6 (2000-2018)) and MUC5B genotype (n=4552) and telomere length (n=4488) assessments. HAA was defined as the per cent of imaged lung with attenuation of -600 to -250 Hounsfield units. We used linear mixed-effects models to examine associations of MUC5B risk allele (T) and telomere length with longitudinal changes in HAAs. Joint models were used to examine associations of longitudinal changes in HAAs with death and interstitial lung disease (ILD).ResultsThe MUC5B risk allele (T) was associated with an absolute change in HAAs of 2.60% (95% CI 0.36% to 4.86%) per 10 years overall. This association was stronger among those with a telomere length below an age-adjusted percentile of 5% (p value for interaction=0.008). A 1% increase in HAAs per year was associated with 7% increase in mortality risk (rate ratio (RR)=1.07, 95% CI 1.02 to 1.12) for overall death and 34% increase in ILD (RR=1.34, 95% CI 1.20 to 1.50). Longer baseline telomere length was cross-sectionally associated with less HAAs from baseline scans, but not with longitudinal changes in HAAs.ConclusionsLongitudinal increases in HAAs were associated with the MUC5B risk allele and a higher risk of death and ILD.
- Published
- 2023
28. Idiopathic Pulmonary Fibrosis Is Associated with Common Genetic Variants and Limited Rare Variants.
- Author
-
Peljto, Anna L, Blumhagen, Rachel Z, Walts, Avram D, Cardwell, Jonathan, Powers, Julia, Corte, Tamera J, Dickinson, Joanne L, Glaspole, Ian, Moodley, Yuben P, Vasakova, Martina Koziar, Bendstrup, Elisabeth, Davidsen, Jesper R, Borie, Raphael, Crestani, Bruno, Dieude, Philippe, Bonella, Francesco, Costabel, Ulrich, Gudmundsson, Gunnar, Donnelly, Seamas C, Egan, Jim, Henry, Michael T, Keane, Michael P, Kennedy, Marcus P, McCarthy, Cormac, McElroy, Aoife N, Olaniyi, Joshua A, O'Reilly, Katherine MA, Richeldi, Luca, Leone, Paolo M, Poletti, Venerino, Puppo, Francesco, Tomassetti, Sara, Luzzi, Valentina, Kokturk, Nurdan, Mogulkoc, Nesrin, Fiddler, Christine A, Hirani, Nikhil, Jenkins, R Gisli, Maher, Toby M, Molyneaux, Philip L, Parfrey, Helen, Braybrooke, Rebecca, Blackwell, Timothy S, Jackson, Peter D, Nathan, Steven D, Porteous, Mary K, Brown, Kevin K, Christie, Jason D, Collard, Harold R, Eickelberg, Oliver, Foster, Elena E, Gibson, Kevin F, Glassberg, Marilyn, Kass, Daniel J, Kropski, Jonathan A, Lederer, David, Linderholm, Angela L, Loyd, Jim, Mathai, Susan K, Montesi, Sydney B, Noth, Imre, Oldham, Justin M, Palmisciano, Amy J, Reichner, Cristina A, Rojas, Mauricio, Roman, Jesse, Schluger, Neil, Shea, Barry S, Swigris, Jeffrey J, Wolters, Paul J, Zhang, Yingze, Prele, Cecilia MA, Enghelmayer, Juan I, Otaola, Maria, Ryerson, Christopher J, Salinas, Mauricio, Sterclova, Martina, Gebremariam, Tewodros H, Myllärniemi, Marjukka, Carbone, Roberto G, Furusawa, Haruhiko, Hirose, Masaki, Inoue, Yoshikazu, Miyazaki, Yasunari, Ohta, Ken, Ohta, Shin, Okamoto, Tsukasa, Kim, Dong Soon, Pardo, Annie, Selman, Moises, Aranda, Alvaro U, Park, Moo Suk, Park, Jong Sun, Song, Jin Woo, Molina-Molina, Maria, Planas-Cerezales, Lurdes, Westergren-Thorsson, Gunilla, Smith, Albert V, Manichaikul, Ani W, and Kim, John S
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Sciences ,Prevention ,Human Genome ,Genetics ,Rare Diseases ,Lung ,Autoimmune Disease ,Precision Medicine ,2.1 Biological and endogenous factors ,Good Health and Well Being ,Humans ,Idiopathic Pulmonary Fibrosis ,Whole Genome Sequencing ,Exome ,whole-genome sequencing ,interstitial lung disease ,TOPMed ,genetic association studies ,telomerase ,Medical and Health Sciences ,Respiratory System ,Cardiovascular medicine and haematology ,Clinical sciences - Abstract
Rationale: Idiopathic pulmonary fibrosis (IPF) is a rare, irreversible, and progressive disease of the lungs. Common genetic variants, in addition to nongenetic factors, have been consistently associated with IPF. Rare variants identified by candidate gene, family-based, and exome studies have also been reported to associate with IPF. However, the extent to which rare variants, genome-wide, may contribute to the risk of IPF remains unknown. Objectives: We used whole-genome sequencing to investigate the role of rare variants, genome-wide, on IPF risk. Methods: As part of the Trans-Omics for Precision Medicine Program, we sequenced 2,180 cases of IPF. Association testing focused on the aggregated effect of rare variants (minor allele frequency ⩽0.01) within genes or regions. We also identified individual rare variants that are influential within genes and estimated the heritability of IPF on the basis of rare and common variants. Measurements and Main Results: Rare variants in both TERT and RTEL1 were significantly associated with IPF. A single rare variant in each of the TERT and RTEL1 genes was found to consistently influence the aggregated test statistics. There was no significant evidence of association with other previously reported rare variants. The SNP heritability of IPF was estimated to be 32% (SE = 3%). Conclusions: Rare variants within the TERT and RTEL1 genes and well-established common variants have the largest contribution to IPF risk overall. Efforts in risk profiling or the development of therapies for IPF that focus on TERT, RTEL1, common variants, and environmental risk factors are likely to have the largest impact on this complex disease.
- Published
- 2023
29. Accelerating Polynomial Multiplication for Homomorphic Encryption on GPUs
- Author
-
Shivdikar, Kaustubh, Jonatan, Gilbert, Mora, Evelio, Livesay, Neal, Agrawal, Rashmi, Joshi, Ajay, Abellan, Jose, Kim, John, and Kaeli, David
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Hardware Architecture ,Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Performance - Abstract
Homomorphic Encryption (HE) enables users to securely outsource both the storage and computation of sensitive data to untrusted servers. Not only does HE offer an attractive solution for security in cloud systems, but lattice-based HE systems are also believed to be resistant to attacks by quantum computers. However, current HE implementations suffer from prohibitively high latency. For lattice-based HE to become viable for real-world systems, it is necessary for the key bottlenecks - particularly polynomial multiplication - to be highly efficient. In this paper, we present a characterization of GPU-based implementations of polynomial multiplication. We begin with a survey of modular reduction techniques and analyze several variants of the widely-used Barrett modular reduction algorithm. We then propose a modular reduction variant optimized for 64-bit integer words on the GPU, obtaining a 1.8x speedup over the existing comparable implementations. Next, we explore the following GPU-specific improvements for polynomial multiplication targeted at optimizing latency and throughput: 1) We present a 2D mixed-radix, multi-block implementation of NTT that results in a 1.85x average speedup over the previous state-of-the-art. 2) We explore shared memory optimizations aimed at reducing redundant memory accesses, further improving speedups by 1.2x. 3) Finally, we fuse the Hadamard product with neighboring stages of the NTT, reducing the twiddle factor memory footprint by 50%. By combining our NTT optimizations, we achieve an overall speedup of 123.13x and 2.37x over the previous state-of-the-art CPU and GPU implementations of NTT kernels, respectively., Comment: Accepted, to be pusblished at SEED 2022 conference (IEEE International Symposium on Secure and Private Execution Environment Design)
- Published
- 2022
30. STAT6 mutations enriched at diffuse large B-cell lymphoma relapse reshape the tumor microenvironment
- Author
-
Benoit, Alexandre, Abraham, Madelyn J., Li, Sheena, Kim, John, Estrada-Tejedor, Roger, Bakadlag, Rowa, Subramaniam, Nivetha, Makhani, Kiran, Guilbert, Cynthia, Tu, Raymond, Salaciak, Matthew, Klein, Kathleen Oros, Coyle, Krysta Mila, Hilton, Laura K., Santiago, Raoul, Dmitrienko, Svetlana, Assouline, Sarit, Morin, Ryan D., del Rincon, Sonia V., Johnson, Nathalie A., and Mann, Koren K.
- Published
- 2024
- Full Text
- View/download PDF
31. Answer Fast: Accelerating BERT on the Tensor Streaming Processor
- Author
-
Ahmed, Ibrahim, Parmar, Sahil, Boyd, Matthew, Beidler, Michael, Kang, Kris, Liu, Bill, Roach, Kyle, Kim, John, and Abts, Dennis
- Subjects
Computer Science - Machine Learning ,Computer Science - Computation and Language - Abstract
Transformers have become a predominant machine learning workload, they are not only the de-facto standard for natural language processing tasks, but they are also being deployed in other domains such as vision and speech recognition. Many of the transformer-based applications are real-time systems such as machine translation and web search. These real time systems often come with strict end-to-end inference latency requirements. Unfortunately, while the majority of the transformer computation comes from matrix multiplications, transformers also include several non-linear components that tend to become the bottleneck during an inference. In this work, we accelerate the inference of BERT models on the tensor streaming processor. By carefully fusing all the nonlinear components with the matrix multiplication components, we are able to efficiently utilize the on-chip matrix multiplication units resulting in a deterministic tail latency of 130 $\mu$s for a batch-1 inference through BERT-base, which is 6X faster than the current state-of-the-art.
- Published
- 2022
32. ARK: Fully Homomorphic Encryption Accelerator with Runtime Data Generation and Inter-Operation Key Reuse
- Author
-
Kim, Jongmin, Lee, Gwangho, Kim, Sangpyo, Sohn, Gina, Kim, John, Rhu, Minsoo, and Ahn, Jung Ho
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Hardware Architecture - Abstract
Homomorphic Encryption (HE) is one of the most promising post-quantum cryptographic schemes that enable privacy-preserving computation on servers. However, noise accumulates as we perform operations on HE-encrypted data, restricting the number of possible operations. Fully HE (FHE) removes this restriction by introducing the bootstrapping operation, which refreshes the data; however, FHE schemes are highly memory-bound. Bootstrapping, in particular, requires loading GBs of evaluation keys and plaintexts from off-chip memory, which makes FHE acceleration fundamentally bottlenecked by the off-chip memory bandwidth. In this paper, we propose ARK, an Accelerator for FHE with Runtime data generation and inter-operation Key reuse. ARK enables practical FHE workloads with a novel algorithm-architecture co-design to accelerate bootstrapping. We first eliminate the off-chip memory bandwidth bottleneck through runtime data generation and inter-operation key reuse. This approach enables ARK to fully exploit on-chip memory by substantially reducing the size of the working set. On top of such algorithmic enhancements, we build ARK microarchitecture that minimizes on-chip data movement through an efficient, alternating data distribution policy based on the data access patterns and a streamlined dataflow organization of the tailored functional units -- including base conversion, number-theoretic transform, and automorphism units. Overall, our co-design effectively handles the heavy computation and data movement overheads of FHE, drastically reducing the cost of HE operations, including bootstrapping., Comment: 18 pages, 9 figures
- Published
- 2022
- Full Text
- View/download PDF
33. BTS: An Accelerator for Bootstrappable Fully Homomorphic Encryption
- Author
-
Kim, Sangpyo, Kim, Jongmin, Kim, Michael Jaemin, Jung, Wonkyung, Rhu, Minsoo, Kim, John, and Ahn, Jung Ho
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Hardware Architecture - Abstract
Homomorphic encryption (HE) enables the secure offloading of computations to the cloud by providing computation on encrypted data (ciphertexts). HE is based on noisy encryption schemes in which noise accumulates as more computations are applied to the data. The limited number of operations applicable to the data prevents practical applications from exploiting HE. Bootstrapping enables an unlimited number of operations or fully HE (FHE) by refreshing the ciphertext. Unfortunately, bootstrapping requires a significant amount of additional computation and memory bandwidth as well. Prior works have proposed hardware accelerators for computation primitives of FHE. However, to the best of our knowledge, this is the first to propose a hardware FHE accelerator that supports bootstrapping as a first-class citizen. In particular, we propose BTS - Bootstrappable, Technologydriven, Secure accelerator architecture for FHE. We identify the challenges of supporting bootstrapping in the accelerator and analyze the off-chip memory bandwidth and computation required. In particular, given the limitations of modern memory technology, we identify the HE parameter sets that are efficient for FHE acceleration. Based on the insights gained from our analysis, we propose BTS, which effectively exploits the parallelism innate in HE operations by arranging a massive number of processing elements in a grid. We present the design and microarchitecture of BTS, including a network-on-chip design that exploits a deterministic communication pattern. BTS shows 5,556x and 1,306x improved execution time on ResNet-20 and logistic regression over a CPU, with a chip area of 373.6mm^2 and up to 163.2W of power., Comment: 15 pages, 10 figures
- Published
- 2021
- Full Text
- View/download PDF
34. A maternally programmed intergenerational mechanism enables male offspring to make piRNAs from Y-linked precursor RNAs in Drosophila
- Author
-
Venkei, Zsolt G., Gainetdinov, Ildar, Bagci, Ayca, Starostik, Margaret R., Choi, Charlotte P., Fingerhut, Jaclyn M., Chen, Peiwei, Balsara, Chiraag, Whitfield, Troy W., Bell, George W., Feng, Suhua, Jacobsen, Steven E., Aravin, Alexei A., Kim, John K., Zamore, Phillip D., and Yamashita, Yukiko M.
- Published
- 2023
- Full Text
- View/download PDF
35. A model for DHX15 mediated disassembly of A-complex spliceosomes
- Author
-
Maul-Newby, Hannah M, Amorello, Angela N, Sharma, Turvi, Kim, John H, Modena, Matthew S, Prichard, Beth E, and Jurica, Melissa S
- Subjects
Biochemistry and Cell Biology ,Biological Sciences ,Genetics ,Generic health relevance ,Introns ,RNA Precursors ,RNA Splicing ,RNA ,Small Nuclear ,Ribonuclease H ,Ribonucleoprotein ,U2 Small Nuclear ,Spliceosomes ,A-complex ,DExH NTPase ,DHX15 ,PRP43 ,U2 snRNP ,spliceosome ,splicing ,Developmental Biology ,Biochemistry and cell biology - Abstract
A critical step of pre-mRNA splicing is the recruitment of U2 snRNP to the branch point sequence of an intron. U2 snRNP conformation changes extensively during branch helix formation, and several RNA-dependent ATPases are implicated in the process. However, the molecular mechanisms involved remain to be fully dissected. We took advantage of the differential nucleotide triphosphates requirements for DExD/H-box enzymes to probe their contributions to in vitro spliceosome assembly. Both ATP and GTP hydrolysis support the formation of A-complex, indicating the activity of a DEAH-enzyme because DEAD-enzymes are selective for ATP. We immunodepleted DHX15 to assess its involvement, and although splicing efficiency decreases with reduced DHX15, A-complex accumulation incongruently increases. DHX15 depletion also results in the persistence of the atypical ATP-independent interaction between U2 snRNP and a minimal substrate that is otherwise destabilized in the presence of either ATP or GTP. These results lead us to hypothesize that DHX15 plays a quality control function in U2 snRNP's engagement with an intron. In efforts to identify the RNA target of DHX15, we determined that an extended polypyrimidine tract is not necessary for disruption of the atypical interaction between U2 snRNP and the minimal substrate. We also examined U2 snRNA by RNase H digestion and identified nucleotides in the branch binding region that become accessible with both ATP and GTP hydrolysis, again implicating a DEAH-enzyme. Together, our results demonstrate that multiple ATP-dependent rearrangements are likely involved in U2 snRNP addition to the spliceosome and that DHX15 may have an expanded role in maintaining splicing fidelity.
- Published
- 2022
36. Simulated dose painting of hypoxic sub-volumes in pancreatic cancer stereotactic body radiotherapy
- Author
-
Elamir, Ahmed M., Stanescu, Teodor, Shessel, Andrea, Tadic, Tony, Yeung, Ivan, Letourneau, Daniel, Kim, John, Lukovic, Jelena, Dawson, Laura A., Wong, Rebecca, Barry, Aisling, Brierley, James, Gallinger, Steven, Knox, Jennifer, O'Kane, Grainne, Dhani, Neesha, Hosni, Ali, and Taylor, Edward
- Subjects
Physics - Medical Physics - Abstract
Dose painting of hypoxic tumour sub-volumes using positron-emission tomography (PET) has been shown to improve tumour control in silico in several sites. Pancreatic cancer presents a more stringent challenge, given its proximity to critical organs-at-risk (OARs) and anatomic motion. A radiobiological model was developed to estimate clonogen survival fraction (SF), using 18F-fluoroazomycin arabinoside PET (FAZA PET) images from ten patients with pancreatic cancer to quantify oxygen enhancement effects. For each patient, four simulated five-fraction stereotactic body radiotherapy (SBRT) plans were generated: 1) a standard SBRT plan aiming to cover the planning target volume with 40 Gy, 2) dose painting plans delivering escalated doses to FAZA-avid hypoxic sub-volumes, 3) dose painting plans with simulated spacer separating the duodenum and pancreatic head, and 4), plans with integrated boosts to geometric contractions of the tumour (GTV). All plans saturated at least one OAR dose limit. SF was calculated for each plan and sensitivity of SF to simulated hypoxia quantification errors was evaluated. Dose painting resulted in a 55% reduction in SF as compared to standard SBRT; 78% with spacer. Integrated boosts to hypoxia-blind geometric contractions resulted in a 41% reduction in SF. The reduction in SF for dose-painting plans persisted for all hypoxia quantification parameters studied, including registration and rigid motion errors that resulted in shifts and rotations of the GTV and hypoxic sub-volumes by as much as 1 cm and 10 degrees. Although proximity to OARs ultimately limited dose escalation, with estimated SFs (~10^-5) well above levels required to completely ablate a ~10 cm^3 tumour, dose painting robustly reduced clonogen survival when accounting for expected treatment and imaging uncertainties and thus, may improve local response and associated morbidity.
- Published
- 2021
- Full Text
- View/download PDF
37. Vaccination Worldwide: Strategies, Distribution and Challenges
- Author
-
Samal, Chirag, Jakimowicz, Kasia, Dasgupta, Krishnendu, Vashishtha, Aniket, O., Francisco, Natarajan, Arunakiry, Nazir, Haris, Varma, Alluri Siddhartha, Dahake, Tejal, Pandey, Amitesh Anand, Singh, Ishaan, Kim, John Sangyeob, Gill, Mehrab Singh, Srivastava, Saurish, Mukhopadhyay, Orna, Patwa, Parth, Mirza, Qamil, Irshad, Sualeha, Shankar, Sheshank, Iyer, Rohan, Sukumaran, Rohan, Mehra, Ashley, Sharma, Anshuman, Singh, Abhishek, Arseni, Maurizio, T V, Sethuraman, Agrawal, Saras, Sharma, Vivek, and Raskar, Ramesh
- Subjects
Quantitative Biology - Populations and Evolution - Abstract
The Coronavirus 2019 (Covid-19) pandemic caused by the SARS-CoV-2 virus represents an unprecedented crisis for our planet. It is a bane of the \"uber connected world that we live in that this virus has affected almost all countries and caused mortality and economic upheaval at a scale whose effects are going to be felt for generations to come. While we can all be buoyed at the pace at which vaccines have been developed and brought to market, there are still challenges ahead for all countries to get their populations vaccinated equitably and effectively. This paper provides an overview of ongoing immunization efforts in various countries. In this early draft, we have identified a few key factors that we use to review different countries' current COVID-19 immunization strategies and their strengths and draw conclusions so that policymakers worldwide can learn from them. Our paper focuses on processes related to vaccine approval, allocation and prioritization, distribution strategies, population to vaccine ratio, vaccination governance, accessibility and use of digital solutions, and government policies. The statistics and numbers are dated as per the draft date [June 24th, 2021].
- Published
- 2021
38. Design and rationale for the prospective treatment efficacy in IPF using genotype for NAC selection (PRECISIONS) clinical trial
- Author
-
Podolanczuk, Anna J, Kim, John S, Cooper, Christopher B, Lasky, Joseph A, Murray, Susan, Oldham, Justin M, Raghu, Ganesh, Flaherty, Kevin R, Spino, Cathie, Noth, Imre, and Martinez, Fernando J
- Subjects
Rare Diseases ,Clinical Trials and Supportive Activities ,Genetics ,Lung ,Complementary and Integrative Health ,Clinical Research ,Autoimmune Disease ,6.1 Pharmaceuticals ,Evaluation of treatments and therapeutic interventions ,Respiratory ,Good Health and Well Being ,Humans ,Acetylcysteine ,Double-Blind Method ,Genotype ,Idiopathic Pulmonary Fibrosis ,Treatment Outcome ,Vital Capacity ,Randomized Controlled Trials as Topic ,Multicenter Studies as Topic ,Clinical Trials ,Phase III as Topic ,IPF ,Clinical trial ,Protocol ,N-acetylcysteine ,PRECISIONS Study Team ,Cardiorespiratory Medicine and Haematology ,Respiratory System - Abstract
BackgroundIdiopathic pulmonary fibrosis (IPF) is a progressive lung disease with few treatment options. N-acetylcysteine (NAC) is a well-tolerated, inexpensive treatment with antioxidant and anti-fibrotic properties. The National Heart, Lung, and Blood Institute (NHLBI)-sponsored PANTHER (Prednisone Azathioprine and NAC therapy in IPF) trial confirmed the harmful effects of immunosuppression in IPF, and did not show a benefit to treatment with NAC. However, a post hoc analysis revealed a potential beneficial effect of NAC in a subgroup of individuals carrying a specific genetic variant, TOLLIP rs3750920 TT genotype, present in about 25% of patients with IPF. Here, we present the design and rationale for the Phase III, multi-center, randomized, double-blind, placebo-controlled Prospective Treatment Efficacy in IPF Using Genotype for NAC Selection (PRECISIONS) clinical trial.MethodsThe PRECISIONS trial will randomize 200 patients with IPF and the TOLLIP rs3750920 TT genotype 1:1 to oral N-acetylcysteine (600 mg tablets taken three times a day) or placebo for a 24-month duration. The primary endpoint is the composite of time to 10% relative decline in forced vital capacity (FVC), first respiratory hospitalization, lung transplantation, or death from any cause. Secondary endpoints include change in patient-reported outcome scores and proportion of participants with treatment-emergent adverse events. Biospecimens, including blood, buccal, and fecal will be collected longitudinally for future research purposes. Study participants will be offered enrollment in a home spirometry substudy, which explores time to 10% relative FVC decline measured at home, and its comparison with study visit FVC.DiscussionThe sentinel observation of a potential pharmacogenetic interaction between NAC and TOLLIP polymorphism highlights the urgent, unmet need for better, molecularly focused, and precise therapeutic strategies in IPF. The PRECISIONS clinical trial is the first study to use molecularly-focused techniques to identify patients with IPF most likely to benefit from treatment. PRECISIONS has the potential to shift the paradigm in how trials in this condition are designed and executed, and is the first step toward personalized medicine for patients with IPF. Trial Registration ClinicalTrials.gov identifier: NCT04300920. Registered March 9, 2020. https://clinicaltrials.gov/ct2/show/NCT04300920.
- Published
- 2022
39. A systematic review of postoperative outcomes of kidney stone surgery and meta-analysis of outcomes of percutaneous nephrolithotomy in individuals with spinal cord injury
- Author
-
Kim, John, Stewart, Veronica, Talwar, Gaurav, Uy, Michael, Hoogenes, Jen, and Matsumoto, Edward D.
- Published
- 2023
- Full Text
- View/download PDF
40. MGPU-TSM: A Multi-GPU System with Truly Shared Memory
- Author
-
Mojumder, Saiful A., Sun, Yifan, Delshadtehrani, Leila, Ma, Yenai, Baruah, Trinayan, Abellán, José L., Kim, John, Kaeli, David, and Joshi, Ajay
- Subjects
Computer Science - Hardware Architecture - Abstract
The sizes of GPU applications are rapidly growing. They are exhausting the compute and memory resources of a single GPU, and are demanding the move to multiple GPUs. However, the performance of these applications scales sub-linearly with GPU count because of the overhead of data movement across multiple GPUs. Moreover, a lack of hardware support for coherency exacerbates the problem because a programmer must either replicate the data across GPUs or fetch the remote data using high-overhead off-chip links. To address these problems, we propose a multi-GPU system with truly shared memory (MGPU-TSM), where the main memory is physically shared across all the GPUs. We eliminate remote accesses and avoid data replication using an MGPU-TSM system, which simplifies the memory hierarchy. Our preliminary analysis shows that MGPU-TSM with 4 GPUs performs, on average, 3.9x? better than the current best performing multi-GPU configuration for standard application benchmarks., Comment: 4 pages, 3 figures
- Published
- 2020
41. HALCONE : A Hardware-Level Timestamp-based Cache Coherence Scheme for Multi-GPU systems
- Author
-
Mojumder, Saiful A., Sun, Yifan, Delshadtehrani, Leila, Ma, Yenai, Baruah, Trinayan, Abellán, José L., Kim, John, Kaeli, David, and Joshi, Ajay
- Subjects
Computer Science - Hardware Architecture - Abstract
While multi-GPU (MGPU) systems are extremely popular for compute-intensive workloads, several inefficiencies in the memory hierarchy and data movement result in a waste of GPU resources and difficulties in programming MGPU systems. First, due to the lack of hardware-level coherence, the MGPU programming model requires the programmer to replicate and repeatedly transfer data between the GPUs' memory. This leads to inefficient use of precious GPU memory. Second, to maintain coherency across an MGPU system, transferring data using low-bandwidth and high-latency off-chip links leads to degradation in system performance. Third, since the programmer needs to manually maintain data coherence, the programming of an MGPU system to maximize its throughput is extremely challenging. To address the above issues, we propose a novel lightweight timestamp-based coherence protocol, HALCONE, for MGPU systems and modify the memory hierarchy of the GPUs to support physically shared memory. HALCONE replaces the Compute Unit (CU) level logical time counters with cache level logical time counters to reduce coherence traffic. Furthermore, HALCONE introduces a novel timestamp storage unit (TSU) with no additional performance overhead in the main memory to perform coherence actions. Our proposed HALCONE protocol maintains the data coherence in the memory hierarchy of the MGPU with minimal performance overhead (less than 1\%). Using a set of standard MGPU benchmarks, we observe that a 4-GPU MGPU system with shared memory and HALCONE performs, on average, 4.6$\times$ and 3$\times$ better than a 4-GPU MGPU system with existing RDMA and with the recently proposed HMG coherence protocol, respectively. We demonstrate the scalability of HALCONE using different GPU counts (2, 4, 8, and 16) and different CU counts (32, 48, and 64 CUs per GPU) for 11 standard benchmarks., Comment: 13 pages, 9 figures
- Published
- 2020
42. Adiposity and Interstitial Lung Abnormalities in Community-Dwelling Adults The MESA Cohort Study
- Author
-
Anderson, Michaela R, Kim, John S, Allison, Matthew, Giles, Jon T, Hoffman, Eric A, Ding, Jingzhong, Barr, R Graham, and Podolanczuk, Anna
- Subjects
Biomedical and Clinical Sciences ,Clinical Sciences ,Obesity ,Clinical Research ,Prevention ,Nutrition ,Lung ,2.1 Biological and endogenous factors ,Adiposity ,Aged ,Biomarkers ,Female ,Humans ,Independent Living ,Male ,Middle Aged ,Phenotype ,Prospective Studies ,Respiratory System Abnormalities ,Tomography ,X-Ray Computed ,high-attenuation areas ,IL-6 ,interstitial lung abnormalities ,interstitial lung disease ,leptin ,obesity ,Respiratory System ,Cardiovascular medicine and haematology ,Clinical sciences - Abstract
BackgroundObesity is associated with restrictive ventilatory defects and a faster rate of decline in FVC. This association is not exclusively mediated by mechanical factors and may reflect direct pulmonary injury by adipose-derived mediators.Research questionIs adipose tissue involved in the pathogenesis of interstitial lung disease (ILD)?Study design and methodsWe evaluated the association of CT measures of pericardial, abdominal visceral, and abdominal subcutaneous adipose tissue with high-attenuation areas (HAAs) and interstitial lung abnormalities (ILAs) in a large multicenter cohort study of community-dwelling adults, using multivariable-adjusted models. We secondarily evaluated the association of adipose depot size with FVC and biomarkers of obesity and inflammation.ResultsIn fully adjusted models, every doubling in pericardial adipose tissue volume was associated with a 63.4-unit increase in HAA (95% CI, 55.5-71.3), 20% increased odds of ILA (95% CI, -2% to 50%), and a 5.5% decrease in percent predicted FVC (95% CI, -6.8% to -4.3%). IL-6 levels accounted for 8% of the association between pericardial adipose tissue and HAA. Every doubling in visceral adipose tissue area was associated with a 41.5-unit increase in HAA (95% CI, 28.3-54.7), 30% increased odds of ILA (95% CI, -10% to 80%), and a 5.4% decrease in percent predicted FVC (95% CI, -6.6% to -4.3%). IL-6 and leptin accounted for 17% and 18%, respectively, of the association between visceral adipose tissue and HAA.InterpretationGreater amounts of pericardial and abdominal visceral adipose tissue were associated with CT measures of early lung injury and lower FVC in a cohort of community-dwelling adults. Adipose tissue may represent a modifiable risk factor for ILD.
- Published
- 2021
43. Cost Effectiveness of Implementing a Universal Birth Hepatitis B Vaccination Program in Ontario
- Author
-
Kim, John J., Alsabbagh, Wasem, and Wong, William W. L.
- Published
- 2023
- Full Text
- View/download PDF
44. Deep Learning Training in Facebook Data Centers: Design of Scale-up and Scale-out Systems
- Author
-
Naumov, Maxim, Kim, John, Mudigere, Dheevatsa, Sridharan, Srinivas, Wang, Xiaodong, Zhao, Whitney, Yilmaz, Serhat, Kim, Changkyu, Yuen, Hector, Ozdal, Mustafa, Nair, Krishnakumar, Gao, Isabel, Su, Bor-Yiing, Yang, Jiyan, and Smelyanskiy, Mikhail
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,68T05, 68M10 ,H.3.3 ,I.2.6 ,C.2.1 - Abstract
Large-scale training is important to ensure high performance and accuracy of machine-learning models. At Facebook we use many different models, including computer vision, video and language models. However, in this paper we focus on the deep learning recommendation models (DLRMs), which are responsible for more than 50% of the training demand in our data centers. Recommendation models present unique challenges in training because they exercise not only compute but also memory capacity as well as memory and network bandwidth. As model size and complexity increase, efficiently scaling training becomes a challenge. To address it we design Zion - Facebook's next-generation large-memory training platform that consists of both CPUs and accelerators. Also, we discuss the design requirements of future scale-out training systems., Comment: 10 pages, 14 figures; adjusted Fig. 10, added reference; fixed typos
- Published
- 2020
45. Superhydrophobic drag reduction in high-speed towing tank
- Author
-
Xu, Muchen, Yu, Ning, Kim, John, and Kim, Chang-Jin ''CJ''
- Subjects
turbulence control ,drag reduction ,MEMS ,NEMS ,Mathematical Sciences ,Engineering ,Fluids & Plasmas - Abstract
Abstract
- Published
- 2021
46. NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units
- Author
-
Hyun, Bongjoon, Kwon, Youngeun, Choi, Yujeong, Kim, John, and Rhu, Minsoo
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning ,Computer Science - Neural and Evolutionary Computing - Abstract
To satisfy the compute and memory demands of deep neural networks, neural processing units (NPUs) are widely being utilized for accelerating deep learning algorithms. Similar to how GPUs have evolved from a slave device into a mainstream processor architecture, it is likely that NPUs will become first class citizens in this fast-evolving heterogeneous architecture space. This paper makes a case for enabling address translation in NPUs to decouple the virtual and physical memory address space. Through a careful data-driven application characterization study, we root-cause several limitations of prior GPU-centric address translation schemes and propose a memory management unit (MMU) that is tailored for NPUs. Compared to an oracular MMU design point, our proposal incurs only an average 0.06% performance overhead.
- Published
- 2019
47. LYTNet: A Convolutional Neural Network for Real-Time Pedestrian Traffic Lights and Zebra Crossing Recognition for the Visually Impaired
- Author
-
Yu, Samuel, Lee, Heon, and Kim, John
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Currently, the visually impaired rely on either a sighted human, guide dog, or white cane to safely navigate. However, the training of guide dogs is extremely expensive, and canes cannot provide essential information regarding the color of traffic lights and direction of crosswalks. In this paper, we propose a deep learning based solution that provides information regarding the traffic light mode and the position of the zebra crossing. Previous solutions that utilize machine learning only provide one piece of information and are mostly binary: only detecting red or green lights. The proposed convolutional neural network, LYTNet, is designed for comprehensiveness, accuracy, and computational efficiency. LYTNet delivers both of the two most important pieces of information for the visually impaired to cross the road. We provide five classes of pedestrian traffic lights rather than the commonly seen three or four, and a direction vector representing the midline of the zebra crossing that is converted from the 2D image plane to real-world positions. We created our own dataset of pedestrian traffic lights containing over 5000 photos taken at hundreds of intersections in Shanghai. The experiments carried out achieve a classification accuracy of 94%, average angle error of 6.35 degrees, with a frame rate of 20 frames per second when testing the network on an iPhone 7 with additional post-processing steps., Comment: 12 pages, 5 figures, 6 tables, International Conference on Computer Analysis of Images and Patterns (CAIP)
- Published
- 2019
- Full Text
- View/download PDF
48. A prospective longitudinal in vivo 1H MR spectroscopy study of the SIV/macaque model of neuroAIDS
- Author
-
Halpern Elkan, Masliah Eliezer, Sehgal Prabhat K, He Julian, Lentz Margaret R, Kim John P, Greco Jane B, Ratai Eva, Westmoreland Susan V, Fuller Robert A, Lackner Andrew A, and González R Gilberto
- Subjects
Neurosciences. Biological psychiatry. Neuropsychiatry ,RC321-571 ,Neurophysiology and neuropsychology ,QP351-495 - Abstract
Abstract Background The neurological complications of HIV infection remain poorly understood. Clinically, in vivo 1H magnetic resonance spectroscopy (MRS) demonstrates brain injury caused by HIV infection even when the MRI is normal. Our goal was to undertsand the dynamics of cerebral injury by performing a longitudinal in vivo 1H MRS study of the SIV/macaque model of neuroAIDS. Results Eight rhesus macaques were infected with SIVmac251 and serially imaged with MRI and 1H MRS to terminal AIDS or the endpoint of 2 years. During acute infection, there were stereotypical brain MRS changes, dominated by a significant elevation of the Cho/Cr ratio in the frontal cortex. Subsequently, brain metabolic patterns diverged between animals. There was an elevation of basal ganglia Cho/Cr four weeks post-inoculation in 2 animals that developed SIV encephalitis (p = 0.022). Metabolite ratios averaged across all 8 animals were not significantly different from baseline at any time point after 2 weeks post inoculation. However, linear regression analysis on all 8 animals revealed a positive correlation between a change in frontal lobe Cho/Cr and plasma viral load (P < 0.001, R = 0.80), and a negative correlation between NAA/Cr in the basal ganglia and the plasma viral load (P < 0.02, R = -0.73). No MRI abnormalities were detected at any time. Conclusions After infection with SIV, macaque brain metabolism changes in a complex manner that is dependent on brain region, host factors and viral load. An elevation of basal ganglia Cho/Cr 4 weeks after SIV infection may be marker of a propensity to develop SIV encephalitis. Elevations of Cho/Cr, often observed in CNS inflammation, were associated with increased plasma viral load during acute and chronic infection. Evidence of neuronal injury in the basal ganglia was associated with increased plasma viral load in the chronic stage of infection. These observations support the use of drugs capable of controlling the viral replication and trafficking of virus into the CNS, and may help explain the reduction in incidence of HIV-associated dementia in the era of HAART despite the inability of most of those drugs to effectively enter the CNS.
- Published
- 2004
- Full Text
- View/download PDF
49. Migration and health study: a socio-ecological analysis of sexual health among migrants in Manitoba, Canada
- Author
-
Souleymanov, Rusty, Akinyele-Akanbi, Bolaji, Njeze, Chinyere, Ukoli, Patricia, Migliardi, Paula, Kim, John, Payne, Michael, Ringaert, Laurie, Restall, Gayle, Larcombe, Linda, Lachowsky, Nathan, Khan, Mohammad Nuruzzaman, Lorway, Robert, and Pino, Fritz
- Published
- 2023
- Full Text
- View/download PDF
50. Transcriptome profiling of a synergistic volumetric muscle loss repair strategy
- Author
-
Roberts, Kevin, Kim, John Taehwan, Huynh, Tai, Schluns, Jacob, Dunlap, Grady, Hestekin, Jamie, and Wolchok, Jeffrey C
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.