279 results on '"Yoo, Shinjae"'
Search Results
252. Solar irradiance forecast system based on geostationary satellite
- Author
-
Peng, Zhenzhou, primary, Yoo, Shinjae, additional, Yu, Dantong, additional, and Huang, Dong, additional
- Published
- 2013
- Full Text
- View/download PDF
253. Cloud motion estimation for short term solar irradiation prediction
- Author
-
Huang, Hao, primary, Xu, Jin, additional, Peng, Zhenzhou, additional, Yoo, Shinjae, additional, Yu, Dantong, additional, Huang, Dong, additional, and Qin, Hong, additional
- Published
- 2013
- Full Text
- View/download PDF
254. Synchrotron X-ray microfluorescence measurement of metal distributions in Phragmites australis root system in the Yangtze River intertidal zone.
- Author
-
Feng, Huan, Zhang, Weiguo, Qian, Yu, Liu, Wenliang, Yu, Lizhong, Yoo, Shinjae, Wang, Jun, Wang, Jia-Jun, Eng, Christopher, Liu, Chang-Jun, and Tappero, Ryan
- Subjects
SYNCHROTRONS ,INTERTIDAL zonation ,PHRAGMITES australis ,ELECTROMAGNETIC waves ,X-rays ,SOIL sampling - Abstract
This study investigates the distributions of Br, Ca, Cl, Cr, Cu, K, Fe, Mn, Pb, Ti, V and Zn in Phragmites australis root system and the function of Fe nanoparticles in scavenging metals in the root epidermis using synchrotron X-ray microfluorescence, synchrotron transmission X-ray microscope measurement and synchrotron X-ray absorption near-edge structure techniques. The purpose of this study is to understand the mobility of metals in wetland plant root systems after their uptake from rhizosphere soils. Phragmites australis samples were collected in the Yangtze River intertidal zone in July 2013. The results indicate that Fe nanoparticles are present in the root epidermis and that other metals correlate significantly with Fe, suggesting that Fe nanoparticles play an important role in metal scavenging in the epidermis. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
255. A New Anomaly Detection Algorithm Based on Quantum Mechanics
- Author
-
Huang, Hao, primary, Qin, Hong, additional, Yoo, Shinjae, additional, and Yu, Dantong, additional
- Published
- 2012
- Full Text
- View/download PDF
256. Local anomaly descriptor
- Author
-
Huang, Hao, primary, Qin, Hong, additional, Yoo, Shinjae, additional, and Yu, Dantong, additional
- Published
- 2012
- Full Text
- View/download PDF
257. Correlation and local feature based cloud motion estimation
- Author
-
Huang, Hao, primary, Yoo, Shinjae, additional, Yu, Dantong, additional, Huang, Dong, additional, and Qin, Hong, additional
- Published
- 2012
- Full Text
- View/download PDF
258. A Robust Clustering Algorithm Based on Aggregated Heat Kernel Mapping
- Author
-
Huang, Hao, primary, Yoo, Shinjae, additional, Qin, Hong, additional, and Yu, Dantong, additional
- Published
- 2011
- Full Text
- View/download PDF
259. Modeling personalized email prioritization
- Author
-
Yoo, Shinjae, primary, Yang, Yiming, additional, and Carbonell, Jaime, additional
- Published
- 2011
- Full Text
- View/download PDF
260. Mining social networks for personalized email prioritization
- Author
-
Yoo, Shinjae, primary, Yang, Yiming, additional, Lin, Frank, additional, and Moon, Il-Chul, additional
- Published
- 2009
- Full Text
- View/download PDF
261. Robustness of adaptive filtering methods in a cross-benchmark evaluation
- Author
-
Yang, Yiming, primary, Yoo, Shinjae, additional, Zhang, Jian, additional, and Kisiel, Bryan, additional
- Published
- 2005
- Full Text
- View/download PDF
262. Quantum machine learning with differential privacy.
- Author
-
Watkins, William M., Chen, Samuel Yen-Chi, and Yoo, Shinjae
- Subjects
- *
MACHINE learning , *IMAGE recognition (Computer vision) , *CROWDSOURCING , *PRIVACY , *QUANTUM computers , *AUTOMATIC speech recognition , *MATHEMATICAL optimization - Abstract
Quantum machine learning (QML) can complement the growing trend of using learned models for a myriad of classification tasks, from image recognition to natural speech processing. There exists the potential for a quantum advantage due to the intractability of quantum operations on a classical computer. Many datasets used in machine learning are crowd sourced or contain some private information, but to the best of our knowledge, no current QML models are equipped with privacy-preserving features. This raises concerns as it is paramount that models do not expose sensitive information. Thus, privacy-preserving algorithms need to be implemented with QML. One solution is to make the machine learning algorithm differentially private, meaning the effect of a single data point on the training dataset is minimized. Differentially private machine learning models have been investigated, but differential privacy has not been thoroughly studied in the context of QML. In this study, we develop a hybrid quantum-classical model that is trained to preserve privacy using differentially private optimization algorithm. This marks the first proof-of-principle demonstration of privacy-preserving QML. The experiments demonstrate that differentially private QML can protect user-sensitive information without signficiantly diminishing model accuracy. Although the quantum model is simulated and tested on a classical computer, it demonstrates potential to be efficiently implemented on near-term quantum devices [noisy intermediate-scale quantum (NISQ)]. The approach's success is illustrated via the classification of spatially classed two-dimensional datasets and a binary MNIST classification. This implementation of privacy-preserving QML will ensure confidentiality and accurate learning on NISQ technology. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
263. Effectiveness and safety of immune checkpoint inhibitors in Black patients versus White patients in a US national health system: a retrospective cohort study.
- Author
-
Miller, Sean, Jiang, Ralph, Schipper, Matthew, Fritsche, Lars G, Strohbehn, Garth, Wallace, Beth, Brinzevich, Daria, Falvello, Virginia, McMahon, Benjamin H, Zamora-Resendiz, Rafael, Ramnath, Nithya, Dai, Xin, Sankar, Kamya, Edwards, Donna M, Allen, Steven G, Yoo, Shinjae, Crivelli, Silvia, Green, Michael D, and Bryant, Alex K
- Abstract
Black patients were severely under-represented in the clinical trials that led to the approval of immune checkpoint inhibitors (ICIs) for all cancers. The aim of this study was to characterise the effectiveness and safety of ICIs in Black patients. We did a retrospective cohort study of patients in the US Veterans Health Administration (VHA) system's Corporate Data Warehouse containing electronic medical records for all patients who self-identified as non-Hispanic Black or African American (referred to as Black) or non-Hispanic White (referred to as White) and received PD-1, PD-L1, CTLA-4, or LAG-3 inhibitors between Jan 1, 2010, and Dec 31, 2023. Effectiveness outcomes were overall survival, time to treatment discontinuation, and time to next treatment. The safety outcome was the frequency of immune-related adverse events; assessed among a random sample of 1000 Black patients and 1000 White patients, 892 pairs were matched on the basis of baseline characteristics using 1:1 exact matching without replacement. After manual chart review, patients who did not receive ICI therapy or who had inadequate follow-up were excluded. The adjusted effect of race on each effectiveness outcome was assessed in the whole ICI-treated cohort with propensity-weighted Cox regression with robust standard errors. Immune-related adverse events outcomes were analysed in the random matched sample with multivariable Cox regression, adjusting for baseline characteristics. We identified 26 398 patients, of whom 4943 (18·7%) patients were Black, 21 455 (81·3%) were White, 895 (3·4%) were female, 25 503 (96·6%) were male, 11 859 (45%) had non-small-cell lung cancer, and 26 045 (98·7%) received PD-1 or PD-L1 inhibitors. As of data cutoff (Aug 28, 2024), median follow-up was 40·3 months (95% CI 38·3–42·3) for Black patients and 43·9 months (43·0–45·1) for White patients. Compared with White patients, Black patients had longer time to treatment discontinuation (2-year unadjusted rates 10·7% [95% CI 9·8–11·7] for Black patients vs 8·6% [8·2–9·0] for White patients; adjusted hazard ratio [HR] 0·91, 95% CI 0·87–0·95, p<0·0001), similar time to next treatment (23·5% [22·3–24·8] for Black patients vs 25·6% [25·0–26·2] for White patients; 1·00, 0·95–1·05, p=0·96), and slightly improved overall survival (36·5% [35·2–38·1] for Black patients vs 36·5% [35·8–37·1]; 0·95, 0·90–0·99, p=0·036). 1710 patients (n=862 Black and n=848 White) were analysed for safety outcomes. Compared with White patients, Black patients had a reduced risk of all-grade immune-related adverse events (unadjusted 2-year rate 33·1% [95% CI 28·9–37·1] vs 44·1% [95% CI 39·1–48·7]; adjusted HR 0·75, 95% CI 0·62–0·90, p=0·0026), immune-related adverse events requiring treatment with systemic steroids (0·61, 0·46–0·81, p=0·00051), and immune-related adverse events resulting in permanent ICI discontinuation (0·58, 0·44–0·78, p=0·00024). In exploratory analyses of irAE subtypes, a significant risk reduction in Black patients was found for colitis (0·46, 0·27–0·76, p=0·0026) and hyperthyroidism or hypothyroidism (0·63, 0·44–0·90, p=0·011), and no significant differences were found for any other immune-related adverse event subtypes analysed. Similar results were found in analyses using a steroid-based definition of immune-related adverse events among the entire ICI-treated cohort. Compared with White patients, Black patients had similar ICI effectiveness and lower toxicities among those treated in the national VHA system, potentially reflecting an important difference in the therapeutic ratio (ratio of benefit to harm) of ICIs. Our findings of decreased toxicity among Black patients require further investigation to assess their generalisability. Million Veteran Program, Office of Research and Development, Veterans Health Administration and the LUNGevity foundation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
264. Pan‐Cancer Survival Impact of Immune Checkpoint Inhibitors in a National Healthcare System.
- Author
-
Miller, Sean R., Schipper, Matthew, Fritsche, Lars G., Jiang, Ralph, Strohbehn, Garth, Ötleş, Erkin, McMahon, Benjamin H., Crivelli, Silvia, Zamora‐Resendiz, Rafael, Ramnath, Nithya, Yoo, Shinjae, Dai, Xin, Sankar, Kamya, Edwards, Donna M., Allen, Steven G., Green, Michael D., and Bryant, Alex K.
- Subjects
- *
SURVIVAL rate , *IMMUNE checkpoint inhibitors , *OVERALL survival , *VETERANS' health , *PROGNOSIS - Abstract
Background: The cumulative, health system‐wide survival benefit of immune checkpoint inhibitors (ICIs) is unclear, particularly among real‐world patients with limited life expectancies and among subgroups poorly represented on clinical trials. We sought to determine the health system‐wide survival impact of ICIs. Methods: We identified all patients receiving PD‐1/PD‐L1 or CTLA‐4 inhibitors from 2010 to 2023 in the national Veterans Health Administration (VHA) system (ICI cohort) and all patients who received non‐ICI systemic therapy in the years before ICI approval (historical control). ICI and historical control cohorts were matched on multiple cancer‐related prognostic factors, comorbidities, and demographics. The effect of ICI on overall survival was quantified with Cox regression incorporating matching weights. Cumulative life‐years gained system‐wide were calculated from the difference in adjusted 5‐year restricted mean survival times. Results: There were 27,322 patients in the ICI cohort and 69,801 patients in the historical control cohort. Among ICI patients, the most common cancer types were NSCLC (46%) and melanoma (10%). ICI demonstrated a large OS benefit in most cancer types with heterogeneity across cancer types (NSCLC: adjusted HR [aHR] 0.56, 95% confidence interval [CI] 0.54–0.58, p < 0.001; urothelial: aHR 0.91, 95% CI 0.83–1.01, p = 0.066). The relative benefit of ICI was stable across patient age, comorbidity, and self‐reported race subgroups. Across VHA, 15,859 life‐years gained were attributable to ICI within 5‐years of treatment, with NSCLC contributing the most life‐years gained. Conclusion: We demonstrated substantial increase in survival due to ICIs across a national health system, including in patient subgroups poorly represented on clinical trials. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
265. Towards Real Time Quantitative Analysis of Supported Nanoparticle Ensemble Evolution Investigated by Environmental TEM.
- Author
-
Zakharov, Dmitri N., Lin, Yuewei, Megret, Remi, Yoo, Shinjae, Voorhees, Peter, Horwath, James P., and Stach, Eric A.
- Published
- 2019
- Full Text
- View/download PDF
266. P1‐270: MACHINE LEARNING PREDICTION OF FUTURE INCIDENCE OF ALZHEIMER'S DISEASE USING POPULATION‐WIDE ELECTRONIC HEALTH RECORDS.
- Author
-
Park, Ji-Hwan, Cho, Han-Eol, Cha, Jun Min, Kim, Jong Hun, Yoo, Shinjae, Kim, Hyoung-Seop, and Cha, Jiook
- Published
- 2019
- Full Text
- View/download PDF
267. Understanding Data Access Patterns for dCache System.
- Author
-
Bellavita, Julian, Sim, Caitlin, Wu, Kesheng, Sim, Alex, Yoo, Shinjae, Ito, Hiro, Garonne, Vincent, and Lancon, Eric
- Subjects
- *
BACK up systems , *DATA warehousing , *HEURISTIC , *METHODOLOGY , *DATA analysis - Abstract
The storage management system dCache acts as a disk cache for high-energy physics (HEP) data from the US ATLAS community. Since its disk capacity is considerably smaller than the total volume of ATLAS data, a heuristic is needed to determine what data should be kept on disks. An effective heuristic would be to keep the data files that are expected to be heavily accessed in the near future. Through a careful study of access statistics, we find a few most popular datasets are accessed much more frequently than others, even though these popular datasets change over time. If we could predict the near-term popularity of datasets, we could pin the most popular ones in the disk cache to prevent their accidental removal and guarantee their availability. To predict a dataset popularity, we present several methods for forecasting the number of times a dataset will be accessed in the next day. Test results show that these methods could predict the next-day access counts of popular datasets reliably. This observation is confirmed with dCache logs from two separate time ranges. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
268. Machine learning prediction of incidence of Alzheimer’s disease using large-scale administrative health data
- Author
-
Park, Ji-Hwan, Cho, Han Eol, Kim, Jong Hun, Wall, Melanie M., Stern, Yaakov, Lim, Hyun-Sun, Yoo, Shinjae, Kim, Hyoung Seop, and Cha, Jiook
- Subjects
Alzheimer's disease ,Diseases--Risk factors ,Cohort analysis ,3. Good health - Abstract
Nationwide population-based cohort provides a new opportunity to build an automated risk prediction model based on individuals’ history of health and healthcare beyond existing risk prediction models. We tested the possibility of machine learning models to predict future incidence of Alzheimer’s disease (AD) using large-scale administrative health data. From the Korean National Health Insurance Service database between 2002 and 2010, we obtained de-identified health data in elders above 65 years (N = 40,736) containing 4,894 unique clinical features including ICD-10 codes, medication codes, laboratory values, history of personal and family illness and socio-demographics. To define incident AD we considered two operational definitions: “definite AD” with diagnostic codes and dementia medication (n = 614) and “probable AD” with only diagnosis (n = 2026). We trained and validated random forest, support vector machine and logistic regression to predict incident AD in 1, 2, 3, and 4 subsequent years. For predicting future incidence of AD in balanced samples (bootstrapping), the machine learning models showed reasonable performance in 1-year prediction with AUC of 0.775 and 0.759, based on “definite AD” and “probable AD” outcomes, respectively; in 2-year, 0.730 and 0.693; in 3-year, 0.677 and 0.644; in 4-year, 0.725 and 0.683. The results were similar when the entire (unbalanced) samples were used. Important clinical features selected in logistic regression included hemoglobin level, age and urine protein level. This study may shed a light on the utility of the data-driven machine learning model based on large-scale administrative health data in AD risk prediction, which may enable better selection of individuals at risk for AD in clinical trials or early detection in clinical settings.
269. Artificial intelligence to unlock real‐world evidence in clinical oncology: A primer on recent advances.
- Author
-
Bryant, Alex K., Zamora‐Resendiz, Rafael, Dai, Xin, Morrow, Destinee, Lin, Yuewei, Jungles, Kassidy M., Rae, James M., Tate, Akshay, Pearson, Ashley N., Jiang, Ralph, Fritsche, Lars, Lawrence, Theodore S., Zou, Weiping, Schipper, Matthew, Ramnath, Nithya, Yoo, Shinjae, Crivelli, Silvia, and Green, Michael D.
- Subjects
- *
ARTIFICIAL intelligence , *LANGUAGE models , *ONCOLOGY - Abstract
Purpose: Real world evidence is crucial to understanding the diffusion of new oncologic therapies, monitoring cancer outcomes, and detecting unexpected toxicities. In practice, real world evidence is challenging to collect rapidly and comprehensively, often requiring expensive and time‐consuming manual case‐finding and annotation of clinical text. In this Review, we summarise recent developments in the use of artificial intelligence to collect and analyze real world evidence in oncology. Methods: We performed a narrative review of the major current trends and recent literature in artificial intelligence applications in oncology. Results: Artificial intelligence (AI) approaches are increasingly used to efficiently phenotype patients and tumors at large scale. These tools also may provide novel biological insights and improve risk prediction through multimodal integration of radiographic, pathological, and genomic datasets. Custom language processing pipelines and large language models hold great promise for clinical prediction and phenotyping. Conclusions: Despite rapid advances, continued progress in computation, generalizability, interpretability, and reliability as well as prospective validation are needed to integrate AI approaches into routine clinical care and real‐time monitoring of novel therapies. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
270. Overestimated prediction using polygenic prediction derived from summary statistics.
- Author
-
Park, David Keetae, Chen, Mingshen, Kim, Seungsoo, Joo, Yoonjung Yoonie, Loving, Rebekah K., Kim, Hyoung Seop, Cha, Jiook, Yoo, Shinjae, and Kim, Jong Hun
- Subjects
- *
DISEASE risk factors , *ALZHEIMER'S disease , *MONOGENIC & polygenic inheritance (Genetics) , *FORECASTING - Abstract
Background: When polygenic risk score (PRS) is derived from summary statistics, independence between discovery and test sets cannot be monitored. We compared two types of PRS studies derived from raw genetic data (denoted as rPRS) and the summary statistics for IGAP (sPRS). Results: Two variables with the high heritability in UK Biobank, hypertension, and height, are used to derive an exemplary scale effect of PRS. sPRS without APOE is derived from International Genomics of Alzheimer's Project (IGAP), which records ΔAUC and ΔR2 of 0.051 ± 0.013 and 0.063 ± 0.015 for Alzheimer's Disease Sequencing Project (ADSP) and 0.060 and 0.086 for Accelerating Medicine Partnership - Alzheimer's Disease (AMP-AD). On UK Biobank, rPRS performances for hypertension assuming a similar size of discovery and test sets are 0.0036 ± 0.0027 (ΔAUC) and 0.0032 ± 0.0028 (ΔR2). For height, ΔR2 is 0.029 ± 0.0037. Conclusion: Considering the high heritability of hypertension and height of UK Biobank and sample size of UK Biobank, sPRS results from AD databases are inflated. Independence between discovery and test sets is a well-known basic requirement for PRS studies. However, a lot of PRS studies cannot follow such requirements because of impossible direct comparisons when using summary statistics. Thus, for sPRS, potential duplications should be carefully considered within the same ethnic group. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
271. Use of physics to improve solar forecast: Part II, machine learning and model interpretability.
- Author
-
Liu, Weijia, Liu, Yangang, Zhang, Tao, Han, Yongxiang, Zhou, Xin, Xie, Yu, and Yoo, Shinjae
- Subjects
- *
ATMOSPHERIC radiation measurement , *FORECASTING , *MACHINE learning , *SOLAR energy - Abstract
• A hierarchy of machine learning forecast systems is presented and compared with five persistence models. • Difference between hierarchical models enhance the interpretability of forecasts. • LSTM and XGBoost outperform five persistence models. • The introduction of cloud albedo and cloud fraction significantly improves solar forecasts. • Machine learning models can somewhat learn clear-sky irradiance, making the value of adding it limited. Machine learning (ML) models have been applied to forecast solar energy; however, they often lack clarity of interpretability and underlying physics. This work addresses such challenges by developing a hierarchy of ML models that gradually introduce predictors to improve the forecast accuracy based on a physics-based framework. Three ML models (ARIMA, LSTM, and XGBoost) are examined and compared with four physics-informed persistence models reported in Part I and the simple persistence model to assess the improvement of different models. The 7-year measurements at the U.S. Department of Energy's Atmospheric Radiation Measurement's Southern Great Plains Central Facility site are used for forecasts and evaluations. The results reveal that the step-by-step introduction of predictors leads to different improvements for models at different hierarchical levels. Comparison of the ML models with persistence models shows that LSTM and XGBoost outperform all the persistence models, with LSTM having the overall best performance; however, ARIMA underperforms the four physics-informed persistence models. This study demonstrates the importance and utility of incorporating physics into ML models in improving forecast accuracy by introducing a hierarchy of physics-based predictors, distinguishing predictor contributions, and enhancing the ML interpretability. The combined use of Global Horizontal Irradiance (GHI) and Direct Normal Irradiance (DNI) significantly improves the forecast accuracy compared to using individual irradiances alone because the pair contains more information on cloud-radiation interactions. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
272. Use of physics to improve solar forecast: Physics-informed persistence models for simultaneously forecasting GHI, DNI, and DHI.
- Author
-
Liu, Weijia, Liu, Yangang, Zhou, Xin, Xie, Yu, Han, Yongxiang, Yoo, Shinjae, and Sengupta, Manajit
- Subjects
- *
ATMOSPHERIC radiation measurement , *FORECASTING , *SPECTRAL irradiance , *STATISTICAL models , *SOLAR energy - Abstract
• Physics-informed persistence forecast systems are presented. • New systems improve the forecast at long lead times. • Cloud-fraction and cloud-albedo-informed systems outperform the other models. • Model performance is dominated by the variability and error in cloud predictor. • Relationship between radiative and cloud properties can explain the model behavior. Observation-based statistical models have been widely used in forecasting solar energy; however, existing models often lack a clear relation to physics and are limited largely to global horizontal irradiance (GHI) forecasts over relatively short time horizons (<1 h). Incorporating physics into observation-based models, increasing forecast time horizons and developing a model system for forecasting not only GHI but also direct normal irradiance (DNI) and diffuse horizontal irradiance (DHI) remain challenging, especially under cloudy conditions because of complex cloud-radiation interactions. This work attempts to address these challenges by developing a hierarchy of four new physics-informed persistence models that can be used to simultaneously forecast GHI, DNI and DHI. The decade-long measurements (1998 to 2014) at the U.S. Department of Energy's Atmospheric Radiation Measurement (ARM)'s Southern Great Plains (SGP) Central Facility site are used to evaluate the performance of the new models. The results show that the new physics-informed forecast models generally outperform the simple and smart persistence models, and improve the forecast accuracy at lead times from 1.25 h up to 6 h. Further analysis reveals that the forecast error is highly related to the error and temporal variability of the assumed cloud predictor. The best model for forecasting different radiative components can be explained by the relationship between solar irradiances and cloud properties. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
273. Exploring Robust Features for Improving Adversarial Robustness.
- Author
-
Wang H, Deng Y, Yoo S, and Lin Y
- Abstract
While deep neural networks (DNNs) have revolutionized many fields, their fragility to carefully designed adversarial attacks impedes the usage of DNNs in safety-critical applications. In this article, we strive to explore the robust features that are not affected by the adversarial perturbations, that is, invariant to the clean image and its adversarial examples (AEs), to improve the model's adversarial robustness. Specifically, we propose a feature disentanglement model to segregate the robust features from nonrobust features and domain-specific features. The extensive experiments on five widely used datasets with different attacks demonstrate that robust features obtained from our model improve the model's adversarial robustness compared to the state-of-the-art approaches. Moreover, the trained domain discriminator is able to identify the domain-specific features from the clean images and AEs almost perfectly. This enables AE detection without incurring additional computational costs. With that, we can also specify different classifiers for clean images and AEs, thereby avoiding any drop in clean image accuracy.
- Published
- 2024
- Full Text
- View/download PDF
274. Pathway-based analyses of gene expression profiles at low doses of ionizing radiation.
- Author
-
Luo X, Niyakan S, Johnstone P, McCorkle S, Park G, López-Marrero V, Yoo S, Dougherty ER, Qian X, Alexander FJ, Jha S, and Yoon BJ
- Abstract
Radiation exposure poses a significant threat to human health. Emerging research indicates that even low-dose radiation once believed to be safe, may have harmful effects. This perception has spurred a growing interest in investigating the potential risks associated with low-dose radiation exposure across various scenarios. To comprehensively explore the health consequences of low-dose radiation, our study employs a robust statistical framework that examines whether specific groups of genes, belonging to known pathways, exhibit coordinated expression patterns that align with the radiation levels. Notably, our findings reveal the existence of intricate yet consistent signatures that reflect the molecular response to radiation exposure, distinguishing between low-dose and high-dose radiation. Moreover, we leverage a pathway-constrained variational autoencoder to capture the nonlinear interactions within gene expression data. By comparing these two analytical approaches, our study aims to gain valuable insights into the impact of low-dose radiation on gene expression patterns, identify pathways that are differentially affected, and harness the potential of machine learning to uncover hidden activity within biological networks. This comparative analysis contributes to a deeper understanding of the molecular consequences of low-dose radiation exposure., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2024 Luo, Niyakan, Johnstone, McCorkle, Park, López-Marrero, Yoo, Dougherty, Qian, Alexander, Jha and Yoon.)
- Published
- 2024
- Full Text
- View/download PDF
275. Multi-Agent Graph-Attention Deep Reinforcement Learning for Post-Contingency Grid Emergency Voltage Control.
- Author
-
Zhang Y, Yue M, Wang J, and Yoo S
- Abstract
Grid emergency voltage control (GEVC) is paramount in electric power systems to improve voltage stability and prevent cascading outages and blackouts in case of contingencies. While most deep reinforcement learning (DRL)-based paradigms perform single agents in a static environment, real-world agents for GEVC are expected to cooperate in a dynamically shifting grid. Moreover, due to high uncertainties from combinatory natures of various contingencies and load consumption, along with the complexity of dynamic grid operation, the data efficiency and control performance of the existing DRL-based methods are challenged. To address these limitations, we propose a multi-agent graph-attention (GATT)-based DRL algorithm for GEVC in multi-area power systems. We develop graph convolutional network (GCN)-based agents for feature representation of the graph-structured voltages to improve the decision accuracy in a data-efficient manner. Furthermore, a cutting-edge attention mechanism concentrates on effective information sharing among multiple agents, synergizing different-sized subnetworks in the grid for cooperative learning. We address several key challenges in the existing DRL-based GEVC approaches, including low scalability and poor stability against high uncertainties. Test results in the IEEE benchmark system verify the advantages of the proposed method over several recent multi-agent DRL-based algorithms.
- Published
- 2024
- Full Text
- View/download PDF
276. INSURE: An Information Theory iNspired diSentanglement and pURification modEl for Domain Generalization.
- Author
-
Yu X, Tseng HH, Yoo S, Ling H, and Lin Y
- Abstract
Domain Generalization (DG) aims to learn a generalizable model on the unseen target domain by only training on the multiple observed source domains. Although a variety of DG methods have focused on extracting domain-invariant features, the domain-specific class-relevant features have attracted attention and been argued to benefit generalization to the unseen target domain. To take into account the class-relevant domain-specific information, in this paper we propose an Information theory iNspired diSentanglement and pURification modEl (INSURE) to explicitly disentangle the latent features to obtain sufficient and compact (necessary) class-relevant feature for generalization to the unseen domain. Specifically, we first propose an information theory inspired loss function to ensure the disentangled class-relevant features contain sufficient class label information and the other disentangled auxiliary feature has sufficient domain information. We further propose a paired purification loss function to let the auxiliary feature discard all the class-relevant information and thus the class-relevant feature will contain sufficient and compact (necessary) class-relevant information. Moreover, instead of using multiple encoders, we propose to use a learnable binary mask as our disentangler to make the disentanglement more efficient and make the disentangled features complementary to each other. We conduct extensive experiments on five widely used DG benchmark datasets including PACS, VLCS, OfficeHome, TerraIncognita, and DomainNet. The proposed INSURE achieves state-of-the-art performance. We also empirically show that domain-specific class-relevant features are beneficial for domain generalization. The code is available at https://github.com/yuxi120407/INSURE.
- Published
- 2024
- Full Text
- View/download PDF
277. Survival analysis of localized prostate cancer with deep learning.
- Author
-
Dai X, Park JH, Yoo S, D'Imperio N, McMahon BH, Rentsch CT, Tate JP, and Justice AC
- Subjects
- Male, Humans, United States, Prostate-Specific Antigen, Cross-Sectional Studies, Survival Analysis, Deep Learning, Prostatic Neoplasms pathology
- Abstract
In recent years, data-driven, deep-learning-based models have shown great promise in medical risk prediction. By utilizing the large-scale Electronic Health Record data found in the U.S. Department of Veterans Affairs, the largest integrated healthcare system in the United States, we have developed an automated, personalized risk prediction model to support the clinical decision-making process for localized prostate cancer patients. This method combines the representative power of deep learning and the analytical interpretability of parametric regression models and can implement both time-dependent and static input data. To collect a comprehensive evaluation of model performances, we calculate time-dependent C-statistics [Formula: see text] over 2-, 5-, and 10-year time horizons using either a composite outcome or prostate cancer mortality as the target event. The composite outcome combines the Prostate-Specific Antigen (PSA) test, metastasis, and prostate cancer mortality. Our longitudinal model Recurrent Deep Survival Machine (RDSM) achieved [Formula: see text] 0.85 (0.83), 0.80 (0.83), and 0.76 (0.81), while the cross-sectional model Deep Survival Machine (DSM) attained [Formula: see text] 0.85 (0.82), 0.80 (0.82), and 0.76 (0.79) for the 2-, 5-, and 10-year composite (mortality) outcomes, respectively. In addition to estimating the survival probability, our method can quantify the uncertainty associated with the prediction. The uncertainty scores show a consistent correlation with the prediction accuracy. We find PSA and prostate cancer stage information are the most important indicators in risk prediction. Our work demonstrates the utility of the data-driven machine learning model in prostate cancer risk prediction, which can play a critical role in the clinical decision system., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
278. Machine learning prediction of incidence of Alzheimer's disease using large-scale administrative health data.
- Author
-
Park JH, Cho HE, Kim JH, Wall MM, Stern Y, Lim H, Yoo S, Kim HS, and Cha J
- Abstract
Nationwide population-based cohort provides a new opportunity to build an automated risk prediction model based on individuals' history of health and healthcare beyond existing risk prediction models. We tested the possibility of machine learning models to predict future incidence of Alzheimer's disease (AD) using large-scale administrative health data. From the Korean National Health Insurance Service database between 2002 and 2010, we obtained de-identified health data in elders above 65 years ( N = 40,736) containing 4,894 unique clinical features including ICD-10 codes, medication codes, laboratory values, history of personal and family illness and socio-demographics. To define incident AD we considered two operational definitions: "definite AD" with diagnostic codes and dementia medication ( n = 614) and "probable AD" with only diagnosis ( n = 2026). We trained and validated random forest, support vector machine and logistic regression to predict incident AD in 1, 2, 3, and 4 subsequent years. For predicting future incidence of AD in balanced samples (bootstrapping), the machine learning models showed reasonable performance in 1-year prediction with AUC of 0.775 and 0.759, based on "definite AD" and "probable AD" outcomes, respectively; in 2-year, 0.730 and 0.693; in 3-year, 0.677 and 0.644; in 4-year, 0.725 and 0.683. The results were similar when the entire (unbalanced) samples were used. Important clinical features selected in logistic regression included hemoglobin level, age and urine protein level. This study may shed a light on the utility of the data-driven machine learning model based on large-scale administrative health data in AD risk prediction, which may enable better selection of individuals at risk for AD in clinical trials or early detection in clinical settings., Competing Interests: Competing interestsThe authors declare no competing interests., (© The Author(s) 2020.)
- Published
- 2020
- Full Text
- View/download PDF
279. KBase: The United States Department of Energy Systems Biology Knowledgebase.
- Author
-
Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, Sneddon MW, Henderson ML, Riehl WJ, Murphy-Olson D, Chan SY, Kamimura RT, Kumari S, Drake MM, Brettin TS, Glass EM, Chivian D, Gunter D, Weston DJ, Allen BH, Baumohl J, Best AA, Bowen B, Brenner SE, Bun CC, Chandonia JM, Chia JM, Colasanti R, Conrad N, Davis JJ, Davison BH, DeJongh M, Devoid S, Dietrich E, Dubchak I, Edirisinghe JN, Fang G, Faria JP, Frybarger PM, Gerlach W, Gerstein M, Greiner A, Gurtowski J, Haun HL, He F, Jain R, Joachimiak MP, Keegan KP, Kondo S, Kumar V, Land ML, Meyer F, Mills M, Novichkov PS, Oh T, Olsen GJ, Olson R, Parrello B, Pasternak S, Pearson E, Poon SS, Price GA, Ramakrishnan S, Ranjan P, Ronald PC, Schatz MC, Seaver SMD, Shukla M, Sutormin RA, Syed MH, Thomason J, Tintle NL, Wang D, Xia F, Yoo H, Yoo S, and Yu D
- Subjects
- Humans, United States, Computational Biology methods, Database Management Systems trends, Knowledge Bases, Systems Biology trends
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.