Author: "Yang, Carl" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yang, Carl"' showing total 294 results

Start Over Author "Yang, Carl"

294 results on '"Yang, Carl"'

251. GraphBLAS C API: Ideas for future versions of the specification

Author: Mattson, Timothy G., primary, Yang, Carl, additional, McMillan, Scott, additional, Buluc, Aydin, additional, and Moreira, Jose E., additional
Published: 2017
Full Text: View/download PDF

252. Bridging Collaborative Filtering and Semi-Supervised Learning

Author: Yang, Carl, primary, Bai, Lanxiao, additional, Zhang, Chao, additional, Yuan, Quan, additional, and Han, Jiawei, additional
Published: 2017
Full Text: View/download PDF

253. Design of the GraphBLAS API for C

Author: Buluc, Aydin, primary, Mattson, Tim, additional, McMillan, Scott, additional, Moreira, Jose, additional, and Yang, Carl, additional
Published: 2017
Full Text: View/download PDF

254. Multi-GPU Graph Analytics

Author: Pan, Yuechao, primary, Wang, Yangzihao, additional, Wu, Yuduo, additional, Yang, Carl, additional, and Owens, John D., additional
Published: 2017
Full Text: View/download PDF

255. Gunrock

Author: Wang, Yangzihao, primary, Pan, Yuechao, additional, Davidson, Andrew, additional, Wu, Yuduo, additional, Yang, Carl, additional, Wang, Leyuan, additional, Osama, Muhammad, additional, Yuan, Chenshan, additional, Liu, Weitang, additional, Riffel, Andy T., additional, and Owens, John D., additional
Published: 2017
Full Text: View/download PDF

256. Bi-directional Joint Inference for User Links and Attributes on Large Social Graphs

Author: Yang, Carl, primary, Zhong, Lin, additional, Li, Li-Jia, additional, and Jie, Luo, additional
Published: 2017
Full Text: View/download PDF

257. Use of Big Data to Evaluate and Improve Performance of Traffic Signal Systems in Resource-Constrained Countries

Author: Lu, Yang Carl, primary, Krambeck, Holly, additional, and Tang, Liang, additional
Published: 2017
Full Text: View/download PDF

258. Optimal Variable Speed Limit Control System for Freeway Work Zone Operations

Author: Yang, Xianfeng, primary, Lu, Yang (Carl), additional, and Lin, Yongjie, additional
Published: 2017
Full Text: View/download PDF

259. Mathematical Foundations of the GraphBLAS

Author: Kepner, Jeremy, Aaltonen, Peter, Bader, David, Buluc, Aydın, Franchetti, Franz, Gilbert, John, Hutchison, Dylan, Kumar, Manoj, Lumsdaine, Andrew, Meyerhenke, Henning, McMillan, Scott, Moreira, Jose, Owens, John D., Yang, Carl, Zalewski, Marcin, Mattson, Timothy, Kepner, Jeremy, Aaltonen, Peter, Bader, David, Buluc, Aydın, Franchetti, Franz, Gilbert, John, Hutchison, Dylan, Kumar, Manoj, Lumsdaine, Andrew, Meyerhenke, Henning, McMillan, Scott, Moreira, Jose, Owens, John D., Yang, Carl, Zalewski, Marcin, and Mattson, Timothy
Abstract: The GraphBLAS standard (GraphBlas.org) is being developed to bring the potential of matrix based graph algorithms to the broadest possible audience. Mathematically the Graph- BLAS defines a core set of matrix-based graph operations that can be used to implement a wide class of graph algorithms in a wide range of programming environments. This paper provides an introduction to the mathematics of the GraphBLAS. Graphs represent connections between vertices with edges. Matrices can represent a wide range of graphs using adjacency matrices or incidence matrices. Adjacency matrices are often easier to analyze while incidence matrices are often better for representing data. Fortunately, the two are easily connected by matrix mul- tiplication. A key feature of matrix mathematics is that a very small number of matrix operations can be used to manipulate a very wide range of graphs. This composability of small number of operations is the foundation of the GraphBLAS. A standard such as the GraphBLAS can only be effective if it has low performance overhead. Performance measurements of prototype GraphBLAS implementations indicate that the overhead is low., Comment: 9 pages; 11 figures; accepted to IEEE High Performance Extreme Computing (HPEC) conference 2016
Published: 2016
Full Text: View/download PDF

260. Mathematical foundations of the GraphBLAS

Author: Kepner, Jeremy, primary, Meyerhenke, Henning, additional, McMillan, Scott, additional, Yang, Carl, additional, Owens, John D., additional, Zalewski, Marcin, additional, Mattson, Timothy, additional, Moreira, Jose, additional, Aaltonen, Peter, additional, Bader, David, additional, Buluc, Aydin, additional, Franchetti, Franz, additional, Gilbert, John, additional, Hutchison, Dylan, additional, Kumar, Manoj, additional, and Lumsdaine, Andrew, additional
Published: 2016
Full Text: View/download PDF

261. A Comparative Study on Exact Triangle Counting Algorithms on the GPU

Author: Wang, Leyuan, primary, Wang, Yangzihao, additional, Yang, Carl, additional, and Owens, John D., additional
Published: 2016
Full Text: View/download PDF

262. Performance Characterization of High-Level Programming Models for GPU Graph Analytics

Author: Wu, Yuduo, primary, Wang, Yangzihao, additional, Pan, Yuechao, additional, Yang, Carl, additional, and Owens, John D., additional
Published: 2015
Full Text: View/download PDF

263. Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU

Author: Yang, Carl, primary, Wang, Yangzihao, additional, and Owens, John D., additional
Published: 2015
Full Text: View/download PDF

264. Optimal Variable Speed Limit Control System for Freeway Work Zone Operations.

Author: Xianfeng Yang, Lu, Yang (Carl), and Yongjie Lin
Subjects: *VARIABLE speed limits, *ROAD work zones, *TRAFFIC flow, *KALMAN filtering, *SIMULATION methods & models, *MATHEMATICAL models
Abstract: Improving operational safety and efficiency of variable speed limit (VSL) systems are the two core control objectives at work zone areas. In response to such need, this study presents a proactive VSL control model for freeway work zone operations. The proposed model uses an embedded macroscopic traffic flow model to predict the traffic state evolutions over the projected time horizon and to determine the optimal speed limits. In addition, Kalman filter is adopted to correct the prediction inaccuracy in a timely manner. To improve the safety of operations, this study proposes a new control objective function to smooth speed transition along the target freeway sketch by minimizing the difference between actual speeds and ideal speeds. Also, the smoothness of speed transition can help prevent the formation of shockwave and consequently enhance system's operational efficiency. The authors' numerical experiment with a calibrated Verkehr In Städten-SIMulationsmodell (VISSIM) simulator reveals that the proposed VSL system can significantly reduce the speed variance among different freeway subsegments. The evaluation of several measures of effectiveness (MOEs) also shows the promising results of the VSL system on the improvement of freeway operational efficiency. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

265. Algorithm for Detector-Error Screening on Basis of Temporal and Spatial Information

Author: Lu, Yang (Carl), primary, Yang, Xianfeng, additional, and Chang, Gang-Len, additional
Published: 2014
Full Text: View/download PDF

266. Dynamic Signal Priority Control Strategy to Mitigate Off-Ramp Queue Spillback to Freeway Mainline Segment

Author: Yang, Xianfeng, primary, Lu, Yang (Carl), additional, and Chang, Gang-Len, additional
Published: 2014
Full Text: View/download PDF

267. Design and Evaluation of an Intelligent Dilemma-Zone Protection System for a High-Speed Rural Intersection

Author: Chang, Gang-Len, primary, Franz, Mark L., additional, Liu, Yue, additional, Lu, Yang (Carl), additional, and Tao, Ruihua, additional
Published: 2013
Full Text: View/download PDF

268. Exploratory analysis of an optimal variable speed control system for a recurrently congested freeway bottleneck.

Author: Yang, Xianfeng, Lu, Yang (Carl), and Chang, Gang-Len
Subjects: *AUTOMATIC control of variable speed drives, *VARIABLE speed limits, *TRAFFIC flow, *TRAFFIC monitoring, *MATHEMATICAL models of traffic congestion, *TRAFFIC estimation
Abstract: This study presents two models for proactive variable speed limit (VSL) control on a recurrently congested freeway segment. The proposed model uses embedded traffic flow relations to predict the evolution of congestion patterns over the projected time horizon, and then computes the time-varying optimal speed limit to smooth traffic flows. To contend with the uncertainties associated with drivers' responses to VSL control, this study has also proposed an advanced model that further adopts Kalman Filter to enhance the traffic state estimation. Both models have been investigated with two control objectives-travel time minimization and speed variance minimization. Our extensive simulation analysis with a VISSIM simulator, calibrated with field data from our previous VSL field demonstration, has revealed the benefits of the proposed VSL control models. Also, the experimental results indicated that the proposed advanced models with both control objectives can significantly reduce the travel time over the recurrent bottleneck locations. With respect to several selected measure of effectiveness (MOEs), such as average number of stops and average travel time, the research results confirm that the control models with the objective of minimizing speed variance can offer the promising properties for field implementation. Copyright © 2014 John Wiley & Sons, Ltd. [ABSTRACT FROM AUTHOR]
Published: 2015
Full Text: View/download PDF

269. The Effect Of The Zeros Of Transfer Function On System Response: Application To Interferometer Speed Control

Author: Bavarian, Behnam, primary, Yang, Carl K., additional, and Auth, Gerald, additional
Published: 1987
Full Text: View/download PDF

270. LENS: label sparsity-tolerant adversarial learning on spatial deceptive reviews.

Author: Prabakar, Sirish, Chen, Haiquan, Jiang, Zhe, Yang, Carl, Yu, Weikuan, and Yan, Da
Subjects: *GENERATIVE adversarial networks, *REINFORCEMENT learning, *KNOWLEDGE base, *WEBSITES
Abstract: Online businesses and websites have recently become the main target of fake reviews, where fake reviews are intentionally composed to manipulate the business ratings positively or negatively. Most of existing works to detect fake reviews are supervised methods, whose performance highly depends on the amount, quality, and variety of the labeled data, which are often non-trivial to obtain in practice. In this paper, we propose a semi-supervised label sparsity-tolerant framework, LENS, for fake review detection by mining spatial knowledge and learning distributions of embedded topics. LENS builds on two key observations. (1) Spatial knowledge revealed in spatial entities and their co-occurring latent topic distributions may indicate the review authenticity. (2) Distributions of the embedded topics (the contextual distribution) may exhibit important patterns to differentiate between real and fake reviews. Specifically, LENS first extracts embeddings for spatial named entities using a knowledge base trained from Wikipedia webpages. Second, LENS represents each input token as a distribution over the learned latent topics in the embedded topic space. To bypass the differentiation difficulty, LENS builds on two discriminators in the actor-critic architecture using reinforcement learning. Extensive experiments using the real-world spatial and non-spatial datasets show that LENS consistently outperformed the state-of-the-art semi-supervised fake review detection methods on few labels at all different labeling rates for real and fake reviews, respectively, in a label-starving setting. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

271. Characterization of Multiple-Antimicrobial-Resistant Salmonella Serovars Isolated from Retail Meats.

Author: Sheng Chen, Shaohua Zhao, White, David G., Schroeder, Carl M., Ran Lu, Hanchun Yang, Carl M., McDermott, Patrick F., Ayers, Sherry, and Jianghong Meng
Subjects: *SALMONELLA, *MEAT, *GENES, *ANTIBIOTICS
Abstract: A total of 133 Salmonella isolates recovered from retail meats purchased in the United States and the People's Republic of China were assayed for antimicrobial susceptibility, the presence of integrons and antimicrobial resistance genes, and horizontal transfer of characterized antimicrobial resistance determinants via conjugation. Seventy-three (82%) of these Salmonella isolates were resistant to at least one antimicrobial agent. Resistance to the following antibiotics was common among the United States isolates: tetracycline (68% of the isolates were resistant), streptomycin (61%), sulfamethoxazole (42%), and ampicillin (29%). Eight Salmonella isolates (6%) were resistant to ceftriaxone. Fourteen isolates (11%) from the People's Republic of China were resistant to nalidixic acid and displayed decreased susceptibility to ciprofloxacin. A total of 19 different antimicrobial resistance genes were identified in 30 multidrug-resistant Salmonella isolates. The bla[sub CMY-2] gene, encoding a class A AmpC β-lactamase, was detected in all 10 Salmonella isolates resistant to extended-spectrum β-lactams. Resistance to ampicillin was most often associated with a TEM-1 family β-lactamase gene. Six aminoglycoside resistance genes, aadA1, aadA2, aacC2, Kn, aph(3)-Ila, and aac(3)-IVa, were commonly present in the Salmonella isolates. Sixteen (54%) of 30 Salmonella isolates tested had integrons ranging in size from 0.75 to 2.7 kb. Conjugation studies demonstrated that there was plasmid-mediated transfer of genes encoding CMY-2 and TEM-1-like β-lactamases. These data indicate that Salmonella isolates recovered from retail raw meats are commonly resistant to multiple antimicrobials, including those used for treating salmonellosis, such as ceftriaxone. Genes conferring antimicrobial resistance in Salmonella are often carried on integrons and plasmids and could be transmitted through conjugation. These mobile DNA elements have likely played an important role in transmission and dissemination of antimicrobial resistance determinants among Salmonella strains. [ABSTRACT FROM AUTHOR]
Published: 2004
Full Text: View/download PDF

272. A simple but tough-to-beat baseline for fMRI time-series classification.

Author: Popov P, Mahmood U, Fu Z, Yang C, Calhoun V, and Plis S
Abstract: Current neuroimaging studies frequently use complex machine learning models to classify human fMRI data, distinguishing healthy and disordered brains, often to validate new methods or enhance prediction accuracy. Yet, where prediction accuracy is a concern, our results suggest that precision in prediction does not always require such sophistication. When a classifier as simple as logistic regression is applied to feature-engineered fMRI data, it can match or even outperform more sophisticated recent models. Classification of the raw time series fMRI data generally benefits from complex parameter-rich models. However, this complexity often pushes them into the class of black-box models. Yet, we found that a relatively simple model can consistently outperform much more complex classifiers in both accuracy and speed. This model applies the same multi-layer perceptron repeatedly across time and averages the results. Thus, the complexity and black-box nature of the parameter rich models, often perceived as a necessary trade-off for higher performance, do not invariably yield superior results on fMRI. Given the success of straightforward approaches, we challenge the merit of research that concentrates solely on complex model development driven by classification. Instead, we advocate for increased focus on designing models that prioritize the explainability of fMRI data or pursue applicable objectives beyond mere classification accuracy, unless they significantly outperform logistic regression or our proposed model. To validate our claim, we explore possible reasons for the superior performance of our straightforward model by examining the innate characteristics of fMRI time series data. Our findings suggest that the sequential information hidden in the temporal order may be far less important for the accurate fMRI classification than the stand-alone pieces of information scattered across the frames of the time series., Competing Interests: Declaration of competing interest The authors declare no conflicts of interest or competing interests., (Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.)
Published: 2024
Full Text: View/download PDF

273. Interpretable Spatio-Temporal Embedding for Brain Structural-Effective Network with Ordinary Differential Equation.

Author: Tang H, Liu G, Dai S, Ye K, Zhao K, Wang W, Yang C, He L, Leow A, Thompson P, Huang H, and Zhan L
Abstract: The MRI-derived brain network serves as a pivotal instrument in elucidating both the structural and functional aspects of the brain, encompassing the ramifications of diseases and developmental processes. However, prevailing methodologies, often focusing on synchronous BOLD signals from functional MRI (fMRI), may not capture directional influences among brain regions and rarely tackle temporal functional dynamics. In this study, we first construct the brain-effective network via the dynamic causal model. Subsequently, we introduce an interpretable graph learning framework termed Spatio-Temporal Embedding ODE (STE-ODE). This framework incorporates specifically designed directed node embedding layers, aiming at capturing the dynamic inter-play between structural and effective networks via an ordinary differential equation (ODE) model, which characterizes spatial-temporal brain dynamics. Our framework is validated on several clinical phenotype prediction tasks using two independent publicly available datasets (HCP and OASIS). The experimental results clearly demonstrate the advantages of our model compared to several state-of-the-art methods.
Published: 2024
Full Text: View/download PDF

274. Empowering Graph Neural Network-Based Computational Drug Repositioning with Large Language Model-Inferred Knowledge Representation.

Author: Gu Y, Xu Z, and Yang C
Abstract: Computational drug repositioning, through predicting drug-disease associations (DDA), offers significant potential for discovering new drug indications. Current methods incorporate graph neural networks (GNN) on drug-disease heterogeneous networks to predict DDAs, achieving notable performances compared to traditional machine learning and matrix factorization approaches. However, these methods depend heavily on network topology, hampered by incomplete and noisy network data, and overlook the wealth of biomedical knowledge available. Correspondingly, large language models (LLMs) excel in graph search and relational reasoning, which can possibly enhance the integration of comprehensive biomedical knowledge into drug and disease profiles. In this study, we first investigate the contribution of LLM-inferred knowledge representation in drug repositioning and DDA prediction. A zero-shot prompting template was designed for LLM to extract high-quality knowledge descriptions for drug and disease entities, followed by embedding generation from language models to transform the discrete text to continual numerical representation. Then, we proposed LLM-DDA with three different model architectures (LLM-DDA Node Feat , LLM-DDA Dual GNN , LLM-DDA GNN-AE ) to investigate the best fusion mode for LLM-based embeddings. Extensive experiments on four DDA benchmarks show that, LLM-DDA GNN-AE achieved the optimal performance compared to 11 baselines with the overall relative improvement in AUPR of 23.22%, F1-Score of 17.20%, and precision of 25.35%. Meanwhile, selected case studies of involving Prednisone and Allergic Rhinitis highlighted the model's capability to identify reliable DDAs and knowledge descriptions, supported by existing literature. This study showcases the utility of LLMs in drug repositioning with its generality and applicability in other biomedical relation prediction tasks., (© 2024. International Association of Scientists in the Interdisciplinary Areas.)
Published: 2024
Full Text: View/download PDF

275. FedBrain: Federated Training of Graph Neural Networks for Connectome-based Brain Imaging Analysis.

Author: Yang Y, Xie H, Cui H, and Yang C
Subjects: Humans, Computational Biology, Brain diagnostic imaging, Neural Networks, Computer, Neuroimaging, Connectome
Abstract: Recent advancements in neuroimaging techniques have sparked a growing interest in understanding the complex interactions between anatomical regions of interest (ROIs), forming into brain networks that play a crucial role in various clinical tasks, such as neural pattern discovery and disorder diagnosis. In recent years, graph neural networks (GNNs) have emerged as powerful tools for analyzing network data. However, due to the complexity of data acquisition and regulatory restrictions, brain network studies remain limited in scale and are often confined to local institutions. These limitations greatly challenge GNN models to capture useful neural circuitry patterns and deliver robust downstream performance. As a distributed machine learning paradigm, federated learning (FL) provides a promising solution in addressing resource limitation and privacy concerns, by enabling collaborative learning across local institutions (i.e., clients) without data sharing. While the data heterogeneity issues have been extensively studied in recent FL literature, cross-institutional brain network analysis presents unique data heterogeneity challenges, that is, the inconsistent ROI parcellation systems and varying predictive neural circuitry patterns across local neuroimaging studies. To this end, we propose FedBrain, a GNN-based personalized FL framework that takes into account the unique properties of brain network data. Specifically, we present a federated atlas mapping mechanism to overcome the feature and structure heterogeneity of brain networks arising from different ROI atlas systems, and a clustering approach guided by clinical prior knowledge to address varying predictive neural circuitry patterns regarding different patient groups, neuroimaging modalities and clinical outcomes. Compared to existing FL strategies, our approach demonstrates superior and more consistent performance, showcasing its strong potential and generalizability in cross-institutional connectome-based brain imaging analysis. The implementation is available here.
Published: 2024

276. BrainSTEAM: A Practical Pipeline for Connectome-based fMRI Analysis towards Subject Classification.

Author: Li A, Yang Y, Cui H, and Yang C
Subjects: Humans, Computational Biology, Neuroimaging, Brain diagnostic imaging, Magnetic Resonance Imaging, Connectome
Abstract: Functional brain networks represent dynamic and complex interactions among anatomical regions of interest (ROIs), providing crucial clinical insights for neural pattern discovery and disorder diagnosis. In recent years, graph neural networks (GNNs) have proven immense success and effectiveness in analyzing structured network data. However, due to the high complexity of data acquisition, resulting in limited training resources of neuroimaging data, GNNs, like all deep learning models, suffer from overfitting. Moreover, their capability to capture useful neural patterns for downstream prediction is also adversely affected. To address such challenge, this study proposes BrainSTEAM, an integrated framework featuring a spatio-temporal module that consists of an EdgeConv GNN model, an autoencoder network, and a Mixup strategy. In particular, the spatio-temporal module aims to dynamically segment the time series signals of the ROI features for each subject into chunked sequences. We leverage each sequence to construct correlation networks, thereby increasing the training data. Additionally, we employ the EdgeConv GNN to capture ROI connectivity structures, an autoencoder for data denoising, and mixup for enhancing model training through linear data augmentation. We evaluate our framework on two real-world neuroimaging datasets, ABIDE for Autism prediction and HCP for gender prediction. Extensive experiments demonstrate the superiority and robustness of BrainSTEAM when compared to a variety of existing models, showcasing the strong potential of our proposed mechanisms in generalizing to other studies for connectome-based fMRI analysis.
Published: 2024

277. Better with Less: A Data-Active Perspective on Pre-Training Graph Neural Networks.

Author: Xu J, Huang R, Jiang X, Cao Y, Yang C, Wang C, and Yang Y
Abstract: Pre-training on graph neural networks (GNNs) aims to learn transferable knowledge for downstream tasks with unlabeled data, and it has recently become an active research area. The success of graph pre-training models is often attributed to the massive amount of input data. In this paper, however, we identify the curse of big data phenomenon in graph pre-training: more training data do not necessarily lead to better downstream performance. Motivated by this observation, we propose a better-with-less framework for graph pre-training: fewer, but carefully chosen data are fed into a GNN model to enhance pre-training. The proposed pre-training pipeline is called the data-active graph pre-training (APT) framework, and is composed of a graph selector and a pre-training model. The graph selector chooses the most representative and instructive data points based on the inherent properties of graphs as well as predictive uncertainty . The proposed predictive uncertainty, as feedback from the pre-training model, measures the confidence level of the model in the data. When fed with the chosen data, on the other hand, the pre-training model grasps an initial understanding of the new, unseen data, and at the same time attempts to remember the knowledge learned from previous data. Therefore, the integration and interaction between these two components form a unified framework (APT), in which graph pre-training is performed in a progressive and iterative way. Experiment results show that the proposed APT is able to obtain an efficient pre-training model with fewer training data and better downstream performance.
Published: 2023

278. WalkLM: A Uniform Language Model Fine-tuning Framework for Attributed Graph Embedding.

Author: Tan Y, Zhou Z, Lv H, Liu W, and Yang C
Abstract: Graphs are widely used to model interconnected entities and improve downstream predictions in various real-world applications. However, real-world graphs nowadays are often associated with complex attributes on multiple types of nodes and even links that are hard to model uniformly, while the widely used graph neural networks (GNNs) often require sufficient training toward specific downstream predictions to achieve strong performance. In this work, we take a fundamentally different approach than GNNs, to simultaneously achieve deep joint modeling of complex attributes and flexible structures of real-world graphs and obtain unsupervised generic graph representations that are not limited to specific downstream predictions. Our framework, built on a natural integration of language models (LMs) and random walks (RWs), is straightforward, powerful and data-efficient. Specifically, we first perform attributed RWs on the graph and design an automated program to compose roughly meaningful textual sequences directly from the attributed RWs; then we fine-tune an LM using the RW-based textual sequences and extract embedding vectors from the LM, which encapsulates both attribute semantics and graph structures. In our experiments, we evaluate the learned node embeddings towards different downstream prediction tasks on multiple real-world attributed graph datasets and observe significant improvements over a comprehensive set of state-of-the-art unsupervised node embedding methods. We believe this work opens a door for more sophisticated technical designs and empirical evaluations toward the leverage of LMs for the modeling of real-world graphs.
Published: 2023

279. Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting.

Author: Cui H, Fang X, Zhang Z, Xu R, Kan X, Liu X, Yu Y, Li M, Song Y, and Yang C
Abstract: Images contain rich relational knowledge that can help machines understand the world. Existing methods on visual knowledge extraction often rely on the pre-defined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation types), restricting the expressiveness of the extracted knowledge. In this work, we take a first exploration to a new paradigm of open visual knowledge extraction. To achieve this, we present OpenVik which consists of an open relational region detector to detect regions potentially containing relational knowledge and a visual knowledge generator that generates format-free knowledge by prompting the large multimodality model with the detected region of interest. We also explore two data enhancement techniques for diversifying the generated format-free visual knowledge. Extensive knowledge quality evaluations highlight the correctness and uniqueness of the extracted open visual knowledge by OpenVik. Moreover, integrating our extracted knowledge across various visual reasoning applications shows consistent improvements, indicating the real-world applicability of OpenVik.
Published: 2023

280. Enhancing Personalized Healthcare via Capturing Disease Severity, Interaction, and Progression.

Author: Tan Y, Zhou Z, Yu L, Liu W, Chen C, Ma G, Hu X, Hertzberg VS, and Yang C
Abstract: Personalized diagnosis prediction based on electronic health records (EHR) of patients is a promising yet challenging task for AI in healthcare. Existing studies typically ignore the heterogeneity of diseases across different patients. For example, diabetes can have different complications across different patients (e.g., hyperlipidemia and circulatory disorder), which requires personalized diagnoses and treatments. Specifically, existing models fail to consider 1) varying severity of the same diseases for different patients, 2) complex interactions among syndromic diseases, and 3) dynamic progression of chronic diseases. In this work, we propose to perform personalized diagnosis prediction based on EHR data via capturing disease severity, interaction, and progression. In particular, we enable personalized disease representations via severity-driven embeddings at the disease level. Then, at the visit level, we propose to capture higher-order interactions among diseases that can collectively affect patients' health status via hypergraph-based aggregation; at the patient level, we devise a personalized generative model based on neural ordinary differential equations to capture the continuous-time disease progressions underlying discrete and incomplete visits. Extensive experiments on two real-world EHR datasets show significant performance gains brought by our approach, yielding average improvements of 10.70% for diagnosis prediction over state-of-the-art competitors.
Published: 2023
Full Text: View/download PDF

281. R-Mixup: Riemannian Mixup for Biological Networks.

Author: Kan X, Li Z, Cui H, Yu Y, Xu R, Yu S, Zhang Z, Guo Y, and Yang C
Abstract: Biological networks are commonly used in biomedical and healthcare domains to effectively model the structure of complex biological systems with interactions linking biological entities. However, due to their characteristics of high dimensionality and low sample size, directly applying deep learning models on biological networks usually faces severe overfitting. In this work, we propose R-Mixup, a Mixup-based data augmentation technique that suits the symmetric positive definite (SPD) property of adjacency matrices from biological networks with optimized training efficiency. The interpolation process in R-Mixup leverages the log-Euclidean distance metrics from the Riemannian manifold, effectively addressing the swelling effect and arbitrarily incorrect label issues of vanilla Mixup. We demonstrate the effectiveness of R-Mixup with five real-world biological network datasets on both regression and classification tasks. Besides, we derive a commonly ignored necessary condition for identifying the SPD matrices of biological networks and empirically study its influence on the model performance. The code implementation can be found in Appendix E.
Published: 2023
Full Text: View/download PDF

282. Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications.

Author: Xie H, Ioannidis VN, Yang C, Zheng D, Song X, Xu Y, Ma J, Ping Q, Zeng B, Zhang H, Wang S, and Chilimbi T
Abstract: Model pre-training on large text corpora has been demonstrated effective for various downstream applications in the NLP domain. In the graph mining domain, a similar analogy can be drawn for pre-training graph models on large graphs in the hope of benefiting downstream graph applications, which has also been explored by several recent studies. However, no existing study has ever investigated the pre-training of text plus graph models on large heterogeneous graphs with abundant textual information (a.k.a. large graph corpora) and then fine-tuning the model on different related downstream applications with different graph schemas. To address this problem, we propose a framework of graph-aware language model pre-training (GaLM) on a large graph corpus, which incorporates large language models and graph neural networks, and a variety of fine-tuning methods on downstream applications. We conduct extensive experiments on Amazon's real internal datasets and large public datasets. Comprehensive empirical results and in-depth analysis demonstrate the effectiveness of our proposed methods along with lessons learned.
Published: 2023
Full Text: View/download PDF

283. When to Pre-Train Graph Neural Networks? From Data Generation Perspective!

Author: Cao Y, Xu J, Yang C, Wang J, Zhang Y, Wang C, Chen L, and Yang Y
Abstract: In recent years, graph pre-training has gained significant attention, focusing on acquiring transferable knowledge from unlabeled graph data to improve downstream performance. Despite these recent endeavors, the problem of negative transfer remains a major concern when utilizing graph pre-trained models to downstream tasks. Previous studies made great efforts on the issue of what to pre-train and how to pre-train by designing a variety of graph pre-training and fine-tuning strategies. However, there are cases where even the most advanced "pre-train and fine-tune" paradigms fail to yield distinct benefits. This paper introduces a generic framework W2PGNN to answer the crucial question of when to pre-train ( i.e ., in what situations could we take advantage of graph pre-training) before performing effortful pre-training or fine-tuning. We start from a new perspective to explore the complex generative mechanisms from the pre-training data to downstream data. In particular, W2PGNN first fits the pre-training data into graphon bases, each element of graphon basis ( i.e ., a graphon) identifies a fundamental transferable pattern shared by a collection of pre-training graphs. All convex combinations of graphon bases give rise to a generator space, from which graphs generated form the solution space for those downstream data that can benefit from pre-training. In this manner, the feasibility of pre-training can be quantified as the generation probability of the downstream data from any generator in the generator space. W2PGNN offers three broad applications: providing the application scope of graph pre-trained models, quantifying the feasibility of pre-training, and assistance in selecting pre-training data to enhance downstream performance. We provide a theoretically sound solution for the first application and extensive empirical justifications for the latter two applications.
Published: 2023
Full Text: View/download PDF

284. Motif-guided Heterogeneous Graph Deep Generation.

Author: Ling C, Yang C, and Zhao L
Abstract: The complex systems in the real-world are commonly associated with multiple types of objects and relations, and heterogeneous graphs are ubiquitous data structures that can inherently represent multi-modal interactions between objects. Generating high-quality heterogeneous graphs allows us to understand the implicit distribution of heterogeneous graphs and provides benchmarks for downstream heterogeneous representation learning tasks. Existing works are limited to either merely generating the graph topology with neglecting local semantic information or only generating the graph without preserving the higher-order structural information and the global heterogeneous distribution in generated graphs. To this end, we formulate a general, end-to-end framework - HGEN for generating novel heterogeneous graphs with a newly proposed heterogeneous walk generator. On top of HGEN, we further develop a network motif generator to better characterize the higher-order structural distribution. A novel heterogeneous graph assembler is further developed to adaptively assemble novel heterogeneous graphs from the generated heterogeneous walks and motifs in a stratified manner. The extended model is proven to preserve the local semantic and heterogeneous global distribution of observed graphs with the theoretical guarantee. Lastly, comprehensive experiments on both synthetic and real-world practical datasets demonstrate the power and efficiency of the proposed method.
Published: 2023
Full Text: View/download PDF

285. Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage Training.

Author: Xu R, Yu Y, Ho J, and Yang C
Abstract: Scientific document classification is a critical task for a wide range of applications, but the cost of collecting human-labeled data can be prohibitive. We study scientific document classification using label names only. In scientific domains, label names often include domain-specific concepts that may not appear in the document corpus, making it difficult to match labels and documents precisely. To tackle this issue, we propose WanDeR, which leverages dense retrieval to perform matching in the embedding space to capture the semantics of label names. We further design the label name expansion module to enrich its representations. Lastly, a self-training step is used to refine the predictions. The experiments on three datasets show that WanDeR outperforms the best baseline by 11.9%. Our code will be published at https://github.com/ritaranx/wander.
Published: 2023
Full Text: View/download PDF

286. HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting.

Author: Lu J, Shen J, Xiong B, Ma W, Staab S, and Yang C
Abstract: Medical decision-making processes can be enhanced by comprehensive biomedical knowledge bases, which require fusing knowledge graphs constructed from different sources via a uniform index system. The index system often organizes biomedical terms in a hierarchy to provide the aligned entities with fine-grained granularity. To address the challenge of scarce supervision in the biomedical knowledge fusion (BKF) task, researchers have proposed various unsupervised methods. However, these methods heavily rely on ad-hoc lexical and structural matching algorithms, which fail to capture the rich semantics conveyed by biomedical entities and terms. Recently, neural embedding models have proved effective in semantic-rich tasks, but they rely on sufficient labeled data to be adequately trained. To bridge the gap between the scarce-labeled BKF and neural embedding models, we propose HiPrompt, a supervision-efficient knowledge fusion framework that elicits the few-shot reasoning ability of large language models through hierarchy-oriented prompts. Empirical results on the collected KG-Hi-BKF benchmark datasets demonstrate the effectiveness of HiPrompt.
Published: 2023
Full Text: View/download PDF

287. Neighborhood-Regularized Self-Training for Learning with Few Labels.

Author: Xu R, Yu Y, Cui H, Kan X, Zhu Y, Ho J, Zhang C, and Yang C
Abstract: Training deep neural networks (DNNs) with limited supervision has been a popular research topic as it can significantly alleviate the annotation burden. Self-training has been successfully applied in semi-supervised learning tasks, but one drawback of self-training is that it is vulnerable to the label noise from incorrect pseudo labels. Inspired by the fact that samples with similar labels tend to share similar representations, we develop a neighborhood-based sample selection approach to tackle the issue of noisy pseudo labels. We further stabilize self-training via aggregating the predictions from different rounds during sample selection. Experiments on eight tasks show that our proposed method outperforms the strongest self-training baseline with 1.83% and 2.51% performance gain for text and graph datasets on average. Our further analysis demonstrates that our proposed data selection strategy reduces the noise of pseudo labels by 36.8% and saves 57.3% of the time when compared with the best baseline. Our code and appendices will be uploaded to https://github.com/ritaranx/NeST.
Published: 2023
Full Text: View/download PDF

288. Hypergraph Transformers for EHR-based Clinical Predictions.

Author: Xu R, Ali MK, Ho JC, and Yang C
Abstract: Electronic health records (EHR) data contain rich information about patients' health conditions including diagnosis, procedures, medications and etc., which have been widely used to facilitate digital medicine. Despite its importance, it is often non-trivial to learn useful representations for patients' visits that support downstream clinical predictions, as each visit contains massive and diverse medical codes. As a result, the complex interactions among medical codes are often not captured, which leads to substandard predictions. To better model these complex relations, we leverage hypergraphs, which go beyond pairwise relations to jointly learn the representations for visits and medical codes. We also propose to use the self-attention mechanism to automatically identify the most relevant medical codes for each visit based on the downstream clinical predictions with better generalization power. Experiments on two EHR datasets show that our proposed method not only yields superior performance, but also provides reasonable insights towards the target tasks., (©2023 AMIA - All rights reserved.)
Published: 2023

289. Graph Neural Network Modeling of Web Search Activity for Real-time Pandemic Forecasting.

Author: Lin C, Zhou J, Zhang J, Yang C, and Agichtein E
Abstract: The utilization of web search activity for pandemic forecasting has significant implications for managing disease spread and informing policy decisions. However, web search records tend to be noisy and influenced by geographical location, making it difficult to develop large-scale models. While regularized linear models have been effective in predicting the spread of respiratory illnesses like COVID-19, they are limited to specific locations. The lack of incorporation of neighboring areas' data and the inability to transfer models to new locations with limited data has impeded further progress. To address these limitations, this study proposes a novel self-supervised message-passing neural network (SMPNN) framework for modeling local and cross-location dynamics in pandemic forecasting. The SMPNN framework utilizes an MPNN module to learn cross-location dependencies through self-supervised learning and improve local predictions with graph-generated features. The framework is designed as an end-to-end solution and is compared with state-of-the-art statistical and deep learning models using COVID-19 data from England and the US. The results of the study demonstrate that the SMPNN model outperforms other models by achieving up to a 6.9% improvement in prediction accuracy and lower prediction errors during the early stages of disease outbreaks. This approach represents a significant advancement in disease surveillance and forecasting, providing a novel methodology, datasets, and insights that combine web search data and spatial information. The proposed SMPNN framework offers a promising avenue for modeling the spread of pandemics, leveraging both local and cross-location information, and has the potential to inform public health policy decisions.
Published: 2023
Full Text: View/download PDF

290. DEEP DAG LEARNING OF EFFECTIVE BRAIN CONNECTIVITY FOR FMRI ANALYSIS.

Author: Yu Y, Kan X, Cui H, Xu R, Zheng Y, Song X, Zhu Y, Zhang K, Nabi R, Guo Y, Zhang C, and Yang C
Abstract: Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROIs), which are noisy and can lead to inferior results for GNN models. To better adapt GNNs for fMRI analysis, we propose DABNet, a D eep D A G learning framework based on B rain Net works for fMRI analysis. DABNet adopts a brain network generator module, which harnesses the DAG learning approach to transform the raw time-series into effective brain connectivities. Experiments on two fMRI datasets demonstrate the efficacy of DABNet. The generated brain networks also highlight the prediction-related brain regions and thus provide interpretations for predictions.
Published: 2023
Full Text: View/download PDF

291. SRDA: Mobile Sensing based Fluid Overload Detection for End Stage Kidney Disease Patients using Sensor Relation Dual Autoencoder.

Author: Tang M, Gao J, Dong G, Yang C, Campbell B, Bowman B, Zoellner JM, Abdel-Rahman E, and Boukhechba M
Abstract: Chronic kidney disease (CKD) is a life-threatening and prevalent disease. CKD patients, especially endstage kidney disease (ESKD) patients on hemodialysis, suffer from kidney failures and are unable to remove excessive fluid, causing fluid overload and multiple morbidities including death. Current solutions for fluid overtake monitoring such as ultrasonography and biomarkers assessment are cumbersome, discontinuous, and can only be performed in the clinic. In this paper, we propose SRDA, a latent graph learning powered fluid overload detection system based on Sensor Relation Dual Autoencoder to detect excessive fluid consumption of EKSD patients based on passively collected bio-behavioral data from smartwatch sensors. Experiments using real-world mobile sensing data indicate that SRDA outperforms the state-of-the-art baselines in both F1 score and recall, and demonstrate the potential of ubiquitous sensing for ESKD fluid intake management.
Published: 2023

292. Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR.

Author: Xu R, Yu Y, Zhang C, Ali MK, Ho JC, and Yang C
Abstract: Electronic Health Record modeling is crucial for digital medicine. However, existing models ignore higher-order interactions among medical codes and their causal relations towards downstream clinical predictions. To address such limitations, we propose a novel framework CACHE, to provide effective and insightful clinical predictions based on hypergraph representation learning and counterfactual and factual reasoning techniques. Experiments on two real EHR datasets show the superior performance of CACHE. Case studies with a domain expert illustrate a preferred capability of CACHE in generating clinically meaningful interpretations towards the correct predictions.
Published: 2022

293. Joint Embedding of Structural and Functional Brain Networks with Graph Neural Networks for Mental Illness Diagnosis.

Author: Zhu Y, Cui H, He L, Sun L, and Yang C
Subjects: Brain diagnostic imaging, Humans, Learning, Mental Disorders diagnosis, Neural Networks, Computer
Abstract: Multimodal brain networks characterize complex connectivities among different brain regions from both structural and functional aspects and provide a new means for mental disease analysis. Recently, Graph Neural Networks (GNNs) have become a de facto model for analyzing graph-structured data. However, how to employ GNNs to extract effective representations from brain networks in multiple modalities remains rarely explored. Moreover, as brain networks provide no initial node features, how to design informative node attributes and leverage edge weights for GNNs to learn is left unsolved. To this end, we develop a novel multiview GNN for multimodal brain networks. In particular, we treat each modality as a view for brain networks and employ contrastive learning for multimodal fusion. Then, we propose a GNN model which takes advantage of the message passing scheme by propagating messages based on degree statistics and brain region connectivities. Extensive experiments on two real-world disease datasets (HIV and Bipolar) demonstrate the effectiveness of our proposed method over state-of-the-art baselines.
Published: 2022
Full Text: View/download PDF

294. FBNetGen: Task-aware GNN-based fMRI Analysis via Functional Brain Network Generation.

Author: Kan X, Cui H, Lukemire J, Guo Y, and Yang C
Abstract: Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the deep graph neural network (GNN) models. In order to fully unleash the power of GNNs in network-based fMRI analysis, we develop FBNETGEN, a task-aware and interpretable fMRI analysis framework via deep brain network generation. In particular, we formulate (1) prominent region of interest (ROI) features extraction, (2) brain networks generation, and (3) clinical predictions with GNNs, in an end-to-end trainable model under the guidance of particular prediction tasks. Along with the process, the key novel component is the graph generator which learns to transform raw time-series features into task-oriented brain networks. Our learnable graphs also provide unique interpretations by highlighting prediction-related brain regions. Comprehensive experiments on two datasets, i.e., the recently released and currently largest publicly available fMRI dataset Adolescent Brain Cognitive Development (ABCD), and the widely-used fMRI dataset PNC, prove the superior effectiveness and interpretability of FBNETGEN. The implementation is available at https://github.com/Wayfear/FBNETGEN.
Published: 2022

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

294 results on '"Yang, Carl"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources