13 results on '"Heil BJ"'
Search Results
2. Detection of ureteral obstruction on radionuclide bone scans
- Author
-
Pollen, JJ, primary, Gerber, K, additional, Heil, BJ, additional, and Talner, LB, additional
- Published
- 1983
- Full Text
- View/download PDF
3. MousiPLIER: A Mouse Pathway-Level Information Extractor Model.
- Author
-
Zhang S, Heil BJ, Mao W, Chikina M, Greene CS, and Heller EA
- Subjects
- Animals, Mice, Gene Expression Profiling, Aging physiology, Unsupervised Machine Learning, Transcriptome, Astrocytes metabolism, Microglia metabolism, Machine Learning, Male, Mice, Inbred C57BL, Brain metabolism
- Abstract
High-throughput gene expression profiling measures individual gene expression across conditions. However, genes are regulated in complex networks, not as individual entities, limiting the interpretability of gene expression data. Machine learning models that incorporate prior biological knowledge are a powerful tool to extract meaningful biology from gene expression data. Pathway-level information extractor (PLIER) is an unsupervised machine learning method that defines biological pathways by leveraging the vast amount of published transcriptomic data. PLIER converts gene expression data into known pathway gene sets, termed latent variables (LVs), to substantially reduce data dimensionality and improve interpretability. In the current study, we trained the first mouse PLIER model on 190,111 mouse brain RNA-sequencing samples, the greatest amount of training data ever used by PLIER. We then validated the mousiPLIER approach in a study of microglia and astrocyte gene expression across mouse brain aging. mousiPLIER identified biological pathways that are significantly associated with aging, including one latent variable (LV41) corresponding to striatal signal. To gain further insight into the genes contained in LV41, we performed k -means clustering on the training data to identify studies that respond strongly to LV41. We found that the variable was relevant to striatum and aging across the scientific literature. Finally, we built a Web server (http://mousiplier.greenelab.com/) for users to easily explore the learned latent variables. Taken together, this study defines mousiPLIER as a method to uncover meaningful biological processes in mouse brain transcriptomic studies., Competing Interests: The authors declare no competing financial interests., (Copyright © 2024 Zhang et al.)
- Published
- 2024
- Full Text
- View/download PDF
4. MousiPLIER: A Mouse Pathway-Level Information Extractor Model.
- Author
-
Zhang S, Heil BJ, Mao W, Chikina M, Greene CS, and Heller EA
- Abstract
High throughput gene expression profiling is a powerful approach to generate hypotheses on the underlying causes of biological function and disease. Yet this approach is limited by its ability to infer underlying biological pathways and burden of testing tens of thousands of individual genes. Machine learning models that incorporate prior biological knowledge are necessary to extract meaningful pathways and generate rational hypothesis from the vast amount of gene expression data generated to date. We adopted an unsupervised machine learning method, Pathway-level information extractor (PLIER), to train the first mouse PLIER model on 190,111 mouse brain RNA-sequencing samples, the greatest amount of training data ever used by PLIER. mousiPLER converted gene expression data into a latent variables that align to known pathway or cell maker gene sets, substantially reducing data dimensionality and improving interpretability. To determine the utility of mousiPLIER, we applied it to a mouse brain aging study of microglia and astrocyte transcriptomic profiling. We found a specific set of latent variables that are significantly associated with aging, including one latent variable (LV41) corresponding to striatal signal. We next performed k-means clustering on the training data to identify studies that respond strongly to LV41, finding that the variable is relevant to striatum and aging across the scientific literature. Finally, we built a web server (http://mousiplier.greenelab.com/) for users to easily explore the learned latent variables. Taken together this study provides proof of concept that mousiPLIER can uncover meaningful biological processes in mouse transcriptomic studies., Competing Interests: The authors declare no competing financial interests.
- Published
- 2023
- Full Text
- View/download PDF
5. The effect of non-linear signal in classification problems using gene expression.
- Author
-
Heil BJ, Crawford J, and Greene CS
- Subjects
- Gene Expression Profiling, Neural Networks, Computer, Linear Models, Gene Expression, Nonlinear Dynamics
- Abstract
Those building predictive models from transcriptomic data are faced with two conflicting perspectives. The first, based on the inherent high dimensionality of biological systems, supposes that complex non-linear models such as neural networks will better match complex biological systems. The second, imagining that complex systems will still be well predicted by simple dividing lines prefers linear models that are easier to interpret. We compare multi-layer neural networks and logistic regression across multiple prediction tasks on GTEx and Recount3 datasets and find evidence in favor of both possibilities. We verified the presence of non-linear signal when predicting tissue and metadata sex labels from expression data by removing the predictive linear signal with Limma, and showed the removal ablated the performance of linear methods but not non-linear ones. However, we also found that the presence of non-linear signal was not necessarily sufficient for neural networks to outperform logistic regression. Our results demonstrate that while multi-layer neural networks may be useful for making predictions from gene expression data, including a linear baseline model is critical because while biological systems are high-dimensional, effective dividing lines for predictive models may not be., Competing Interests: The authors declare that they have no conflict of interest., (Copyright: © 2023 Heil et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
6. Hetnet connectivity search provides rapid insights into how two biomedical entities are related.
- Author
-
Himmelstein DS, Zietz M, Rubinetti V, Kloster K, Heil BJ, Alquaddoomi F, Hu D, Nicholson DN, Hao Y, Sullivan BD, Nagle MW, and Greene CS
- Abstract
Hetnets, short for "heterogeneous networks", contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet connects 11 types of nodes - including genes, diseases, drugs, pathways, and anatomical structures - with over 2 million edges of 24 types. Previous work has demonstrated that supervised machine learning methods applied to such networks can identify drug repurposing opportunities. However, a training set of known relationships does not exist for many types of node pairs, even when it would be useful to examine how nodes of those types are meaningfully connected. For example, users may be curious not only how metformin is related to breast cancer, but also how the GJA1 gene might be involved in insomnia. We developed a new procedure, termed hetnet connectivity search, that proposes important paths between any two nodes without requiring a supervised gold standard. The algorithm behind connectivity search identifies types of paths that occur more frequently than would be expected by chance (based on node degree alone). We find that predictions are broadly similar to those from previously described supervised approaches for certain node type pairs. Scoring of individual paths is based on the most specific paths of a given type. Several optimizations were required to precompute significant instances of node connectivity at the scale of large knowledge graphs. We implemented the method on Hetionet and provide an online interface at https://het.io/search . We provide an open source implementation of these methods in our new Python package named hetmatpy .
- Published
- 2023
- Full Text
- View/download PDF
7. The Field-Dependent Nature of PageRank Values in Citation Networks.
- Author
-
Heil BJ and Greene CS
- Abstract
The value of scientific research can be easier to assess at the collective level than at the level of individual contributions. Several journal-level and article-level metrics aim to measure the importance of journals or individual manuscripts. However, many are citation-based and citation practices vary between fields. To account for these differences, scientists have devised normalization schemes to make metrics more comparable across fields. We use PageRank as an example metric and examine the extent to which field-specific citation norms drive estimated importance differences. In doing so, we recapitulate differences in journal and article PageRanks between fields. We also find that manuscripts shared between fields have different PageRanks depending on which field's citation network the metric is calculated in. We implement a degree-preserving graph shuffling algorithm to generate a null distribution of similar networks and find differences more likely attributed to field-specific preferences than citation norms. Our results suggest that while differences exist between fields' metric distributions, applying metrics in a field-aware manner rather than using normalized global metrics avoids losing important information about article preferences. They also imply that assigning a single importance value to a manuscript may not be a useful construct, as the importance of each manuscript varies by the reader's field.
- Published
- 2023
- Full Text
- View/download PDF
8. Hetnet connectivity search provides rapid insights into how biomedical entities are related.
- Author
-
Himmelstein DS, Zietz M, Rubinetti V, Kloster K, Heil BJ, Alquaddoomi F, Hu D, Nicholson DN, Hao Y, Sullivan BD, Nagle MW, and Greene CS
- Subjects
- Probability, Algorithms
- Abstract
Background: Hetnets, short for "heterogeneous networks," contain multiple node and relationship types and offer a way to encode biomedical knowledge. One such example, Hetionet, connects 11 types of nodes-including genes, diseases, drugs, pathways, and anatomical structures-with over 2 million edges of 24 types. Previous work has demonstrated that supervised machine learning methods applied to such networks can identify drug repurposing opportunities. However, a training set of known relationships does not exist for many types of node pairs, even when it would be useful to examine how nodes of those types are meaningfully connected. For example, users may be curious about not only how metformin is related to breast cancer but also how a given gene might be involved in insomnia., Findings: We developed a new procedure, termed hetnet connectivity search, that proposes important paths between any 2 nodes without requiring a supervised gold standard. The algorithm behind connectivity search identifies types of paths that occur more frequently than would be expected by chance (based on node degree alone). Several optimizations were required to precompute significant instances of node connectivity at the scale of large knowledge graphs., Conclusion: We implemented the method on Hetionet and provide an online interface at https://het.io/search. We provide an open-source implementation of these methods in our new Python package named hetmatpy., (© The Author(s) 2023. Published by Oxford University Press GigaScience.)
- Published
- 2022
- Full Text
- View/download PDF
9. Reproducibility standards for machine learning in the life sciences.
- Author
-
Heil BJ, Hoffman MM, Markowetz F, Lee SI, Greene CS, and Hicks SC
- Subjects
- Reproducibility of Results, Software, Computational Biology methods, Computational Biology standards, Machine Learning standards
- Published
- 2021
- Full Text
- View/download PDF
10. Protein classification using modified n-grams and skip-grams.
- Author
-
Islam SMA, Heil BJ, Kearney CM, and Baker EJ
- Subjects
- Models, Molecular, Proteins metabolism, Molecular Sequence Annotation methods, Natural Language Processing, Protein Conformation, Proteins classification, Sequence Analysis, Protein methods, Supervised Machine Learning
- Abstract
Motivation: Classification by supervised machine learning greatly facilitates the annotation of protein characteristics from their primary sequence. However, the feature generation step in this process requires detailed knowledge of attributes used to classify the proteins. Lack of this knowledge risks the selection of irrelevant features, resulting in a faulty model. In this study, we introduce a supervised protein classification method with a novel means of automating the work-intensive feature generation step via a Natural Language Processing (NLP)-dependent model, using a modified combination of n-grams and skip-grams (m-NGSG)., Results: A meta-comparison of cross-validation accuracy with twelve training datasets from nine different published studies demonstrates a consistent increase in accuracy of m-NGSG when compared to contemporary classification and feature generation models. We expect this model to accelerate the classification of proteins from primary sequence data and increase the accessibility of protein characteristic prediction to a broader range of scientists., Availability and Implementation: m-NGSG is freely available at Bitbucket: https://bitbucket.org/sm_islam/mngsg/src. A web server is available at watson.ecs.baylor.edu/ngsg., Contact: erich_baker@baylor.edu., Supplementary Information: Supplementary data are available at Bioinformatics online.
- Published
- 2018
- Full Text
- View/download PDF
11. Treatment of benign prostatic hyperplasia.
- Author
-
Heil BJ
- Subjects
- Algorithms, Decision Trees, Diagnosis, Differential, Humans, Male, Nurse Practitioners, Prostatic Hyperplasia physiopathology, Prostatic Hyperplasia psychology, Prostatic Hyperplasia diagnosis, Prostatic Hyperplasia therapy
- Published
- 1999
12. Detection of ureteral obstruction on radionuclide bone scans.
- Author
-
Pollen JJ, Gerber K, Heil BJ, and Talner LB
- Subjects
- Bone Neoplasms secondary, Evaluation Studies as Topic, False Negative Reactions, Humans, Male, Prostatic Neoplasms complications, Radionuclide Imaging, Retrospective Studies, Technetium Tc 99m Medronate, Ureteral Obstruction etiology, Urography, Bone Neoplasms diagnostic imaging, Diphosphonates, Technetium, Ureteral Obstruction diagnostic imaging
- Abstract
The kidneys are nearly always visible on a routine radionuclide bone scan. To assess the reliability of the bone scan in detecting ureteral obstruction, 220 bone scans and excretory urograms were compared in 53 patients followed serially for prostatic cancer. There were 15 kidneys obstructed on excretory urograms. Only nine were diagnosed as obstructed on the bone scans. Seven of the nine cases of unilaterally obstructed kidneys were detected, whereas only two of the six kidneys in three patients with bilateral obstruction were correctly diagnosed. The results indicate that unilateral obstruction is more likely to be detected, whereas bilateral obstruction is more likely to be missed, on bone scans. Therefore, the routine radionuclide bone scan is an unreliable test for ureteral obstruction.
- Published
- 1983
- Full Text
- View/download PDF
13. Idiopathic dilatation of the superior vena cava.
- Author
-
Heil BJ, Felman AH, Talbert JL, Hawkins IF, and Garnica A
- Subjects
- Child, Dilatation, Pathologic, Humans, Male, Radiography, Vascular Diseases diagnostic imaging, Vena Cava, Superior diagnostic imaging
- Published
- 1978
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.