151,319 results on '"genetic data"'
Search Results
2. Diabetes exerts a causal impact on the nervous system within the right hippocampus: substantiated by genetic data
- Author
-
Long, Qian, Huang, Piao, Kuang, Jian, Huang, Yu, and Guan, Haixia
- Published
- 2024
- Full Text
- View/download PDF
3. New genetic data unveil taxonomic complexity in the high-Andean sigmodontine Abrothrix andina (Rodentia, Cricetidae)
- Author
-
Tammone, Mauro N., Cuellar Soto, Erika, Voglino, Damián, and Pardiñas, Ulyses F. J.
- Published
- 2024
- Full Text
- View/download PDF
4. Matching clinical and genetic data in pediatric patients at risk of developing cystic kidney disease
- Author
-
Bracciamà, Valeria, Vaisitti, Tiziana, Mioli, Fiorenza, Faini, Angelo Corso, del Prever, Giulia Margherita Brach, Martins, Vitor Hugo, Camilla, Roberta, Mattozzi, Francesca, Pieretti, Silvia, Luca, Maria, Romeo, Carmelo Maria, Saglia, Claudia, Migliorero, Martina, Arruga, Francesca, Carli, Diana, Amoroso, Antonio, Lonardi, Pietro, Deaglio, Silvia, and Peruzzi, Licia
- Published
- 2024
- Full Text
- View/download PDF
5. GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console
- Author
-
Nath, Anindita, Mwesigwa, Savannah, Dai, Yulin, Jiang, Xiaoqian, and Zhao, Zhongming
- Subjects
Quantitative Biology - Quantitative Methods ,Computer Science - Artificial Intelligence - Abstract
Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's 'copilot'. It automates the analysis, retrieval, and visualization of customized domain-specific genetic information, and integrates functionalities to generate protein interaction networks, enrich gene sets, and search scientific literature from PubMed, Google Scholar, and arXiv, making it a comprehensive tool for biomedical research. In its pilot phase, GENEVIC is assessed using a curated database that ranks genetic variants associated with Alzheimer's disease, schizophrenia, and cognition, based on their effect weights from the Polygenic Score Catalog, thus enabling researchers to prioritize genetic variants in complex diseases. GENEVIC's operation is user-friendly, accessible without any specialized training, secured by Azure OpenAI's HIPAA-compliant infrastructure, and evaluated for its efficacy through real-time query testing. As a prototype, GENEVIC is set to advance genetic research, enabling informed biomedical decisions. Availability and implementation: GENEVIC is publicly accessible at https://genevic-anath2024.streamlit.app. The underlying code is open-source and available via GitHub at https://github.com/anath2110/GENEVIC.git.
- Published
- 2024
6. MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading
- Author
-
Alwazzan, Omnia, Khan, Abbas, Patras, Ioannis, and Slabaugh, Gregory
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Brain tumors are an abnormal growth of cells in the brain. They can be classified into distinct grades based on their growth. Often grading is performed based on a histological image and is one of the most significant predictors of a patients prognosis, the higher the grade, the more aggressive the tumor. Correct diagnosis of a tumor grade remains challenging. Though histopathological grading has been shown to be prognostic, results are subject to interobserver variability, even among experienced pathologists. Recently, the World Health Organization reported that advances in molecular genetics have led to improvements in tumor classification. This paper seeks to integrate histological images and genetic data for improved computer-aided diagnosis. We propose a novel Multi-modal Outer Arithmetic Block (MOAB) based on arithmetic operations to combine latent representations of the different modalities for predicting the tumor grade (Grade \rom{2}, \rom{3} and \rom{4}). Extensive experiments evaluate the effectiveness of our approach. By applying MOAB to The Cancer Genome Atlas (TCGA) glioma dataset, we show that it can improve separation between similar classes (Grade \rom{2} and \rom{3}) and outperform prior state-of-the-art grade classification techniques.
- Published
- 2024
- Full Text
- View/download PDF
7. Evaluating the Causal Effect of Type 2 Diabetes on Alzheimer’s Disease Using Large-Scale Genetic Data
- Author
-
Liu, D., Baranova, A., and Zhang, Fuquan
- Published
- 2024
- Full Text
- View/download PDF
8. Comparative Analysis of Machine Learning Techniques for Imbalanced Genetic Data
- Author
-
Kaur, Arshmeet and Sarmadi, Morteza
- Published
- 2024
- Full Text
- View/download PDF
9. A three-stage approach to identify biomarker signatures for cancer genetic data with survival endpoints
- Author
-
Wu, Xue, Chen, Chixiang, Li, Zheng, Zhang, Lijun, Chinchilli, Vernon M., and Wang, Ming
- Published
- 2024
- Full Text
- View/download PDF
10. Correlated allele frequency changes reveal clonal structure and selection in temporal genetic data.
- Author
-
Li, Yunxiao and Barton, John
- Abstract
In evolving populations where the rate of beneficial mutations is large, subpopulations of individuals with competing beneficial mutations can be maintained over long times. Evolution with this kind of clonal structure is commonly observed in a wide range of microbial and viral populations. However, it can be difficult to completely resolve clonal dynamics in data. This is due to limited read lengths in high-throughput sequencing methods, which are often insufficient to directly measure linkage disequilibrium or determine clonal structure. Here, we develop a method to infer clonal structure using correlated allele frequency changes in time-series sequence data. Simulations show that our method recovers true, underlying clonal structures when they are known and accurately estimates linkage disequilibrium. This information can then be combined with other inference methods to improve estimates of the fitness effects of individual mutations. Applications to data suggest novel clonal structures in an E. coli long-term evolution experiment, and yield improved predictions of the effects of mutations on bacterial fitness and antibiotic resistance. Moreover, our method is computationally efficient, requiring orders of magnitude less run time for large data sets than existing methods. Overall, our method provides a powerful tool to infer clonal structures from data sets where only allele frequencies are available, which can also improve downstream analyses.
- Published
- 2024
11. Advancing forensic-based investigation incorporating slime mould search for gene selection of high-dimensional genetic data
- Author
-
Feng Qiu, Ali Asghar Heidari, Yi Chen, Huiling Chen, and Guoxi Liang
- Subjects
Forensic-based investigation ,High-dimensional genetic data ,Gene selection ,Slime mould algorithm ,Global optimization ,Medicine ,Science - Abstract
Abstract Modern medicine has produced large genetic datasets of high dimensions through advanced gene sequencing technology, and processing these data is of great significance for clinical decision-making. Gene selection (GS) is an important data preprocessing technique that aims to select a subset of feature information to improve performance and reduce data dimensionality. This study proposes an improved wrapper GS method based on forensic-based investigation (FBI). The method introduces the search mechanism of the slime mould algorithm in the FBI to improve the original FBI; the newly proposed algorithm is named SMA_FBI; then GS is performed by converting the continuous optimizer to a binary version of the optimizer through a transfer function. In order to verify the superiority of SMA_FBI, experiments are first executed on the 30-function test set of CEC2017 and compared with 10 original algorithms and 10 state-of-the-art algorithms. The experimental results show that SMA_FBI is better than other algorithms in terms of finding the optimal solution, convergence speed, and robustness. In addition, BSMA_FBI (binary version of SMA_FBI) is compared with 8 binary algorithms on 18 high-dimensional genetic data from the UCI repository. The results indicate that BSMA_FBI is able to obtain high classification accuracy with fewer features selected in GS applications. Therefore, SMA_FBI is considered an optimization tool with great potential for dealing with global optimization problems, and its binary version, BSMA_FBI, can be used for GS tasks.
- Published
- 2024
- Full Text
- View/download PDF
12. Bayesian Inference of Reproduction Number from Epidemiological and Genetic Data Using Particle MCMC
- Author
-
Gill, Alicia, Koskela, Jere, Didelot, Xavier, and Everitt, Richard G.
- Subjects
Statistics - Methodology ,Quantitative Biology - Genomics ,Quantitative Biology - Populations and Evolution ,Statistics - Applications ,Statistics - Computation ,62P10, 65C05, 92D10, 92D30 - Abstract
Inference of the reproduction number through time is of vital importance during an epidemic outbreak. Typically, epidemiologists tackle this using observed prevalence or incidence data. However, prevalence and incidence data alone is often noisy or partial. Models can also have identifiability issues with determining whether a large amount of a small epidemic or a small amount of a large epidemic has been observed. Sequencing data however is becoming more abundant, so approaches which can incorporate genetic data are an active area of research. We propose using particle MCMC methods to infer the time-varying reproduction number from a combination of prevalence data reported at a set of discrete times and a dated phylogeny reconstructed from sequences. We validate our approach on simulated epidemics with a variety of scenarios. We then apply the method to a real data set of HIV-1 in North Carolina, USA, between 1957 and 2019., Comment: 28 pages, 18 figures (44 pages, 35 figures including appendices)
- Published
- 2023
13. Leveraging large-scale genetic data to assess the causal impact of COVID-19 on multisystemic diseases
- Author
-
Xiangyang Zhang, Zhaohui Jiang, Jiayao Ma, Yaru Qi, Yin Li, Yan Zhang, Yihan Liu, Chaochao Wei, Yihong Chen, Ping Liu, Yinghui Peng, Jun Tan, Ying Han, Shan Zeng, Changjing Cai, and Hong Shen
- Subjects
Genome-Wide Association Study (GWAS) ,Mendelian randomization ,Cancer ,Long-term effect ,Coronavirus disease-2019 (COVID-19) ,Heart failure Alzheimer’s disease ,Computer engineering. Computer hardware ,TK7885-7895 ,Information technology ,T58.5-58.64 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract Background The long-term impacts of COVID-19 on human health are a major concern, yet comprehensive evaluations of its effects on various health conditions are lacking. Methods This study aims to evaluate the role of various diseases in relation to COVID-19 by analyzing genetic data from a large-scale population over 2,000,000 individuals. A bidirectional two-sample Mendelian randomization approach was used, with exposures including COVID-19 susceptibility, hospitalization, and severity, and outcomes encompassing 86 different diseases or traits. A reverse Mendelian randomization analysis was performed to assess the impact of these diseases on COVID-19. Results Our analysis identified causal relationships between COVID-19 susceptibility and several conditions, including breast cancer (OR = 1.0073, 95% CI = 1.0032–1.0114, p = 5 × 10 − 4), ER + breast cancer (OR = 0.5252, 95% CI = 0.3589–0.7685, p = 9 × 10 − 4), and heart failure (OR = 1.0026, 95% CI = 1.001–1.0042, p = 0.002). COVID-19 hospitalization was causally linked to heart failure (OR = 1.0017, 95% CI = 1.0006–1.0028, p = 0.002) and Alzheimer’s disease (OR = 1.5092, 95% CI = 1.1942–1.9072, p = 0.0006). COVID-19 severity had causal effects on primary biliary cirrhosis (OR = 2.6333, 95% CI = 1.8274–3.7948, p = 2.059 × 10 − 7), celiac disease (OR = 0.0708, 95% CI = 0.0538–0.0932, p = 9.438 × 10–80), and Alzheimer’s disease (OR = 1.5092, 95% CI = 1.1942–1.9072, p = 0.0006). Reverse MR analysis indicated that rheumatoid arthritis, diabetic nephropathy, multiple sclerosis, and total testosterone (female) influence COVID-19 outcomes. We assessed heterogeneity and horizontal pleiotropy to ensure result reliability and employed the Steiger directionality test to confirm the direction of causality. Conclusions This study provides a comprehensive analysis of the causal relationships between COVID-19 and diverse health conditions. Our findings highlight the long-term impacts of COVID-19 on human health, emphasizing the need for continuous monitoring and targeted interventions for affected individuals. Future research should explore these relationships to develop comprehensive healthcare strategies.
- Published
- 2024
- Full Text
- View/download PDF
14. Haplotype frequency inference from pooled genetic data with a latent multinomial model
- Author
-
Foo, Yong See and Flegg, Jennifer A.
- Subjects
Statistics - Methodology - Abstract
In genetic studies, haplotype data provide more refined information than data about separate genetic markers. However, large-scale studies that genotype hundreds to thousands of individuals may only provide results of pooled data, where only the total allele counts of each marker in each pool are reported. Methods for inferring haplotype frequencies from pooled genetic data that scale well with pool size rely on a normal approximation, which we observe to produce unreliable inference when applied to real data. We illustrate cases where the approximation breaks down, due to the normal covariance matrix being near-singular. As an alternative to approximate methods, in this paper we propose exact methods to infer haplotype frequencies from pooled genetic data based on a latent multinomial model, where the observed allele counts are considered integer combinations of latent, unobserved haplotype counts. One of our methods, latent count sampling via Markov bases, achieves approximately linear runtime with respect to pool size. Our exact methods produce more accurate inference over existing approximate methods for synthetic data and for data based on haplotype information from the 1000 Genomes Project. We also demonstrate how our methods can be applied to time-series of pooled genetic data, as a proof of concept of how our methods are relevant to more complex hierarchical settings, such as spatiotemporal models., Comment: 35 pages, 16 figures, 3 algorithms, submitted to Biometrics journal
- Published
- 2023
15. Statistical Learning of Large-Scale Genetic Data: How to Run a Genome-Wide Association Study of Gene-Expression Data Using the 1000 Genomes Project Data
- Author
-
Sugolov, Anton, Emmenegger, Eric, Paterson, Andrew D., and Sun, Lei
- Published
- 2024
- Full Text
- View/download PDF
16. On the connections between the spatial Lambda-Fleming-Viot model and other processes for analysing geo-referenced genetic data
- Author
-
Wirtz, Johannes and Guindon, Stéphane
- Subjects
Quantitative Biology - Populations and Evolution - Abstract
The introduction of the spatial Lambda-Fleming-Viot model (LV) in population genetics was mainly driven by the pioneering work of Alison Etheridge, in collaboration with Nick Barton and Amandine V\'eber about ten years ago (1,2). The LV model provides a sound mathematical framework for describing the evolution of a population of related individuals along a spatial continuum. It alleviates the "pain in the torus" issue with Wright and Mal\'ecot's isolation by distance model and is sampling consistent, making it a tool of choice for statistical inference. Yet, little is known about the potential connections between the LV and other stochastic processes generating trees and the spatial coordinates along the corresponding lineages. This work focuses on a version of the LV whereby lineages move infinitely rapidly over infinitely small distances. Using simulations, we show that the induced LV tree-generating process is well approximated by a birth-death model. Our results also indicate that Brownian motions modelling the movements of lineages along birth-death trees do not generally provide a good approximation of the LV due to habitat boundaries effects that play an increasingly important role in the long run. Finally, we describe efficient algorithms for fast simulation of the backward and forward in time versions of the LV model.
- Published
- 2023
17. mtDNA 'nomenclutter' and its consequences on the interpretation of genetic data
- Author
-
Vladimir Bajić, Vanessa Hava Schulmann, and Katja Nowick
- Subjects
Mitochondria ,MtDNA ,MtDNA nomenclature ,Haplogroup ,Classification ,Grouping ,Ecology ,QH540-549.5 ,Evolution ,QH359-425 - Abstract
Abstract Population-based studies of human mitochondrial genetic diversity often require the classification of mitochondrial DNA (mtDNA) haplotypes into more than 5400 described haplogroups, and further grouping those into hierarchically higher haplogroups. Such secondary haplogroup groupings (e.g., “macro-haplogroups”) vary across studies, as they depend on the sample quality, technical factors of haplogroup calling, the aims of the study, and the researchers' understanding of the mtDNA haplogroup nomenclature. Retention of historical nomenclature coupled with a growing number of newly described mtDNA lineages results in increasingly complex and inconsistent nomenclature that does not reflect phylogeny well. This “clutter” leaves room for grouping errors and inconsistencies across scientific publications, especially when the haplogroup names are used as a proxy for secondary groupings, and represents a source for scientific misinterpretation. Here we explore the effects of phylogenetically insensitive secondary mtDNA haplogroup groupings, and the lack of standardized secondary haplogroup groupings on downstream analyses and interpretation of genetic data. We demonstrate that frequency-based analyses produce inconsistent results when different secondary mtDNA groupings are applied, and thus allow for vastly different interpretations of the same genetic data. The lack of guidelines and recommendations on how to choose appropriate secondary haplogroup groupings presents an issue for the interpretation of results, as well as their comparison and reproducibility across studies. To reduce biases originating from arbitrarily defined secondary nomenclature-based groupings, we suggest that future updates of mtDNA phylogenies aimed for the use in mtDNA haplogroup nomenclature should also provide well-defined and standardized sets of phylogenetically meaningful algorithm-based secondary haplogroup groupings such as “macro-haplogroups”, “meso-haplogroups”, and “micro-haplogroups”. Ideally, each of the secondary haplogroup grouping levels should be informative about different human population history events. Those phylogenetically informative levels of haplogroup groupings can be easily defined using TreeCluster, and then implemented into haplogroup callers such as HaploGrep3. This would foster reproducibility across studies, provide a grouping standard for population-based studies, and reduce errors associated with haplogroup nomenclatures in future studies.
- Published
- 2024
- Full Text
- View/download PDF
18. The Association Between Periodontal Disease and Cardiovascular Disease: Insights From Imaging, Observational, and Genetic Data
- Author
-
Sanghvi, Mihir M., Ramírez, Julia, Chadalavada, Sucharitha, Aung, Nay, Munroe, Patricia B., Donos, Nikolaos, and Petersen, Steffen E.
- Published
- 2024
- Full Text
- View/download PDF
19. Integrated physiological and genetic data reveal key-traits for heat tolerance in tomato
- Author
-
Francesca, S., Vitale, L., Graci, S., Addonizio, M., Barone, A., and Rigano, M.M.
- Published
- 2024
- Full Text
- View/download PDF
20. Reflections on Turkish Personal Data Protection Law and Genetic Data in Focus Group Discussions
- Author
-
Özkan, Özlem, Şahinol, Melike, Aydinoglu, Arsev Umur, and Aydin Son, Yesim
- Published
- 2022
- Full Text
- View/download PDF
21. Past population control biases interpretations of contemporary genetic data: implications for future invasive Sitka black-tailed deer management in Haida Gwaii
- Author
-
Burgess, Brock T., Irvine, Robyn L., Martin, Jean-Louis, and Russello, Michael A.
- Published
- 2023
- Full Text
- View/download PDF
22. A systematic review on investigating major depressive disorder and bipolar disorder using MRI and genetic data from 2018 to 2024
- Author
-
Kai Sun, Xin Wang, Guifei Zhou, Wenchao Lv, Rujia Song, Wei Wei, Zhenyu Liu, and Changbin Yu
- Subjects
major depressive disorder ,bipolar disorder ,gene‐imaging ,Neurology. Diseases of the nervous system ,RC346-429 - Abstract
Abstract The incidence of affective disorders, of which major depression disorder (MDD) and bipolar disorder (BD) are two main types, has increased rapidly in recent years. They significantly impact patients, their families, and society. However, while affective disorders have become a major issue worldwide, their pathogenesis remains unclear. In the last 6 years, research using magnetic resonance imaging (MRI) and genetic data has gained prominence in understanding their pathophysiology and etiology. This systematic review collected the studies of MDD and BD research published between January 1, 2018, and February 1, 2024, focusing on studies using MRI and genetic data and indexed in the Web of Science and PubMed database. It aims to investigate the similarities and differences in their imaging phenotypes and underlying molecular bases. After exclusions, a total of 80 articles were included in this review. Research on MDD and BD reveals the critical role of epigenetic modifications, such as DNA methylation, in brain structure and function changes. The genes and pathways implicated in MDD are directly associated with depressive symptoms. In contrast, those implicated in BD are associated with mood regulation and cognitive functions. In addition, functional imaging studies have revealed that abnormalities in MDD are frequently concentrated in regions involved in emotion regulation and stress response. In contrast, those in BD are frequently concentrated in the neural circuits related to reward processing and emotional stability. Further multimodal and multiscale studies are needed to advance the field of mood disorder research.
- Published
- 2024
- Full Text
- View/download PDF
23. Musical instruments, tools, language and genetic data reveal ancient hunter-gatherer networks
- Published
- 2024
- Full Text
- View/download PDF
24. Description of regression models for predicting the dynamics of pink salmon returns in the Kamchatka region based on climate-oceanological and population-genetic data
- Author
-
A. V. Bugaev, O. B. Tepnin, N. Yu. Shpigalskaya, and V. V. Kulik
- Subjects
pink salmon ,spawning stock ,salmon return ,fishery forecasting ,regression model ,Aquaculture. Fisheries. Angling ,SH1-691 - Abstract
Several regression models for predicting returns of pink salmon in the Kamchatka region are presented. The data for 1990–2023 were analyzed. Among available climatic and oceanological indices, the most suitable for using as predictors for forecasting of pink salmon returns were the Pacific Decadal Oscillation (PDO) index, Western Pacific Cyclonic Index (WP), Arctic Oscillation (AO) index, and the sea surface temperature anomaly in the North Pacific. Multi-dimensional models of the «stock–recruitment» type were built on identified statistical patterns, which allowed to estimate potential abundance of the pink salmon returns to northeastern and western Kamchatka. Besides, methods for predicting the abundance of pink salmon returns on the data of fish counting in the sea are considered, using the materials of TINRO trawl surveys conducted in the Bering and Okhotsk Seas in the fall seasons of 2012–2023. To determine the abundance of pink salmon originated from West Kamchatka, genetic identification of regional composition of juveniles in mixed trawl catches was used. All tested methods have a high level of determination, but simpler regressive models are more prospective for practical forecasting of general trend in dynamics of pink salmon stocks in the Kamchatka region due to very weak generalization ability of more complicated models.
- Published
- 2024
- Full Text
- View/download PDF
25. The associations between dysregulation of human blood metabolites and lung cancer risk: evidence from genetic data
- Author
-
Gujie Wu, Jun Liu, Haochun Shi, Binyang Pan, Min Li, Xiaolin Wang, Yao Li, Lin Cheng, Weigang Guo, and Yiwei Huang
- Subjects
Lung cancer ,Metabolomics ,Mendelian randomization ,Genome-wide association studies ,Causality ,Neoplasms. Tumors. Oncology. Including cancer and carcinogens ,RC254-282 - Abstract
Abstract Background Metabolic dysregulation is recognized as a significant hallmark of cancer progression. Although numerous studies have linked specific metabolic pathways to cancer incidence, the causal relationship between blood metabolites and lung cancer risk remains unclear. Methods Genomic data from 29,266 lung cancer patients and 56,450 control individuals from the Transdisciplinary Research in Cancer of the Lung and the International Lung Cancer Consortium (TRICL-ILCCO) were utilized, and findings were replicated using additional data from the FinnGen consortium. The analysis focused on the associations between 486 blood metabolites and the susceptibility to overall lung cancer and its three major clinical subtypes. Various Mendelian randomization methods, including inverse-variance weighting, weighted median estimation, and MR-Egger regression, were employed to ensure the robustness of our findings. Results A total of 19 blood metabolites were identified with significant associations with lung cancer risk. Specifically, oleate (OR per SD = 2.56, 95% CI: 1.51 to 4.36), 1-arachidonoylglyceropholine (OR = 1.79, 95% CI: 1.22 to 2.65), and arachidonate (OR = 1.67, 95% CI: 1.16 to 2.40) were associated with a higher risk of lung cancer. Conversely, 1-linoleoylglycerophosphoethanolamine (OR = 0.57, 95% CI: 0.40 to 0.82), ADpSGEGDFXAEGGGVR, a fibrinogen cleavage peptide (OR = 0.60, 95% CI: 0.47 to 0.77), and isovalerylcarnitine (OR = 0.62, 95% CI: 0.49 to 0.78) were associated with a lower risk of lung cancer. Notably, isoleucine (OR = 9.64, 95% CI: 2.55 to 36.38) was associated with a significantly higher risk of lung squamous cell cancer, while acetyl phosphate (OR = 0.11, 95% CI: 0.01 to 0.89) was associated with a significantly lower risk of small cell lung cancer. Conclusion This study reveals the complex relationships between specific blood metabolites and lung cancer risk, highlighting their potential as biomarkers for lung cancer prevention, screening, and treatment. The findings not only deepen our understanding of the metabolic mechanisms of lung cancer but also provide new insights for future treatment strategies.
- Published
- 2024
- Full Text
- View/download PDF
26. Assessing the causality of interferon‐γ and its receptor 1/2 with systemic lupus erythematosus risk using genetic data
- Author
-
Minjing Chang, Kaixin Yao, Jiawei Hao, Yinqi Long, Lulin Qiao, Yaru Zhang, Kexin Ma, and Peifeng He
- Subjects
interferon‐γ ,interferon‐γ receptor ,Mendelian randomization ,risk factor ,single‐nucleotide polymorphism ,systemic lupus erythematosus ,Immunologic diseases. Allergy ,RC581-607 ,Diseases of the musculoskeletal system ,RC925-935 - Abstract
Abstract Background The interferon‐γ (IFN‐γ) signaling pathway is activated in systemic lupus erythematosus (SLE). This study aimed to assess the causal association between IFN‐γ, IFN‐γ receptor 1 (IFN‐γR1), and IFN‐γR2 and SLE using a bidirectional Mendelian randomization (MR) design. Methods Genetic instruments for exposure to IFN‐γ, IFN‐γR1, and IFN‐γR2 were derived from a large genome‐wide association study (GWAS) that included a sample size of 3301 participants. Instrumental variables for SLE were selected from another independent GWAS analysis comprising 5201 cases and 6099 controls with European ancestry. Bidirectional two‐sample MR was performed using inverse variance weighting, MR‐Egger regression, and weighted median methods. A series of sensitivity analyses were conducted to assess the robustness of the results. Results The inverse variance weighting showed that IFN‐γ had a positive causal association with the risk of SLE (odd ratio [OR] = 1.24, 95% confidence interval [CI]: 1.03–1.47, p = 0.018). IFN‐γR2 levels were not associated with SLE risk after adjustment for multiple comparisons (OR = 0.85, 95% CI: 0.73–0.99, p = 0.034). No genetic association was also detected between IFN‐γR1 and SLE (OR = 0.97, 95% CI: 0.79–1.19, p = 0.768). Evidence from bidirectional MR did not support reverse causality. The weighted median regression also showed directionally similar estimates. Conclusion Higher levels of IFN‐γ are significantly associated with an increased risk of SLE, providing insights into the pathogenesis of SLE.
- Published
- 2024
- Full Text
- View/download PDF
27. Learning semi-supervised enrichment of longitudinal imaging-genetic data for improved prediction of cognitive decline
- Author
-
Seo, Hoon, Brand, Lodewijk, and Wang, Hua
- Published
- 2024
- Full Text
- View/download PDF
28. On the connections between the spatial Lambda–Fleming–Viot model and other processes for analysing geo-referenced genetic data
- Author
-
Wirtz, Johannes and Guindon, Stéphane
- Published
- 2024
- Full Text
- View/download PDF
29. Can Psychopathy be Prevented? Clinical, Neuroimaging and Genetic Data
- Author
-
Feggy Ostrosky Shejet, Head of Neuropsychology and Psychophisiology Laboratory
- Published
- 2023
30. Investigating infectious diseases using genetic data
- Author
-
Lin, ShangKuan, Wilson, Daniel, Mentzer, Alexander, and Ansari, Mohamad
- Subjects
Infectious diseases ,Hepatitis B virus ,Hepatitis C virus ,Computational biology ,Bioinformatics - Abstract
Infectious diseases have been a major cause of mortality and morbidity throughout human history. The dramatic advances in technology, especially in genome sequencing, have opened a new avenue to study infectious diseases and changed the landscape of public health and medical science. Compared to non-communicable human diseases where human genetics is the sole focus of genetic studies, infectious diseases add another layer of complexity due to the host-pathogen dynamic. Since infection can be seen as the result of the complex interplay of host and pathogen and the environment, a comprehensive understanding requires the incorporation of all these factors. In this thesis, I explore the different applications of genetic data in studying infectious diseases through the lens of host-pathogen dynamics and their clinical or public health implication. I first used virus whole genome data to rebuild the epidemiological history of Hepatitis C virus subtype 3a (HCV-3a). I showed how World War II had likely led to the spread of HCV-3a from South Asia and becoming a global epidemic. Notably, I also incorporated host genetic data into the analysis and showed how it helped us identify and address the confounding factor of human migration when conducting an evolutionary analysis on HCV-3a. I then integrated previously published methods to investigate protein residue covariation to investigate the signals co-evolution within the Hepatitis B virus (HBV). I built a co-evolution profile for HBV and link them to previous studies to demonstrate how drug resistance is one of the major driving forces of protein residue co-evolution in HBV. Lastly, I explored the potential of integration of national-scale databases that collect pathogen (Second Generation Surveillance System) and host data (UK Biobank) respectively and conducted multiple genome-wide association studies (GWASs) and lay out how such integration can help improve our understanding of human genetic risk factors for infectious diseases. To conclude, I discuss the importance of host-pathogen data integration as well as the need for developing statistical models that can account for both the data volume as well as complexity resulted from such integration.
- Published
- 2023
31. Modern Bioinformatics Solutions Used for Genetic Data Analysis
- Author
-
Ya. A. Kibirev, A. V. Kuznetsovskiy, S. G. Isupov, and I. V. Darmov
- Subjects
bioinformatics ,genetic analysis ,identification ,microorganisms ,nucleic acids ,sequencing ,software ,Military Science - Abstract
Effective counteraction to biological threats, both natural and man-made, requires the availability of means and methods for rapid and reliable microorganism identification and a comprehensive study of their basic biological properties. Over the past decade, the arsenal of domestic microbiologists has been supplemented by numerous methods for analyzing the genomes of pathogens, primarily based on nucleic acid sequencing. The purpose of this work is to provide the reader with information about capabilities of modern technical and methodological arsenal used for in-depth molecular genetic study of microorganisms, including bioinformatics solutions used for the genetic data analysis. The source base for this research is English-language scientific literature available via the Internet, bioinformation software documentation. The research method is an analysis of scientific sources from the general to the specific. We considered the features of sequencing platforms, the main stages of genetic information analysis, current bioinformation utilities, their interaction and organization into a single workflow. Results and discussion. The performance of modern genetic analyzers allows for complete decoding of the bacterial genome within one day, including the time required to prepare the sample for research. The key factor that largely determines the effectiveness of the genetic analysis methods used is the competent use of the necessary bioinformatics software utilities. Standard stages of primary genetic data analysis are assessment of the quality control, data preprocessing, mapping to a reference genome or de novo genome assembly, genome annotation, typing and identification of significant genetic determinants (resistance to antibacterial drugs, pathogenicity factors, etc.), phylogenetic analysis. For each stage bioinformation utilities have been developed, differing in implemented analysis algorithms. Conclusion. Open source utilities that do not require access to remote resources for their operation are of greatest interest due to activities specifics of NBC protection corps units.
- Published
- 2024
- Full Text
- View/download PDF
32. Consent for Use of Genetic Data among US Hispanics/Latinos : Results from the Hispanic Community Health Study/Study of Latinos
- Author
-
Gonzalez, Sara, Strizich, Garrett, Isasi, Carmen R., Hua, Simin, Comas, Betsy, Sofer, Tamar, Thyagarajan, Bharat, Perreira, Krista M., Talavera, Gregory A., Daviglus, Martha L., Nelson, Sarah C., Giachello, Aida L., Schneiderman, Neil, and Kaplan, Robert C.
- Published
- 2021
33. Importance of feature selection stability in the classifier evaluation on high-dimensional genetic data
- Author
-
Tomasz Łukaszuk and Jerzy Krawczuk
- Subjects
Classification of objects ,Feature selection stability ,Classifier evaluation ,Gene expression data ,Feature selection ,Medicine ,Biology (General) ,QH301-705.5 - Abstract
Classifiers trained on high-dimensional data, such as genetic datasets, often encounter situations where the number of features exceeds the number of objects. In these cases, classifiers typically rely on a small subset of features. For a robust algorithm, this subset should remain relatively stable with minor changes in the training data, such as the replacement of a few samples. While the stability of feature selection is a common focus in studies of feature selection algorithms, it is less emphasized in classifier evaluation, where only metrics such as accuracy are commonly used. We investigate the importance of feature selection stability through an empirical study of four classifiers (logistic regression, support vector machine, convex and piecewise Linear, and Random Forest) on seven high dimensional, publicly available, gene datasets. We measure the stability of feature selection using Lustgarten, Nogueira and Jaccard Index measures. We employed our own cross-validation procedure that guarantees a difference of exactly p objects between any two training sets which allows us to control the level of disturbance in the data. Our results show the existence of a drop in feature selection stability when we increase disturbance in the data in all 28 experiment configurations (seven datasets and four classifiers). The relationship is not linear, but resembles more of a hyperbolic pattern. In the evaluation of the tested classifiers, logistic regression demonstrated the highest stability. It was followed in order by support vector machine, convex and piecewise linear, with Random Forest exhibiting the lowest stability among them. This work provides evidence that all tested classifiers are very sensitive to even small changes in learning data in terms of features used by the model, while showing almost no sensitivity in terms of accuracy. The data and codes to reproduce the results in the article are available publicly on GitHub: https://github.com/tlukaszuk/feature-selection-stability-in-classifier-evaluation.
- Published
- 2024
- Full Text
- View/download PDF
34. Using Genetic Data to Determine Origin for Out‐Migrating Smolt and Returning Adult Steelhead Trout (Oncorhynchus mykiss) in a Southeast Alaska Drainage
- Author
-
Evan J. Barfuss, Bridey E. Brown, Shriya Sachdeva, Asher B. Smith, Frank P. Thrower, Charles D. Waters, Krista M. Nichols, and Matthew C. Hale
- Subjects
conservation ,DMAS‐qPCR ,rainbow trout ,smoltification ,steelhead trout ,Ecology ,QH540-549.5 - Abstract
ABSTRACT Oncorhynchus mykiss is a partially migratory salmonid species, and many migratory populations (known as steelhead) have declined in recent decades in the western United States and Canada. Closely related resident populations (known as rainbow trout) may be an effective resource in the recovery of these declining migratory populations. However, the extent to which different populations of resident rainbow trout produce migratory individuals and how likely these individuals are to return as adults to spawn remains unknown. One limitation to answering these questions is the identification of loci that accurately segregate between migratory and resident populations. To address this limitation, we used existing genomic data from a well‐studied population of O. mykiss from Southeastern Alaska (Sashin Creek) to identify loci that segregate between phenotypes. We then utilized Double Mismatch Allele‐Specific qPCR (DMAS‐qPCR) to genotype 233 smolts out‐migrating from Sashin Creek and 99 returning adult steelhead trout across a five‐year period to determine (a) the origin of out‐migrating smolts and returning adults and (b) to quantify the extent to which the resident population contributes to the migratory population. Our results show that 37.3% of out‐migrating smolts were produced from resident parents, whereas 19.3% of returning adults had resident parents. Ultimately, these results demonstrate that resident populations of rainbow trout produce migrant offspring that successfully complete their migration and return to spawn, increasing population sizes and likely improving genetic diversity. Therefore, conservation efforts should consider landlocked resident populations for producing smolts when developing recovery plans for migratory steelhead populations.
- Published
- 2024
- Full Text
- View/download PDF
35. Integrated physiological and genetic data reveal key-traits for heat tolerance in tomato
- Author
-
S. Francesca, L. Vitale, S. Graci, M. Addonizio, A. Barone, and M.M. Rigano
- Subjects
Heat stress ,Novel genotypes ,Photosynthesis ,Genotyping-by-sequencing ,Solanum lycopersicum ,Plant ecology ,QK900-989 - Abstract
As global temperatures continue to rise, high summer temperatures severely affect crop growth, reducing yield and quality. The projections of annual declines in crop yield require more in-depth multidisciplinary studies on plant tolerance to abiotic stresses. As tomato is a major crop in the Mediterranean region, its response to heat stress has become important to be addressed in order to identify those traits affecting stress tolerance. In this study, physiological and genomic analyses were performed on two heat-tolerant genotypes (LA3120 and E42) grown at high temperatures during the entire life cycle. The results showed that heat stress diversely affected gas exchange and fluorescence parameters in the two genotypes. In particular, E42 regulated the photosynthetic machinery under heat stress by modulating the electron transport chain, whereas LA3120 was less affected by the applied stress. Genotyping data obtained from a GBS (genotyping by sequencing) analysis were used to explore the genetic variability of both genotypes with the aim of identifying candidate genes that might regulate their stress response. These results further deepen our understanding of the physiological mechanisms activated in response to heat stress and allowed to select key traits that could be used in breeding program to select thermotolerant tomato genotypes.
- Published
- 2024
- Full Text
- View/download PDF
36. Multimodal integration of neuroimaging and genetic data for the diagnosis of mood disorders based on computer vision models
- Author
-
Lee, Seungeun, Cho, Yongwon, Ji, Yuyoung, Jeon, Minhyek, Kim, Aram, Ham, Byung-Joo, and Joo, Yoonjung Yoonie
- Published
- 2024
- Full Text
- View/download PDF
37. IG-GRD: A Model Based on Disentangled Graph Representation Learning for Imaging Genetic Data Fusion.
- Author
-
Shuang Feng, Letian Wang, Chang Li, Xiaohua Wan, Fa Zhang 0001, and Bin Hu 0001
- Published
- 2024
- Full Text
- View/download PDF
38. Genetic Data Analysis
- Author
-
Shamila, M., primary and Tyagi, Amit Kumar, additional
- Published
- 2023
- Full Text
- View/download PDF
39. Learning semi-supervised enrichment of longitudinal imaging-genetic data for improved prediction of cognitive decline
- Author
-
Hoon Seo, Lodewijk Brand, Hua Wang, and for the Alzheimer’s Disease Neuroimaging Initiative
- Subjects
Alzheimer’s disease ,Multi-modal ,Longitudinal learning ,Enrichment ,Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Abstract Background Alzheimer’s Disease (AD) is a progressive memory disorder that causes irreversible cognitive decline. Given that there is currently no cure, it is critical to detect AD in its early stage during the disease progression. Recently, many statistical learning methods have been presented to identify cognitive decline with temporal data, but few of these methods integrate heterogeneous phenotype and genetic information together to improve the accuracy of prediction. In addition, many of these models are often unable to handle incomplete temporal data; this often manifests itself in the removal of records to ensure consistency in the number of records across participants. Results To address these issues, in this work we propose a novel approach to integrate the genetic data and the longitudinal phenotype data to learn a fixed-length “enriched” biomarker representation derived from the temporal heterogeneous neuroimaging records. Armed with this enriched representation, as a fixed-length vector per participant, conventional machine learning models can be used to predict clinical outcomes associated with AD. Conclusion The proposed method shows improved prediction performance when applied to data derived from Alzheimer’s Disease Neruoimaging Initiative cohort. In addition, our approach can be easily interpreted to allow for the identification and validation of biomarkers associated with cognitive decline.
- Published
- 2024
- Full Text
- View/download PDF
40. An annotated catalogue of selected historical type specimens, including genetic data, housed in the Natural History Museum Vienna
- Author
-
Anja Palandačić, Min J. Chai, Gennadiy A. Shandikov, Nesrine Akkari, Pedro R. Frade, Susanne Randolf, Hans-Martin Berg, Ernst Mikschi, and Nina G. Bogutskaya
- Subjects
Zoology ,QL1-991 - Abstract
Museum collections are an important source for resolving taxonomic issues and species delimitation. Type specimens as name-bearing specimens, traditionally used in morphology-based taxonomy, are, due to the progress in historical DNA methodology, increasingly used in molecular taxonomic studies. Museum collections are subject to constant deterioration and major disasters. The digitisation of collections offers a partial solution to these problems and makes museum collections more accessible to the wider scientific community. The Extended Specimen Approach (ESA) is a method of digitisation that goes beyond the physical specimen to include the historical information stored in the collection. The collections of the Natural History Museum Vienna represent one of the largest non-university research centres in Europe and, due to their size and numerous type specimens, are frequently used for taxonomic studies by visiting and resident scientists. Recently, a version of ESA was presented in the common catalogue of the Fish and Evertebrata Varia collections and extended to include genetic information on type specimens in a case study of a torpedo ray. Here the case study was extended to a heterogeneous selection of historical type series from different collections with the type locality of Vienna. The goal was to apply the ESA, including genetic data on a selected set of type material: three parasitic worms, three myriapods, two insects, twelve fishes, and one bird species. Five hundred digital items (photographs, X-rays, scans) were produced, and genetic analysis was successful in eleven of the 21 type series. In one case a complete mitochondrial genome was assembled, and in another case ten short fragments (100–230 bp) of the cytochrome oxidase I gene were amplified and sequenced. For five type series, genetic analysis confirmed their taxonomic status as previously recognised synonyms, and for one the analysis supported its status as a distinct species. For two species, genetic information was provided for the first time. This catalogue thus demonstrates the usefulness of ESA in providing digitised data of types that can be easily made available to scientists worldwide for further study.
- Published
- 2024
- Full Text
- View/download PDF
41. Concerns raised about 23andMe's genetic data, NY Times reports
- Subjects
Genetic research ,Business ,News, opinion and commentary - Abstract
A decline in 23andMe's valuation, a recent board resignation, and a data breach have all raised questions about the future of genetic data collected from millions of customers, The New [...]
- Published
- 2024
42. Refugia, colonization and diversification of an arid-adapted bird: coincident patterns between genetic data and ecological niche modelling
- Author
-
Kvist, Laura, Barbosa, Andrés, Valera, Francisco, Khoury, Fhares, Varela, Sara, Moreno, Eulalia, Barrientos Yuste, Rafael, Kvist, Laura, Barbosa, Andrés, Valera, Francisco, Khoury, Fhares, Varela, Sara, Moreno, Eulalia, and Barrientos Yuste, Rafael
- Abstract
Microsatellite data, DNA sequences and their corresponding GenBank Accession nos. are deposited in the CSIC institutional repository (http://digital.csic.es/handle/10261/85523). Geographical coordinates of the locations used to model the ecological niche of the trumpeter finch from its current range are provided in Table S4 (Supporting Information) and the R-script for running the ecological niche models are available in GitHub (https://github.com/SaraVarela/Bucanetes)., Phylogeographical studies are common in boreal and temperate species from the Palaearctic, but scarce in arid-adapted species. We used nuclear and mitochondrial markers to investigate phylogeography and to estimate chronology of colonization events of the trumpeter finch Bucanetes githagineus, an arid-adapted bird. We used 271 samples from 16 populations, most of which were fresh samples but including some museum specimens. Microsatellite data showed no clear grouping according to the sampling locations. Microsatellite and mitochondrial data showed the clearest differentiation between Maghreb and Canary Islands and between Maghreb and Western Sahara. Mitochondrial data suggest differentiation between different Maghreb populations and among Maghreb and Near East populations, between Iberian Peninsula and Canary Islands, as well as between Western Sahara and Maghreb. Our coalescence analyses indicate that the trumpeter finch colonized North Africa during the humid Marine Isotope Stage 5 (MIS5) period of the Sahara region 125 000 years ago. We constructed an ecological niche model (ENM) to estimate the geographical distribution of climatically suitable habitats for the trumpeter finch. We tested whether changes in the species range in relation to glacial–interglacial cycles could be responsible for observed patterns of genetic diversity and structure. Modelling results matched with those from genetic data as the species' potential range increases in interglacial scenarios (in the present climatic scenario and during MIS5) and decreases in glacial climates (during the last glacial maximum, LGM, 21 000 years ago). Our results suggest that the trumpeter finch responded to Pleistocene climatic changes by expanding and contracting its range., Depto. de Biodiversidad, Ecología y Evolución, Fac. de Ciencias Biológicas, TRUE, pub
- Published
- 2024
43. A Neural Network Based Method with Transfer Learning for Genetic Data Analysis
- Author
-
Lin, Jinghang, Zhang, Shan, and Lu, Qing
- Subjects
Statistics - Applications ,Computer Science - Machine Learning ,Quantitative Biology - Genomics - Abstract
Transfer learning has emerged as a powerful technique in many application problems, such as computer vision and natural language processing. However, this technique is largely ignored in application to genetic data analysis. In this paper, we combine transfer learning technique with a neural network based method(expectile neural networks). With transfer learning, instead of starting the learning process from scratch, we start from one task that have been learned when solving a different task. We leverage previous learnings and avoid starting from scratch to improve the model performance by passing information gained in different but related task. To demonstrate the performance, we run two real data sets. By using transfer learning algorithm, the performance of expectile neural networks is improved compared to expectile neural network without using transfer learning technique.
- Published
- 2022
44. Cancer survival prediction by learning comprehensive deep feature representation for multiple types of genetic data
- Author
-
Hao, Yaru, Jing, Xiao-Yuan, and Sun, Qixing
- Published
- 2023
- Full Text
- View/download PDF
45. THE RIGHT TO BE FORGOTTEN REGARDING GENETIC DATA: A LEGAL AND ETHICAL ANALYSIS.
- Author
-
Correia, Mónica, Rego, Guilhermina, and Nunes, Rui
- Subjects
- *
RIGHT to be forgotten , *DATA privacy , *GENETIC privacy , *GENERAL Data Protection Regulation, 2016 , *RIGHT of privacy - Abstract
This article investigates an under-discussed provision of the European Union's (EU's) General Data Protection Regulation (GDPR) regarding genetic data, i.e., the right to be forgotten. The debate on this right came from the commercerelated side of data protection instead of the medical side. Thus, this article addresses the implications of the RTBF for the lawful processing of familial genetic data. The article develops a normative, ethically focused principles argument about interpreting genetic data's right to be forgotten. It gives due consideration to autonomy, privacy, and human dignity. It argues that the individualistic approach of genetic privacy materialised through the extreme solution of data erasure is challenging to combine with familial and scientific research interests. The article suggests an interpretation of the GDPR according to bioethical principles and the inclusion of a specific exception regarding genetic data to prevent patients from claiming the right to be forgotten. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. A case study of a liver transplant-treated patient with glycogen storage disease type Ia presenting with multiple inflammatory hepatic adenomas: an analysis of clinicopathologic and genetic data
- Author
-
Ao Wang, Jiamei Wu, Xiaohui Yuan, Jianping Liu, and Changli Lu
- Subjects
GSD-Ia ,Hepatic adenomas ,G6PC gene ,Liver transplant-treated ,Clinicopathologic ,Internal medicine ,RC31-1245 ,Genetics ,QH426-470 - Abstract
Abstract Background Glycogen storage disease (GSD) is a disease caused by excessive deposition of glycogen in tissues due to genetic disorders in glycogen metabolism. Glycogen storage disease type I (GSD-I) is also known as VonGeirk disease and glucose-6-phosphatase deficiency. This disease is inherited in an autosomal recessive manner, and both sexes can be affected. The main symptoms include hypoglycaemia, hepatomegaly, acidosis, hyperlipidaemia, hyperuricaemia, hyperlactataemia, coagulopathy and developmental delay. Case presentation Here, we present the case of a 13-year-old female patient with GSD Ia complicated with multiple inflammatory hepatic adenomas. She presented to the hospital with hepatomegaly, hypoglycaemia, and epistaxis. By clinical manifestations and imaging and laboratory examinations, we suspected that the patient suffered from GSD I. Finally, the diagnosis was confirmed by liver pathology and whole-exome sequencing (WES). WES revealed a synonymous mutation, c.648 G > T (p.L216 = , NM_000151.4), in exon 5 and a frameshift mutation, c.262delG (p.Val88Phefs*14, NM_000151.4), in exon 2 of the G6PC gene. According to the pedigree analysis results of first-generation sequencing, heterozygous mutations of c.648 G > T and c.262delG were obtained from the patient's father and mother. Liver pathology revealed that the solid nodules were hepatocellular hyperplastic lesions, and immunohistochemical (IHC) results revealed positive expression of CD34 (incomplete vascularization), liver fatty acid binding protein (L-FABP) and C-reactive protein (CRP) in nodule hepatocytes and negative expression of β-catenin and glutamine synthetase (GS). These findings suggest multiple inflammatory hepatocellular adenomas. PAS-stained peripheral hepatocytes that were mostly digested by PAS-D were strongly positive. This patient was finally diagnosed with GSD-Ia complicated with multiple inflammatory hepatic adenomas, briefly treated with nutritional therapy after diagnosis and then underwent living-donor liver allotransplantation. After 14 months of follow-up, the patient recovered well, liver function and blood glucose levels remained normal, and no complications occurred. Conclusion The patient was diagnosed with GSD-Ia combined with multiple inflammatory hepatic adenomas and received liver transplant treatment. For childhood patients who present with hepatomegaly, growth retardation, and laboratory test abnormalities, including hypoglycaemia, hyperuricaemia, and hyperlipidaemia, a diagnosis of GSD should be considered. Gene sequencing and liver pathology play important roles in the diagnosis and typing of GSD.
- Published
- 2024
- Full Text
- View/download PDF
47. Species delimitation, discovery and conservation in a tiger beetle species complex despite discordant genetic data
- Author
-
Daniel P. Duran, Robert A. Laroche, Stephen J. Roman, William Godwin, David P. Herrmann, Ethan Bull, and Scott P. Egan
- Subjects
Medicine ,Science - Abstract
Abstract In an age of species declines, delineating and discovering biodiversity is critical for both taxonomic accuracy and conservation. In recent years, there has been a movement away from using exclusively morphological characters to delineate and describe taxa and an increase in the use of molecular markers to describe diversity or through integrative taxonomy, which employs traditional morphological characters, as well as genetic or other data. Tiger beetles are charismatic, of conservation concern, and much work has been done on the morphological delineation of species and subspecies, but few of these taxa have been tested with genetic analyses. In this study, we tested morphologically based taxonomic hypotheses of polymorphic tiger beetles in the Eunota circumpicta (LaFerté-Sénectère, 1841) species complex using multilocus genomic and mtDNA analyses. We find multiple cryptic species within the previous taxonomic concept of Eunota circumpicta, some of which were historically recognized as subspecies. We found that the mtDNA and genomic datasets did not identify the same taxonomic units and that the mtDNA was most at odds with all other genetic and morphological patterns. Overall, we describe new cryptic diversity, which raises important conservation concerns, and provide a working example for testing species and subspecies validity despite discordant data.
- Published
- 2024
- Full Text
- View/download PDF
48. Open science and human genetic data: recommendations on South Africa’s Draft National Open Science Policy
- Author
-
Donrich Thaldar, Amy Gooden, and Michaela Steytler
- Subjects
Draft National Open Science Policy ,freedom of scientific research ,human genetic data ,open science ,ownership ,South Africa ,Genetics ,QH426-470 - Abstract
The Draft National Open Science Policy, which was shared by the South African government with stakeholders in 2022, is an encouraging step forward as it aims to promote the practice of open science in South Africa through a system of incentives. Since South Africa is constitutionally committed to be an open and democratic society, this approach is preferable to the approach of state control that characterizes the Draft National Policy on Data and Cloud—another data-related policy initiative by the South African government. However, there is room for improvement in the Draft National Open Science Policy. In particular, it should: (a) rely on the right to freedom of scientific research to strengthen the policy; (b) rectify the omission of ownership from its policy analysis; and (c) retain a clear differentiation between human and non-human genetic data. This will ensure that the final policy is clearly anchored in the South African Constitution, and that the principle of “as open as possible, as closed as necessary” can be applied to human genetic data in a legally well informed and accountable way.
- Published
- 2023
- Full Text
- View/download PDF
49. Genetic Data Governance in Japanese Hospitals
- Author
-
Suzuki, Mizuho Yamazaki, Ohnuki, Yuko, and Takeshita, Kei
- Published
- 2023
- Full Text
- View/download PDF
50. Long-term video and genetic data yield insights into complex sociality of a solitary large carnivore
- Author
-
Reynolds-Hogland, Melissa, Brooks, Carly, Ramsey, Alan B., Hogland, John S., Pilgrim, Kristine L., Engkjer, Cory, and Ramsey, Philip W.
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.