175 results on '"Yakhini Z"'
Search Results
2. SNPs in genes coding for ROS metabolism and signalling in association with docetaxel clearance
- Author
-
Edvardsen, H, Brunsvig, P F, Solvang, H, Tsalenko, A, Andersen, A, Syvanen, A-C, Yakhini, Z, Børresen-Dale, A-L, Olsen, H, Aamdal, S, and Kristensen, V N
- Published
- 2010
- Full Text
- View/download PDF
3. A steroid metabolomic approach to 17α-hydroxylase/17,20 lyase deficiency
- Author
-
Tiosano, D., Navon, R., Flor, O., Knopf, C., Hartmann, M. F., Wudy, S. A., Yakhini, Z., and Hochberg, Z.
- Published
- 2010
- Full Text
- View/download PDF
4. Chromosomal aberrations and gene expression profiles in non-small cell lung cancer
- Author
-
Dehan, E., Ben-Dor, A., Liao, W., Lipson, D., Frimer, H., Rienstein, S., Simansky, D., Krupsky, M., Yaron, P., Friedman, E., Rechavi, G., Perlman, M., Aviram-Goldring, A., Izraeli, S., Bittner, M., Yakhini, Z., and Kaminski, N.
- Published
- 2007
- Full Text
- View/download PDF
5. Molecular classification of cutaneous malignant melanoma by gene expression profiling
- Author
-
Bittner, M., Meltzer, P., Chen, Y., Jiang, Y., Seftor, E., Hendrix, M., Radmacher, M., Simon, R., Yakhini, Z., Ben-Dor, A., Sampas, N., Dougherty, E., Wang, E., Marincola, F., Gooden, C., Lueders, J., Glatfelter, A., Pollock, P., Carpten, J., Gillanders, E., Leja, D., Dietrich, K., Beaudry, C., Berens, M., Alberts, D., Sondak, V., Hayward, N., and Trent, J.
- Published
- 2000
- Full Text
- View/download PDF
6. Methods for analysis and visualization of SNP genotype data for complex diseases
- Author
-
Tsalenko, A., Ben-Dor, A., Yakhini, Z., and Cox, N.J.
- Subjects
Human genetics -- Research ,Diseases -- Genetic aspects ,Algorithms -- Usage ,Genetic research -- Methods ,Biological sciences - Published
- 2001
7. Alu elements implicated in the generation of 9/20 CNV deletions investigated by breakpoint sequencing
- Author
-
de Smith, A, Walters, R, Coin, L, Steinfeld, I, Yakhini, Z, Sladek, R, Froguel, P, and Blakemore, A
- Published
- 2008
8. Global methylation patterns in idiopathic pulmonary fibrosis
- Author
-
Rabinovich, EI, Kapetanaki, MG, Steinfeld, I, Gibson, KF, Pandit, KV, Yu, G, Yakhini, Z, Kaminski, N, Rabinovich, EI, Kapetanaki, MG, Steinfeld, I, Gibson, KF, Pandit, KV, Yu, G, Yakhini, Z, and Kaminski, N
- Abstract
Background: Idiopathic Pulmonary Fibrosis (IPF) is characterized by profound changes in the lung phenotype including excessive extracellular matrix deposition, myofibroblast foci, alveolar epithelial cell hyperplasia and extensive remodeling. The role of epigenetic changes in determining the lung phenotype in IPF is unknown. In this study we determine whether IPF lungs exhibit an altered global methylation profile. Methodology/Principal Findings: Immunoprecipitated methylated DNA from 12 IPF lungs, 10 lung adenocarcinomas and 10 normal histology lungs was hybridized to Agilent human CpG Islands Microarrays and data analysis was performed using BRB-Array Tools and DAVID Bioinformatics Resources software packages. Array results were validated using the EpiTYPER MassARRAY platform for 3 CpG islands. 625 CpG islands were differentially methylated between IPF and control lungs with an estimated False Discovery Rate less than 5%. The genes associated with the differentially methylated CpG islands are involved in regulation of apoptosis, morphogenesis and cellular biosynthetic processes. The expression of three genes (STK17B, STK3 and HIST1H2AH) with hypomethylated promoters was increased in IPF lungs. Comparison of IPF methylation patterns to lung cancer or control samples, revealed that IPF lungs display an intermediate methylation profile, partly similar to lung cancer and partly similar to control with 402 differentially methylated CpG islands overlapping between IPF and cancer. Despite their similarity to cancer, IPF lungs did not exhibit hypomethylation of long interspersed nuclear element 1 (LINE-1) retrotransposon while lung cancer samples did, suggesting that the global hypomethylation observed in cancer was not typical of IPF. Conclusions/Significance: Our results provide evidence that epigenetic changes in IPF are widespread and potentially important. The partial similarity to cancer may signify similar pathogenetic mechanisms while the differences constitute IP
- Published
- 2012
9. SNPs in genes coding for ROS metabolism and signalling in association with docetaxel clearance
- Author
-
Edvardsen, H., Brunsvig, P F., Solvang, H., Tsalenko, A., Andersen, A., Syvänen, Ann-Christine, Yakhini, Z., Børresen-Dale, A-L., Olsen, H., Aamdal, S., Kristensen, V. N., Edvardsen, H., Brunsvig, P F., Solvang, H., Tsalenko, A., Andersen, A., Syvänen, Ann-Christine, Yakhini, Z., Børresen-Dale, A-L., Olsen, H., Aamdal, S., and Kristensen, V. N.
- Abstract
The dose of docetaxel is currently calculated based on body surface area and does not reflect the pharmacokinetic, metabolic potential or genetic background of the patients. The influence of genetic variation on the clearance of docetaxel was analysed in a two-stage analysis. In step one, 583 single-nucleotide polymorphisms (SNPs) in 203 genes were genotyped on samples from 24 patients with locally advanced non-small cell lung cancer. We found that many of the genes harbour several SNPs associated with clearance of docetaxel. Most notably these were four SNPs in EGF, three SNPs in PRDX4 and XPC, and two SNPs in GSTA4, TGFBR2, TNFAIP2, BCL2, DPYD and EGFR. The multiple SNPs per gene suggested the existence of common haplotypes associated with clearance. These were confirmed with detailed haplotype analysis. On the basis of analysis of variance (ANOVA), quantitative mutual information score (QMIS) and Kruskal-Wallis (KW) analysis SNPs significantly associated with clearance of docetaxel were confirmed for GSTA4, PRDX4, TGFBR2 and XPC and additional putative markers were found in CYP2C8, EPHX1, IGF2, IL1R2, MAPK7, NDUFB4, TGFBR3, TPMT (2 SNPs), (P<0.05 or borderline significant for all three methods, 14 SNPs in total). In step two, these 14 SNPs were genotyped in additional 9 samples and the results combined with the genotyping results from the first step. For 7 of the 14 SNPs, the results are still significant/borderline significant by all three methods: ANOVA, QMIS and KW analysis strengthening our hypothesis that they are associated with the clearance of docetaxel..
- Published
- 2010
- Full Text
- View/download PDF
10. Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements
- Author
-
Lichten, M, de Smith, AJ, Walters, RG, Coin, LJM, Steinfeld, I, Yakhini, Z, Sladek, R, Froguel, P, Blakemore, AIF, Lichten, M, de Smith, AJ, Walters, RG, Coin, LJM, Steinfeld, I, Yakhini, Z, Sladek, R, Froguel, P, and Blakemore, AIF
- Abstract
Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms.
- Published
- 2008
11. Deubiquitination of EGFR by Cezanne-1 contributes to cancer progression
- Author
-
Pareja, F, primary, Ferraro, D A, additional, Rubin, C, additional, Cohen-Dvashi, H, additional, Zhang, F, additional, Aulmann, S, additional, Ben-Chetrit, N, additional, Pines, G, additional, Navon, R, additional, Crosetto, N, additional, Köstler, W, additional, Carvalho, S, additional, Lavi, S, additional, Schmitt, F, additional, Dikic, I, additional, Yakhini, Z, additional, Sinn, P, additional, Mills, G B, additional, and Yarden, Y, additional
- Published
- 2011
- Full Text
- View/download PDF
12. 648 Molecular characterization of breast cancer subtypes derived from joint analysis of high throughput miRNA and mRNA data
- Author
-
Enerly, E., primary, Steinfeld, I., additional, Kleivi, K., additional, Aure, M.R., additional, Leivonen, S.K., additional, Johnsen, H., additional, Kallioniemi, O., additional, Kristensen, V.N., additional, Yakhini, Z., additional, and Børresen-Dale, A.L., additional
- Published
- 2010
- Full Text
- View/download PDF
13. The Role of miR-154 microRNA Family in IPF.
- Author
-
Milosevic, J, primary, Pandit, KV, additional, Jacobs, R, additional, Benos, PV, additional, Yakhini, Z, additional, Chensny, L, additional, and Kaminski, N, additional
- Published
- 2009
- Full Text
- View/download PDF
14. Optimization of probe coverage for high-resolution oligonucleotide aCGH
- Author
-
Lipson, D., primary, Yakhini, Z., additional, and Aumann, Y., additional
- Published
- 2007
- Full Text
- View/download PDF
15. Genetic variation in putative regulatory loci controlling gene expression in breast cancer
- Author
-
Kristensen, V. N., primary, Edvardsen, H., additional, Tsalenko, A., additional, Nordgard, S. H., additional, Sorlie, T., additional, Sharan, R., additional, Vailaya, A., additional, Ben-Dor, A., additional, Lonning, P. E., additional, Lien, S., additional, Omholt, S., additional, Syvanen, A.-C., additional, Yakhini, Z., additional, and Borresen-Dale, A.-L., additional
- Published
- 2006
- Full Text
- View/download PDF
16. Su-S3:2 Data integration and data analysis for collaborative studies in cardiovascular diseases
- Author
-
Yakhini, Z., primary
- Published
- 2006
- Full Text
- View/download PDF
17. Tu-P7:206 Molecular signatures determining coronary artery and saphenous vein smooth muscle cell phenotypes: Distinct responses to stimuli
- Author
-
Deng, D.X., primary, Spin, J.M., additional, Tsalenko, A., additional, Vailaya, A., additional, Ben-Dor, A., additional, Yakhini, Z., additional, Tsao, P., additional, Bruhn, L., additional, and Quertermous, T., additional
- Published
- 2006
- Full Text
- View/download PDF
18. Comparative genomic hybridization using oligonucleotide arrays and total genomic DNA
- Author
-
Barrett, MT, primary, Sampas, N, additional, Ben-Dor, A, additional, Scheffer, A, additional, Anderson, P, additional, Tsang, P, additional, Gooden, C, additional, Walker, R, additional, Curry, B, additional, Kincaid, R, additional, Lipson, D, additional, Bittner, M, additional, Yakhini, Z, additional, Meltzer, PS, additional, Bruhn, L, additional, and Laderman, S, additional
- Published
- 2005
- Full Text
- View/download PDF
19. Analysis of SNP-expression association matrices
- Author
-
Tsalenko, A., primary, Sharan, R., additional, Edvardsen, H., additional, Kristensen, V., additional, Boerresen-Dale, A.-L., additional, Ben-Dor, A., additional, and Yakhini, Z., additional
- Published
- 2005
- Full Text
- View/download PDF
20. MULTIPLEXING SCHEMES FOR GENERIC SNP GENOTYPING ASSAYS
- Author
-
SHARAN, R., primary, BEN-DOR, A., additional, and YAKHINI, Z., additional
- Published
- 2003
- Full Text
- View/download PDF
21. Cancer computational biology
- Author
-
Jurisica Igor and Yakhini Zohar
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Published
- 2011
- Full Text
- View/download PDF
22. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists
- Author
-
Steinfeld Israel, Navon Roy, Eden Eran, Lipson Doron, and Yakhini Zohar
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results. Results GOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). GOrilla employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the top of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, GOrilla computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms. Conclusion GOrilla is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. GOrilla's unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. GOrilla is publicly available at: http://cbl-gorilla.cs.technion.ac.il
- Published
- 2009
- Full Text
- View/download PDF
23. Multilocus analysis of SNP and metabolic data within a given pathway
- Author
-
Fjeldstad Ståle, Lingjærde Ole, Grenaker Grethe, Faldaas Anne, Geisler Jurgen, Tsalenko Anya, Kristensen Vessela N, Yakhini Zohar, Lønning Per, and Børresen-Dale Anne-Lise
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Complex traits, which are under the influence of multiple and possibly interacting genes, have become a subject of new statistical methodological research. One of the greatest challenges facing human geneticists is the identification and characterization of susceptibility genes for common multifactorial diseases and their association to different quantitative phenotypic traits. Results Two types of data from the same metabolic pathway were used in the analysis: categorical measurements of 18 SNPs; and quantitative measurements of plasma levels of several steroids and their precursors. Using the combinatorial partitioning method we tested various thresholds for each metabolic trait and each individual SNP locus. One SNP in CYP19, 3UTR, two SNPs in CYP1B1 (R48G and A119S) and one in CYP1A1 (T461N) were significantly differently distributed between the high and low level metabolic groups. The leave one out cross validation method showed that 6 SNPs in concert make 65% correct prediction of phenotype. Further we used pattern recognition, computing the p-value by Monte Carlo simulation to identify sets of SNPs and physiological characteristics such as age and weight that contribute to a given metabolic level. Since the SNPs detected by both methods reside either in the same gene (CYP1B1) or in 3 different genes in immediate vicinity on chromosome 15 (CYP19, CYP11 and CYP1A1) we investigated the possibility that they form intragenic and intergenic haplotypes, which may jointly account for a higher activity in the pathway. We identified such haplotypes associated with metabolic levels. Conclusion The methods reported here may enable to study multiple low-penetrance genetic factors that together determine various quantitative phenotypic traits. Our preliminary data suggest that several genes coding for proteins involved in a common pathway, that happen to be located on common chromosomal areas and may form intragenic haplotypes, together account for a higher activity of the whole pathway.
- Published
- 2006
- Full Text
- View/download PDF
24. Detecting significant expression patterns in single-cell and spatial transcriptomics with a flexible computational approach.
- Author
-
Biran H, Hashimshony T, Lahav T, Efrat O, Mandel-Gutfreund Y, and Yakhini Z
- Subjects
- Humans, Cluster Analysis, Single-Cell Analysis methods, Algorithms, Gene Expression Profiling methods, Computational Biology methods, Transcriptome
- Abstract
Gene expression data holds the potential to shed light on multiple biological processes at once. However, data analysis methods for single cell sequencing mostly focus on finding cell clusters or the principal progression line of the data. Data analysis for spatial transcriptomics mostly addresses clustering and finding spatially variable genes. Existing data analysis methods are effective in finding the main data features, but they might miss less pronounced, albeit significant, processes, possibly involving a subset of the samples. In this work we present SPIRAL: Significant Process InfeRence ALgorithm. SPIRAL is based on Gaussian statistics to detect all statistically significant biological processes in single cell, bulk and spatial transcriptomics data. The algorithm outputs a list of structures, each defined by a set of genes working simultaneously in a specific population of cells. SPIRAL is unique in its flexibility: the structures are constructed by selecting subsets of genes and cells based on statistically significant and consistent differential expression. Every gene and every cell may be part of one structure, more or none. SPIRAL also provides several visual representations of structures and pathway enrichment information. We validated the statistical soundness of SPIRAL on synthetic datasets and applied it to single cell, spatial and bulk RNA-sequencing datasets. SPIRAL is available at https://spiral.technion.ac.il/ ., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
25. HIPI: Spatially resolved multiplexed protein expression inferred from H&E WSIs.
- Author
-
Zeira R, Anavy L, Yakhini Z, Rivlin E, and Freedman D
- Subjects
- Humans, Tumor Microenvironment, Image Processing, Computer-Assisted methods, Algorithms, Colorectal Neoplasms metabolism, Colorectal Neoplasms pathology, Computational Biology methods, Biomarkers, Tumor metabolism
- Abstract
Solid tumors are characterized by complex interactions between the tumor, the immune system and the microenvironment. These interactions and intra-tumor variations have both diagnostic and prognostic significance and implications. However, quantifying the underlying processes in patient samples requires expensive and complicated molecular experiments. In contrast, H&E staining is typically performed as part of the routine standard process, and is very cheap. Here we present HIPI (H&E Image Interpretation and Protein Expression Inference) for predicting cell marker expression from tumor H&E images. We process paired H&E and CyCIF images taken from serial sections of colorectal cancers to train our model. We show that our model accurately predicts the spatial distribution of several important cell markers, on both held-out tumor regions as well as new tumor samples taken from different patients. Moreover, using only the tissue image morphology, HIPI is able to colocalize the interactions between different cell types, further demonstrating its potential clinical significance., Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: All authors are employed by Verily Life Sciences., (Copyright: © 2024 Zeira et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2024
- Full Text
- View/download PDF
26. Quantifying allele-specific CRISPR editing activity with CRISPECTOR2.0.
- Author
-
Assa G, Kalter N, Rosenberg M, Beck A, Markovich O, Gontmakher T, Hendel A, and Yakhini Z
- Subjects
- Humans, Polymorphism, Single Nucleotide, Algorithms, Gene Editing methods, Alleles, CRISPR-Cas Systems, Software
- Abstract
Off-target effects present a significant impediment to the safe and efficient use of CRISPR-Cas genome editing. Since off-target activity is influenced by the genomic sequence, the presence of sequence variants leads to varying on- and off-target profiles among different alleles or individuals. However, a reliable tool that quantifies genome editing activity in an allelic context is not available. Here, we introduce CRISPECTOR2.0, an extended version of our previously published software tool CRISPECTOR, with an allele-specific editing activity quantification option. CRISPECTOR2.0 enables reference-free, allele-aware, precise quantification of on- and off-target activity, by using de novo sample-specific single nucleotide variant (SNV) detection and statistical-based allele-calling algorithms. We demonstrate CRISPECTOR2.0 efficacy in analyzing samples containing multiple alleles and quantifying allele-specific editing activity, using data from diverse cell types, including primary human cells, plants, and an original extensive human cell line database. We identified instances where an SNV induced changes in the protospacer adjacent motif sequence, resulting in allele-specific editing. Intriguingly, differential allelic editing was also observed in regions carrying distal SNVs, hinting at the involvement of additional epigenetic factors. Our findings highlight the importance of allele-specific editing measurement as a milestone in the adaptation of efficient, accurate, and safe personalized genome editing., (© The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2024
- Full Text
- View/download PDF
27. Differential Expression Analysis of Cutaneous Squamous Cell Carcinoma and Basal Cell Carcinoma Proteomic Profiles Sampled with Electroporation-Based Biopsy.
- Author
-
Vitkin E, Wise J, Berl A, Shir-Az O, Gabay B, Singh A, Kravtsov V, Yakhini Z, Shalom A, and Golberg A
- Abstract
Clinical misdiagnosis between cutaneous squamous cell carcinoma (cSCC) and basal cell carcinoma (BCC) affects treatment plans. We report a tissue sampling approach with molecular biopsy using electroporation. This method, coined electroporation-based biopsy (e-biopsy), enables nondestructive nonthermal permeabilization of cells in the skin for vacuum-assisted extraction of biomolecules. We used e-biopsy for ex vivo proteome extraction from 3 locations per patient in 21 cSCC, 20 BCC, and 7 actinic keratosis human skin samples. Using liquid chromatography with tandem mass spectrometry, we identified 5966 proteins observed with nonzero intensity in at least 1 sample. The intrapatient Pearson correlation of 0.888 ± 0.065 for patients with BCC, 0.858 ± 0.077 for patients with cSCC, and 0.876 ± 0.116 for those with solar actinic keratosis indicates high consistency of the e-biopsy sampling. The mass spectra presented significantly different proteome profiles for cSCC, BCC, and solar actinic keratosis, with several hundreds of proteins differentially expressed. Notably, our study showed that proteomes sampled with e-biopsy from cSCC and BCC lesions are different and that proteins of CRNN , SULT1E1 , and ITPK1 genes are significantly overexpressed in BCC in comparison with those in cSCC. Our results provide evidence that the e-biopsy approach could potentially be used as a tool to support cutaneous lesions classification with molecular pathology., (© 2024 The Authors.)
- Published
- 2024
- Full Text
- View/download PDF
28. High-throughput lipidomic profiles sampled with electroporation-based biopsy differentiate healthy skin, cutaneous squamous cell carcinoma, and basal cell carcinoma.
- Author
-
Louie L, Wise J, Berl A, Shir-Az O, Kravtsov V, Yakhini Z, Shalom A, Golberg A, and Vitkin E
- Subjects
- Humans, Biopsy, Female, Male, Middle Aged, Aged, Lipids analysis, Tandem Mass Spectrometry methods, Carcinoma, Basal Cell pathology, Carcinoma, Basal Cell metabolism, Carcinoma, Basal Cell diagnosis, Skin Neoplasms pathology, Skin Neoplasms metabolism, Carcinoma, Squamous Cell pathology, Carcinoma, Squamous Cell metabolism, Carcinoma, Squamous Cell chemistry, Lipidomics methods, Skin pathology, Skin metabolism, Skin chemistry, Electroporation methods
- Abstract
Background: The incidence rates of cutaneous squamous cell carcinoma (cSCC) and basal cell carcinoma (BCC) skin cancers are rising, while the current diagnostic process is time-consuming. We describe the development of a novel approach to high-throughput sampling of tissue lipids using electroporation-based biopsy, termed e-biopsy. We report on the ability of the e-biopsy technique to harvest large amounts of lipids from human skin samples., Materials and Methods: Here, 168 lipids were reliably identified from 12 patients providing a total of 13 samples. The extracted lipids were profiled with ultra-performance liquid chromatography and tandem mass spectrometry (UPLC-MS-MS) providing cSCC, BCC, and healthy skin lipidomic profiles., Results: Comparative analysis identified 27 differentially expressed lipids (p < 0.05). The general profile trend is low diglycerides in both cSCC and BCC, high phospholipids in BCC, and high lyso-phospholipids in cSCC compared to healthy skin tissue samples., Conclusion: The results contribute to the growing body of knowledge that can potentially lead to novel insights into these skin cancers and demonstrate the potential of the e-biopsy technique for the analysis of lipidomic profiles of human skin tissues., (© 2024 The Authors. Skin Research and Technology published by John Wiley & Sons Ltd.)
- Published
- 2024
- Full Text
- View/download PDF
29. Efficient DNA-based data storage using shortmer combinatorial encoding.
- Author
-
Preuss I, Rosenberg M, Yakhini Z, and Anavy L
- Subjects
- Sequence Analysis, DNA methods, Algorithms, Information Storage and Retrieval, DNA genetics, DNA Replication
- Abstract
Data storage in DNA has recently emerged as a promising archival solution, offering space-efficient and long-lasting digital storage solutions. Recent studies suggest leveraging the inherent redundancy of synthesis and sequencing technologies by using composite DNA alphabets. A major challenge of this approach involves the noisy inference process, obstructing large composite alphabets. This paper introduces a novel approach for DNA-based data storage, offering, in some implementations, a 6.5-fold increase in logical density over standard DNA-based storage systems, with near-zero reconstruction error. Combinatorial DNA encoding uses a set of clearly distinguishable DNA shortmers to construct large combinatorial alphabets, where each letter consists of a subset of shortmers. We formally define various combinatorial encoding schemes and investigate their theoretical properties. These include information density and reconstruction probabilities, as well as required synthesis and sequencing multiplicities. We then propose an end-to-end design for a combinatorial DNA-based data storage system, including encoding schemes, two-dimensional (2D) error correction codes, and reconstruction algorithms, under different error regimes. We performed simulations and show, for example, that the use of 2D Reed-Solomon error correction has significantly improved reconstruction rates. We validated our approach by constructing two combinatorial sequences using Gibson assembly, imitating a 4-cycle combinatorial synthesis process. We confirmed the successful reconstruction, and established the robustness of our approach for different error types. Subsampling experiments supported the important role of sampling rate and its effect on the overall performance. Our work demonstrates the potential of combinatorial shortmer encoding for DNA-based data storage and describes some theoretical research questions and technical challenges. Combining combinatorial principles with error-correcting strategies, and investing in the development of DNA synthesis technologies that efficiently support combinatorial synthesis, can pave the way to efficient, error-resilient DNA-based storage solutions., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
30. A Cyclic Permutation Approach to Removing Spatial Dependency between Clustered Gene Ontology Terms.
- Author
-
Rapoport R, Greenberg A, Yakhini Z, and Simon I
- Abstract
Traditional gene set enrichment analysis falters when applied to large genomic domains, where neighboring genes often share functions. This spatial dependency creates misleading enrichments, mistaking mere physical proximity for genuine biological connections. Here we present Spatial Adjusted Gene Ontology (SAGO), a novel cyclic permutation-based approach, to tackle this challenge. SAGO separates enrichments due to spatial proximity from genuine biological links by incorporating the genes' spatial arrangement into the analysis. We applied SAGO to various datasets in which the identified genomic intervals are large, including replication timing domains, large H3K9me3 and H3K27me3 domains, HiC compartments and lamina-associated domains (LADs). Intriguingly, applying SAGO to prostate cancer samples with large copy number alteration (CNA) domains eliminated most of the enriched GO terms, thus helping to accurately identify biologically relevant gene sets linked to oncogenic processes, free from spatial bias.
- Published
- 2024
- Full Text
- View/download PDF
31. Exploring multisite heterogeneity of human basal cell carcinoma proteome and transcriptome.
- Author
-
Berl A, Shir-Az O, Genish I, Biran H, Mann D, Singh A, Wise J, Kravtsov V, Kidron D, Golberg A, Vitkin E, Yakhini Z, and Shalom A
- Subjects
- Humans, Transcriptome, Proteome genetics, RNA, Carcinoma, Basal Cell pathology, Skin Neoplasms pathology
- Abstract
Basal cell carcinoma (BCC) is the most common type of skin cancer. Due to multiple, potential underlying molecular tumor aberrations, clinical treatment protocols are not well-defined. This study presents multisite molecular heterogeneity profiles of human BCC based on RNA and proteome profiling. Three areas from lesions excised from 9 patients were analyzed. The focus was gene expression profiles based on proteome and RNA measurements of intra-tumor heterogeneity from the same patient and inter-tumor heterogeneity in nodular, infiltrative, and superficial BCC tumor subtypes from different patients. We observed significant overlap in intra- and inter-tumor variability of proteome and RNA expression profiles, showing significant multisite heterogeneity of protein expression in the BCC tumors. Inter-subtype analysis has also identified unique proteins for each BCC subtype. This profiling leads to a deeper understanding of BCC molecular heterogeneity and potentially contributes to developing new sampling tools for personalized diagnostics therapeutic approaches to BCC., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2023 Berl et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
32. Analysis of Spatial Molecular Data.
- Author
-
Levy-Jurgenson A, Tekpli X, Kristensen VN, and Yakhini Z
- Subjects
- Humans, Phenotype, Algorithms, Microscopy methods, Neoplasms diagnosis, Neoplasms genetics
- Abstract
Digital analysis of pathology whole-slide images has been recently gaining interest in the context of cancer diagnosis and treatment. In particular, deep learning methods have demonstrated significant potential in supporting pathology analysis, recently detecting molecular traits never before recognized in pathology H&E whole-slide images (WSIs). Alongside these advancements in the digital analysis of WSIs, it is becoming increasingly evident that both spatial and overall tumor heterogeneity may be significant determinants of cancer prognosis and treatment outcome. In this chapter, we describe methods that aim to support these two elements. We describe both an end-to-end deep learning pipeline for producing limited spatial transcriptomics from WSIs with associated bulk gene expression data, as well as an algorithm for quantifying spatial tumor heterogeneity based on the results of this pipeline., (© 2023. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.)
- Published
- 2023
- Full Text
- View/download PDF
33. Nondestructive protein sampling with electroporation facilitates profiling of spatial differential protein expression in breast tumors in vivo.
- Author
-
Vitkin E, Singh A, Wise J, Ben-Elazar S, Yakhini Z, and Golberg A
- Subjects
- Animals, Electroporation, Mice, Neoplasm Proteins, Precision Medicine, Neoplasms pathology, Proteomics
- Abstract
Excision tissue biopsy, while central to cancer treatment and precision medicine, presents risks to the patient and does not provide a sufficiently broad and faithful representation of the heterogeneity of solid tumors. Here we introduce e-biopsy-a novel concept for molecular profiling of solid tumors using molecular sampling with electroporation. As e-biopsy provides access to the molecular composition of a solid tumor by permeabilization of the cell membrane, it facilitates tumor diagnostics without tissue resection. Furthermore, thanks to its non tissue destructive characteristics, e-biopsy enables probing the solid tumor multiple times in several distinct locations in the same procedure, thereby enabling the spatial profiling of tumor molecular heterogeneity.We demonstrate e-biopsy in vivo, using the 4T1 breast cancer model in mice to assess its performance, as well as the inferred spatial differential protein expression. In particular, we show that proteomic profiles obtained via e-biopsy in vivo distinguish the tumors from healthy breast tissue and reflect spatial tumor differential protein expression. E-biopsy provides a completely new molecular sampling modality for solid tumors molecular cartography, providing information that potentially enables more rapid and sensitive detection at lesser risk, as well as more precise personalized medicine., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
34. Generating Alerts from Breathing Pattern Outliers.
- Author
-
Benmussa C, Cauchard JR, and Yakhini Z
- Subjects
- Humans, Models, Statistical, Rest, Respiration, Respiratory Rate
- Abstract
Analysing human physiological data allows access to the health state and the state of mind of the subject individual. Whenever a person is sick, having a panic attack, happy or scared, physiological signals will be different. In terms of physiological signals, we focus, in this manuscript, on monitoring breathing patterns. The scope can be extended to also address heart rate and other variables. We describe an analysis of breathing rate patterns during activities including resting, walking, running and watching a movie. We model normal breathing behaviours by statistically analysing signals, processed to represent quantities of interest. We consider moving maximum/minimum, the amplitude and the Fourier transform of the respiration signal, working with different window sizes. We then learn a statistical model for the basal behaviour, per individual, and detect outliers. When outliers are detected, a system that incorporates our approach would send a visible signal through a smart garment or through other means. We describe alert generation performance in two datasets-one literature dataset and one collected as a field study for this work. In particular, when learning personal rest distributions for the breathing signals of 14 subjects, we see alerts generated more often when the same individual is running than when they are tested in rest conditions.
- Published
- 2022
- Full Text
- View/download PDF
35. Electroporation-based proteome sampling ex vivo enables the detection of brain melanoma protein signatures in a location proximate to visible tumor margins.
- Author
-
Genish I, Gabay B, Ruban A, Goldshmit Y, Singh A, Wise J, Levkov K, Shalom A, Vitkin E, Yakhini Z, and Golberg A
- Subjects
- Animals, Brain pathology, Electroporation, Margins of Excision, Mice, Proteomics, Melanoma pathology, Proteome
- Abstract
A major concern in tissue biopsies with a needle is missing the most lethal clone of a tumor, leading to a false negative result. This concern is well justified, since needle-based biopsies gather tissue information limited to needle size. In this work, we show that molecular harvesting with electroporation, e-biopsy, could increase the sampled tissue volume in comparison to tissue sampling by a needle alone. Suggested by numerical models of electric fields distribution, the increased sampled volume is achieved by electroporation-driven permeabilization of cellular membranes in the tissue around the sampling needle. We show that proteomic profiles, sampled by e-biopsy from the brain tissue, ex vivo, at 0.5mm distance outside the visible margins of mice brain melanoma metastasis, have protein patterns similar to melanoma tumor center and different from the healthy brain tissue. In addition, we show that e-biopsy probed proteome signature differentiates between melanoma tumor center and healthy brain in mice. This study suggests that e-biopsy could provide a novel tool for a minimally invasive sampling of molecules in tissue in larger volumes than achieved with traditional needle biopsies., Competing Interests: A patent application was filed to protect the e-biopsy technology described herein as invented by AG, JS, and ZY. KL and AG have a patent application for the high-voltage pulsed electric field generator. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
- Published
- 2022
- Full Text
- View/download PDF
36. On the stability of log-rank test under labeling errors.
- Author
-
Galili B, Samohi A, and Yakhini Z
- Subjects
- Uncertainty, Algorithms
- Abstract
Motivation: Log-rank test is a widely used test that serves to assess the statistical significance of observed differences in survival, when comparing two or more groups. The log-rank test is based on several assumptions that support the validity of the calculations. It is naturally assumed, implicitly, that no errors occur in the labeling of the samples. That is, the mapping between samples and groups is perfectly correct. In this work, we investigate how test results may be affected when considering some errors in the original labeling., Results: We introduce and define the uncertainty that arises from labeling errors in log-rank test. In order to deal with this uncertainty, we develop a novel algorithm for efficiently calculating a stability interval around the original log-rank P-value and prove its correctness. We demonstrate our algorithm on several datasets., Availability and Implementation: We provide a Python implementation, called LoRSI, for calculating the stability interval using our algorithm https://github.com/YakhiniGroup/LoRSI., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2021. Published by Oxford University Press.)
- Published
- 2021
- Full Text
- View/download PDF
37. Erratum to: Assessing heterogeneity in spatial data using the HTA index with applications to spatial transcriptomics and imaging.
- Author
-
Levy-Jurgenson A, Tekpli X, and Yakhini Z
- Published
- 2021
- Full Text
- View/download PDF
38. Assessing heterogeneity in spatial data using the HTA index with applications to spatial transcriptomics and imaging.
- Author
-
Levy-Jurgenson A, Tekpli X, and Yakhini Z
- Subjects
- Humans, Technology Assessment, Biomedical, Genomics, Neuroimaging, Transcriptome, Neoplasms
- Abstract
Motivation: Tumour heterogeneity is being increasingly recognized as an important characteristic of cancer and as a determinant of prognosis and treatment outcome. Emerging spatial transcriptomics data hold the potential to further our understanding of tumour heterogeneity and its implications. However, existing statistical tools are not sufficiently powerful to capture heterogeneity in the complex setting of spatial molecular biology., Results: We provide a statistical solution, the HeTerogeneity Average index (HTA), specifically designed to handle the multivariate nature of spatial transcriptomics. We prove that HTA has an approximately normal distribution, therefore lending itself to efficient statistical assessment and inference. We first demonstrate that HTA accurately reflects the level of heterogeneity in simulated data. We then use HTA to analyze heterogeneity in two cancer spatial transcriptomics datasets: spatial RNA sequencing by 10x Genomics and spatial transcriptomics inferred from H&E. Finally, we demonstrate that HTA also applies to 3D spatial data using brain MRI. In spatial RNA sequencing, we use a known combination of molecular traits to assert that HTA aligns with the expected outcome for this combination. We also show that HTA captures immune-cell infiltration at multiple resolutions. In digital pathology, we show how HTA can be used in survival analysis and demonstrate that high levels of heterogeneity may be linked to poor survival. In brain MRI, we show that HTA differentiates between normal ageing, Alzheimer's disease and two tumours. HTA also extends beyond molecular biology and medical imaging, and can be applied to many domains, including GIS., Availability and Implementation: Python package and source code are available at: https://github.com/alonalj/hta., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2021. Published by Oxford University Press.)
- Published
- 2021
- Full Text
- View/download PDF
39. A broad analysis of splicing regulation in yeast using a large library of synthetic introns.
- Author
-
Schirman D, Yakhini Z, Pilpel Y, and Dahan O
- Subjects
- Computational Biology methods, Evolution, Molecular, Genes, Fungal, High-Throughput Nucleotide Sequencing, Introns, RNA, Messenger genetics, RNA Splicing, Saccharomyces cerevisiae genetics
- Abstract
RNA splicing is a key process in eukaryotic gene expression, in which an intron is spliced out of a pre-mRNA molecule to eventually produce a mature mRNA. Most intron-containing genes are constitutively spliced, hence efficient splicing of an intron is crucial for efficient regulation of gene expression. Here we use a large synthetic oligo library of ~20,000 variants to explore how different intronic sequence features affect splicing efficiency and mRNA expression levels in S. cerevisiae. Introns are defined by three functional sites, the 5' donor site, the branch site, and the 3' acceptor site. Using a combinatorial design of synthetic introns, we demonstrate how non-consensus splice site sequences in each of these sites affect splicing efficiency. We then show that S. cerevisiae splicing machinery tends to select alternative 3' splice sites downstream of the original site, and we suggest that this tendency created a selective pressure, leading to the avoidance of cryptic splice site motifs near introns' 3' ends. We further use natural intronic sequences from other yeast species, whose splicing machineries have diverged to various extents, to show how intron architectures in the various species have been adapted to the organism's splicing machinery. We suggest that the observed tendency for cryptic splicing is a result of a loss of a specific splicing factor, U2AF1. Lastly, we show that synthetic sequences containing two introns give rise to alternative RNA isoforms in S. cerevisiae, demonstrating that merely a synthetic fusion of two introns might be suffice to facilitate alternative splicing in yeast. Our study reveals novel mechanisms by which introns are shaped in evolution to allow cells to regulate their transcriptome. In addition, it provides a valuable resource to study the regulation of constitutive and alternative splicing in a model organism., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
40. CRISPECTOR provides accurate estimation of genome editing translocation and off-target activity from comparative NGS data.
- Author
-
Amit I, Iancu O, Levy-Jurgenson A, Kurgan G, McNeill MS, Rettig GR, Allen D, Breier D, Ben Haim N, Wang Y, Anavy L, Hendel A, and Yakhini Z
- Subjects
- Algorithms, DNA-Binding Proteins genetics, HEK293 Cells, Homeodomain Proteins genetics, Humans, Nuclear Proteins genetics, Software, Transcription Factors genetics, CRISPR-Cas Systems, Computational Biology methods, Gene Editing methods
- Abstract
Controlling off-target editing activity is one of the central challenges in making CRISPR technology accurate and applicable in medical practice. Current algorithms for analyzing off-target activity do not provide statistical quantification, are not sufficiently sensitive in separating signal from noise in experiments with low editing rates, and do not address the detection of translocations. Here we present CRISPECTOR, a software tool that supports the detection and quantification of on- and off-target genome-editing activity from NGS data using paired treatment/control CRISPR experiments. In particular, CRISPECTOR facilitates the statistical analysis of NGS data from multiplex-PCR comparative experiments to detect and quantify adverse translocation events. We validate the observed results and show independent evidence of the occurrence of translocations in human cell lines, after genome editing. Our methodology is based on a statistical model comparison approach leading to better false-negative rates in sites with weak yet significant off-target activity.
- Published
- 2021
- Full Text
- View/download PDF
41. SOLQC: Synthetic Oligo Library Quality Control tool.
- Author
-
Sabary O, Orlev Y, Shafir R, Anavy L, Yaakobi E, and Yakhini Z
- Subjects
- Gene Library, Quality Control, Synthetic Biology, Libraries, Software
- Abstract
Motivation: Recent years have seen a growing number and an expanding scope of studies using synthetic oligo libraries for a range of applications in synthetic biology. As experiments are growing by numbers and complexity, analysis tools can facilitate quality control and support better assessment and inference., Results: We present a novel analysis tool, called SOLQC, which enables fast and comprehensive analysis of synthetic oligo libraries, based on NGS analysis performed by the user. SOLQC provides statistical information such as the distribution of variant representation, different error rates and their dependence on sequence or library properties. SOLQC produces graphical reports from the analysis, in a flexible format. We demonstrate SOLQC by analyzing literature libraries. We also discuss the potential benefits and relevance of the different components of the analysis., Availability and Implementation: SOLQC is a free software for non-commercial use, available at https://app.gitbook.com/@yoav-orlev/s/solqc/. For commercial use please contact the authors., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2021
- Full Text
- View/download PDF
42. Overcoming the design, build, test bottleneck for synthesis of nonrepetitive protein-RNA cassettes.
- Author
-
Katz N, Tripto E, Granik N, Goldberg S, Atar O, Yakhini Z, Orenstein Y, and Amit R
- Subjects
- Attachment Sites, Microbiological genetics, Binding Sites genetics, Cell Line, Tumor, Escherichia coli genetics, Gene Library, Humans, Machine Learning, Plasmids genetics, Allolevivirus genetics, Capsid Proteins metabolism, Escherichia coli virology, Levivirus genetics, RNA metabolism
- Abstract
We apply an oligo-library and machine learning-approach to characterize the sequence and structural determinants of binding of the phage coat proteins (CPs) of bacteriophages MS2 (MCP), PP7 (PCP), and Qβ (QCP) to RNA. Using the oligo library, we generate thousands of candidate binding sites for each CP, and screen for binding using a high-throughput dose-response Sort-seq assay (iSort-seq). We then apply a neural network to expand this space of binding sites, which allowed us to identify the critical structural and sequence features for binding of each CP. To verify our model and experimental findings, we design several non-repetitive binding site cassettes and validate their functionality in mammalian cells. We find that the binding of each CP to RNA is characterized by a unique space of sequence and structural determinants, thus providing a more complete description of CP-RNA interaction as compared with previous low-throughput findings. Finally, based on the binding spaces we demonstrate a computational tool for the successful design and rapid synthesis of functional non-repetitive binding-site cassettes.
- Published
- 2021
- Full Text
- View/download PDF
43. miRNA normalization enables joint analysis of several datasets to increase sensitivity and to reveal novel miRNAs differentially expressed in breast cancer.
- Author
-
Ben-Elazar S, Aure MR, Jonsdottir K, Leivonen SK, Kristensen VN, Janssen EAM, Kleivi Sahlberg K, Lingjærde OC, and Yakhini Z
- Subjects
- Algorithms, Biomarkers metabolism, Biomarkers, Tumor genetics, Cell Line, Tumor, Computer Simulation, Estrogen Receptor alpha metabolism, Female, Humans, MCF-7 Cells, Oligonucleotide Array Sequence Analysis, Programming Languages, RNA, Messenger genetics, Breast Neoplasms genetics, Breast Neoplasms metabolism, Gene Expression Profiling, Gene Expression Regulation, Neoplastic, MicroRNAs metabolism
- Abstract
Different miRNA profiling protocols and technologies introduce differences in the resulting quantitative expression profiles. These include differences in the presence (and measurability) of certain miRNAs. We present and examine a method based on quantile normalization, Adjusted Quantile Normalization (AQuN), to combine miRNA expression data from multiple studies in breast cancer into a single joint dataset for integrative analysis. By pooling multiple datasets, we obtain increased statistical power, surfacing patterns that do not emerge as statistically significant when separately analyzing these datasets. To merge several datasets, as we do here, one needs to overcome both technical and batch differences between these datasets. We compare several approaches for merging and jointly analyzing miRNA datasets. We investigate the statistical confidence for known results and highlight potential new findings that resulted from the joint analysis using AQuN. In particular, we detect several miRNAs to be differentially expressed in estrogen receptor (ER) positive versus ER negative samples. In addition, we identify new potential biomarkers and therapeutic targets for both clinical groups. As a specific example, using the AQuN-derived dataset we detect hsa-miR-193b-5p to have a statistically significant over-expression in the ER positive group, a phenomenon that was not previously reported. Furthermore, as demonstrated by functional assays in breast cancer cell lines, overexpression of hsa-miR-193b-5p in breast cancer cell lines resulted in decreased cell viability in addition to inducing apoptosis. Together, these observations suggest a novel functional role for this miRNA in breast cancer. Packages implementing AQuN are provided for Python and Matlab: https://github.com/YakhiniGroup/PyAQN., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
44. Efficient gene expression signature for a breast cancer immuno-subtype.
- Author
-
Galili B, Tekpli X, Kristensen VN, and Yakhini Z
- Subjects
- Disease-Free Survival, Female, Humans, Survival Rate, Biomarkers, Tumor immunology, Breast Neoplasms classification, Breast Neoplasms immunology, Breast Neoplasms mortality, Gene Expression Regulation, Neoplastic immunology, Transcriptome immunology
- Abstract
Motivation and Background: The patient's immune system plays an important role in cancer pathogenesis, prognosis and susceptibility to treatment. Recent work introduced an immune related breast cancer. This subtyping is based on the expression profiles of the tumor samples. Specifically, one study showed that analyzing 658 genes can lead to a signature for subtyping tumors. Furthermore, this classification is independent of other known molecular and clinical breast cancer subtyping. Finally, that study shows that the suggested subtyping has significant prognostic implications., Results: In this work we develop an efficient signature associated with survival in breast cancer. We begin by developing a more efficient signature for the above-mentioned breast cancer immune-based subtyping. This signature represents better performance with a set of 579 genes that obtains an improved Area Under Curve (AUC). We then determine a set of 193 genes and an associated classification rule that yield subtypes with a much stronger statistically significant (log rank p-value < 2 × 10-4 in a test cohort) difference in survival. To obtain these improved results we develop a feature selection process that matches the high-dimensionality character of the data and the dual performance objectives, driven by survival and anchored by the literature subtyping., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
45. Spatial transcriptomics inferred from pathology whole-slide images links tumor heterogeneity to survival in breast and lung cancer.
- Author
-
Levy-Jurgenson A, Tekpli X, Kristensen VN, and Yakhini Z
- Subjects
- Female, Humans, Deep Learning, Gene Expression, MicroRNAs genetics, MicroRNAs metabolism, RNA, Messenger genetics, RNA, Messenger metabolism, Survival Rate, Breast Neoplasms diagnostic imaging, Breast Neoplasms genetics, Breast Neoplasms mortality, Genetic Heterogeneity, Image Processing, Computer-Assisted methods, Lung Neoplasms diagnostic imaging, Lung Neoplasms genetics, Lung Neoplasms mortality
- Abstract
Digital analysis of pathology whole-slide images is fast becoming a game changer in cancer diagnosis and treatment. Specifically, deep learning methods have shown great potential to support pathology analysis, with recent studies identifying molecular traits that were not previously recognized in pathology H&E whole-slide images. Simultaneous to these developments, it is becoming increasingly evident that tumor heterogeneity is an important determinant of cancer prognosis and susceptibility to treatment, and should therefore play a role in the evolving practices of matching treatment protocols to patients. State of the art diagnostic procedures, however, do not provide automated methods for characterizing and/or quantifying tumor heterogeneity, certainly not in a spatial context. Further, existing methods for analyzing pathology whole-slide images from bulk measurements require many training samples and complex pipelines. Our work addresses these two challenges. First, we train deep learning models to spatially resolve bulk mRNA and miRNA expression levels on pathology whole-slide images (WSIs). Our models reach up to 0.95 AUC on held-out test sets from two cancer cohorts using a simple training pipeline and a small number of training samples. Using the inferred gene expression levels, we further develop a method to spatially characterize tumor heterogeneity. Specifically, we produce tumor molecular cartographies and heterogeneity maps of WSIs and formulate a heterogeneity index (HTI) that quantifies the level of heterogeneity within these maps. Applying our methods to breast and lung cancer slides, we show a significant statistical link between heterogeneity and survival. Our methods potentially open a new and accessible approach to investigating tumor heterogeneity and other spatial molecular properties and their link to clinical characteristics, including treatment susceptibility and survival.
- Published
- 2020
- Full Text
- View/download PDF
46. Increasing CRISPR Efficiency and Measuring Its Specificity in HSPCs Using a Clinically Relevant System.
- Author
-
Shapiro J, Iancu O, Jacobi AM, McNeill MS, Turk R, Rettig GR, Amit I, Tovin-Recht A, Yakhini Z, Behlke MA, and Hendel A
- Abstract
Genome editing of human cluster of differentiation 34
+ (CD34+ ) hematopoietic stem and progenitor cells (HSPCs) holds great therapeutic potential. This study aimed to optimize on-target, ex vivo genome editing using the CRISPR-Cas9 system in CD34+ HSPCs and to create a clear workflow for precise identification of off-target effects. Modified synthetic guide RNAs (gRNAs), either 2-part gRNA or single-guide RNA (sgRNA), were delivered to CD34+ HSPCs as part of ribonucleoprotein (RNP) complexes, targeting therapeutically relevant genes. The addition of an Alt-R electroporation enhancer (EE), a short, single-stranded oligodeoxynucleotide (ssODN), significantly increased editing efficiency in CD34+ HSPCs. Notably, similar editing improvement was observed when excess gRNA over Cas9 protein was used, providing a DNA-free alternative suitable for therapeutic applications. Furthermore, we demonstrated that sgRNA may be preferable over 2-part gRNA in a locus-specific manner. Finally, we present a clear experimental framework suitable for the unbiased identification of bona fide off-target sites by Genome-Wide, Unbiased Identification of Double-Strand Breaks (DSBs) Enabled by Sequencing (GUIDE-seq), as well as subsequent editing quantification in CD34+ HSPCs using rhAmpSeq. These findings may facilitate the implementation of genome editing in CD34+ HSPCs for research and therapy and can be adapted for other hematopoietic cells., (© 2020 The Author(s).)- Published
- 2020
- Full Text
- View/download PDF
47. Distributed flux balance analysis simulations of serial biomass fermentation by two organisms.
- Author
-
Vitkin E, Gillis A, Polikovsky M, Bender B, Golberg A, and Yakhini Z
- Subjects
- Hydrolysis, Rhodophyta metabolism, Ulva metabolism, Zea mays metabolism, Computer Simulation, Computer-Aided Design, Escherichia coli growth & development, Ethanol metabolism, Fermentation, Models, Biological, Saccharomyces cerevisiae growth & development
- Abstract
Intelligent biorefinery design that addresses both the composition of the biomass feedstock as well as fermentation microorganisms could benefit from dedicated tools for computational simulation and computer-assisted optimization. Here we present the BioLego Vn2.0 framework, based on Microsoft Azure Cloud, which supports large-scale simulations of biomass serial fermentation processes by two different organisms. BioLego enables the simultaneous analysis of multiple fermentation scenarios and the comparison of fermentation potential of multiple feedstock compositions. Thanks to the effective use of cloud computing it further allows resource intensive analysis and exploration of media and organism modifications. We use BioLego to obtain biological and validation results, including (1) exploratory search for the optimal utilization of corn biomasses-corn cobs, corn fiber and corn stover-in fermentation biorefineries; (2) analysis of the possible effects of changes in the composition of K. alvarezi biomass on the ethanol production yield in an anaerobic two-step process (S. cerevisiae followed by E. coli); (3) analysis of the impact, on the estimated ethanol production yield, of knocking out single organism reactions either in one or in both organisms in an anaerobic two-step fermentation process of Ulva sp. into ethanol (S. cerevisiae followed by E. coli); and (4) comparison of several experimentally measured ethanol fermentation rates with the predictions of BioLego., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2020
- Full Text
- View/download PDF
48. Design and Analysis of Offshore Macroalgae Biorefineries.
- Author
-
Golberg A, Liberzon A, Vitkin E, and Yakhini Z
- Subjects
- Algorithms, Biomass, Geography, Marine Biology, Metagenomics, Biotechnology, Fermentation, Models, Theoretical, Seaweed
- Abstract
Displacing fossil fuels and their derivatives with renewables, and increasing sustainable food production are among the major challenges facing the world in the coming decades. A possible, sustainable direction for addressing this challenge is the production of biomass and the conversion of this biomass to the required products through a complex system coined biorefinery. Terrestrial biomass and microalgae are possible sources; however, concerns over net energy balance, potable water use, environmental hazards, and uncertainty in the processing technologies raise questions regarding their actual potential to meet the anticipated food, feed, and energy challenges in a sustainable way. Alternative sustainable sources for biorefineries are macroalgae grown and processed offshore. However, implementation of the offshore biorefineries requires detailed analysis of their technological, economic, and environmental performance. In this chapter, the basic principles of marine biorefineries design are shown. The methods to integrate thermodynamic efficiency, investment, and environmental aspects are discussed. The performance improvement by development of new cultivation methods that fit macroalgae physiology and development of new fermentation methods that address macroalgae unique chemical composition is shown.
- Published
- 2020
- Full Text
- View/download PDF
49. Molecular harvesting with electroporation for tissue profiling.
- Author
-
Golberg A, Sheviryov J, Solomon O, Anavy L, and Yakhini Z
- Subjects
- Animals, Female, Gene Ontology, Genomics, Hep G2 Cells, Humans, Kidney pathology, Liver pathology, Mice, Mice, Nude, Neoplasm Proteins metabolism, Neoplasms genetics, Neoplasms metabolism, Neoplasms pathology, Proteomics, RNA, Neoplasm metabolism, Transplantation, Heterologous, Electroporation methods, Kidney metabolism, Liver metabolism
- Abstract
Recent developments in personalized medicine are based on molecular measurement steps that guide personally adjusted medical decisions. A central approach to molecular profiling consists of measuring DNA, RNA, and/or proteins in tissue samples, most notably in and around tumors. This measurement yields molecular biomarkers that are potentially predictive of response and of tumor type. Current methods in cancer therapy mostly use tissue biopsy as the starting point of molecular profiling. Tissue biopsies involve a physical resection of a small tissue sample, leading to localized tissue injury, bleeding, inflammation and stress, as well as to an increased risk of metastasis. Here we developed a technology for harvesting biomolecules from tissues using electroporation. We show that tissue electroporation, achieved using a combination of high-voltage short pulses, 50 pulses 500 V cm
-1 , 30 µs, 1 Hz, with low-voltage long pulses 50 pulses 50 V cm-1 , 10 ms, delivered at 1 Hz, allows for tissue-specific extraction of RNA and proteins. We specifically tested RNA and protein extraction from excised kidney and liver samples and from excised HepG2 tumors in mice. Further in vivo development of extraction methods based on electroporation can drive novel approaches to the molecular profiling of tumors and of tumor environment and to related diagnosis practices.- Published
- 2019
- Full Text
- View/download PDF
50. Data storage in DNA with fewer synthesis cycles using composite DNA letters.
- Author
-
Anavy L, Vaknin I, Atar O, Amit R, and Yakhini Z
- Subjects
- Algorithms, Base Sequence, High-Throughput Nucleotide Sequencing methods, Humans, Information Storage and Retrieval, Sequence Analysis, DNA methods, DNA chemical synthesis
- Abstract
The density and long-term stability of DNA make it an appealing storage medium, particularly for long-term data archiving. Existing DNA storage technologies involve the synthesis and sequencing of multiple nominally identical molecules in parallel, resulting in information redundancy. We report the development of encoding and decoding methods that exploit this redundancy using composite DNA letters. A composite DNA letter is a representation of a position in a sequence that consists of a mixture of all four DNA nucleotides in a predetermined ratio. Our methods encode data using fewer synthesis cycles. We encode 6.4 MB into composite DNA, with distinguishable composition medians, using 20% fewer synthesis cycles per unit of data, as compared to previous reports. We also simulate encoding with larger composite alphabets, with distinguishable composition deciles, to show that 75% fewer synthesis cycles are potentially sufficient. We describe applicable error-correcting codes and inference methods, and investigate error patterns in the context of composite DNA letters.
- Published
- 2019
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.