11 results on '"Durif G"'
Search Results
2. Noninvasive prenatal diagnosis of genetic diseases induced by triplet repeat expansion by linked read haplotyping and Bayesian approach
- Author
-
Liautard-Haag, C., Durif, G., VanGoethem, C., Baux, D., Louis, A., Cayrefourcq, L., Lamairia, M., Willems, M., Zordan, C., Dorian, V., Rooryck, C., Goizet, C., Chaussenot, A., Monteil, L., Calvas, P., Miry, C., Favre, R., Le Boette, E., Fradin, M., Roux, A. F., Cossée, M., Koenig, M., Alix-Panabière, C., Guissart, C., and Vincent, M. C.
- Published
- 2022
- Full Text
- View/download PDF
3. High Dimensional Classification with combined Adaptive Sparse PLS and Logistic Regression
- Author
-
Durif, G., Modolo, L., Michaelsson, J., Mold, J. E., Lambert-Lacroix, S., and Picard, F.
- Subjects
Statistics - Methodology ,Statistics - Computation - Abstract
Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection, which combined constitute a powerful framework for classification, as well as data visualization and interpretation. However, current proposed combinations lead to instable and non convergent methods due to inappropriate computational frameworks. We hereby propose a stable and convergent approach for classification in high dimensional based on sparse Partial Least Squares (sparse PLS). Results: We start by proposing a new solution for the sparse PLS problem that is based on proximal operators for the case of univariate responses. Then we develop an adaptive version of the sparse PLS for classification, which combines iterative optimization of logistic regression and sparse PLS to ensure convergence and stability. Our results are confirmed on synthetic and experimental data. In particular we show how crucial convergence and stability can be when cross-validation is involved for calibration purposes. Using gene expression data we explore the prediction of breast cancer relapse. We also propose a multicategorial version of our method on the prediction of cell-types based on single-cell expression data. Availability: Our approach is implemented in the plsgenomics R-package., Comment: 9 pages, 3 figures, 4 tables + Supplementary Materials 8 pages, 3 figures, 10 tables
- Published
- 2015
- Full Text
- View/download PDF
4. DECONbench: a benchmarking platform dedicated to deconvolution methods for tumor heterogeneity quantification.
- Author
-
Decamps, Clémentine, Arnaud, Alexis, Petitprez, Florent, Ayadi, Mira, Baurès, Aurélia, Armenoult, Lucile, HADACA consortium, Alcala, N., Arnaud, A., Avila Cobos, F., Batista, Luciana, Batto, A.-F., Blum, Y., Chuffart, F., Cros, J., Decamps, C., Dirian, L., Doncevic, D., Durif, G., and Bahena Hernandez, S. Y.
- Subjects
HETEROGENEITY ,CANCER invasiveness ,TRANSCRIPTOMES ,DECONVOLUTION (Mathematics) ,MEDICAL sciences ,SOURCE code - Abstract
Background: Quantification of tumor heterogeneity is essential to better understand cancer progression and to adapt therapeutic treatments to patient specificities. Bioinformatic tools to assess the different cell populations from single-omic datasets as bulk transcriptome or methylome samples have been recently developed, including reference-based and reference-free methods. Improved methods using multi-omic datasets are yet to be developed in the future and the community would need systematic tools to perform a comparative evaluation of these algorithms on controlled data. Results: We present DECONbench, a standardized unbiased benchmarking resource, applied to the evaluation of computational methods quantifying cell-type heterogeneity in cancer. DECONbench includes gold standard simulated benchmark datasets, consisting of transcriptome and methylome profiles mimicking pancreatic adenocarcinoma molecular heterogeneity, and a set of baseline deconvolution methods (reference-free algorithms inferring cell-type proportions). DECONbench performs a systematic performance evaluation of each new methodological contribution and provides the possibility to publicly share source code and scoring. Conclusion: DECONbench allows continuous submission of new methods in a user-friendly fashion, each novel contribution being automatically compared to the reference baseline methods, which enables crowdsourced benchmarking. DECONbench is designed to serve as a reference platform for the benchmarking of deconvolution methods in the evaluation of cancer heterogeneity. We believe it will contribute to leverage the benchmarking practices in the biomedical and life science communities. DECONbench is hosted on the open source Codalab competition platform. It is freely available at: https://competitions.codalab.org/competitions/27453. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
5. Probabilistic Count Matrix Factorization for Single Cell Expression Data Analysis
- Author
-
Durif, G., primary, Modolo, L., additional, Mold, J. E., additional, Lambert-Lacroix, S., additional, and Picard, F., additional
- Published
- 2017
- Full Text
- View/download PDF
6. DECONbench: a benchmarking platform dedicated to deconvolution methods for tumor heterogeneity quantification
- Author
-
Decamps, Clémentine, Arnaud, Alexis, Petitprez, Florent, Ayadi, Mira, Baurès, Aurélia, Armenoult, Lucile, Alcala, N., Arnaud, A., Avila Cobos, Francisco, Batista, Luciana, Batto, A.-F., Blum, Y., Chuffart, F., Cros, J., Decamps, C., Dirian, L., Doncevic, D., Durif, G., Bahena Hernandez, S. Y., Jakobi, M., Jardillier, R., Jeanmougin, M., Jedynak, P., Jumentier, B., Kakoichankava, A., Kondili, Maria, Liu, J., Maie, T., Marécaille, J., Merlevede, J., Meylan, M., Nazarov, P., Newar, K., Nyrén, K., Petitprez, F., Novella Rausell, C., Richard, M., Scherer, M., Sompairac, N., Waury, K., Xie, T., Zacharouli, M.-A., Escalera, Sergio, Guyon, Isabelle, Nicolle, Rémy, Tomasini, Richard, de Reyniès, Aurélien, Cros, Jérôme, Blum, Yuna, Richard, Magali, HADACA consortium, [missing], Translational Innovation in Medicine and Complexity / Recherche Translationnelle et Innovation en Médecine et Complexité - UMR 5525 (TIMC ), VetAgro Sup - Institut national d'enseignement supérieur et de recherche en alimentation, santé animale, sciences agronomiques et de l'environnement (VAS)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Ligue Nationale Contre le Cancer - Paris, Ligue Nationnale Contre le Cancer, University of Barcelona, Laboratoire Interdisciplinaire des Sciences du Numérique (LISN), CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), A&O (Apprentissage et Optimisation) (A&O), CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Science des Données (SDD), CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Algorithmes, Apprentissage et Calcul (AAC), CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Cancérologie de Marseille (CRCM), Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Institut Paoli-Calmettes, Fédération nationale des Centres de lutte contre le Cancer (FNCLCC)-Fédération nationale des Centres de lutte contre le Cancer (FNCLCC)-Aix Marseille Université (AMU), Institut de Génétique et Développement de Rennes (IGDR), Structure Fédérative de Recherche en Biologie et Santé de Rennes ( Biosit : Biologie - Santé - Innovation Technologique )-Centre National de la Recherche Scientifique (CNRS)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES), The research leading to these results was supported by Univ. Grenoble-Alpes via the Grenoble Alpes Data Institute [MR, AA] (ANR-15-IDEX-02), EIT Health Campus HADACA and COMETH programs [MR, YB], activities 19359 and 20377 and the Ligue Nationale Contre le Cancer. Other fundings: South-Eastern Norway Regional Health Authority (project number 2019030 [MJ]), European IMI IMMUCAN project [NS], European Union's Horizon 2020 program (Grant 826121, iPC project, [JM, FAC]). This article did not receive specific sponsorship in the design of the study, analysis, interpretation of data and in writing the manuscript., ANR-15-IDEX-0002,UGA,IDEX UGA(2015), European Project: 826121,iPaediatricCurie, Biologie Computationnelle et Modélisation (TIMC-BCM ), Université Grenoble Alpes (UGA)-VetAgro Sup - Institut national d'enseignement supérieur et de recherche en alimentation, santé animale, sciences agronomiques et de l'environnement (VAS)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Ligue Nationale Contre le Cancer (LNCC), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Science des Données (SDD), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Algorithmes, Apprentissage et Calcul (AAC), Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS), Aix Marseille Université (AMU)-Institut Paoli-Calmettes, Fédération nationale des Centres de lutte contre le Cancer (FNCLCC)-Fédération nationale des Centres de lutte contre le Cancer (FNCLCC)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Université de Rennes (UR)-Centre National de la Recherche Scientifique (CNRS)-Structure Fédérative de Recherche en Biologie et Santé de Rennes ( Biosit : Biologie - Santé - Innovation Technologique ), Richard, Magali, IDEX UGA - - UGA2015 - ANR-15-IDEX-0002 - IDEX - VALID, and European Union’s Horizon 2020 program (grant No. 826121, iPC project) - iPaediatricCurie - 826121 - INCOMING
- Subjects
Classificació de tumors ,Source code ,Computer science ,Deconvolution ,computer.software_genre ,Biochemistry ,Omics integration ,0302 clinical medicine ,Resource (project management) ,Structural Biology ,Medicine and Health Sciences ,Biology (General) ,Càncer ,media_common ,Cancer ,0303 health sciences ,[SDV.BIBS] Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,DNA methylation ,Applied Mathematics ,Benchmarking ,[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM] ,Computer algorithms ,3. Good health ,Computer Science Applications ,Benchmarking platform ,030220 oncology & carcinogenesis ,Benchmark (computing) ,Algorithms ,EXPRESSION ,Cellular heterogeneity ,QH301-705.5 ,media_common.quotation_subject ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Adenocarcinoma ,Machine learning ,Set (abstract data type) ,03 medical and health sciences ,Computational Deconvolution ,Humans ,Leverage (statistics) ,Algorismes computacionals ,CELL ,Molecular Biology ,030304 developmental biology ,business.industry ,Tumors classification ,Computational Biology ,Gold standard (test) ,Pancreatic Neoplasms ,Artificial intelligence ,business ,Transcriptome ,computer ,Software - Abstract
Background Quantification of tumor heterogeneity is essential to better understand cancer progression and to adapt therapeutic treatments to patient specificities. Bioinformatic tools to assess the different cell populations from single-omic datasets as bulk transcriptome or methylome samples have been recently developed, including reference-based and reference-free methods. Improved methods using multi-omic datasets are yet to be developed in the future and the community would need systematic tools to perform a comparative evaluation of these algorithms on controlled data. Results We present DECONbench, a standardized unbiased benchmarking resource, applied to the evaluation of computational methods quantifying cell-type heterogeneity in cancer. DECONbench includes gold standard simulated benchmark datasets, consisting of transcriptome and methylome profiles mimicking pancreatic adenocarcinoma molecular heterogeneity, and a set of baseline deconvolution methods (reference-free algorithms inferring cell-type proportions). DECONbench performs a systematic performance evaluation of each new methodological contribution and provides the possibility to publicly share source code and scoring. Conclusion DECONbench allows continuous submission of new methods in a user-friendly fashion, each novel contribution being automatically compared to the reference baseline methods, which enables crowdsourced benchmarking. DECONbench is designed to serve as a reference platform for the benchmarking of deconvolution methods in the evaluation of cancer heterogeneity. We believe it will contribute to leverage the benchmarking practices in the biomedical and life science communities. DECONbench is hosted on the open source Codalab competition platform. It is freely available at: https://competitions.codalab.org/competitions/27453.
- Full Text
- View/download PDF
7. The Fate of a Polygenic Phenotype Within the Genomic Landscapes of Introgression in the European Seabass Hybrid Zone.
- Author
-
Leitwein M, Durif G, Delpuech E, Gagnaire PA, Ernande B, Vandeputte M, Vergnet A, Duranton M, Clota F, and Allal F
- Subjects
- Animals, Polymorphism, Single Nucleotide, Selection, Genetic, Mediterranean Sea, Genome, Bass genetics, Hybridization, Genetic, Genetic Introgression, Multifactorial Inheritance, Phenotype
- Abstract
Unraveling the evolutionary mechanisms and consequences of hybridization is a major concern in biology. Many studies have documented the interplay between recombination and selection in modulating the genomic landscape of introgression, but few have considered how associations with phenotype may affect this landscape. Here, we use the European seabass (Dicentrarchus labrax), a key species in marine aquaculture that undergoes natural hybridization, to determine how selection on phenotype modulates the introgression landscape between Atlantic and Mediterranean lineages. We use a high-density single nucleotide polymorphism array to assess individual local ancestry along the genome and improve the mapping of muscle fat content, a polygenic trait that is divergent between lineages. Taking into account variation in recombination rates, we reveal a purging of Atlantic ancestry in the admixed Mediterranean populations. While Atlantic individuals had higher muscle fat content, we observed that genomic regions associated with this trait in Mediterranean populations displayed reduced introgression of Atlantic ancestry. These results emphasize how selection against maladapted alleles shapes the genomic landscape of introgression., Competing Interests: Conflict of Interest The authors declare that there is no conflict of interest., (© The Author(s) 2024. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.)
- Published
- 2024
- Full Text
- View/download PDF
8. Extending approximate Bayesian computation with supervised machine learning to infer demographic history from genetic polymorphisms using DIYABC Random Forest.
- Author
-
Collin FD, Durif G, Raynal L, Lombaert E, Gautier M, Vitalis R, Marin JM, and Estoup A
- Subjects
- Bayes Theorem, Computer Simulation, Demography, Polymorphism, Single Nucleotide, Supervised Machine Learning, Algorithms, Genetics, Population
- Abstract
Simulation-based methods such as approximate Bayesian computation (ABC) are well-adapted to the analysis of complex scenarios of populations and species genetic history. In this context, supervised machine learning (SML) methods provide attractive statistical solutions to conduct efficient inferences about scenario choice and parameter estimation. The Random Forest methodology (RF) is a powerful ensemble of SML algorithms used for classification or regression problems. Random Forest allows conducting inferences at a low computational cost, without preliminary selection of the relevant components of the ABC summary statistics, and bypassing the derivation of ABC tolerance levels. We have implemented a set of RF algorithms to process inferences using simulated data sets generated from an extended version of the population genetic simulator implemented in DIYABC v2.1.0. The resulting computer package, named DIYABC Random Forest v1.0, integrates two functionalities into a user-friendly interface: the simulation under custom evolutionary scenarios of different types of molecular data (microsatellites, DNA sequences or SNPs) and RF treatments including statistical tools to evaluate the power and accuracy of inferences. We illustrate the functionalities of DIYABC Random Forest v1.0 for both scenario choice and parameter estimation through the analysis of pseudo-observed and real data sets corresponding to pool-sequencing and individual-sequencing SNP data sets. Because of the properties inherent to the implemented RF methods and the large feature vector (including various summary statistics and their linear combinations) available for SNP data, DIYABC Random Forest v1.0 can efficiently contribute to the analysis of large SNP data sets to make inferences about complex population genetic histories., (© The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.)
- Published
- 2021
- Full Text
- View/download PDF
9. Divergent clonal differentiation trajectories establish CD8 + memory T cell heterogeneity during acute viral infections in humans.
- Author
-
Mold JE, Modolo L, Hård J, Zamboni M, Larsson AJM, Stenudd M, Eriksson CJ, Durif G, Ståhl PL, Borgström E, Picelli S, Reinius B, Sandberg R, Réu P, Talavera-Lopez C, Andersson B, Blom K, Sandberg JK, Picard F, Michaëlsson J, and Frisén J
- Subjects
- Acute Disease, Cell Differentiation, Cells, Cultured, Humans, CD8-Positive T-Lymphocytes immunology, Virus Diseases virology, Yellow Fever virology
- Abstract
The CD8
+ T cell response to an antigen is composed of many T cell clones with unique T cell receptors, together forming a heterogeneous repertoire of effector and memory cells. How individual T cell clones contribute to this heterogeneity throughout immune responses remains largely unknown. In this study, we longitudinally track human CD8+ T cell clones expanding in response to yellow fever virus (YFV) vaccination at the single-cell level. We observed a drop in clonal diversity in blood from the acute to memory phase, suggesting that clonal selection shapes the circulating memory repertoire. Clones in the memory phase display biased differentiation trajectories along a gradient from stem cell to terminally differentiated effector memory fates. In secondary responses, YFV- and influenza-specific CD8+ T cell clones are poised to recapitulate skewed differentiation trajectories. Collectively, we show that the sum of distinct clonal phenotypes results in the multifaceted human T cell response to acute viral infections., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2021 The Author(s). Published by Elsevier Inc. All rights reserved.)- Published
- 2021
- Full Text
- View/download PDF
10. Probabilistic count matrix factorization for single cell expression data analysis.
- Author
-
Durif G, Modolo L, Mold JE, Lambert-Lacroix S, and Picard F
- Subjects
- Algorithms, High-Throughput Nucleotide Sequencing, Single-Cell Analysis, Data Analysis, Software
- Abstract
Motivation: The development of high-throughput single-cell sequencing technologies now allows the investigation of the population diversity of cellular transcriptomes. The expression dynamics (gene-to-gene variability) can be quantified more accurately, thanks to the measurement of lowly expressed genes. In addition, the cell-to-cell variability is high, with a low proportion of cells expressing the same genes at the same time/level. Those emerging patterns appear to be very challenging from the statistical point of view, especially to represent a summarized view of single-cell expression data. Principal component analysis (PCA) is a most powerful tool for high dimensional data representation, by searching for latent directions catching the most variability in the data. Unfortunately, classical PCA is based on Euclidean distance and projections that poorly work in presence of over-dispersed count data with dropout events like single-cell expression data., Results: We propose a probabilistic Count Matrix Factorization (pCMF) approach for single-cell expression data analysis that relies on a sparse Gamma-Poisson factor model. This hierarchical model is inferred using a variational EM algorithm. It is able to jointly build a low dimensional representation of cells and genes. We show how this probabilistic framework induces a geometry that is suitable for single-cell data visualization, and produces a compression of the data that is very powerful for clustering purposes. Our method is competed against other standard representation methods like t-SNE, and we illustrate its performance for the representation of single-cell expression data., Availability and Implementation: Our work is implemented in the pCMF R-package (https://github.com/gdurif/pCMF)., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2019
- Full Text
- View/download PDF
11. High dimensional classification with combined adaptive sparse PLS and logistic regression.
- Author
-
Durif G, Modolo L, Michaelsson J, Mold JE, Lambert-Lacroix S, and Picard F
- Subjects
- Calibration, Genomics methods, Genomics standards, Least-Squares Analysis, Sequence Analysis, DNA standards, Logistic Models, Sequence Analysis, DNA methods, Software
- Abstract
Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection, which combined constitute a powerful framework for classification, as well as data visualization and interpretation. However, current proposed combinations lead to unstable and non convergent methods due to inappropriate computational frameworks. We hereby propose a computationally stable and convergent approach for classification in high dimensional based on sparse Partial Least Squares (sparse PLS)., Results: We start by proposing a new solution for the sparse PLS problem that is based on proximal operators for the case of univariate responses. Then we develop an adaptive version of the sparse PLS for classification, called logit-SPLS, which combines iterative optimization of logistic regression and sparse PLS to ensure computational convergence and stability. Our results are confirmed on synthetic and experimental data. In particular, we show how crucial convergence and stability can be when cross-validation is involved for calibration purposes. Using gene expression data, we explore the prediction of breast cancer relapse. We also propose a multicategorial version of our method, used to predict cell-types based on single-cell expression data., Availability and Implementation: Our approach is implemented in the plsgenomics R-package., Contact: ghislain.durif@inria.fr., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com)
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.