Author: "Precioso, Frédéric" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Precioso, Frédéric"' showing total 164 results

Start Over Author "Precioso, Frédéric"

164 results on '"Precioso, Frédéric"'

1. Domain-specific long text classification from sparse relevant information

Author: D'Cruz, Célia, Bereder, Jean-Marc, Precioso, Frédéric, and Riveill, Michel
Subjects: Computer Science - Computation and Language
Abstract: Large Language Models have undoubtedly revolutionized the Natural Language Processing field, the current trend being to promote one-model-for-all tasks (sentiment analysis, translation, etc.). However, the statistical mechanisms at work in the larger language models struggle to exploit the relevant information when it is very sparse, when it is a weak signal. This is the case, for example, for the classification of long domain-specific documents, when the relevance relies on a single relevant word or on very few relevant words from technical jargon. In the medical domain, it is essential to determine whether a given report contains critical information about a patient's condition. This critical information is often based on one or few specific isolated terms. In this paper, we propose a hierarchical model which exploits a short list of potential target terms to retrieve candidate sentences and represent them into the contextualized embedding of the target term(s) they contain. A pooling of the term(s) embedding(s) entails the document representation to be classified. We evaluate our model on one public medical document benchmark in English and on one private French medical dataset. We show that our narrower hierarchical model is better than larger language models for retrieving relevant long documents in a domain-specific context., Comment: Submitted to conference ECAI 2024: 27TH European Conference on Artificial Intelligence
Published: 2024

2. Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers

Author: Ngnawé, Jonas, Sahoo, Sabyasachi, Pequignot, Yann, Precioso, Frédéric, and Gagné, Christian
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite extensive research on adversarial training strategies to improve robustness, the decisions of even the most robust deep learning models can still be quite sensitive to imperceptible perturbations, creating serious risks when deploying them for high-stakes real-world applications. While detecting such cases may be critical, evaluating a model's vulnerability at a per-instance level using adversarial attacks is computationally too intensive and unsuitable for real-time deployment scenarios. The input space margin is the exact score to detect non-robust samples and is intractable for deep neural networks. This paper introduces the concept of margin consistency -- a property that links the input space margins and the logit margins in robust models -- for efficient detection of vulnerable samples. First, we establish that margin consistency is a necessary and sufficient condition to use a model's logit margin as a score for identifying non-robust samples. Next, through comprehensive empirical analysis of various robustly trained models on CIFAR10 and CIFAR100 datasets, we show that they indicate high margin consistency with a strong correlation between their input space margins and the logit margins. Then, we show that we can effectively and confidently use the logit margin to detect brittle decisions with such models. Finally, we address cases where the model is not sufficiently margin-consistent by learning a pseudo-margin from the feature representation. Our findings highlight the potential of leveraging deep representations to assess adversarial vulnerability in deployment scenarios efficiently., Comment: 10 pages, 6 figures, 2 tables. Version Update: Neurips Camera Ready
Published: 2024

3. Layerwise Early Stopping for Test Time Adaptation

Author: Sahoo, Sabyasachi, ElAraby, Mostafa, Ngnawe, Jonas, Pequignot, Yann, Precioso, Frederic, and Gagne, Christian
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Test Time Adaptation (TTA) addresses the problem of distribution shift by enabling pretrained models to learn new features on an unseen domain at test time. However, it poses a significant challenge to maintain a balance between learning new features and retaining useful pretrained features. In this paper, we propose Layerwise EArly STopping (LEAST) for TTA to address this problem. The key idea is to stop adapting individual layers during TTA if the features being learned do not appear beneficial for the new domain. For that purpose, we propose using a novel gradient-based metric to measure the relevance of the current learnt features to the new domain without the need for supervised labels. More specifically, we propose to use this metric to determine dynamically when to stop updating each layer during TTA. This enables a more balanced adaptation, restricted to layers benefiting from it, and only for a certain number of steps. Such an approach also has the added effect of limiting the forgetting of pretrained features useful for dealing with new domains. Through extensive experiments, we demonstrate that Layerwise Early Stopping improves the performance of existing TTA approaches across multiple datasets, domain shifts, model architectures, and TTA losses., Comment: 14 pages, 5 figures
Published: 2024

4. Kernel KMeans clustering splits for end-to-end unsupervised decision trees

Author: Ohl, Louis, Mattei, Pierre-Alexandre, Leclercq, Mickaël, Droit, Arnaud, and Precioso, Frédéric
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, 62h30, G.3
Abstract: Trees are convenient models for obtaining explainable predictions on relatively small datasets. Although there are many proposals for the end-to-end construction of such trees in supervised learning, learning a tree end-to-end for clustering without labels remains an open challenge. As most works focus on interpreting with trees the result of another clustering algorithm, we present here a novel end-to-end trained unsupervised binary tree for clustering: Kauri. This method performs a greedy maximisation of the kernel KMeans objective without requiring the definition of centroids. We compare this model on multiple datasets with recent unsupervised trees and show that Kauri performs identically when using a linear kernel. For other kernels, Kauri often outperforms the concatenation of kernel KMeans and a CART decision tree.
Published: 2024

5. Attention Meets Post-hoc Interpretability: A Mathematical Perspective

Author: Lopardo, Gianluigi, Precioso, Frederic, and Garreau, Damien
Subjects: Statistics - Machine Learning, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Attention-based architectures, in particular transformers, are at the heart of a technological revolution. Interestingly, in addition to helping obtain state-of-the-art results on a wide range of applications, the attention mechanism intrinsically provides meaningful insights on the internal behavior of the model. Can these insights be used as explanations? Debate rages on. In this paper, we mathematically study a simple attention-based architecture and pinpoint the differences between post-hoc and attention-based explanations. We show that they provide quite different results, and that, despite their limitations, post-hoc methods are capable of capturing more useful insights than merely examining the attention weights., Comment: Accepted at ICML 2024
Published: 2024

6. Visual Objectification in Films: Towards a New AI Task for Video Interpretation

Author: Tores, Julie, Sassatelli, Lucile, Wu, Hui-Yin, Bergman, Clement, Andolfi, Lea, Ecrement, Victor, Precioso, Frederic, Devars, Thierry, Guaresi, Magali, Julliard, Virginie, and Lecossais, Sarah
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In film gender studies, the concept of 'male gaze' refers to the way the characters are portrayed on-screen as objects of desire rather than subjects. In this article, we introduce a novel video-interpretation task, to detect character objectification in films. The purpose is to reveal and quantify the usage of complex temporal patterns operated in cinema to produce the cognitive perception of objectification. We introduce the ObyGaze12 dataset, made of 1914 movie clips densely annotated by experts for objectification concepts identified in film studies and psychology. We evaluate recent vision models, show the feasibility of the task and where the challenges remain with concept bottleneck models. Our new dataset and code are made available to the community., Comment: 12 pages, 3 figures, 2 tables
Published: 2024

7. Mind the map! Accounting for existing map information when estimating online HDMaps from sensor

Author: Sun, Rémy, Yang, Li, Lingrand, Diane, and Precioso, Frédéric
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: While HDMaps are a crucial component of autonomous driving, they are expensive to acquire and maintain. Estimating these maps from sensors therefore promises to significantly lighten costs. These estimations however overlook existing HDMaps, with current methods at most geolocalizing low quality maps or considering a general database of known maps. In this paper, we propose to account for existing maps of the precise situation studied when estimating HDMaps. We identify 3 reasonable types of useful existing maps (minimalist, noisy, and outdated). We also introduce MapEX, a novel online HDMap estimation framework that accounts for existing maps. MapEX achieves this by encoding map elements into query tokens and by refining the matching algorithm used to train classic query based map estimation models. We demonstrate that MapEX brings significant improvements on the nuScenes dataset. For instance, MapEX - given noisy maps - improves by 38% over the MapTRv2 detector it is based on and by 8% over the current SOTA., Comment: 23 pages, 4 figures, 6 tables
Published: 2023

8. Faithful and Robust Local Interpretability for Textual Predictions

Author: Lopardo, Gianluigi, Precioso, Frederic, and Garreau, Damien
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Interpretability is essential for machine learning models to be trusted and deployed in critical domains. However, existing methods for interpreting text models are often complex, lack mathematical foundations, and their performance is not guaranteed. In this paper, we propose FRED (Faithful and Robust Explainer for textual Documents), a novel method for interpreting predictions over text. FRED offers three key insights to explain a model prediction: (1) it identifies the minimal set of words in a document whose removal has the strongest influence on the prediction, (2) it assigns an importance score to each token, reflecting its influence on the model's output, and (3) it provides counterfactual explanations by generating examples similar to the original document, but leading to a different prediction. We establish the reliability of FRED through formal definitions and theoretical analyses on interpretable classifiers. Additionally, our empirical evaluation against state-of-the-art methods demonstrates the effectiveness of FRED in providing insights into text models.
Published: 2023

9. Generalised Mutual Information: a Framework for Discriminative Clustering

Author: Ohl, Louis, Mattei, Pierre-Alexandre, Bouveyron, Charles, Harchaoui, Warith, Leclercq, Mickaël, Droit, Arnaud, and Precioso, Frédéric
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Information Theory, Computer Science - Machine Learning, Statistics - Methodology, 62H30, G.3
Abstract: In the last decade, recent successes in deep clustering majorly involved the Mutual Information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight how the maximisation of MI does not lead to satisfying clusters. We identified the Kullback-Leibler divergence as the main reason of this behaviour. Hence, we generalise the mutual information by changing its core distance, introducing the Generalised Mutual Information (GEMINI): a set of metrics for unsupervised neural network training. Unlike MI, some GEMINIs do not require regularisations when training as they are geometry-aware thanks to distances or kernels in the data space. Finally, we highlight that GEMINIs can automatically select a relevant number of clusters, a property that has been little studied in deep discriminative clustering context where the number of clusters is a priori unknown., Comment: Submitted for review at the IEEE Transactions on Pattern Analysis and Machine Intelligence. This article is an extension of an original NeurIPS 2022 article [arXiv:2210.06300]
Published: 2023

10. Sparse and geometry-aware generalisation of the mutual information for joint discriminative clustering and feature selection

Author: Ohl, Louis, Mattei, Pierre-Alexandre, Bouveyron, Charles, Leclercq, Mickaël, Droit, Arnaud, and Precioso, Frédéric
Published: 2024
Full Text: View/download PDF

11. BERNN: Enhancing classification of Liquid Chromatography Mass Spectrometry data with batch effect removal neural networks

Author: Pelletier, Simon J., Leclercq, Mickaël, Roux-Dalvai, Florence, de Geus, Matthijs B., Leslie, Shannon, Wang, Weiwei, Lam, TuKiet T., Nairn, Angus C., Arnold, Steven E., Carlyle, Becky C., Precioso, Frédéric, and Droit, Arnaud
Published: 2024
Full Text: View/download PDF

12. Interpretable Neural-Symbolic Concept Reasoning

Author: Barbiero, Pietro, Ciravegna, Gabriele, Giannini, Francesco, Zarlenga, Mateo Espinosa, Magister, Lucie Charlotte, Tonda, Alberto, Lio', Pietro, Precioso, Frederic, Jamnik, Mateja, and Marra, Giuseppe
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Statistics - Machine Learning
Abstract: Deep learning methods are highly accurate, yet their opaque decision process prevents them from earning full human trust. Concept-based models aim to address this issue by learning tasks based on a set of human-understandable concepts. However, state-of-the-art concept-based models rely on high-dimensional concept embedding representations which lack a clear semantic meaning, thus questioning the interpretability of their decision process. To overcome this limitation, we propose the Deep Concept Reasoner (DCR), the first interpretable concept-based model that builds upon concept embeddings. In DCR, neural networks do not make task predictions directly, but they build syntactic rule structures using concept embeddings. DCR then executes these rules on meaningful concept truth degrees to provide a final interpretable and semantically-consistent prediction in a differentiable manner. Our experiments show that DCR: (i) improves up to +25% w.r.t. state-of-the-art interpretable concept-based models on challenging benchmarks (ii) discovers meaningful logic rules matching known ground truths even in the absence of concept supervision during training, and (iii), facilitates the generation of counterfactual examples providing the learnt rules as guidance.
Published: 2023
Full Text: View/download PDF

13. Understanding Post-hoc Explainers: The Case of Anchors

Author: Lopardo, Gianluigi, Precioso, Frederic, and Garreau, Damien
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: In many scenarios, the interpretability of machine learning models is a highly required but difficult task. To explain the individual predictions of such models, local model-agnostic approaches have been proposed. However, the process generating the explanations can be, for a user, as mysterious as the prediction to be explained. Furthermore, interpretability methods frequently lack theoretical guarantees, and their behavior on simple models is frequently unknown. While it is difficult, if not impossible, to ensure that an explainer behaves as expected on a cutting-edge model, we can at least ensure that everything works on simple, already interpretable models. In this paper, we present a theoretical analysis of Anchors (Ribeiro et al., 2018): a popular rule-based interpretability method that highlights a small set of words to explain a text classifier's decision. After formalizing its algorithm and providing useful insights, we demonstrate mathematically that Anchors produces meaningful results when used with linear text classifiers on top of a TF-IDF vectorization. We believe that our analysis framework can aid in the development of new explainability methods based on solid theoretical foundations., Comment: arXiv admin note: substantial text overlap with arXiv:2205.13789
Published: 2023

14. Sparse and geometry-aware generalisation of the mutual information for joint discriminative clustering and feature selection

Author: Ohl, Louis, Mattei, Pierre-Alexandre, Bouveyron, Charles, Leclercq, Mickaël, Droit, Arnaud, and Precioso, Frédéric
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Statistics - Computation, Statistics - Methodology, 62H30, G.3
Abstract: Feature selection in clustering is a hard task which involves simultaneously the discovery of relevant clusters as well as relevant variables with respect to these clusters. While feature selection algorithms are often model-based through optimised model selection or strong assumptions on the data distribution, we introduce a discriminative clustering model trying to maximise a geometry-aware generalisation of the mutual information called GEMINI with a simple l1 penalty: the Sparse GEMINI. This algorithm avoids the burden of combinatorial feature subset exploration and is easily scalable to high-dimensional data and large amounts of samples while only designing a discriminative clustering model. We demonstrate the performances of Sparse GEMINI on synthetic datasets and large-scale datasets. Our results show that Sparse GEMINI is a competitive algorithm and has the ability to select relevant subsets of variables with respect to the clustering without using relevance criteria or prior hypotheses., Comment: Published in Statistics and Computing, Volume 34, article number 155, (2024), https://doi.org/10.1007/s11222-024-10467-9
Published: 2023

15. Generalised Mutual Information for Discriminative Clustering

Author: Ohl, Louis, Mattei, Pierre-Alexandre, Bouveyron, Charles, Harchaoui, Warith, Leclercq, Mickaël, Droit, Arnaud, and Precioso, Frederic
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Information Theory, Computer Science - Machine Learning, Statistics - Methodology, 62H30, G.3
Abstract: In the last decade, recent successes in deep clustering majorly involved the mutual information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight how the maximisation of MI does not lead to satisfying clusters. We identified the Kullback-Leibler divergence as the main reason of this behaviour. Hence, we generalise the mutual information by changing its core distance, introducing the generalised mutual information (GEMINI): a set of metrics for unsupervised neural network training. Unlike MI, some GEMINIs do not require regularisations when training. Some of these metrics are geometry-aware thanks to distances or kernels in the data space. Finally, we highlight that GEMINIs can automatically select a relevant number of clusters, a property that has been little studied in deep clustering context where the number of clusters is a priori unknown., Comment: To be published in Neural Information Processing Systems 2022
Published: 2022

16. Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off

Author: Zarlenga, Mateo Espinosa, Barbiero, Pietro, Ciravegna, Gabriele, Marra, Giuseppe, Giannini, Francesco, Diligenti, Michelangelo, Shams, Zohreh, Precioso, Frederic, Melacci, Stefano, Weller, Adrian, Lio, Pietro, and Jamnik, Mateja
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, 68T07, I.2.6
Abstract: Deploying AI-powered systems requires trustworthy models supporting effective human interactions, going beyond raw prediction accuracy. Concept bottleneck models promote trustworthiness by conditioning classification tasks on an intermediate level of human-like concepts. This enables human interventions which can correct mispredicted concepts to improve the model's performance. However, existing concept bottleneck models are unable to find optimal compromises between high task accuracy, robust concept-based explanations, and effective interventions on concepts -- particularly in real-world conditions where complete and accurate concept supervisions are scarce. To address this, we propose Concept Embedding Models, a novel family of concept bottleneck models which goes beyond the current accuracy-vs-interpretability trade-off by learning interpretable high-dimensional concept representations. Our experiments demonstrate that Concept Embedding Models (1) attain better or competitive task accuracy w.r.t. standard neural models without concepts, (2) provide concept representations capturing meaningful semantics including and beyond their ground truth labels, (3) support test-time concept interventions whose effect in test accuracy surpasses that in standard concept bottleneck models, and (4) scale to real-world conditions where complete concept supervisions are scarce., Comment: To appear at NeurIPS 2022
Published: 2022

17. A Sea of Words: An In-Depth Analysis of Anchors for Text Data

Author: Lopardo, Gianluigi, Precioso, Frederic, and Garreau, Damien
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Anchors (Ribeiro et al., 2018) is a post-hoc, rule-based interpretability method. For text data, it proposes to explain a decision by highlighting a small set of words (an anchor) such that the model to explain has similar outputs when they are present in a document. In this paper, we present the first theoretical analysis of Anchors, considering that the search for the best anchor is exhaustive. After formalizing the algorithm for text classification, we present explicit results on different classes of models when the vectorization step is TF-IDF, and words are replaced by a fixed out-of-dictionary token when removed. Our inquiry covers models such as elementary if-then rules and linear classifiers. We then leverage this analysis to gain insights on the behavior of Anchors for any differentiable classifiers. For neural networks, we empirically show that the words corresponding to the highest partial derivatives of the model with respect to the input, reweighted by the inverse document frequencies, are selected by Anchors., Comment: Accepted to AISTATS 2023. 9+2 page paper, 21-page appendix
Published: 2022

18. A Multi-stage deep architecture for summary generation of soccer videos

Author: Sanabria, Melissa, Precioso, Frédéric, Mattei, Pierre-Alexandre, and Menguy, Thomas
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multimedia
Abstract: Video content is present in an ever-increasing number of fields, both scientific and commercial. Sports, particularly soccer, is one of the industries that has invested the most in the field of video analytics, due to the massive popularity of the game and the emergence of new markets. Previous state-of-the-art methods on soccer matches video summarization rely on handcrafted heuristics to generate summaries which are poorly generalizable, but these works have yet proven that multiple modalities help detect the best actions of the game. On the other hand, machine learning models with higher generalization potential have entered the field of summarization of general-purpose videos, offering several deep learning approaches. However, most of them exploit content specificities that are not appropriate for sport whole-match videos. Although video content has been for many years the main source for automatizing knowledge extraction in soccer, the data that records all the events happening on the field has become lately very important in sports analytics, since this event data provides richer context information and requires less processing. We propose a method to generate the summary of a soccer match exploiting both the audio and the event metadata. The results show that our method can detect the actions of the match, identify which of these actions should belong to the summary and then propose multiple candidate summaries which are similar enough but with relevant variability to provide different options to the final editor. Furthermore, we show the generalization capability of our work since it can transfer knowledge between datasets from different broadcasting companies, different competitions, acquired in different conditions, and corresponding to summaries of different lengths
Published: 2022

19. AI-Enhanced Prediction of Aortic Stenosis Progression: Insights From the PROGRESSA Study

Author: Sanabria, Melissa, Tastet, Lionel, Pelletier, Simon, Leclercq, Mickael, Ohl, Louis, Hermann, Lara, Mattei, Pierre-Alexandre, Precioso, Frederic, Coté, Nancy, Pibarot, Philippe, and Droit, Arnaud
Published: 2024
Full Text: View/download PDF

20. SMACE: A New Method for the Interpretability of Composite Decision Systems

Author: Lopardo, Gianluigi, Garreau, Damien, Precioso, Frederic, and Ottosson, Greger
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Interpretability is a pressing issue for decision systems. Many post hoc methods have been proposed to explain the predictions of a single machine learning model. However, business processes and decision systems are rarely centered around a unique model. These systems combine multiple models that produce key predictions, and then apply decision rules to generate the final decision. To explain such decisions, we propose the Semi-Model-Agnostic Contextual Explainer (SMACE), a new interpretability method that combines a geometric approach for decision rules with existing interpretability methods for machine learning models to generate an intuitive feature ranking tailored to the end user. We show that established model-agnostic approaches produce poor results on tabular data in this setting, in particular giving the same importance to several features, whereas SMACE can rank them in a meaningful way., Comment: Accepted to ECML PKDD 2022, the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
Published: 2021
Full Text: View/download PDF

21. Knowledge-driven Active Learning

Author: Ciravegna, Gabriele, Precioso, Frédéric, Betti, Alessandro, Mottin, Kevin, and Gori, Marco
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The deployment of Deep Learning (DL) models is still precluded in those contexts where the amount of supervised data is limited. To answer this issue, active learning strategies aim at minimizing the amount of labelled data required to train a DL model. Most active strategies are based on uncertain sample selection, and even often restricted to samples lying close to the decision boundary. These techniques are theoretically sound, but an understanding of the selected samples based on their content is not straightforward, further driving non-experts to consider DL as a black-box. For the first time, here we propose to take into consideration common domain-knowledge and enable non-expert users to train a model with fewer samples. In our Knowledge-driven Active Learning (KAL) framework, rule-based knowledge is converted into logic constraints and their violation is checked as a natural guide for sample selection. We show that even simple relationships among data and output classes offer a way to spot predictions for which the model need supervision. We empirically show that KAL (i) outperforms many active learning strategies, particularly in those contexts where domain knowledge is rich, (ii) it discovers data distribution lying far from the initial training data, (iii) it ensures domain experts that the provided knowledge is acquired by the model, (iv) it is suitable for regression and object recognition tasks unlike uncertainty-based strategies, and (v) its computational demand is low., Comment: Accepted at ECML2023 for presentation! Check also the github repo: https://github.com/gabrieleciravegna/Knowledge-driven-Active-Learning
Published: 2021
Full Text: View/download PDF

22. Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-based Multimodal Fusion

Author: Pouthier, Baptiste, Pilati, Laurent, Gudupudi, Leela K., Bouveyron, Charles, and Precioso, Frederic
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: It is now well established from a variety of studies that there is a significant benefit from combining video and audio data in detecting active speakers. However, either of the modalities can potentially mislead audiovisual fusion by inducing unreliable or deceptive information. This paper outlines active speaker detection as a multi-objective learning problem to leverage best of each modalities using a novel self-attention, uncertainty-based multimodal fusion scheme. Results obtained show that the proposed multi-objective learning architecture outperforms traditional approaches in improving both mAP and AUC scores. We further demonstrate that our fusion strategy surpasses, in active speaker detection, other modality fusion methods reported in various disciplines. We finally show that the proposed method significantly improves the state-of-the-art on the AVA-ActiveSpeaker dataset., Comment: In INTERSPEECH 2021
Published: 2021
Full Text: View/download PDF

23. Toward Novel Optimizers: A Moreau-Yosida View of Gradient-Based Learning

Author: Betti, Alessandro, Ciravegna, Gabriele, Gori, Marco, Melacci, Stefano, Mottin, Kevin, Precioso, Frédéric, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Basili, Roberto, editor, Lembo, Domenico, editor, Limongelli, Carla, editor, and Orlandini, Andrea, editor
Published: 2023
Full Text: View/download PDF

24. Knowledge-Driven Active Learning

Author: Ciravegna, Gabriele, Precioso, Frédéric, Betti, Alessandro, Mottin, Kevin, Gori, Marco, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Koutra, Danai, editor, Plant, Claudia, editor, Gomez Rodriguez, Manuel, editor, Baralis, Elena, editor, and Bonchi, Francesco, editor
Published: 2023
Full Text: View/download PDF

25. SMACE: A New Method for the Interpretability of Composite Decision Systems

Author: Lopardo, Gianluigi, Garreau, Damien, Precioso, Frédéric, Ottosson, Greger, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Amini, Massih-Reza, editor, Canu, Stéphane, editor, Fischer, Asja, editor, Guns, Tias, editor, Kralj Novak, Petra, editor, and Tsoumakas, Grigorios, editor
Published: 2023
Full Text: View/download PDF

26. From text saliency to linguistic objects: learning linguistic interpretable markers with a multi-channels convolutional architecture

Author: Vanni, Laurent, Corneli, Marco, Mayaffre, Damon, and Precioso, Frédéric
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: A lot of effort is currently made to provide methods to analyze and understand deep neural network impressive performances for tasks such as image or text classification. These methods are mainly based on visualizing the important input features taken into account by the network to build a decision. However these techniques, let us cite LIME, SHAP, Grad-CAM, or TDS, require extra effort to interpret the visualization with respect to expert knowledge. In this paper, we propose a novel approach to inspect the hidden layers of a fitted CNN in order to extract interpretable linguistic objects from texts exploiting classification process. In particular, we detail a weighted extension of the Text Deconvolution Saliency (wTDS) measure which can be used to highlight the relevant features used by the CNN to perform the classification task. We empirically demonstrate the efficiency of our approach on corpora from two different languages: English and French. On all datasets, wTDS automatically encodes complex linguistic objects based on co-occurrences and possibly on grammatical and syntax analysis., Comment: 7 pages, 22 figures
Published: 2020

27. Deep Learning

Author: Gori, Marco, primary, Precioso, Frédéric, additional, and Trentin, Edmondo, additional
Published: 2023
Full Text: View/download PDF

28. Revisiting Deep Architectures for Head Motion Prediction in 360{\deg} Videos

Author: Rondon, Miguel Fabian Romero, Sassatelli, Lucile, Pardo, Ramon Aparicio, and Precioso, Frederic
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing, Statistics - Machine Learning
Abstract: We consider predicting the user's head motion in 360-degree videos, with 2 modalities only: the past user's positions and the video content (not knowing other users' traces). We make two main contributions. First, we re-examine existing deep-learning approaches for this problem and identify hidden flaws from a thorough root-cause analysis. Second, from the results of this analysis, we design a new proposal establishing state-of-the-art performance. First, re-assessing the existing methods that use both modalities, we obtain the surprising result that they all perform worse than baselines using the user's trajectory only. A root-cause analysis of the metrics, datasets and neural architectures shows in particular that (i) the content can inform the prediction for horizons longer than 2 to 3 sec. (existing methods consider shorter horizons), and that (ii) to compete with the baselines, it is necessary to have a recurrent unit dedicated to process the positions, but this is not sufficient. Second, from a re-examination of the problem supported with the concept of Structural-RNN, we design a new deep neural architecture, named TRACK. TRACK achieves state-of-the-art performance on all considered datasets and prediction horizons, outperforming competitors by up to 20 percent on focus-type videos and horizons 2-5 seconds. The entire framework (codes and datasets) is online and received an ACM reproducibility badge.
Published: 2019

29. DeepWILD: Wildlife Identification, Localisation and estimation on camera trap videos using Deep learning

Author: Simões, Fanny, Bouveyron, Charles, and Precioso, Frédéric
Published: 2023
Full Text: View/download PDF

30. Adaptive Bayesian Linear Regression for Automated Machine Learning

Author: Zhou, Weilin and Precioso, Frederic
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: To solve a machine learning problem, one typically needs to perform data preprocessing, modeling, and hyperparameter tuning, which is known as model selection and hyperparameter optimization.The goal of automated machine learning (AutoML) is to design methods that can automatically perform model selection and hyperparameter optimization without human interventions for a given dataset. In this paper, we propose a meta-learning method that can search for a high-performance machine learning pipeline from the predefined set of candidate pipelines for supervised classification datasets in an efficient way by leveraging meta-data collected from previous experiments. More specifically, our method combines an adaptive Bayesian regression model with a neural network basis function and the acquisition function from Bayesian optimization. The adaptive Bayesian regression model is able to capture knowledge from previous meta-data and thus make predictions of the performances of machine learning pipelines on a new dataset. The acquisition function is then used to guide the search of possible pipelines based on the predictions.The experiments demonstrate that our approach can quickly identify high-performance pipelines for a range of test datasets and outperforms the baseline methods., Comment: Added references;Corrected typos.Revised argument,results unchanged
Published: 2019

31. Toward Novel Optimizers: A Moreau-Yosida View of Gradient-Based Learning

Author: Betti, Alessandro, primary, Ciravegna, Gabriele, additional, Gori, Marco, additional, Melacci, Stefano, additional, Mottin, Kevin, additional, and Precioso, Frédéric, additional
Published: 2023
Full Text: View/download PDF

32. SMACE: A New Method for the Interpretability of Composite Decision Systems

Author: Lopardo, Gianluigi, primary, Garreau, Damien, additional, Precioso, Frédéric, additional, and Ottosson, Greger, additional
Published: 2023
Full Text: View/download PDF

33. Knowledge-Driven Active Learning

Author: Ciravegna, Gabriele, primary, Precioso, Frédéric, additional, Betti, Alessandro, additional, Mottin, Kevin, additional, and Gori, Marco, additional
Published: 2023
Full Text: View/download PDF

34. Adversarial Active Learning for Deep Networks: a Margin Based Approach

Author: Ducoffe, Melanie and Precioso, Frederic
Subjects: Computer Science - Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: We propose a new active learning strategy designed for deep neural networks. The goal is to minimize the number of data annotation queried from an oracle during training. Previous active learning strategies scalable for deep networks were mostly based on uncertain sample selection. In this work, we focus on examples lying close to the decision boundary. Based on theoretical works on margin theory for active learning, we know that such examples may help to considerably decrease the number of annotations. While measuring the exact distance to the decision boundaries is intractable, we propose to rely on adversarial examples. We do not consider anymore them as a threat instead we exploit the information they provide on the distribution of the input space in order to approximate the distance to decision boundaries. We demonstrate empirically that adversarial active queries yield faster convergence of CNNs trained on MNIST, the Shoe-Bag and the Quick-Draw datasets.
Published: 2018

35. DiagnoseNET: Automatic Framework to Scale Neural Networks on Heterogeneous Systems Applied to Medical Diagnosis

Author: Garcia Henao, John Anderson, Precioso, Frédéric, Staccini, Pascal, Riveill, Michel, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zhang, Junjie James, Series Editor, Kim, Hyuncheol, editor, and Kim, Kuinam J., editor
Published: 2021
Full Text: View/download PDF

36. Semi-supervised consensus clustering based on closed patterns

Author: Yang, Tianshu, Pasquier, Nicolas, and Precioso, Frédéric
Published: 2022
Full Text: View/download PDF

37. Key Passages : From Statistics to Deep Learning

Author: Vanni, Laurent, Corneli, Marco, Longrée, Dominique, Mayaffre, Damon, Precioso, Frédéric, Gaul, Wolfgang, Managing Editor, Vichi, Maurizio, Managing Editor, Weihs, Claus, Managing Editor, Baier, Daniel, Editorial Board Member, Critchley, Frank, Editorial Board Member, Decker, Reinhold, Editorial Board Member, Diday, Edwin, Editorial Board Member, Greenacre, Michael, Editorial Board Member, Lauro, Carlo Natale, Editorial Board Member, Meulman, Jacqueline, Editorial Board Member, Monari, Paola, Editorial Board Member, Nishisato, Shizuhiko, Editorial Board Member, Ohsumi, Noboru, Editorial Board Member, Opitz, Otto, Editorial Board Member, Ritter, Gunter, Editorial Board Member, Schader, Martin, Editorial Board Member, Iezzi, Domenica Fioredistella, editor, Mayaffre, Damon, editor, and Misuraca, Michelangelo, editor
Published: 2020
Full Text: View/download PDF

38. Visualizing ECG Contribution into Convolutional Neural Network Classification

Author: Li, Yaowei, Precioso, Frédéric, Liu, Chengyu, Liu, Chengyu, editor, and Li, Jianqing, editor
Published: 2020
Full Text: View/download PDF

39. A new perspective on optimizers: leveraging moreau-yosida approximation in gradient-based learning

Author: Betti, Alessandro, Ciravegna, Gabriele, Gori, Marco, Melacci, Stefano, Mottin, Kevin, and Precioso, Frédéric
Abstract: Machine Learning (ML) heavily relies on optimization techniques built upon gradient descent. Numerous gradient-based update methods have been proposed in the scientific literature, particularly in the context of neural networks, and have gained widespread adoption as optimizers in ML software libraries. This paper introduces a novel perspective by framing gradient-based update strategies using the Moreau-Yosida (MY) approximation of the loss function. Leveraging a first-order Taylor expansion, we demonstrate the concrete exploitability of the MY approximation to generalize the model update process. This enables the evaluation and comparison of regularization properties underlying popular optimizers like gradient descent with momentum, ADAGRAD, RMSprop, and ADAM. The MY-based unifying view opens up possibilities for designing new update schemes with customizable regularization properties. To illustrate this potential, we propose a case study that redefines the concept of closeness in the parameter space using network outputs. We present a proof-of-concept experimental procedure, demonstrating the effectiveness of this approach in continual learning scenarios. Specifically, we employ the well-known permuted MNIST dataset, a progressively-permuted MNIST and CIFAR-10 benchmarks, and a non i.i.d. stream. Additionally, we validate the update scheme’s efficacy in an offline-learning scenario. By embracing the MY-based unifying view, we pave the way for advancements in optimization techniques for machine learning.
Published: 2024
Full Text: View/download PDF

40. QBDC: Query by dropout committee for training deep supervised architecture

Author: Ducoffe, Melanie and Precioso, Frederic
Subjects: Computer Science - Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: While the current trend is to increase the depth of neural networks to increase their performance, the size of their training database has to grow accordingly. We notice an emergence of tremendous databases, although providing labels to build a training set still remains a very expensive task. We tackle the problem of selecting the samples to be labelled in an online fashion. In this paper, we present an active learning strategy based on query by committee and dropout technique to train a Convolutional Neural Network (CNN). We derive a commmittee of partial CNNs resulting from batchwise dropout runs on the initial CNN. We evaluate our active learning strategy for CNN on MNIST benchmark, showing in particular that selecting less than 30 % from the annotated database is enough to get similar error rate as using the full training set on MNIST. We also studied the robustness of our method against adversarial examples., Comment: Submitted to ICLR2016
Published: 2015

41. Parallel and Distributed Processing for Unsupervised Patient Phenotype Representation

Author: García Heano, John Anderson, Precioso, Frédéric, Staccini, Pascal, Riveill, Michel, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Yuan, Junsong, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Washio, Takashi, Founding Editor, Meneses, Esteban, editor, Castro, Harold, editor, Barrios Hernández, Carlos Jaime, editor, and Ramos-Pollan, Raul, editor
Published: 2019
Full Text: View/download PDF

42. A Co-evolutionary Approach to Analyzing the Impact of Rationality on the Italian Electricity Market

Author: da Costa Pereira, Célia, Bevilacqua, Sara, Guerci, Eric, Precioso, Frédéric, Sartori, Claudio, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Baldoni, Matteo, editor, Dastani, Mehdi, editor, Liao, Beishui, editor, Sakurai, Yuko, editor, and Zalila Wenkstern, Rym, editor
Published: 2019
Full Text: View/download PDF

43. Analysing the Impact of Rationality on the Italian Electricity Market

Author: Bevilacqua, Sara, da Costa Pereira, Célia, Guerci, Eric, Precioso, Frédéric, Sartori, Claudio, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Torra, Vicenç, editor, Narukawa, Yasuo, editor, Pasi, Gabriella, editor, and Viviani, Marco, editor
Published: 2019
Full Text: View/download PDF

44. Assessment of algorithms for mitosis detection in breast cancer histopathology images

Author: Veta, Mitko, van Diest, Paul J., Willems, Stefan M., Wang, Haibo, Madabhushi, Anant, Cruz-Roa, Angel, Gonzalez, Fabio, Larsen, Anders B. L., Vestergaard, Jacob S., Dahl, Anders B., Cireşan, Dan C., Schmidhuber, Jürgen, Giusti, Alessandro, Gambardella, Luca M., Tek, F. Boray, Walter, Thomas, Wang, Ching-Wei, Kondo, Satoshi, Matuszewski, Bogdan J., Precioso, Frederic, Snell, Violet, Kittler, Josef, de Campos, Teofilo E., Khan, Adnan M., Rajpoot, Nasir M., Arkoumani, Evdokia, Lacle, Miangela M., Viergever, Max A., and Pluim, Josien P. W.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The proliferative activity of breast tumors, which is routinely estimated by counting of mitotic figures in hematoxylin and eosin stained histology sections, is considered to be one of the most important prognostic markers. However, mitosis counting is laborious, subjective and may suffer from low inter-observer agreement. With the wider acceptance of whole slide images in pathology labs, automatic image analysis has been proposed as a potential solution for these issues. In this paper, the results from the Assessment of Mitosis Detection Algorithms 2013 (AMIDA13) challenge are described. The challenge was based on a data set consisting of 12 training and 11 testing subjects, with more than one thousand annotated mitotic figures by multiple observers. Short descriptions and results from the evaluation of eleven methods are presented. The top performing method has an error rate that is comparable to the inter-observer agreement among pathologists., Comment: 23 pages, 5 figures, accepted for publication in the journal Medical Image Analysis
Published: 2014
Full Text: View/download PDF

45. Introducing Semantics in Short Text Classification

Author: Bouaziz, Ameni, da Costa Pereira, Célia, Dartigues-Pallez, Christel, Precioso, Frédéric, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, and Gelbukh, Alexander, editor
Published: 2018
Full Text: View/download PDF

46. DiagnoseNET: Automatic Framework to Scale Neural Networks on Heterogeneous Systems Applied to Medical Diagnosis

Author: Garcia Henao, John Anderson, primary, Precioso, Frédéric, additional, Staccini, Pascal, additional, and Riveill, Michel, additional
Published: 2020
Full Text: View/download PDF

47. Exploring the Road Graph in Trajectory Forecasting for Autonomous Driving

Author: Sun, Rémy, primary, Lingrand, Diane, additional, and Precioso, Frédéric, additional
Published: 2023
Full Text: View/download PDF

48. Handling Noisy Labels in Gaze-Based CBIR System

Author: Lopez, Stéphanie, Revel, Arnaud, Lingrand, Diane, Precioso, Frédéric, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Blanc-Talon, Jacques, editor, Penne, Rudi, editor, Philips, Wilfried, editor, Popescu, Dan, editor, and Scheunders, Paul, editor
Published: 2017
Full Text: View/download PDF

49. Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification

Author: Seddiki, Khawla, Saudemont, Philippe, Precioso, Frédéric, Ogrinc, Nina, Wisztorski, Maxence, Salzet, Michel, Fournier, Isabelle, and Droit, Arnaud
Published: 2020
Full Text: View/download PDF

50. Multiple Consensuses Clustering by Iterative Merging/Splitting of Clustering Patterns

Author: Al-najdi, Atheer, Pasquier, Nicolas, Precioso, Frédéric, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, and Perner, Petra, editor
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

164 results on '"Precioso, Frédéric"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources