Author: "Mickael Rouvier" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mickael Rouvier"' showing total 116 results

Start Over Author "Mickael Rouvier"

116 results on '"Mickael Rouvier"'

101. Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech

Author: Georges Linarès, Mickael Rouvier, Benjamin Lecouteux, Laboratoire Informatique d'Avignon (LIA), and Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI
Subjects: Audio mining, Voice activity detection, Acoustics and Ultrasonics, Computer science, business.industry, Speech recognition, Search engine indexing, lcsh:QC221-246, Speech corpus, Spotting, Speech processing, computer.software_genre, lcsh:QA75.5-76.95, lcsh:Acoustics. Sound, [INFO]Computer Science [cs], Speech analytics, lcsh:Electronic computers. Computer science, Artificial intelligence, Electrical and Electronic Engineering, business, computer, Utterance, Natural language processing
Abstract: International audience; Spoken utterance retrieval was largely studied in the last decades, with the purpose of indexing large audio databases or of detecting keywords in continuous speech streams. While the indexing of closed corpora can be performed via a batch process, on-line spotting systems have to synchronously detect the targeted spoken utterances. We propose a two-level architecture for on-the-fly term spotting. The first level performs a fast detection of the speech segments that probably contain the targeted utterance. The second level refines the detection on the selected segments, by using a speech recognizer based on a query-driven decoding algorithm. Experiments are conducted on both broadcast and spontaneous speech corpora. We investigate the impact of the spontaneity level on system performance. Results show that our method remains effective even if the recognition rates are significantly degraded by disfluencies.
Published: 2010
Full Text: View/download PDF

102. 'speech is silver, but silence is golden': improving speech-to-speech translation performance by slashing users input

Author: Benoit Favre, Frédéric Béchet, Mickael Rouvier, Laboratoire d'informatique Fondamentale de Marseille - UMR 6166 (LIF), Université de la Méditerranée - Aix-Marseille 2-Université de Provence - Aix-Marseille 1-Centre National de la Recherche Scientifique (CNRS), Laboratoire Informatique d'Avignon (LIA), Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI, Laboratoire d'informatique Fondamentale de Marseille (LIF), Centre National de la Recherche Scientifique (CNRS)-École Centrale de Marseille (ECM)-Aix Marseille Université (AMU), Favre, Benoit, Traitement Automatique du Langage Ecrit et Parlé (TALEP), Laboratoire d'Informatique et Systèmes (LIS), Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS), and Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Matching (statistics), Vocabulary, Machine translation, Computer science, media_common.quotation_subject, Speech recognition, 02 engineering and technology, computer.software_genre, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], Task (project management), 030507 speech-language pathology & audiology, 03 medical and health sciences, 0202 electrical engineering, electronic engineering, information engineering, Quality (business), Dialog box, Dialog system, ComputingMilieux_MISCELLANEOUS, media_common, business.industry, 020206 networking & telecommunications, Usability, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], 13. Climate action, 0305 other medical science, business, computer
Abstract: Speech-to-speech translation is a challenging task mixing two of the most ambitious Natural Language Processing challenges: Machine Translation (MT) and Automatic Speech Recognition (ASR). Recent advances in both fields have led to operational systems achieving good performance when used in matching conditions with those of ASR and MT models training. Regardless of the quality of these models, errors are inevitable due to some technical limitations of the systems (e.g. closed vocabulary) and intrinsic ambiguities of spoken languages. However all ASR and MT errors don’t have the same impact on the usability of a given speech-to-speech dialog system: some can be very benign, unconsciously corrected by users, some can damage the understanding between users and eventually lead the dialog to a failure. We present in this paper a strategy focusing on ASR error segments that have a high negative impact on MT performance. We propose a method that consists firstly in automatically detecting these erroneous segments then secondly estimating their impact on MT. We show that removing such segments prior to translation can lead to a significant decrease in translation error rate, even without any correction strategy.
Published: 2015
Full Text: View/download PDF

103. Correction interactive de transcriptions de parole par fusion de phrases

Author: Mickael Rouvier, Benoit Favre, Frédéric Béchet, Laboratoire Informatique d'Avignon (LIA), Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI, Laboratoire d'informatique Fondamentale de Marseille (LIF), Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS), Laboratoire d'informatique Fondamentale de Marseille - UMR 6166 (LIF), Université de la Méditerranée - Aix-Marseille 2-Université de Provence - Aix-Marseille 1-Centre National de la Recherche Scientifique (CNRS), Traitement Automatique du Langage Ecrit et Parlé (TALEP), Laboratoire d'Informatique et Systèmes (LIS), Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS), Centre National de la Recherche Scientifique (CNRS)-École Centrale de Marseille (ECM)-Aix Marseille Université (AMU), and Favre, Benoit
Subjects: [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], ComputingMilieux_MISCELLANEOUS, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Abstract: International audience; no abstract
Published: 2014

104. Scene understanding for identifying persons in TV shows: beyond face authentication

Author: Géraldine Damnati, Delphine Charlet, Meriem Bendris, Benoit Favre, Mickael Rouvier, Favre, Benoit, Laboratoire d'informatique Fondamentale de Marseille (LIF), Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS), France Télécom Recherche & Développement (FT R&D), France Télécom, France Télécom Recherche et Développement [Lannion] (FTR&D), Laboratoire d'informatique Fondamentale de Marseille - UMR 6166 (LIF), Université de la Méditerranée - Aix-Marseille 2-Université de Provence - Aix-Marseille 1-Centre National de la Recherche Scientifique (CNRS), Traitement Automatique du Langage Ecrit et Parlé (TALEP), Laboratoire d'Informatique et Systèmes (LIS), Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Informatique d'Avignon (LIA), Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI, Centre National de la Recherche Scientifique (CNRS)-École Centrale de Marseille (ECM)-Aix Marseille Université (AMU), and Centre d'Enseignement et de Recherche en Informatique - CERI-Avignon Université (AU)
Subjects: Focus (computing), Authentication, Exploit, Computer science, business.industry, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], Identification (information), [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], Face (geometry), Three-dimensional face recognition, Computer vision, Artificial intelligence, Face detection, business, Baseline (configuration management), ComputingMilieux_MISCELLANEOUS
Abstract: Our goal is to automatically identify people in TV news and debates without any predefined dictionary of people. In this paper, we focus on the problem of person identification beyond face authentication in order to improve the identification results and not only where the face is detectable. We propose to use automatic scene analysis as features for people identification. We exploit two features: scene classification (studio and report) and camera identification. Then, people are identified by propagation strategies of overlaid names (OCR results) and speakers to scene classes and specific camera shots. Experiments performed on the REPERE corpus show improvement of face identification using scene understanding features (+13.9% of F-measure compared to the baseline).
Published: 2014

105. Joint Decoding of Complementary Utterances

Author: Benoit Favre, Mickael Rouvier, Frédéric Béchet, Favre, Benoit, Laboratoire d'informatique Fondamentale de Marseille - UMR 6166 (LIF), Université de la Méditerranée - Aix-Marseille 2-Université de Provence - Aix-Marseille 1-Centre National de la Recherche Scientifique (CNRS), Traitement Automatique du Langage Ecrit et Parlé (TALEP), Laboratoire d'Informatique et Systèmes (LIS), and Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Computer science, business.industry, Speech recognition, Probabilistic logic, 020206 networking & telecommunications, 0102 computer and information sciences, 02 engineering and technology, Translation (geometry), computer.software_genre, 01 natural sciences, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], Task (project management), 010201 computation theory & mathematics, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, Joint (audio engineering), business, computer, Decoding methods, Word (computer architecture), Natural language processing, Utterance, ComputingMilieux_MISCELLANEOUS
Abstract: Errors in open-domain ASR can be corrected by asking the speaker to rephrase targeted segments in utterances where they have been detected. The utterance merging problem consists in generating a better transcript from the utterance where errors have been detected and a clarification utterance. We introduce an alignment-decoding algorithm for jointly processing the two utterances and benefit from the complementary information they contain. The algorithm aligns word lattices in the WFST framework with a probabilistic cost model. Results on the BOLT-BC speech-to-speech translation task show an improvement of 2.84 points of accuracy compared to aligning the one best without joint decoding.
Published: 2014

106. LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers

Author: Mickael Rouvier, Yannick Estève, Paul Deléglise, Fethi Bougares, and AMOKRANE, HAKIM
Subjects: Open source, System combination, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], Computer science, Speech recognition, Transcription (software), ComputingMilieux_MISCELLANEOUS
Abstract: In this paper, we report the LIUM participation in the ETAPE [1] (Evaluations en Traitement Automatique de la Parole) evaluation campaign, on the rich transcription task for French track. After describing the ETAPE goals and guidelines, we present our ASR system, which ranked first in the ETAPE evaluation campaign. Two ASR systems were used for our participation in ETAPE 2011. In addition to the LIUM ASR system based on CMU Sphinx project, we utilized an additional open-source ASR system based on the RASR toolkit. We evaluate, in this paper, the gain obtained with various acoustics modeling and adaptation techniques for each of the two systems, as well as with various system combination techniques. The combination of two different ASR systems allows a significant WER reduction, from 23.6% for the best single ASR system to 22.6% for the combination.
Published: 2013
Full Text: View/download PDF

107. Noise compensation for speech recognition using subspace Gaussian mixture models

Author: Mohamed Bouallegue, Mickael Rouvier, Driss Matrouf, and Georges Linarès
Published: 2012
Full Text: View/download PDF

108. Factor analysis based session variability compensation for Automatic Speech Recognition

Author: Mohamed Bouallegue, Mickael Rouvier, Driss Matrouf, and Georges Linarès
Subjects: Normalization (statistics), Voice activity detection, Computer science, business.industry, Speech recognition, Speech coding, Acoustic model, Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing), Pattern recognition, Speaker recognition, Speech processing, Computer Science::Sound, Artificial intelligence, Hidden Markov model, business, Test data
Abstract: In this paper we propose a new feature normalization based on Factor Analysis (FA) for the problem of acoustic variability in Automatic Speech Recognition (ASR). The FA paradigm was previously used in the field of ASR, in order to model the usefull information: the HMM state dependent acoustic information. In this paper, we propose to use the FA paradigm to model the useless information (speaker- or channel-variability) in order to remove it from acoustic data frames. The transformed training data frames are then used to train new HMM models using the standard training algorithm. The transformation is also applied to the test data before the decoding process. With this approach we obtain, on french broadcast news, an absolute WER reduction of 1.3%.
Published: 2011
Full Text: View/download PDF

109. Static and dynamic video summaries

Author: Georges Linarès, Mickael Rouvier, Yingbo Li, and Bernard Merialdo
Subjects: Set (abstract data type), Information retrieval, Computer science, Relevance (information retrieval), Extension (predicate logic), Duration (project management), Automatic summarization
Abstract: Currently there are a lot of algorithms for video summarization; however most of them only represent visual information. In this paper, we propose two approaches for the construction of the summary using both video and text. One approach focuses on static summaries, where the summary is a set of selected keyframes and keywords, to be displayed in a fixed area. The second approach addresses dynamic summaries where video segments are selected based on both their visual and textual content to compose a new video sequence of predefined duration. Our approaches rely on an existing summarization algorithm, Video Maximal Marginal Relevance (Video-MMR), and its extension Text Video Maximal Marginal Relevance (TV-MMR) proposed by us. We describe the details of those approaches and present experimental results.
Published: 2011
Full Text: View/download PDF

110. Transcription-based video genre classification

Author: Mickael Rouvier, Stanislas Oger, and Georges Linarès
Subjects: Artificial neural network, Transcription (linguistics), business.industry, Computer science, Speech recognition, Feature extraction, Artificial intelligence, Pragmatics, business, computer.software_genre, computer, Natural language processing
Abstract: In this paper, we present a new method for video genre identification based on the linguistic content analysis. This approach relies on the analysis of the most frequent words in the video transcriptions provided by an automatic speech recognition system. Experiments are conducted on a corpus composed of cartoons, movies, news, commercials, documentary, sport and music. On this 7-genre identification task, the proposed transcription-based method obtains up to 80% of correct identification. Finally, this rate is increased to 95% by combining the proposed linguistic-level features with low-level acoustic features.
Published: 2010
Full Text: View/download PDF

111. On-the-fly video genre classification by combination of audio features

Author: Driss Matrouf, Mickael Rouvier, and Georges Linarès
Subjects: Motion analysis, Computer science, business.industry, Speech recognition, Feature extraction, Frame (networking), Pattern recognition, Speech processing, computer.software_genre, Identification (information), Feature (computer vision), Artificial intelligence, Mel-frequency cepstrum, Audio signal processing, business, computer
Abstract: Video genre identification methods are frequently based on image or motion analysis, which are relatively timeconsuming processes. Since such approaches are tractable by batch processing, as-soon-as-possible identification requires faster methods. In this paper, we investigate the use of audio-only methods for on-the-fly video classification. We propose to use several acoustic feature streams and we evaluate various combination schemes at the frame or at the score level. Results are compared to those obtained by humans, according to the listening duration. Although the system based on model combination slightly outperforms the humans on very soon detection. The latter remain significantly more accurate on long sessions.
Published: 2010
Full Text: View/download PDF

112. Factor Analysis for Audio-based Video Genre Classification

Author: Driss Matrouf, Mickael Rouvier, Georges Linarès, and Déposants HAL-Avignon, bibliothèque Universitaire
Subjects: Channel (digital image), Computer science, business.industry, Speech recognition, Pattern recognition, automatic classification, [INFO] Computer Science [cs], Mixture model, Domain (software engineering), Support vector machine, ComputingMethodologies_PATTERNRECOGNITION, Factor (programming language), Index Terms: video genre identification, Feature (machine learning), Artificial intelligence, business, Factor Analy-sis, computer, computer.programming_language
Abstract: Statistical classifiers operate on features that generally include both useful and useless information. These two types of information are difficult to separate in the feature domain. Recently, a new paradigm based on a Latent Factor Analysis (LFA) proposed a model decomposition into usefull and useless components. This method was successfully applied to speaker and language recognition tasks. In this paper, we study the use of LFA for video genre classification by using only the audio channel. We propose a classification method based on short-term cep-stral features and Gaussian Mixture Models (GMM) or Support Vector Machine (SVM) classifiers, that are combined with Factor Analysis (FA). Experiments are conducted on a corpus composed of 5 types of video (musics, commercials, cartoons, movies and news). The relative classification error reduction obtained by using the best factor analysis configuration with respect to the baseline system, Gaussian Mixture Model Universal Background Model (GMM-UBM), is about 56%, corresponding to a correct identification rate of about 90%.
Published: 2009

113. Reranked aligners for interactive transcript correction

Author: Benoit Favre, Mickael Rouvier, Frédéric Béchet, Laboratoire d'informatique Fondamentale de Marseille - UMR 6166 (LIF), Université de la Méditerranée - Aix-Marseille 2-Université de Provence - Aix-Marseille 1-Centre National de la Recherche Scientifique (CNRS), Traitement Automatique du Langage Ecrit et Parlé (TALEP), Laboratoire d'Informatique et Systèmes (LIS), Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-Université de Toulon (UTLN)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Informatique d'Avignon (LIA), Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI, Laboratoire d'informatique Fondamentale de Marseille (LIF), Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS), Favre, Benoit, and Centre National de la Recherche Scientifique (CNRS)-École Centrale de Marseille (ECM)-Aix Marseille Université (AMU)
Subjects: Matching (statistics), Word embedding, Artificial neural network, Computer science, business.industry, Synonym, Speech recognition, Word error rate, Dialog systems, computer.software_genre, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], Set (abstract data type), ASR error detection, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], Reranking Levenshtein alignment, Artificial intelligence, Error correction, Error detection and correction, business, computer, Natural language processing, Word (computer architecture), Utterance
Abstract: International audience; Clarification dialogs can help address ASR errors in speech-to-speech translation systems and other interactive applications. We propose to use variants of Levenshtein alignment for merging an errorful utterance with a targeted rephrase of an error segment. ASR errors that might harm the alignment are addressed through phonetic matching, and a word embedding distance is used to account for the use of synonyms outside targeted segments. These features lead to a relative improvement of 30% of word error rate on ASR output compared to not performing the clarification. Twice as many utterance are completely corrected compared to using basic word alignment. Furthermore, we generate a set of potential merges and train a neural network on crowd-sourced rephrases in order to select the best merger, leading to 24% more instances completely corrected. The system is deployed in the framework of the BOLT project.

114. Correction automatique d'examens écrits par approche neuronale profonde et attention croisée bidirectionnelle

Author: yanis labrak, Philippe Turcotte, Richard Dufour, Mickael Rouvier, Laboratoire Informatique d'Avignon (LIA), Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI, Traitement Automatique du Langage Naturel (LS2N - équipe TALN ), Laboratoire des Sciences du Numérique de Nantes (LS2N), Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-École Centrale de Nantes (Nantes Univ - ECN), Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ)-Nantes université - UFR des Sciences et des Techniques (Nantes univ - UFR ST), Nantes Université - pôle Sciences et technologie, Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ)-Nantes Université - pôle Sciences et technologie, Nantes Université (Nantes Univ)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique), and Nantes Université (Nantes Univ)
Subjects: [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing, Correction Automatique Bidirectionnal Cross-Attention, French, Attention Croisée Bidirectionnelle, Transformers, [INFO]Computer Science [cs], Short Answer Grading, BERT, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Abstract: International audience; Cet article présente les systèmes développés par l'équipe LIA-LS2N dans le cadre de la campagne d'évaluation DEFT 2022 (Grouin & Illouz, 2022). Nous avons participé à la première tâche impliquant la correction automatique de copies d'étudiants à partir de références existantes. Nous proposons trois systèmes de classification reposant sur des caractéristiques extraites de plongements de mots contextuels issus d'un modèle BERT (CamemBERT). Nos approches reposent sur les concepts suivants : extraction de mesures de similarité entre les plongements de mots, attention croisée bidirectionnelle entre les plongements et fine-tuning (affinage) des plongements de mots. Les soumissions finales comprenaient deux systèmes fusionnés combinant l'attention croisée bidirectionnelle avec nos classificateurs basés sur BERT et celui sur les mesures de similarité. Notre meilleure soumission obtient une précision de 72,6 % en combinant le classifieur basé sur un modèle CamemBERT affiné et le mécanisme d'attention croisée bidirectionnelle. Ces résultats sont proches de ceux obtenus par le meilleur système de cette édition (75,6 %).

115. Actes du Défi Fouille de Textes@TALN 2024, Toulouse, France, July 8-12, 2024

Author: Richard Dufour, Benoît Favre, Mickael Rouvier, Adrien Bazoge, and Yanis Labrak
Published: 2024

116. SPQR@Deft2023 : Automatically answering to MCQ in the medical domain with similaritymeasures and domain-specific corpora

Author: Bezançon, Julien, Boubehziz, Toufik, Chutaux, Corina, Zine, Oumaima, Acensio, Laurie, Koudoro-Parfait, Caroline, Briglia, Andrea, Lejeune, Gaël, Équipe Linguistique computationnelle (STIH-LC), Sens, Texte, Informatique, Histoire (STIH), Sorbonne Université (SU)-Sorbonne Université (SU), Sorbonne Center for Artificial Intelligence (SCAI), Muséum national d'Histoire naturelle (MNHN)-Institut de Recherche pour le Développement (IRD)-Université de Technologie de Compiègne (UTC)-Institut National de Recherche en Informatique et en Automatique (Inria)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Assistance publique - Hôpitaux de Paris (AP-HP) (AP-HP)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Institut Européen d'administration des Affaires (INSEAD), UMR 1253 IBrain Imagerie & Cerveau Equipe 1 : 'Psychiatrie Neuro-Fonctionnelle' (PNF), Imagerie et cerveau (iBrain - Inserm U1253 - UNIV Tours ), Université de Tours (UT)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Tours (UT)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre Hospitalier Régional Universitaire de Tours (CHRU Tours), Sorbonne Université (SU), ATALA, and Adrien Bazoge, Béatrice Daille, Richard Dufour, Yanis Labrak, Emmanuel Morin, Mickael Rouvier (Éds.)
Subjects: QCM, pharmacologie, n-grammes de caractères, FrenchMedMCQA, systèmes de question-réponse MCQ, [INFO]Computer Science [cs], pharmacology, [SCCO.LING]Cognitive science/Linguistics, similarity, Question-answering systems, similarité, character n-grams
Abstract: International audience; We exhibit the approach of the SPQR team in the 2023 French Text Mining Challenge (DEFT). Thischallenge focused on automatically answering Multiple Choice Questions (MCQ) in the pharmacology domain. We proposed an approach that takes advantage of domain-specific corpora in order to find similarities between possible answers and sentences in the corpora.We compute a cosine similarity on character n-gram vectors to compare them. The best scores we obtained were 0,294 for the Hammingscore on the test set (0,305 on the dev set) and 0,997 for the Exact Match ratio ( 0,16 on the dev set).; Nous présentons le travail de SPQR (Sorbonne Question-Réponses) au DÉfi Fouille de Textes 2023 sur la réponse automatique à des questionnaires à choix multiples dans le domaine de la pharmacologie. Nous proposons une approche fondée sur la constitution de corpus de spécialité et la recherche de phrases similaires entre ces corpus et les différentes réponses possibles à une question. Nous calculons une similarité cosinus sur des vecteurs en n-grammes de caractères pour déterminer les bonnes réponses. Cette approche a obtenu un score maximal en Hamming de 0,249 sur les données de test (0,305 sur le dev) et de 0,0997 en Exact Match Ratio (0,16 sur le dev).
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

116 results on '"Mickael Rouvier"'

101. Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech

102. 'speech is silver, but silence is golden': improving speech-to-speech translation performance by slashing users input

103. Correction interactive de transcriptions de parole par fusion de phrases

104. Scene understanding for identifying persons in TV shows: beyond face authentication

105. Joint Decoding of Complementary Utterances

106. LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers

107. Noise compensation for speech recognition using subspace Gaussian mixture models

108. Factor analysis based session variability compensation for Automatic Speech Recognition

109. Static and dynamic video summaries

110. Transcription-based video genre classification

111. On-the-fly video genre classification by combination of audio features

112. Factor Analysis for Audio-based Video Genre Classification

113. Reranked aligners for interactive transcript correction

114. Correction automatique d'examens écrits par approche neuronale profonde et attention croisée bidirectionnelle

115. Actes du Défi Fouille de Textes@TALN 2024, Toulouse, France, July 8-12, 2024

116. SPQR@Deft2023 : Automatically answering to MCQ in the medical domain with similaritymeasures and domain-specific corpora

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

116 results on '"Mickael Rouvier"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources