Author: "Paquet, Thierry" / Language: undetermined - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Paquet, Thierry"' showing total 6 results

Start Over Author "Paquet, Thierry" Language undetermined

6 results on '"Paquet, Thierry"'

1. Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers

Author: Maarand Martin, Paquet Thierry, Kermorvant Christopher, and Boillet Melodie
Subjects: FOS: Computer and information sciences, Focus (computing), Information retrieval, Computer science, business.industry, Deep learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, 16. Peace & justice, Visual appearance, Pipeline (software), 030218 nuclear medicine & medical imaging, Image (mathematics), 03 medical and health sciences, 0302 clinical medicine, Market segmentation, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Segmentation, Artificial intelligence, business, Historical document
Abstract: The segmentation of complex images into semantic regions has seen a growing interest these last years with the advent of Deep Learning. Until recently, most existing methods for Historical Document Analysis focused on the visual appearance of documents, ignoring the rich information that textual content can offer. However, the segmentation of complex documents into semantic regions is sometimes impossible relying only on visual features and recent models embed both visual and textual information. In this paper, we focus on the use of both visual and textual information for segmenting historical registers into structured and meaningful units such as acts. An act is a text recording containing valuable knowledge such as demographic information (baptism, marriage or death) or royal decisions (donation or pardon). We propose a simple pipeline to enrich document images with the position of text lines containing key-phrases and show that running a standard image-based layout analysis system on these images can lead to significant gains. Our experiments show that the detection of acts increases from 38 % of mAP to 74 % when adding textual information, in real use-case conditions where text lines positions and content are extracted with an automatic recognition system.
Published: 2021
Full Text: View/download PDF

2. SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

Author: Coquenet, Denis, Chatelain, Cl��ment, and Paquet, Thierry
Subjects: FOS: Computer and information sciences, ComputingMethodologies_PATTERNRECOGNITION, Computer Vision and Pattern Recognition (cs.CV), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Abstract: Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict & Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without any prior segmentation stage. The framework is as simple as the one used for the recognition of isolated lines and we achieve competitive results on three popular datasets: RIMES, IAM and READ 2016. The proposed model does not require any dataset adaptation, it can be trained from scratch, without segmentation labels, and it does not require line breaks in the transcription labels. Our code and trained model weights are available at https://github.com/FactoDeepLearning/SPAN.
Published: 2021
Full Text: View/download PDF

3. A syllable based model for handwriting recognition

Author: Swaileh, Wassim and Paquet, Thierry
Subjects: FOS: Computer and information sciences, ComputingMethodologies_PATTERNRECOGNITION, Computer Vision and Pattern Recognition (cs.CV), InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL, Computer Science - Computer Vision and Pattern Recognition, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
Abstract: In this paper, we introduce a new modeling approach of texts for handwriting recognition based on syllables. We propose a supervised syllabification approach for the French and English languages for building a vocabulary of syllables. Statistical n-gram language models of syllables are trained on French and English Wikipedia corpora. The handwriting recognition system, based on optical HMM context independent character models, performs a two pass decoding, integrating the proposed syllabic models. Evaluation is carried out on the French RIMES dataset and English IAM dataset by analyzing the performance for various coverage of the syllable models. We also compare the syllable models with lexicon and character n-gram models. The proposed approach reaches interesting performances thanks to its capacity to cover a large amount of out of vocabulary words working with a limited amount of syllables combined with statistical n-gram of reasonable order.
Published: 2018
Full Text: View/download PDF

4. A Lexicon Verification Strategy in a BLSTM Cascade Framework

Author: Paquet Thierry, Stuner Bruno, Chatelain Clement, Equipe Apprentissage (DocApp - LITIS), Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes (LITIS), Université Le Havre Normandie (ULH), Normandie Université (NU)-Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Institut national des sciences appliquées Rouen Normandie (INSA Rouen Normandie), Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Institut National des Sciences Appliquées (INSA)-Université Le Havre Normandie (ULH), and Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Institut National des Sciences Appliquées (INSA)
Subjects: 050210 logistics & transportation, Intelligent character recognition, Computer science, business.industry, Speech recognition, 05 social sciences, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], Word error rate, 02 engineering and technology, computer.software_genre, Lexicon, ComputingMethodologies_PATTERNRECOGNITION, Recurrent neural network, Handwriting recognition, 0502 economics and business, Classifier (linguistics), Word recognition, 0202 electrical engineering, electronic engineering, information engineering, [INFO]Computer Science [cs], 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Natural language processing, Decoding methods
Abstract: International audience; Handwriting recognition always has been a difficult problem, with image related problems on the one hand and language processing on the other hand. Significant improvements have been made in handwriting recognition thanks to new recurrent neural networks based on LSTM cells. The high character recognition performances of these networks are almost systematically combined with linguistic knowledge, that is to say lexicon driven decoding method, to correct character misrecognitions. However with such high performance, we wonder on the possibility to use them without lexical decoding for word recognition. In this article, we explore this idea by proposing a lexicon verification strategy that provides a very low error rate, while conceding a consequent amount of rejects. Therefore, this verification approach perfectly fits in a cascade framework, where the rejects of a classifier are processed by the next cascade's classifier. The resulting system is nearly insensitive to the lexicon size, while providing a much faster decoding process than a standard lexicon driven decoding. Furthermore, when processing the final rejects of the cascade by a basic lexical decoding, our approach reach state of the art performance for isolated word recognition.
Published: 2016

5. Un modèle syllabique pour la reconnaissance de l’écriture

Author: Swaileh, Wassim, Ait Mohand, Kamel, and Paquet, Thierry
Abstract: RÉSUMÉ. Dans cet article nous introduisons un nouveau modèle syllabique pour la reconnaissance de l'écriture. Une méthode de syllabation orthographique supervisée du Français est proposé pour la construction d’un vocabulaire de syllabes. Un modèle de langage statistique en n-gram combinant syllabes et caractères est appris sur un corpus Wikipedia. Le système de reconnaissance d’écriture fondé sur des modèles optiques HMM de caractères procède à un décodage en deux passes en exploitant le modèle syllabique proposé. L’évaluation est réalisée sur la base RIMES en analysant les performances pour différents taux de couverture du modèle syllabique. Nous comparons le modèle proposé à un modèle lexical ainsi qu’à un modèle de caractères. L’approche proposée permet d’atteindre des performances intéressantes grâce à sa capacité à couvrir une proportion importante des mots hors lexique en travaillant avec un lexique de syllabes de taille limitée combiné à un modèle de n-gram d’ordre raisonnable.
Published: 2016
Full Text: View/download PDF

6. Détection de tableaux dans des documents complexes

Author: Kasar, Thotreingam, Barlas, Philippine, Adam, Sébastien, Chatelain, Clément, and Paquet, Thierry
Abstract: RÉSUMÉ. Dans cet article, nous présentons les résultats obtenus par un détecteur de tableau dans le cadre des campagnes MAURDOR, pour lesquelles le corpus présente la particularité de contenir des document fortement hétérogènes dans leur mise en page, leurs scripts et les langues utilisées.
Published: 2014
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Paquet, Thierry"'

1. Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers

2. SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

3. A syllable based model for handwriting recognition

4. A Lexicon Verification Strategy in a BLSTM Cascade Framework

5. Un modèle syllabique pour la reconnaissance de l’écriture

6. Détection de tableaux dans des documents complexes

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Database

Publisher

6 results on '"Paquet, Thierry"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources