Back to Search Start Over

Synthetic dataset of ID and Travel Documents

Authors :
Carlos Boned
Maxime Talarmain
Nabil Ghanmi
Guillaume Chiron
Sanket Biswas
Ahmad Montaser Awal
Oriol Ramos Terrades
Source :
Scientific Data, Vol 11, Iss 1, Pp 1-10 (2024)
Publication Year :
2024
Publisher :
Nature Portfolio, 2024.

Abstract

Abstract This paper presents a new synthetic dataset of ID and travel documents, called SIDTD. The SIDTD dataset is created to help training and evaluating forged ID documents detection systems. Such a dataset has become a necessity as ID documents contain personal information and a public dataset of real documents can not be released. Moreover, forged documents are scarce, compared to legit ones, and the way they are generated varies from one fraudster to another resulting in a class of high intra-variability. In this paper we introduce a dataset, synthetically generated, that simulates the most common, and easiest, forgeries to be made by common users of ID documents and travel documents. The creation of this dataset will help to document image analysis community to progress in the task of automatic ID document verification in online onboarding systems.

Subjects

Subjects :
Science

Details

Language :
English
ISSN :
20524463
Volume :
11
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Scientific Data
Publication Type :
Academic Journal
Accession number :
edsdoj.2e14e4f8c5e48ca99a91803ffa374d7
Document Type :
article
Full Text :
https://doi.org/10.1038/s41597-024-04160-9