Ospina Zapata, Hader Sebastián, Hoyos Sánchez, María Camila, Darío Suárez, Brayhan, María Rodríguez, Aura, Herrera Sánchez, Valentina, Mario Ospina, Carlos, Julián Barbosa, Hamilton, Carranza Martinez, Julio Cesar, Adolfo Vallejo, Gustavo, Echeverry, María Clara, Duitama, Jorge, and Alfonso Urrea, Daniel
Chagas disease is caused by the protozoan parasite Trypanosoma cruzi, class Kinetoplastea, family Trypanosomatidae. In Colombia it is estimated that there are between 700,000 and 1,200,000 inhabitants infected with T. cruzi and about 8,000,000 people at risk of acquiring the infection. Kinetoplastids are characterized by a single mitochondrion with an intricate organization, consisting of a network of dozens of maxicircles and thousands of concatenated minicircles representing up to 20-25% of the total amount of DNA per cell. Maxicircles contain the genes involved in the mitochondrial electron transport chain and their study may help to understand the phylogenetic and evolutionary relationships of this group of parasites and adaptive processes. The aim of the present work was to identify the genomic characteristics of the maxicircles and minicircles (kDNA) of T. cruzi circulating in Colombia, by using sequencing technologies based on long reads, to obtain complete genomes that encompass all complex regions that present a repetitive naturalness and low percentages of Guanine/Citokine, difficult to appreciate through studies with short reads. A strain from a rural area of the municipality of Coyaima-Tolima isolated from the natural reservoir Didelphis marsupialis was used, which was sequenced by PacBio Hifi 100x technology. As for the maxicircles, 10 molecules were assembled de novo, with a size range between 21,950-47,166 bp, whose heterogeneity in their final size is mainly due to the length of the divergent region which presents sizes ranging from 6,400 bp to 32,000 bp. Through bioinformatics tools, a final molecule with a size of 47,166 bp was obtained and its circular nature was verified by bioinformatics methods. Its annotation revealed a total of 2 ribosomal genes and 18 structural genes that showed perfect synteny with mitochondrial genomes of other trypanosomatids previously reported. However, partial and complete deletions in the ND5 and RPS12 genes were found in some assembled molecules, suggesting possible heteroplasmy phenomena within the sequenced strain. Additionally, it was possible to obtain the complete divergent region, finding that its size was being underestimated. For this reason, a deeper analysis of this region, which has been the least studied of this genome, was performed, distinguishing specific characteristics such as its repetitive nature. A representative repertoire of sequences was obtained for the minicircles, which presented genomic and size heterogeneity, finding different numbers of conserved regions (Conserved Sequence Blocks-CSB) and hypervariables. In the present work, the first report of mitochondrial genomes of a strain isolated from southern Tolima was carried out. The usefulness of sequencing based on long reads for the analysis of complex genomes, such as the kinetoplast (kDNA), which presents low percentages of Guanine/Citokine and very repetitive regions, was demonstrated, thus contributing to the study of unexplored regions such as the divergent region, laying the foundations for future studies that can evaluate the biological implications that this region has in molecular processes in trypanosomatids. [ABSTRACT FROM AUTHOR]