Lisa J. Martin, Frédéric Lecerf, Christoph Rau, Anatole Ghazalpour, Raffi Hagopian, Aldons J. Lusis, Farhad Hormozdiari, Jun Yan, Sandrine Lagarrigue, Yu Xiao, Eleazar Eskin, Thomas A. Drake, Yehudit Hasin, Physiologie, Environnement et Génétique pour l'Animal et les Systèmes d'Elevage [Rennes] (PEGASE), Institut National de la Recherche Agronomique (INRA)-AGROCAMPUS OUEST, UMR1348 Physiologie, Environnement et Génétique pour l'Animal et les Systèmes d'Elevage, AGROCAMPUS OUEST, Université européenne de Bretagne - European University of Brittany (UEB), Department of Computer Sciences, University of California, Department of Human Genetics, Department of Medicine/Division of Cardiology, Department of Pathology and Laboratory Medicine, Department of Microbiology, Immunology and Molecular Genetics, National Institutes of Health (NIH) [HL28481, HL30568, DK072206], National Science Foundation [0513612, 0731455, 0729049, 0916676, 1065276], NIH [HL080079, DA024417], French genomic agricultural society (AGENAVI), INRA, Agence Nationale de la Recherche [0426], [HD07228], Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro), and AGROCAMPUS OUEST-Institut National de la Recherche Agronomique (INRA)
Several studies have investigated RNA–DNA differences (RDD), presumably due to RNA editing, with conflicting results. We report a rigorous analysis of RDD in exonic regions in mice, taking into account critical biases in RNA-Seq analysis. Using deep-sequenced F1 reciprocal inbred mice, we mapped 40 million RNA-Seq reads per liver sample and 180 million reads per adipose sample. We found 7300 apparent hepatic RDDs using a multiple-site mapping procedure, compared with 293 RDD found using a unique-site mapping procedure. After filtering for repeat sequence, splice junction proximity, undirectional strand, and extremity read bias, 63 RDD remained. In adipose tissue unique-site mapping identified 1667 RDD, and after applying the same four filters, 188 RDDs remained. In both tissues, the filtering procedure increased the proportion of canonical (A-to-I and C-to-U) editing events. The genomic DNA of 12 RDD sites among the potential 63 hepatic RDD was tested by Sanger sequencing, three of which proved to be due to unreferenced SNPs. We validated seven liver RDD with Sequenom technology, including two noncanonical, Gm5424 C-to-I(G) and Pisd I(G)-to-A RDD. Differences in diet, sex, or genetic background had very modest effects on RDD occurrence. Only a small number of apparent RDD sites overlapped between liver and adipose, indicating a high degree of tissue specificity. Our findings underscore the importance of properly filtering for bias in RNA-Seq investigations, including the necessity of confirming the DNA sequence to eliminate unreferenced SNPs. Based on our results, we conclude that RNA editing is likely limited to hundreds of events in exonic RNA in liver and adipose.