1. Topology based identification and comprehensive classification of four-transmembrane helix containing proteins (4TMs) in the human genome
- Author
-
Arunkumar Krishnan, Helgi B. Schiöth, Valentina Pivotti, Markus Sällman Almén, Misty M. Attwood, and Samira Yazdi
- Subjects
0301 basic medicine ,Drug targets ,Proteome ,Medical Biotechnology ,Topology prediction ,Biology ,Topology ,Proteomics ,Genome ,03 medical and health sciences ,Medicinsk bioteknologi ,Human proteome project ,Genetics ,Humans ,Function ,Cancer ,4TM ,Genome, Human ,Human proteome ,Membrane Proteins ,Transmembrane protein ,Transmembrane domain ,Structure function ,030104 developmental biology ,Membrane protein ,Four transmembrane ,Human genome ,Research Article ,Biotechnology - Abstract
Background Membrane proteins are key components in a large spectrum of diverse functions and thus account for the major proportion of the drug-targeted portion of the genome. From a structural perspective, the α-helical transmembrane proteins can be categorized into major groups based on the number of transmembrane helices and these groups are often associated with specific functions. When compared to the well-characterized seven-transmembrane containing proteins (7TM), other TM groups are less explored and in particular the 4TM group. In this study, we identify the complete 4TM complement from the latest release of the human genome and assess the 4TM structure group as a whole. We functionally characterize this dataset and evaluate the resulting groups and ubiquitous functions, and furthermore describe disease and drug target involvement. Results We classified 373 proteins, which represents ~7 % of the human membrane proteome, and includes 69 more proteins than our previous estimate. We have characterized the 4TM dataset based on functional, structural, and/or evolutionary similarities. Proteins that are involved in transport activity constitute 37 % of the dataset, 23 % are receptor-related, and 13 % have enzymatic functions. Intriguingly, proteins involved in transport are more than double the 15 % of transporters in the entire human membrane proteome, which might suggest that the 4TM topological architecture is more favored for transporting molecules over other functions. Moreover, we found an interesting exception to the ubiquitous intracellular N- and C-termini localization that is found throughout the entire membrane proteome and 4TM dataset in the neurotransmitter gated ion channel families. Overall, we estimate that 58 % of the dataset has a known association to disease conditions with 19 % of the genes possibly involved in different types of cancer. Conclusions We provide here the most robust and updated classification of the 4TM complement of the human genome as a platform to further understand the characteristics of 4TM functions and to explore pharmacological opportunities. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2592-7) contains supplementary material, which is available to authorized users.
- Full Text
- View/download PDF