1. UniProt: the Universal Protein Knowledgebase in 2023
- Author
-
Fábio Madeira, Paul Denny, Yvonne Lussi, Antonia Lock, Dushyanth Jyothi, Pedro Raposo, Daniel Rice, Prabhat Totoo, Aurélien Luciani, Rafael Santos, Tunca Dogan, Sandra Orchard, Alex Bateman, Swaathi Kandasaamy, Leonardo Jose Da Costa Gonzales, Jie Luo, Jun Fan, Giuseppe Insana, and Emily Bowler-Barnett
- Subjects
Genetics - Abstract
The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication we describe enhancements made to our data processing pipeline and to our website to adapt to an ever-increasing information content. The number of sequences in UniProtKB has risen to over 227 million and we are working towards including a reference proteome for each taxonomic group. We continue to extract detailed annotations from the literature to update or create reviewed entries, while unreviewed entries are supplemented with annotations provided by automated systems using a variety of machine-learning techniques. In addition, the scientific community continues their contributions of publications and annotations to UniProt entries of their interest. Finally, we describe our new website (https://www.uniprot.org/), designed to enhance our users’ experience and make our data easily accessible to the research community. This interface includes access to AlphaFold structures for more than 85% of all entries as well as improved visualisations for subcellular localisation of proteins.
- Published
- 2022