PomBase home

Reference - PMID:36408920 - UniProt: the Universal Protein Knowledgebase in 2023.

Reference summary

PubMed ID
PMID:36408920
Title
UniProt: the Universal Protein Knowledgebase in 2023.
Authors
UniProt Consortium
Citation
Nucleic Acids Res 2023 Jan 06;51(D1):D523-D531
Publication year
2023
Abstract
The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication we describe enhancements made to our data processing pipeline and to our website to adapt to an ever-increasing information content. The number of sequences in UniProtKB has risen to over 227 million and we are working towards including a reference proteome for each taxonomic group. We continue to extract detailed annotations from the literature to update or create reviewed entries, while unreviewed entries are supplemented with annotations provided by automated systems using a variety of machine-learning techniques. In addition, the scientific community continues their contributions of publications and annotations to UniProt entries of their interest. Finally, we describe our new website (https://www.uniprot.org/), designed to enhance our users' experience and make our data easily accessible to the research community. This interface includes access to AlphaFold structures for more than 85% of all entries as well as improved visualisations for subcellular localisation of proteins.

Annotation

Modification

MOD:00226 - 1'-(8alpha-FAD)-L-histidine

Genes:

MOD:01625 - 1-thioglycine

Genes:

MOD:00153 - 3'-(8alpha-FAD)-L-histidine

Genes:

MOD:00793 - dehydroalanine (Cys)

Genes:

MOD:00257 - dipyrrolylmethanemethyl-L-cysteine

Genes:

MOD:00689 - disulfide crosslinked residues

Genes:

MOD:00441 - geranylgeranylated residue

Genes:

MOD:00818 - glycosylphosphatidylinositolated residue

Genes:

MOD:00125 - hypusine

Genes:

MOD:00156 - L-2',4',5'-topaquinone

Genes:

MOD:00042 - L-aspartic 4-phosphoric anhydride

Genes:

MOD:00234 - L-cysteine glutathione disulfide

Genes:

MOD:00114 - L-cysteine methyl ester

Genes:

MOD:00304 - L-leucine methyl ester

Genes:

MOD:01982 - N,N,N-trimethylglycine

Genes:

MOD:00050 - N-acetyl-L-alanine

Genes:

MOD:00060 - N-acetyl-L-serine

Genes:

MOD:00006 - N-glycosylated residue

Genes:

MOD:00351 - N-glycyl-1-(phosphatidyl)ethanolamine

Genes:

MOD:00068 - N-myristoylglycine

Genes:

MOD:00171 - N-seryl-glycosylphosphatidylinositolethanolamine

Genes:

MOD:00310 - N5-methyl-L-arginine

Genes:

MOD:00080 - N5-methyl-L-glutamine

Genes:

MOD:00083 - N6,N6,N6-trimethyl-L-lysine

Genes:

MOD:00084 - N6,N6-dimethyl-L-lysine

Genes:

MOD:00064 - N6-acetyl-L-lysine

Genes:

MOD:00126 - N6-biotinyl-L-lysine

Genes:

MOD:00123 - N6-carboxy-L-lysine

Genes:

MOD:00127 - N6-lipoyl-L-lysine

Genes:

MOD:00085 - N6-methyl-L-lysine

Genes:

MOD:00128 - N6-pyridoxal phosphate-L-lysine

Genes:

MOD:00046 - O-phospho-L-serine

Genes:

MOD:00047 - O-phospho-L-threonine

Genes:

MOD:00159 - O-phosphopantetheine-L-serine

Genes:

MOD:00048 - O4'-phospho-L-tyrosine

Genes:

MOD:00890 - phosphorylated L-histidine

Genes:

MOD:01154 - pyruvic acid

Genes:

MOD:00111 - S-farnesyl-L-cysteine

Genes:

MOD:00115 - S-palmitoyl-L-cysteine

Genes:

Protein sequence feature

SO:0001808 - mitochondrial_targeting_signal

Genes:

SO:0000418 - signal_peptide

Genes: