DNA Binding Sites

DNA Binding Consensus Sequences in S. pombe

Transcription Factor Binding Sites

Site Name Consensus Sequence Sequence Ontology ID
Sap1_recognition_motif TARGCAGNTNYAACGMG SO:0001864
AP_1_binding_site TGACTCA SO:0001842
calcineurin-dependent response element (CDRE motif) GNGGCKCA SO:0001865
cyclic AMP response element (CRE) TGACGTCA SO:0001843
CSL_response_element GTGRGAA SO:0001839
copper-response element (CuRE) HTHNNGCTGD SO:0001844
DNA damage response element (DRE) CGWGGWNGMM SO:0001845
FLEX_element GTAAACAAACAAAM SO:0001846
forkhead_motif TTTRTTTACA SO:0001847
homol_D_box CAGTCACA (or inverted form TGTGACTG) SO:0001848
homol_E_box ACCCTACCCT (or inverted form AGGGTAGGGT) SO:0001849
heat shock element (HSE) NGAAN (at least 3 copies) SO:0001850
iron_repressed_GATA_element WGATAA SO:0001851
mating_type_M_box ACAAT SO:0001852
sterol_regulatory_element ATCACCCCAC (and variants) SO:0001861
STREP_motif CCCCTC SO:0001859
TR_box TTCTTTGTTY SO:0001858
Ace2_UAS CCAGCC SO:0001857
CCAAT_motif CCAAT SO:0001856
MluI cell cycle box (MCB) ACGCGT SO:0001855

Other DNA Binding Sites

Site Name Consensus Sequence Sequence Ontology ID
GT_dinucleotide_repeat (GT)n SO:0001862
GTT_trinucleotide_repeat (GTT)n SO:0001863
rDNA_intergenic_spacer_element AGGTAAGGGTAATGCAC SO:0001860

SO IDs link to MISO, the Sequence Ontology Browser. Consensus sequences use the IUBMB Nomenclature for Incompletely Specified Bases in Nucleic Acid Sequences; briefly:

  • R = purine (A or G)
  • Y = pyrimidine (C or T)
  • W = A or T
  • M = A or C
  • K = G or T
  • H = A, C or T
  • B = C, G or T