The DNA nucleotide codes describe the four base compounds that make up a DNA sequence. The codes are as follows.

Name Letter Type Pairs With Formula
adenine a purine guanine C5H5N5
guanine g purine adenine C5H5N5O
cytosine c pyrimidine thymine C4H5N3O
thymine t pyrimidine cytosine C5H6N2O2

The SproutDatabase mostly manipulates fully-realized DNA sequences for complete genomes, which means only the four letters shown above will appear in our DNA sequences; however, when doing a pattern scan, you can also specify the StandardAmbiguityCodes. These are wild cards that indicate multiple possible nucleotide values, as shown in the table below. You can also use the code u to indicate uracil (C4H4N2O2), the compound that replaces thymine when RNA is created.

Code Nucleotides
M {A, C}
R {A, G}
W {A, T}
S {C, G}
Y {C, T}
K {G, T}
V {A, C, G}
H {A, C, T}
D {A, G, T}
B {C, G, T}
N {A, C, G, T}
SequencingForm
Sequence 000100
Summary Definitions of the four DNA nucleotides
Topic revision: r2 - 11 Oct 2008 - 23:47:08 - BruceParrello
 
NMPDR is a collaboration among researchers from the Computation Institute of the University of Chicago, the Fellowship for Interpretation of Genomes (FIG), Argonne National Laboratory, and the National Center for Supercomputing Applications (NCSA) at the University of Illinois. NMPDR is funded by the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract HHSN266200400042C. Banner images are copyright © Dennis Kunkel.