Sequence Search

The sequence search is designed to locate matching DNA or protein sequences among selected genomes in the NMPDR database. It includes three BLAST tools and two pattern-matching tools, protScan and dnaScan.

screen shot of the tool search
The Tool dropdown allows you to select which tool to use for the search. In each case, the genomes to be searched are selected using the Genome Control.

  • blastp compares an input protein sequence to the protein databases for the selected genomes.
  • blastn compares an input DNA sequence to the nucleotide databases for the selected genomes, which should be closely related to the organism that is the source of the query sequence.
  • blastx translates an input DNA sequence and compares all frames to the protein databases for the selected genomes, which may be distantly related to the organism that is the source of the query sequence.
  • dnaScan matches a DNA search pattern against the nucleotide databases for the selected genomes.
  • protScan matches a protein search pattern against the protein databases for the selected genomes.


For the BLAST tools, enter a DNA sequence, a protein sequence, or a FIG ID for your input (which is called the query sequence). DNA or protein sequences may be in raw format, listing just the single letter codes, or the input may be in %FIG{FASTA} format, which includes a header, as shown below.

    >2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase
    MQQHTVYIGIGSNIGDRMSHLREALAMLNNLESTSVTAISAIYMTEPVGEINQERFYNGV
    LAVTTSMQPEALRQECKAIERTIGRPATYQRWSPRVIDLDLLLFDTLVCQTDTLAIPHPE
    LHHRNFVLIPLLDIANPTHPLLGKSIRELLALCPDRSVLIKLQENIAL

For the Scan tools, enter a sequence motif using the single letter code for amino acids or nucleotides. Degenerate positions are indicated with the ambiguous nucleotide code or by "X" for amino acids. Patterns may be constucted by leaving a blank letter space between elements of the pattern. See the article on Search Patterns for a thorough discussion of creating patterns for the scan tools.

For the BLAST tools, you may specify options in the Blast Options input box.

Search results include the name of the feature nearest the matched sequence. Neighborhood Width indicates how many base pairs in each direction from the center of the matched sequence will be searched to find a feature.

The Sequence Search can be found here.

Topic revision: r8 - 01 Apr 2009 - 16:18:32 - Bruce Parrello
 
Notice to NMPDR Users - The NMPDR BRC contract has ended and bacterial data from NMPDR has been transferred to PATRIC (http://www.patricbrc.org), a new consolidated BRC for all NIAID category A-C priority pathogenic bacteria. NMPDR was a collaboration among researchers from the Computation Institute of the University of Chicago, the Fellowship for Interpretation of Genomes (FIG), Argonne National Laboratory, and the National Center for Supercomputing Applications (NCSA) at the University of Illinois. NMPDR is funded by the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract HHSN266200400042C. Banner images are copyright © Dennis Kunkel.