The sequence search is designed to locate matching DNA or protein sequences among selected genomes in the NMPDR database. It includes three BLAST? tools and two pattern-matching tools, protScan and dnaScan.

screen shot of the tool search
The Tool dropdown allows you to select which tool to use for the search. In each case, the genomes to be searched are selected using the GenomeControl.

  • blastp compares an input protein sequence to the protein databases for the selected genomes.
  • blastn compares an input DNA sequence to the nucleotide databases for the selected genomes, which should be closely related to the organism that is the source of the query sequence.
  • blastx translates an input DNA sequence and compares all frames to the protein databases for the selected genomes, which may be distantly related to the organism that is the source of the query sequence.
  • dnaScan matches a DNA search pattern against the nucleotide databases for the selected genomes.
  • protScan matches a protein search pattern against the protein databases for the selected genomes.


For the BLAST tools, enter a DnaSequence? , a ProteinSequence? , or a FigId for your input (which is called the query sequence). DNA or protein sequences may be in raw format, listing just the single letter codes, or the input may be in FastaFormat, which includes a header, as shown below.

    >2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase
    MQQHTVYIGIGSNIGDRMSHLREALAMLNNLESTSVTAISAIYMTEPVGEINQERFYNGV
    LAVTTSMQPEALRQECKAIERTIGRPATYQRWSPRVIDLDLLLFDTLVCQTDTLAIPHPE
    LHHRNFVLIPLLDIANPTHPLLGKSIRELLALCPDRSVLIKLQENIAL

For the Scan tools, enter a sequence motif using the single letter code for amino acids or nucleotides. See the article on SearchPatterns? for information about creating patterns for the scan tools.

For the BLAST tools, you may specify options in the Blast Options input box.

Search results include the name of the feature nearest the matched sequence. Neighborhood Width indicates how many base pairs in each direction from the center of the matched sequence will be searched to find a feature.

Topic revision: r1 - 21 Apr 2008 - 06:34:21 - BruceParrello
 
NMPDR is a collaboration among researchers from the Computation Institute of the University of Chicago, the Fellowship for Interpretation of Genomes (FIG), Argonne National Laboratory, and the National Center for Supercomputing Applications (NCSA) at the University of Illinois. NMPDR is funded by the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract HHSN266200400042C. Banner images are copyright © Dennis Kunkel.