Functional Roles

The functional role of a feature is the task it performs in the host organism. The role is stored in the NMPDR database as a descriptive phrase. For a Protein Encoding Gene that generates a known protein, the functional role will be the protein name. Some functional roles correspond to EC Numbers. In that case, the EC number is shown in parentheses at or near the end of the text, as shown in the table below.

FIG ID Functional Role
fig|83332.1.peg.588 hypothetical protein Rv0588
fig|83332.1.peg.58 Replicative DNA helicase (EC 3.6.1.-); intein-containing
fig|71421.1.peg.117 CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase (EC 2.7.8.5)
fig|71421.1.peg.1180 Methylglyoxal synthase (EC 4.2.3.3)

For some features, the functional role is well-known; however, in most cases the role is assigned by inference (see Feature Annotation). As more data comes in from the Genome Pipeline, our ability to infer functional roles improves. For this reason, the functional role assigned to a feature may change over time. The current best guess for a feature's functional role is called its functional assignment. On the Genome Viewer Annotation Page, the current functional assignment is shown, and a button is provided that allows you to see previous assignments (see also Assertions).

It is important to be able to find genes that perform the same functional role; however, since functional roles are described using free-form text, this is not always possible. In the NMPDR, subsystems and FIGfams are used to find functional roles that should be the same, and we use this information to derive a canonical description of the functional role.

Special Punctuation

A single gene may have multiple functional roles, or we may be unsure of the functional role. This is indicated by special punctuation characters as follows.

Character Example Description
slash (/) N-acetylglucosamine kinase bacterial type predicted (EC 2.7.1.59) / Transcriptional regulator Gene performs multiple functions. In the example, the gene is both a transcriptional regulator and a type of kinase
at (@) 3-ketoacyl-CoA thiolase (EC 2.3.1.16) @ Acetyl-CoA acetyltransferase (EC 2.3.1.9) Gene performs multiple functions using the same Protein Domain. In the example, a specific site of the protein molecule acts as both a thiolase and an acetyltransferase
semi-colon (;) Lead, cadmium, zinc and mercury transporting ATPase (EC 3.6.3.3) (EC 3.6.3.5); Copper-translocating P-type ATPase (EC 3.6.3.4) Gene function is not clear, but is believed to be one of the choices present. In the example, the function is related to some sort of metal handling, either copper or lead, cadmium, zinc and mercury
pound (#) 1-pyrroline-4-hydroxy-2-carboxylate deaminase (EC 3.5.4.22) # predicted Indicates a comment. In the example, the gene performs a deaminase function, and the text predicted is a note from the annotator

Further Reading

Topic revision: r11 - 31 Mar 2009 - 21:14:20 - Bruce Parrello
 
NMPDR is a collaboration among researchers from the Computation Institute of the University of Chicago, the Fellowship for Interpretation of Genomes (FIG), Argonne National Laboratory, and the National Center for Supercomputing Applications (NCSA) at the University of Illinois. NMPDR is funded by the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract HHSN266200400042C. Banner images are copyright © Dennis Kunkel.