Downloads
NMPDR has released new genome annotations for each of the focus organisms in modified GFF3 format. GFF3, described at the
Sequence Ontology website, is a flat file format for describing genomic features. The formatted GFF3 files accessible below contain rows of records, each with nine tab-delimited fields: seqid, source, type, start, end, score, strand, phase, and attributes. The
score and
phase fields are not in use, so in each row, those fields contain the period (
.) character. Each row describes a feature, which is a region on the DNA located between start and end nucleotide coordinates. To describe a protein-encoding gene, two rows are used to record two features at the same location: gene and CDS. FASTA formatted nucleotide and amino acid sequences follow the tab-delimited table of feature annotations.