Difference between revisions of "NCBI"
From Organic Design wiki
m |
(Adding info) |
||
Line 1: | Line 1: | ||
− | Genbank is a flat file database structure for nucleotide sequence information and auxillary information. | + | Genbank is a flat file database structure for primary nucleotide sequence information and auxillary information. These records can |
+ | display the [http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html#OriginB ORIGIN] information for different nucleotide molecular types, and have no limit on the length of sequence displayed. Entire chromosomes can be stored as a genbank record for an organism of interest, potentially making the disk storage of the record very large. | ||
− | An example Genbank [http://www.ncbi.nlm.nih.gov/ | + | An example Genbank [http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html sample record] |
== Regular expressions matching parts we care about == | == Regular expressions matching parts we care about == | ||
+ | *The unique [http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html#AccessionB ACCESSION] number | ||
+ | *The ORIGIN field | ||
+ | *Any amino acid [http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html#TranslationB /translation] in the [http://www.ncbi.nlm.nih.gov/Sitemap/samplerecord.html#FeaturesB FEATURE] table | ||
+ | |||
+ | ==See also== | ||
+ | * [http://www.ncbi.nlm.nih.gov/Genbank/index.html NCBI GenBank Overview] |
Revision as of 04:52, 5 May 2007
Genbank is a flat file database structure for primary nucleotide sequence information and auxillary information. These records can display the ORIGIN information for different nucleotide molecular types, and have no limit on the length of sequence displayed. Entire chromosomes can be stored as a genbank record for an organism of interest, potentially making the disk storage of the record very large.
An example Genbank sample record
Regular expressions matching parts we care about
- The unique ACCESSION number
- The ORIGIN field
- Any amino acid /translation in the FEATURE table