New on STN
July 2010
CAS coverage of global patent authorities has expanded to 61 with the addition of Costa Rica
Databases in Focus
Chemical Abstracts Plus
(CAplusSM) and CAS REGISTRYSM
USGENE and PCTGEN: new FASTA display formats added
01. February 2010
The new sequence display formats FASTA and FASTA2 have been added to the sequence databases USGENE and PCTGEN. These FASTA display formats are widely accepted by 3rd party sequence analysis tools, so that sequence search results from USGENE and PCTGEN can directly be used for further analysis.
The new format FASTA comprises a header line providing a unique description of the sequence (accession number, sequence identity number and patent publication number) and the actual sequence in lines of 70 characters. FASTA2 is a lower-priced alternative format which provides the same sequence information with a truncated header line.
FASTA and FASTA2 can be used in combination with the standard display formats ALL and BRIEF generating no additional costs.
Example of FASTA- and FASTA2-format:
=> D FASTA
FASTA:
>USGENE|20100017904.32958|Protein|sequence 32958 from US20100017904
mgevvatweateggagvkgpvvvtgasgflgswlvmkllqagytvratvrdpanvvktkplldlpgater
lslwkadladegsfddairgctgvfhvatpmdfeskdpenevikptvegmmsimrackeagtvrrivfts
sagtvnieerqrpvydqdnwsdvdfcqrvkmtgwmyfvskslaekaamayaaehgldfisiiptlvvgpf
lsagmppslitalalvtgneahysilkqvqfvhlddlcdahlflfehpaaagryvcsshdatihglaaml
rdrypeydiperfpgieddlqpvhfsskklldhgftfkytvedmfdaairmcrekgliplatagggralp
=> D FASTA2
FASTA2:
>USGENE|Protein
mgevvatweateggagvkgpvvvtgasgflgswlvmkllqagytvratvrdpanvvktkplldlpgater
lslwkadladegsfddairgctgvfhvatpmdfeskdpenevikptvegmmsimrackeagtvrrivfts
sagtvnieerqrpvydqdnwsdvdfcqrvkmtgwmyfvskslaekaamayaaehgldfisiiptlvvgpf
lsagmppslitalalvtgneahysilkqvqfvhlddlcdahlflfehpaaagryvcsshdatihglaaml
rdrypeydiperfpgieddlqpvhfsskklldhgftfkytvedmfdaairmcrekgliplatagggralp
The revised summary sheets are available at:
http://www.stn-international.com/sum_sheets.html
For pricing information see HELP COST in the file.
