Run CleanSeq

The input GenBank file might contain characters other than "A", "G", "C" and "T", which are used to represent gaps and degenerate bases. This kind of sequence will cause problems in the translation steps during the annotation. As a result, cpgview does not accept sequences containing characters other than "A", "G", "C" and "T". This CleanSeq module will convert a sequence with unacceptable characters to a sequence only containing "A", "G", "C" and "T".

Basicly, the frequencies of all characters in the input sequence will be counted, the degenerate bases will be replaces with the standard bases having the highest frequencies.

Option I: enter a GenBank accession number.

GenBank Accession Number (e.g. NC_000932).

Note: you can enter the GenBank accession numbers with the version ID such as NC_000932.x. However, they will be converted to the version that is stored in our database.

 - 
    
OR
 

Option II: upload a file in GenBank format.

The input file should be in GenBank format. A sample file is here (GenBank File
 -