Skip to main content

Table 3 Classification of sequences in the final alignment

From: Bioinformatic curation and alignment of genotyped hepatitis B virus (HBV) sequence data from the GenBank public database

Genotype C..* CN.* C.W* CNW* S..* SN.* S.W* SNW* Total C**U S**U
A 424 9 87 3 4676 64 596 9 5868 0 149
B 458 5 119 2 2957 60 986 43 4630 7 8
C 868 4 210 5 5684 92 916 41 7820 2 130
D 544 1 127 6 6276 61 1257 28 8300 35 163
E 165 0 41 0 1594 8 234 1 2043 0 0
F 104 2 37 2 722 49 63 6 985 2 5
G 17 0 1 0 157 0 14 0 189 0 4
H 11 0 5 0 76 0 16 0 108 0 0
I 5 0 0 0 14 0 4 0 23 0 0
Total 2596 21 627 18 22,156 334 4086 128 29,966 46 459
  1. “C” in the first position represents “Complete” sequences, “S” in the first position represents “Subgenomic” sequences, “N” in the second position indicates at least one “N” character in the sequence, otherwise “.”; “W” in the third position indicates at least one wobble in the sequence, otherwise “.”, “U” in the last position indicates an “Unverified” sequence, otherwise “.”, “*” indicates that either value for that position is included whereas "**" indicates either value for two adjoining columns, the “Total” column is the total of the first eight columns