Skip to main content

Table 2 Extraction of data and preparation of data set

From: Bioinformatic curation and alignment of genotyped hepatitis B virus (HBV) sequence data from the GenBank public database

 

Genotyped

Overlength (removed)

Raw

No BLAST hits (removed)

Placed

Consensus with gaps (removed)

Mismatch (included)

Overhang (removed)

Final

A

5959

14

5945

0

5945

77

0

0

5868

B

4734

24

4710

0

4710

79

0

1

4630

C

8205

72

8133

90

8043

222

0

1

7820

D

8508

103

8405

6

8399

99

0

0

8300

E

2053

2

2051

0

2051

8

0

0

2043

F

1009

4

1005

1

1004

19

0

0

985

G

250

2

248

4

244

55

0

0

189

H

113

3

110

0

110

2

0

0

108

I

25

1

24

0

24

1

0

0

23

Total

30,856

225

30,631

101

30,530

562

0

2

29,966