Skip to main content

Table 2 Example dataset words in category C1 and C2

From: Computing symmetrical strength of N-grams: a two pass filtering approach in automatic classification of text documents

Category

N-Gram

Documents

D1

D2

D3

D4

D5

D6

C1

“penalty shootout”

0

0

0

0

0

0

“penalty corner”

1

1

0

1

2

0

“beautifully”

0

1

1

2

0

1

“play”

1

1

1

2

2

1

  

D7

D8

D9

D10

D11

D12

C2

“penalty shootout”

1

2

0

0

0

1

“penalty corner”

0

0

0

0

1

0

“beautifully”

1

0

1

2

0

0

“play”

0

0

2

0

1

1