Skip to main content

Table 6 Orthographic features

From: Biomedical named entity extraction: some issues of corpus compatibilities

Feature

Example

Feature

Example

InitCap

Src

AllCaps

EBNA, LMP

InCap

mAb

CapMixAlpha

NFkappaB, EpoR

DigitOnly

1, 123

DigitSpecial

12-3

DigitAlpha

2× NFkappaB, 2A

AlphaDigitAlpha

IL23R, EIA

Hyphen

-

CapLowAlpha

Src, Ras,Epo

CapsAndDigits

32Dc13

RomanNumeral

I, II

StopWord

at, in

ATGCSeq

CCGCCC, ATAGAT

AlphaDigit

p50, p65

DigitCommaDigit

1,28

GreekLetter

alpha, beta

LowMixAlpha

mRNA, mAb