-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Description
I just noticed something strange...
I filtered the scispacy.csv to show only rows containing:
((sentences containing TNF) AND (entities containing TNF))(see scispacy_match.pdf attached)((sentences containing TNF) AND (entities **NOT** containing TNF))(see scispacy_mismatch.pdf attached)
The latter turned up a bunch of results where TNF was not recognized as an entity in the sentence. I don't see why it should detect entities sometimes and not others.
Another thing I noticed was I found a bunch of sentences with this typo: TNF-<space>𝛼 (TNF- 𝛼)
scispacy caught the "TNF-" but left out the alpha because of the space after the dash. (See scispacy_TNF-space.pdf attached). I don't know if there's anything we can do about that, but I thought it should be noted.
scispacy_TNF-space.pdf
scispacy_mismatch.pdf
scispacy_match.pdf
Metadata
Metadata
Assignees
Labels
No labels