Skip to content

Korean POS tagging poor performance #20

@jdoerfler

Description

@jdoerfler

SpaCy is having trouble with POS tagging in Korean. When inserting a break point in the code and using the sample sentences from SpaCy documentation, the parser does perfectly, so this may be a bit more insidious of a bug.

The below example should be getting a number of different tokens but instead gets one big one.

실제적인효용은크않을지몰라도
Attributes for Token: 실제적인효용은크않을지몰라도
Tag: ncn+jp+ecs
POS: VERB

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions