Skip to content

Optical Character Recognition (OCR) #316

@tolik518

Description

@tolik518

Recognizing factorials directly in a image could be a fun addition and would give the bot a much wider coverage. As far as I have seen most OCR crates are fairly easy to integrate.

Downsides:

  • slower response times/latency (since currently we're working on the comments sequentially)
  • higher resource use with OCR
  • false positives/negatives
    • might make the bot look dumb

Crates

I don't have a strong preference right now, but from what I have seen floneum looks promising as it uses models which could work best with structured inputs.
Traditional OCR like tesseract would probably be a lot less resource hungry, but less accurate and would probably need more configuration. Tesseract biggest upside by far is that it should be more resource-efficient than "model-based" OCRs

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestproposalOpen for discussion

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions