Recognizing factorials directly in a image could be a fun addition and would give the bot a much wider coverage. As far as I have seen most OCR crates are fairly easy to integrate.
Downsides:
- slower response times/latency (since currently we're working on the comments sequentially)
- higher resource use with OCR
- false positives/negatives
- might make the bot look dumb
Crates
I don't have a strong preference right now, but from what I have seen floneum looks promising as it uses models which could work best with structured inputs.
Traditional OCR like tesseract would probably be a lot less resource hungry, but less accurate and would probably need more configuration. Tesseract biggest upside by far is that it should be more resource-efficient than "model-based" OCRs
Recognizing factorials directly in a image could be a fun addition and would give the bot a much wider coverage. As far as I have seen most OCR crates are fairly easy to integrate.
Downsides:
Crates
I don't have a strong preference right now, but from what I have seen floneum looks promising as it uses models which could work best with structured inputs.
Traditional OCR like tesseract would probably be a lot less resource hungry, but less accurate and would probably need more configuration. Tesseract biggest upside by far is that it should be more resource-efficient than "model-based" OCRs