Milestone: M1 — Real OCR - Implement PDF → Image conversion - Add preprocessing (grayscale, threshold, deskewing) - Integrate Tesseract - Add fallback OCR logic - Store OCR artifacts - Add OCR confidence scoring
Milestone: M1 — Real OCR