-
Notifications
You must be signed in to change notification settings - Fork 5
Description
I'm not sure I understand precisely what is happening. I have a PDF file, raster only, along with an hocr I obtained from Tesseract. When I use the python-hocr script hocr2pdf, I get the following error:
Traceback (most recent call last):
File "/Users/fms/bin/hocr2pdf", line 11, in
load_entry_point('hocr==0.2.11', 'console_scripts', 'hocr2pdf')()
File "/Users/fms/lib/python3.5/site-packages/hocr-0.2.11-py3.5.egg/hocr/commands.py", line 37, in hocr2pdf
File "/Users/fms/lib/python3.5/site-packages/hocr-0.2.11-py3.5.egg/hocr/overlay.py", line 210, in overlay
File "src/hummus/context.pyx", line 35, in hummus.context.Context.add (src/hummus/context.cpp:1510)
File "src/hummus/text.pyx", line 37, in hummus.text.Text.add_to (src/hummus/text.cpp:1525)
ValueError: ('Font not recognized by PDF', b'/Library/Fonts/Microsoft/Times')
It looks to me as though the path name is being truncated at the first space. There is a file "/Library/Fonts/Microsoft/Times New Roman.ttf" on my system. Anyone have any ideas? I wish I had a patch rather than a ticket, sorry.