Apache OpenOffice (AOO) Bugzilla – Issue 81854
Combine OCR with PDF import
Last modified: 2013-08-07 14:38:26 UTC
Since the next new release will import PDFs (I hear), then this is a good time to consider integrating OCR into the new feature. People don't necessarily know beforehand if a PDF was scanned or generated directly from Adobe. In law firms, attorneys frequently receive batches of PDFs by email to which they must respond by quoting parts of other PDFs. It's understood that OCR isn't a perfect science, but getting 90% of the characters right is lot of time and retyping that would be saved. Thus integrating OCR will (1) increase functionality by making people more efficient, (2) reduce the inevitable support questions of "why my PDF didn't import", and (3) expand the potential market segment. Thank you.
Reassigned
*** Issue 81854 has been confirmed by votes. ***
This isn't getting enough attention. I really hope that someone makes this. Linux really lacks OCR, not that it's a particularly easy thing to code...