81854 – Combine OCR with PDF import

Issue 81854 - Combine OCR with PDF import

Summary: Combine OCR with PDF import

Status:	CONFIRMED

Alias:	None

Product:	Writer
Classification:	Application
Component:	open-import (show other issues)
Version:	OOo 1.0.0
Hardware:	All All

Importance:	P3 Trivial with 8 votes (vote)
Target Milestone:	---
Assignee:	AOO issues mailing list
QA Contact:

URL:
Keywords:

Depends on:
Blocks:

Reported:	2007-09-22 22:31 UTC by phills
Modified:	2013-08-07 14:38 UTC (History)
CC List:	2 users (show)

See Also:
Issue Type:	ENHANCEMENT
Latest Confirmation in:	---
Developer Difficulty:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this issue.

Description phills 2007-09-22 22:31:58 UTC

Since the next new release will import PDFs (I hear), then this is a good time
to consider integrating OCR into the new feature.  People don't necessarily know
beforehand if a PDF was scanned or generated directly from Adobe.  

In law firms, attorneys frequently receive batches of PDFs by email to which
they must respond by quoting parts of other PDFs.  It's understood that OCR
isn't a perfect science, but getting 90% of the characters right is lot of time
and retyping that would be saved.

Thus integrating OCR will (1) increase functionality by making people more
efficient, (2) reduce the inevitable support questions of "why my PDF didn't
import", and (3) expand the potential market segment.  Thank you.

Comment 1 eric.savary 2007-09-25 10:51:19 UTC

Reassigned

Comment 2 cianoz 2008-09-08 23:06:46 UTC

*** Issue 81854 has been confirmed by votes. ***

Comment 3 amoose136 2008-11-01 05:37:06 UTC

This isn't getting enough attention. I really hope that someone makes this.
Linux really lacks OCR, not that it's a particularly easy thing to code...