[Date Prev][Date Next][Subject Prev][Subject Next][
Date Index][
Subject Index]
Re: Customization Guide Typo?
- Subject: Re: Customization Guide Typo?
- From: "Robert Holmgren" holmgren@xxxxxxxx
- Date: Fri, 18 Nov 2005 15:21:12 -0500
** Reply to message from "Patricia M. Godfrey" on Fri, 18
Nov 2005 12:29:23 -0500
> Why couldn't we get some volunteers to divvy up the
> work of assigning the styles?
Because it needs consistency, a single approach. I think the beauty of a
"Style" is that it can be applied rather simply to a whole document; and I
think that creative use of search-and-replace strings (which is what you'd use
for pages 1-50, or 51-100, etc., *anyway*) is just as easily applied to the
whole document.
The complexity of the process will be largely a function of the output of the
OCRs. Do they, for example, retain empty lines that appear in the scanned
text? Do they replace long runs of space on a single line with tab characters?
A look at a snippet of the original OCR text, fresh from the OCR engine, would
be useful to determine how much effort this will require. (Or, perhaps, a raw
snippet PLUS the same snippet after you added tags...)
The OCR-ing is unquestionably the larger of the labors, by a giant factor.
-----------------------------
Robert Holmgren
holmgren@xxxxxxxx
-----------------------------