[Date Prev][Date Next][Subject Prev][Subject Next][ Date Index][ Subject Index]

Re: Customization Guide Typo?



** Reply to message from "Patricia M. Godfrey"  on Fri, 18
Nov 2005 12:29:23 -0500


> Why couldn't we get some volunteers to divvy up the
> work of assigning the styles?

Because it needs consistency, a single approach. I think the beauty of a
"Style" is that it can be applied rather simply to a whole document; and I
think that creative use of search-and-replace strings (which is what you'd use
for pages 1-50, or 51-100, etc., *anyway*) is just as easily applied to the
whole document.

The complexity of the process will be largely a function of the output of the
OCRs. Do they, for example, retain empty lines that appear in the scanned
text? Do they replace long runs of space on a single line with tab characters?
A look at a snippet of the original OCR text, fresh from the OCR engine, would
be useful to determine how much effort this will require. (Or, perhaps, a raw
snippet PLUS the same snippet after you added tags...)

The OCR-ing is unquestionably the larger of the labors, by a giant factor.

-----------------------------
Robert Holmgren
holmgren@xxxxxxxx
-----------------------------