[Date Prev][Date Next][Subject Prev][Subject Next][Date Index][Subject Index]

Re: Exporting files with accented characters



Reply to note from Edward Mendelson  Thu, 3 Feb 2022 21:55:29 -0500

Edward,

> Xy4 seems to store them with unique codes that don't correspond to
> anything else.

Xy4 uses a proprietary (idiosyncratic) Speedo 3-byte character set for
character codes in the range 256-909. Each Speedo char in this range
consists of three bytes: Byte 1 is always Ascii-254, and Bytes 2 and 3
are within the range 0-255. (The factory-issued file CHARSET lists all
of the available 3-byte characters, with descriptions.) Most of the
Speedo glyphs are alien to Ascii 0-255 in Code Page 437, but some have
screen cognates. If you inspect your XYFILE.XY in a byte editor, you'll
see that the e-acute and capital U-umlaut are 1-byte characters from CP
437, while the others -- a-grave, i-umlaut, and o-grave -- are 3-byte
Speedo characters introduced by Ascii-254. In contrast, all of the
characters in NBFILE.NB are 1-byte Ascii. As the W4W RTF filter does
not recognize Speedo chars, when it encounters one, it outputs "_".
This explains the discrepancies between XYFILE.RTF and NBFILE.RTF.

-- 
Carl Distefano
cld@xxxxxxxxxx