NWP pro seems to mis-translate some Word punctuation glyphs
Posted: 2010-05-29 21:36:03
My editorial work on an internet-only literary magazine involves accepting lots of files in various formats, usually Word 97, converting them to RTF, editing them, converting those RTF files to HTML, and massaging those files into XHTML.
I find NWP excellent to use as a primary editing tool, mainly because of its macro language, which I use to clean up the typing styles that various authors present.
I have run into a problem, though, and I wonder if anyone can help.
Sometimes a whole DOC file, and sometimes just part of a DOC file, carries infected characters, usually left or right single or double quotes, and sometimes en and em dashes. These characters have a hidden component: that is when I delete each one by backspacing over them, the first backspace deletes a character, and the second backspace seems to do nothing, but does delete another, invisible character.
On the other hand, if I open the DOC file with TextEdit, save the file as RTF, then open that RTF file in NWP, the infected characters have disappeared and the text is clean.
This suggest to me that NWP is failing to interpret Word 97 (or some other version of Word) punctuation glyphs correctly, glyphs that Text Edit can read correctly.
I attach two files:
tranter-nwp-flaws1.rtf
part of a Microsoft Word file opened with NWP v1.4.1 and saved as a RTF file,
showing the faulty characters.
and
tranter-nwp-flaws2.rtf
part of the same RTF file saved by NWP and then opened with TextEdit v1.6 (264) and saved as a RTF file, showing no faulty characters.
Uhhh... unbelievably, in a forum devoted to NWP, which saves all files in RTF, the RTF extension is not allowed as an upload! SO I have converted the files to DOC format and uploaded them.
I find NWP excellent to use as a primary editing tool, mainly because of its macro language, which I use to clean up the typing styles that various authors present.
I have run into a problem, though, and I wonder if anyone can help.
Sometimes a whole DOC file, and sometimes just part of a DOC file, carries infected characters, usually left or right single or double quotes, and sometimes en and em dashes. These characters have a hidden component: that is when I delete each one by backspacing over them, the first backspace deletes a character, and the second backspace seems to do nothing, but does delete another, invisible character.
On the other hand, if I open the DOC file with TextEdit, save the file as RTF, then open that RTF file in NWP, the infected characters have disappeared and the text is clean.
This suggest to me that NWP is failing to interpret Word 97 (or some other version of Word) punctuation glyphs correctly, glyphs that Text Edit can read correctly.
I attach two files:
tranter-nwp-flaws1.rtf
part of a Microsoft Word file opened with NWP v1.4.1 and saved as a RTF file,
showing the faulty characters.
and
tranter-nwp-flaws2.rtf
part of the same RTF file saved by NWP and then opened with TextEdit v1.6 (264) and saved as a RTF file, showing no faulty characters.
Uhhh... unbelievably, in a forum devoted to NWP, which saves all files in RTF, the RTF extension is not allowed as an upload! SO I have converted the files to DOC format and uploaded them.