PDF anomaly with Chinese
Posted: 2021-03-02 05:17:47
An anomaly that came to light through a post on the Scrivener forum. A Chinese user was finding s/he couldn't copy and paste Chinese text from a PDF compiled from Scrivener. I did a bit of detective work on it and found that this was true when running Catalina 10.15.7, but not running Big Sur 11.2.2. so I did further experimentation and found that the same is true for Chinese printed to PDF by NWP under 10.15.7. In other words, this is an Apple bug, that they have largely solved with Big Sur. I'm posting this here so anyone using other CJK or non-Roman languages and on 10.15.7 (or perhaps earlier versions of MacOS) can check if the same is true for them (if it is of any relevance!).
To this end I attach a zip containing: (1) pdf_test.rtf: (2) pdf_test.pdf, a PDF printed from NWP under 10.15.7; (3) pdf_test_2.pdf, a short file printed from NWP after import from DOCX, to see if that made any difference; (4) pdf_test_BS.pdf, the longer text printed from NWP running under 11.2.2.
If you open the three PDFs in Preview, highlight and copy the text and paste it into a TextEdit file, the first two will show there is something there, but the Chinese is not visible; the third (11.2.2) shows the Chinese text, though there seems to be a problem with the font in the heading; on 10.15.7 it is undisplayed, on 11.2.2 my version of TextEdit doesn't recognise the font! You can try it yourself, if you're interested, using the RTF.
I think this will be true of many other programs based on the Apple TextKit.

Mark
To this end I attach a zip containing: (1) pdf_test.rtf: (2) pdf_test.pdf, a PDF printed from NWP under 10.15.7; (3) pdf_test_2.pdf, a short file printed from NWP after import from DOCX, to see if that made any difference; (4) pdf_test_BS.pdf, the longer text printed from NWP running under 11.2.2.
If you open the three PDFs in Preview, highlight and copy the text and paste it into a TextEdit file, the first two will show there is something there, but the Chinese is not visible; the third (11.2.2) shows the Chinese text, though there seems to be a problem with the font in the heading; on 10.15.7 it is undisplayed, on 11.2.2 my version of TextEdit doesn't recognise the font! You can try it yourself, if you're interested, using the RTF.
I think this will be true of many other programs based on the Apple TextKit.

Mark