Some UTF8 signs not transformed into PDF

U+0084 (German start quotation mark), U+21FE (rightwards open-headed arrow) e.g. are not found in the PDF, The german quatotation mark is also missing in the epub3, while the arrow is not. I wonder about the origin of this lacking. Bernhard -- spitzhalde9 D-79853 lenzkirch bernhard.kleine@gmx.net www.b-kleine.com, www.urseetal.net - thunderbird mit enigmail GPG schlüssel: D5257409 fingerprint: 08 B7 F8 70 22 7A FC C1 15 49 CA A6 C7 6F A0 2E D5 25 74 09

U+0084 is the low double-9 quote only in Windows CP 1252 encoding. In Unicode it’s U+201E („). So maybe your input is not encoded as UTF-8? Sent from MailDroid -----Original Message----- From: Bernhard Kleine <bernhard.kleine@gmx.net> To: "oxygen-user@oxygenxml.com" <oxygen-user@oxygenxml.com> Sent: Sat, 17 Feb 2018 16:47 Subject: [oXygen-user] Some UTF8 signs not transformed into PDF U+0084 (German start quotation mark), U+21FE (rightwards open-headed arrow) e.g. are not found in the PDF, The german quatotation mark is also missing in the epub3, while the arrow is not. I wonder about the origin of this lacking. Bernhard -- spitzhalde9 D-79853 lenzkirch bernhard.kleine@gmx.net www.b-kleine.com, www.urseetal.net - thunderbird mit enigmail GPG schlüssel: D5257409 fingerprint: 08 B7 F8 70 22 7A FC C1 15 49 CA A6 C7 6F A0 2E D5 25 74 09

I am quite convinced that all I write is coded in UTF8, but how can I see whether that is true? I am open to suggestions. Am 17.02.2018 um 19:59 schrieb gerrit.imsieke@le-tex.de:
U+0084 is the low double-9 quote only in Windows CP 1252 encoding. In Unicode it’s U+201E („). So maybe your input is not encoded as UTF-8?
Sent from MailDroid <https://goo.gl/ODgwBb>
-----Original Message----- From: Bernhard Kleine <bernhard.kleine@gmx.net> To: "oxygen-user@oxygenxml.com" <oxygen-user@oxygenxml.com> Sent: Sat, 17 Feb 2018 16:47 Subject: [oXygen-user] Some UTF8 signs not transformed into PDF
U+0084 (German start quotation mark), U+21FE (rightwards open-headed arrow) e.g. are not found in the PDF,
The german quatotation mark is also missing in the epub3, while the arrow is not.
I wonder about the origin of this lacking.
Bernhard
-- spitzhalde9 D-79853 lenzkirch bernhard.kleine@gmx.net <mailto:bernhard.kleine@gmx.net> www.b-kleine.com <http://www.b-kleine.com>, www.urseetal.net <http://www.urseetal.net> - thunderbird mit enigmail GPG schlüssel: D5257409 <tel:5257409> fingerprint: 08 B7 F8 70 22 7A FC C1 15 49 CA A6 C7 6F A0 2E D5 25 74 09 <tel:5257409>
-- spitzhalde9 D-79853 lenzkirch bernhard.kleine@gmx.net www.b-kleine.com, www.urseetal.net - thunderbird mit enigmail GPG schlüssel: D5257409 fingerprint: 08 B7 F8 70 22 7A FC C1 15 49 CA A6 C7 6F A0 2E D5 25 74 09

Am 17.02.2018 um 19:59 schrieb gerrit.imsieke@le-tex.de:
U+0084 is the low double-9 quote only in Windows CP 1252 encoding I took it from the code table in Oxygen. There is no indication that I will not be displayed when the document is transformed to pdf. It has also a UTF code. I have changed it however to the U+201E code.
-- spitzhalde9 D-79853 lenzkirch bernhard.kleine@gmx.net www.b-kleine.com, www.urseetal.net - thunderbird mit enigmail GPG schlüssel: D5257409 fingerprint: 08 B7 F8 70 22 7A FC C1 15 49 CA A6 C7 6F A0 2E D5 25 74 09

Hi Bernhard, If your XML documents start with the processing-instruction:
<?xml version='1.0' encoding='UTF-8'?>
then it's UTF-8 encoded meaning that you can save in it any character that you want. If it does not start with an <?xml...?> encoding declaration, then it's again UTF-8 encoded because it's the default encoding for XML. So you can save any character you want in the XML but this does not mean that Oxygen or the PDF output will be able to render the character. For rendering the character you need to use appropriate fonts. In the Oxygen Preferences->"Fonts" page you can configure the fonts used for editing. Also for building the PDF output from Docbook, if you edit the transformation scenario, in the "Parameters" list you can search for "font" and you will find various font-related parameters. For each of the parameters there are lists of fonts, used as fallbacks. Basically when a character needs to be rendered, the first font in the list is asked, if it does not support it, then the next font is asked and so on. Ideally you should avoid using fancy characters which are not supported in all fonts, maybe replace them with images instead. Regards, Radu Radu Coravu <oXygen/> XML Editor, Schema Editor and XSLT Editor/Debugger http://www.oxygenxml.com On 2/18/2018 5:49 PM, Bernhard Kleine wrote:
Am 17.02.2018 um 19:59 schrieb gerrit.imsieke@le-tex.de:
U+0084 is the low double-9 quote only in Windows CP 1252 encoding I took it from the code table in Oxygen. There is no indication that I will not be displayed when the document is transformed to pdf. It has also a UTF code. I have changed it however to the U+201E code.
_______________________________________________ oXygen-user mailing list oXygen-user@oxygenxml.com https://www.oxygenxml.com/mailman/listinfo/oxygen-user
participants (3)
-
Bernhard Kleine
-
gerrit.imsieke@le-tex.de
-
Oxygen XML Editor Support (Radu Coravu)