Consultor Eletrônico



Kbase P106420: The SAX parser does not read in a UTF-8 XML file correctly
Autor   Progress Software Corporation - Progress
Acesso   Público
Publicação   26/02/2010
Status: Verified

SYMPTOM(s):

The SAX parser does not read in a UTF-8 XML file correctly

Extended characters in a UTF-8 XML file are not read correctly by the SAX parser

After importing French accented characters from a UTF-8 XML file the characters do not display correctly

FACT(s) (Environment):

Progress 9.x
OpenEdge 10.x
All Supported Operating Systems
OpenEdge Category: I18N

CAUSE:

The XML file was not created correctly. The accented characters are not in the UTF-8 code page. So although the XML file header says the file is UTF-8 the accented characters are not. It is likely that these were added later manually with cut and paste.

FIX:

The only real solution to the problem is to ensure that the process that created the XML file really does create all its data using the UTF-8 code page.