[Expat-discuss] Processing of incorrect characters
Michael B. Allen
miallen at eskimo.com
Tue Feb 25 16:12:16 EST 2003
On Tue, 25 Feb 2003 12:41:20 +0300
Ivan Murashko <Ivan.Murashko at Arcadia.spb.ru> wrote:
> I process a big xml file (more than 1 Gb) by means of expat. This file
> contains erroneous character, for an example character with code 0x03.
>
> Parser returns XML_ERROR_INVALID_TOKEN error.
>
> Can I say to expat to ignore these characters?
I don't think so. Perhaps you can search for the invalid characters
in the buffers passed to XML_ParseBuffer and convert them in-place to
something benign.
Mike
--
A program should be written to model the concepts of the task it
performs rather than the physical world or a process because this
maximizes the potential for it to be applied to tasks that are
conceptually similar and, more important, to tasks that have not
yet been conceived.
More information about the Expat-discuss
mailing list