Java White Papers
Instantiating JavaCC Tokenizers/Parsers to Read From Unicode Source Files
Overview JavaCC is a popular parser-generator used to implement parsers for programming languages. Unicode is a practical reality, and Unicode-capable text editors and Graphical User Interfaces (GUIs) are available on all popular platforms. This makes it possible to define new programming languages that contain Unicode strings or even Unicode identifiers and operators. Thus some JavaCC parsers need to read from Unicode files and other Unicode sources. This paper explains how to instantiate a JavaCC parser to read from a source (file or whatever) that is not necessarily in the default character encoding of one's operating system.
| Publisher | Xerox | File Format | |
|---|---|---|---|
| Date Published | October 2005 | Downloads | 3 |
| Format | White Papers | ||
| Topics | |||


