Java White Papers

Instantiating JavaCC Tokenizers/Parsers to Read From Unicode Source Files

Overview JavaCC is a popular parser-generator used to implement parsers for programming languages. Unicode is a practical reality, and Unicode-capable text editors and Graphical User Interfaces (GUIs) are available on all popular platforms. This makes it possible to define new programming languages that contain Unicode strings or even Unicode identifiers and operators. Thus some JavaCC parsers need to read from Unicode files and other Unicode sources. This paper explains how to instantiate a JavaCC parser to read from a source (file or whatever) that is not necessarily in the default character encoding of one's operating system.

Further White Paper Details
PublisherXerox File FormatPDF
Date PublishedOctober 2005 Downloads3
FormatWhite Papers   
Topics
Thin clients switch on digitally excluded

Thin clients switch on digitally excluded

Case study: Digital inclusion project tackles social exclusion in Liverpool more

Renault goes multilingual

Renault goes multilingual

Case study: Translation tech turns docs into 23 languages… more


Quick Sitemap Links: