Text Processing

Detailed Description

This set of classes and functions extends the string and localization support of the C++ standard library to work with unicode characters and strings. A unicode character type and string class (a specialization of std::basic_string) can be used to hold unicode text. A set of functions allows to transform and classify individual characters. Text can be converted e.g. between different encodings using i/o streams and text codecs. A regular expression class allows to search and match patterns in unicode strings. Localization facets are available for the systems which support standard C++ locales.

Classes

class  Base64Codec
 A codec for base-64 encoding. More...
 
class  BasicTextBuffer< CharT, ByteT >
 Converts character sequences with different encodings. More...
 
class  BasicTextIStream< CharT, ByteT >
 Converts character sequences using a codec. More...
 
class  BasicTextOStream< CharT, ByteT >
 Converts character sequences using a codec. More...
 
class  BasicTextStream< CharT, ByteT >
 Converts character sequences using a codec. More...
 
class  Char
 Unicode character type. More...
 
class  InvalidRegex
 Invalid regular expression. More...
 
class  IStringStream
 Unicode string input stream. More...
 
class  OStringStream
 Unicode string output stream. More...
 
class  Regex
 Regular Expressions for Unicode Strings. More...
 
class  RegexSMatch
 Result of a regular expression match. More...
 
class  String
 Unicode capable basic_string. More...
 
class  StringStream
 Unicode string stream. More...
 
class  TextBuffer
 Converts character sequences with different encodings. More...
 
class  TextCodec< CharT, ByteT >
 Converts between character encodings. More...
 
class  TextIStream
 Text input stream for unicode character conversion. More...
 
class  TextOStream
 Text output stream for unicode character conversion. More...
 
class  TextStream
 Text stream for unicode character conversion. More...
 
class  Utf16BECodec
 UTF-16 big endian codec. More...
 
class  Utf16LECodec
 UTF-16 little endian codec. More...
 
class  Utf32BECodec
 UTF-32 big endian codec. More...
 
class  Utf32LECodec
 UTF-32 little endian codec. More...
 
class  Utf8Appender
 UTF-8 string output iterator. More...
 
class  Utf8Codec
 Convert between unicode and UTF-8. More...
 
class  Utf8Convert
 UTF-8 string converter. More...