Boundary Analysis


Classes

class  boost::locale::boundary::mapping< RangeIterator >
 Class the holds boundary mapping of the text that can be used with iterators. More...
class  boost::locale::boundary::token_iterator< IteratorType, ValueType >
 token_iterator is an iterator that returns text chunks between boundary positions More...
class  boost::locale::boundary::break_iterator< IteratorType >
 break_iterator is bidirectional iterator that returns text boundary positions More...

Enumerations

enum  boost::locale::boundary::boundary_type { boost::locale::boundary::character, boost::locale::boundary::word, boost::locale::boundary::sentence, boost::locale::boundary::line }
enum  boost::locale::boundary::word_type {
  boost::locale::boundary::word_none = 0x0000F, boost::locale::boundary::word_number = 0x000F0, boost::locale::boundary::word_letter = 0x00F00, boost::locale::boundary::word_kana = 0x0F000,
  boost::locale::boundary::word_ideo = 0xF0000, boost::locale::boundary::word_any = 0xFFFF0, boost::locale::boundary::word_letters = 0xFFF00, boost::locale::boundary::word_kana_ideo = 0xFF000,
  boost::locale::boundary::word_mask = 0xFFFFF
}
enum  boost::locale::boundary::line_break_type { boost::locale::boundary::line_soft = 0x0F, boost::locale::boundary::line_hard = 0xF0, boost::locale::boundary::line_any = 0xFF, line_mask = 0xFF }
enum  boost::locale::boundary::sentence_break_type { boost::locale::boundary::sentence_term = 0x0F, boost::locale::boundary::sentence_sep = 0xF0, boost::locale::boundary::sentence_any = 0xFF, sentence_mask = 0xFF }
enum  boost::locale::boundary::character_break_type { boost::locale::boundary::character_any = 0xF, character_mask = 0xF }

Functions

unsigned boost::locale::boundary::boundary_mask (boundary_type t)

Detailed Description

This module contains all operations required for boundary analysis of text: character, word, like and sentence boundaries

Enumeration Type Documentation

The enum that describes possible break types

Enumerator:
character  Find character boundaries.
word  Find word boundaries.
sentence  Find sentence boundaries.
line  Find a positions suitable for line breaks.

Flags that describe a type of character break. At this point break iterator does not distinguish different kinds of characters so it is used for consistency.

Enumerator:
character_any  Not in use, just for consistency.

Flags that describe a type of line break

Enumerator:
line_soft  Soft line break: optional but not required.
line_hard  Hard line break: like break is required (as per CR/LF).
line_any  Soft or Hard line break.

Flags that describe a type of sentence break

Enumerator:
sentence_term  The sentence was terminated with a sentence terminator like ".", "!" possible followed by hard separator like CR, LF, PS
sentence_sep  The sentence does not contain terminator like ".", "!" but ended with hard separator like CR, LF, PS or end of input.
sentence_any  Either first or second sentence break type;.

Flags used with word boundary analysis -- the type of the word found

Enumerator:
word_none  Not a word.
word_number  Word that appear to be a number.
word_letter  Word that contains letters, excluding kana and ideographic characters.
word_kana  Word that contains kana characters.
word_ideo  Word that contains ideographic characters.
word_any  Any word including numbers, 0 is special flag, equivalent to 15.
word_letters  Any word, excluding numbers but including letters, kana and ideograms.
word_kana_ideo  Word that includes kana or ideographic characters.
word_mask  Maximal used mask.


Function Documentation

unsigned boost::locale::boundary::boundary_mask ( boundary_type  t  )  [inline]

This function returns the mask that covers all variants for specific boundary type


Generated on Thu Mar 18 23:02:03 2010 for Boost.Locale by doxygen 1.5.6