Boundary Analysis
|
Classes |
class | boost::locale::boundary::mapping< RangeIterator > |
| Class the holds boundary mapping of the text that can be used with iterators. More...
|
class | boost::locale::boundary::token_iterator< IteratorType, ValueType > |
| token_iterator is an iterator that returns text chunks between boundary positions More...
|
class | boost::locale::boundary::break_iterator< IteratorType > |
| break_iterator is bidirectional iterator that returns text boundary positions More...
|
Enumerations |
enum | boost::locale::boundary::boundary_type { boost::locale::boundary::character,
boost::locale::boundary::word,
boost::locale::boundary::sentence,
boost::locale::boundary::line
} |
enum | boost::locale::boundary::word_type {
boost::locale::boundary::word_none = 0x0000F,
boost::locale::boundary::word_number = 0x000F0,
boost::locale::boundary::word_letter = 0x00F00,
boost::locale::boundary::word_kana = 0x0F000,
boost::locale::boundary::word_ideo = 0xF0000,
boost::locale::boundary::word_any = 0xFFFF0,
boost::locale::boundary::word_letters = 0xFFF00,
boost::locale::boundary::word_kana_ideo = 0xFF000,
boost::locale::boundary::word_mask = 0xFFFFF
} |
enum | boost::locale::boundary::line_break_type { boost::locale::boundary::line_soft = 0x0F,
boost::locale::boundary::line_hard = 0xF0,
boost::locale::boundary::line_any = 0xFF,
line_mask = 0xFF
} |
enum | boost::locale::boundary::sentence_break_type { boost::locale::boundary::sentence_term = 0x0F,
boost::locale::boundary::sentence_sep = 0xF0,
boost::locale::boundary::sentence_any = 0xFF,
sentence_mask = 0xFF
} |
enum | boost::locale::boundary::character_break_type { boost::locale::boundary::character_any = 0xF,
character_mask = 0xF
} |
Functions |
unsigned | boost::locale::boundary::boundary_mask (boundary_type t) |
Detailed Description
This module contains all operations required for
boundary analysis of text: character, word, like and sentence boundaries
Enumeration Type Documentation
The enum that describes possible break types
- Enumerator:
-
character |
Find character boundaries. |
word |
Find word boundaries. |
sentence |
Find sentence boundaries. |
line |
Find a positions suitable for line breaks. |
Flags that describe a type of character break. At this point break iterator does not distinguish different kinds of characters so it is used for consistency.
- Enumerator:
-
character_any |
Not in use, just for consistency. |
Flags that describe a type of line break
- Enumerator:
-
line_soft |
Soft line break: optional but not required. |
line_hard |
Hard line break: like break is required (as per CR/LF). |
line_any |
Soft or Hard line break. |
Flags that describe a type of sentence break
- Enumerator:
-
sentence_term |
The sentence was terminated with a sentence terminator like ".", "!" possible followed by hard separator like CR, LF, PS |
sentence_sep |
The sentence does not contain terminator like ".", "!" but ended with hard separator like CR, LF, PS or end of input. |
sentence_any |
Either first or second sentence break type;. |
Flags used with word boundary analysis -- the type of the word found
- Enumerator:
-
word_none |
Not a word. |
word_number |
Word that appear to be a number. |
word_letter |
Word that contains letters, excluding kana and ideographic characters. |
word_kana |
Word that contains kana characters. |
word_ideo |
Word that contains ideographic characters. |
word_any |
Any word including numbers, 0 is special flag, equivalent to 15. |
word_letters |
Any word, excluding numbers but including letters, kana and ideograms. |
word_kana_ideo |
Word that includes kana or ideographic characters. |
word_mask |
Maximal used mask. |
Function Documentation
unsigned boost::locale::boundary::boundary_mask |
( |
boundary_type |
t |
) |
[inline] |
This function returns the mask that covers all variants for specific boundary type